Commit Graph

14 Commits

Author SHA1 Message Date
chenxiaodong 993e062068 update environment and agents for improved performance
refactors the code for the environment and agent modules to optimize performance and readability.
- Standardizing the order of arguments in solar step function calls.
- Introducing a percentage-based action voltage adjustment in the module
  step function to enhance control flexibility.
- Updating PPO and related files to use Generalized Advantage Estimation
  (GAE) by default, improving reward shaping and stability.
2024-07-09 10:49:26 +08:00
chenxiaodong 741fba6cd5 update logic 2024-07-05 15:39:57 +08:00
chenxiaodong f26024c8d4 train 2024-06-26 15:44:30 +08:00
chenxiaodong f66de9fb54 nothing 2024-06-25 15:08:50 +08:00
chenxiaodong 9d0b220b54 plot 2024-06-25 14:07:04 +08:00
chenxiaodong d7feebab50 llm 2024-06-25 11:02:53 +08:00
chenxiaodong 2c672d9569 llm 2024-06-25 09:11:30 +08:00
chenxiaodong 21be0966dc nothing 2024-06-20 09:19:22 +08:00
chenxiaodong b5a1842147 nothing 2024-06-20 09:00:38 +08:00
chenxiaodong 63f145467b nothing 2024-06-19 16:01:00 +08:00
chenxiaodong 53d3ac9ca8 edit the layer normalization 2024-06-19 14:36:11 +08:00
chenxiaodong 25a5da0e00 test 2024-06-18 14:54:23 +08:00
chenxiaodong 488a893354 test 2024-06-18 11:24:43 +08:00
chenxiaodong 5fd1ac5e14 'init' 2024-06-18 10:49:43 +08:00