Commit Graph

7 Commits

Author SHA1 Message Date
chenxiaodong 993e062068 update environment and agents for improved performance
refactors the code for the environment and agent modules to optimize performance and readability.
- Standardizing the order of arguments in solar step function calls.
- Introducing a percentage-based action voltage adjustment in the module
  step function to enhance control flexibility.
- Updating PPO and related files to use Generalized Advantage Estimation
  (GAE) by default, improving reward shaping and stability.
2024-07-09 10:49:26 +08:00
chenxiaodong 82ccfa7076 update logic 2024-07-05 15:40:21 +08:00
chenxiaodong f26024c8d4 train 2024-06-26 15:44:30 +08:00
chenxiaodong d7feebab50 llm 2024-06-25 11:02:53 +08:00
chenxiaodong 2c672d9569 llm 2024-06-25 09:11:30 +08:00
chenxiaodong 2806fc645e llm 2024-06-24 16:30:47 +08:00
chenxiaodong d990687099 llm 2024-06-24 16:29:27 +08:00