Commit Graph

4 Commits

Author SHA1 Message Date
chenxiaodong ecae092534 随机训练匹配llm_action 2024-08-27 10:20:32 +08:00
chenxiaodong 012e4e0fd4 8.22 2024-08-22 10:06:26 +08:00
chenxiaodong 993e062068 update environment and agents for improved performance
refactors the code for the environment and agent modules to optimize performance and readability.
- Standardizing the order of arguments in solar step function calls.
- Introducing a percentage-based action voltage adjustment in the module
  step function to enhance control flexibility.
- Updating PPO and related files to use Generalized Advantage Estimation
  (GAE) by default, improving reward shaping and stability.
2024-07-09 10:49:26 +08:00
chenxiaodong 741fba6cd5 update logic 2024-07-05 15:39:57 +08:00