Commit Graph

9 Commits

Author SHA1 Message Date
chenxiaodong ecae092534 随机训练匹配llm_action 2024-08-27 10:20:32 +08:00
chenxiaodong 012e4e0fd4 8.22 2024-08-22 10:06:26 +08:00
chenxiaodong 993e062068 update environment and agents for improved performance
refactors the code for the environment and agent modules to optimize performance and readability.
- Standardizing the order of arguments in solar step function calls.
- Introducing a percentage-based action voltage adjustment in the module
  step function to enhance control flexibility.
- Updating PPO and related files to use Generalized Advantage Estimation
  (GAE) by default, improving reward shaping and stability.
2024-07-09 10:49:26 +08:00
chenxiaodong 69fe33deec nothing 2024-06-20 09:11:09 +08:00
chenxiaodong 63f145467b nothing 2024-06-19 16:01:00 +08:00
chenxiaodong 88bbddbb7f nothing 2024-06-19 15:55:41 +08:00
chenxiaodong 25a5da0e00 test 2024-06-18 14:54:23 +08:00
chenxiaodong 488a893354 test 2024-06-18 11:24:43 +08:00
chenxiaodong 5fd1ac5e14 'init' 2024-06-18 10:49:43 +08:00