Commit Graph

21 Commits

Author SHA1 Message Date
chenxiaodong ecae092534 随机训练匹配llm_action 2024-08-27 10:20:32 +08:00
chenxiaodong 012e4e0fd4 8.22 2024-08-22 10:06:26 +08:00
chenxiaodong 94f447c711 meeting 2024-07-30 14:11:23 +08:00
chenxiaodong 7c186de43d meeting 2024-07-30 13:31:13 +08:00
chenxiaodong 16eccf95b6 meeting 2024-07-30 11:33:07 +08:00
chenxiaodong 9056fbdc79 meeting 2024-07-30 10:38:39 +08:00
chenxiaodong 23c1ad592d meeting 2024-07-30 09:05:32 +08:00
chenxiaodong 993e062068 update environment and agents for improved performance
refactors the code for the environment and agent modules to optimize performance and readability.
- Standardizing the order of arguments in solar step function calls.
- Introducing a percentage-based action voltage adjustment in the module
  step function to enhance control flexibility.
- Updating PPO and related files to use Generalized Advantage Estimation
  (GAE) by default, improving reward shaping and stability.
2024-07-09 10:49:26 +08:00
chenxiaodong 741fba6cd5 update logic 2024-07-05 15:39:57 +08:00
chenxiaodong f26024c8d4 train 2024-06-26 15:44:30 +08:00
chenxiaodong f66de9fb54 nothing 2024-06-25 15:08:50 +08:00
chenxiaodong 9d0b220b54 plot 2024-06-25 14:07:04 +08:00
chenxiaodong d7feebab50 llm 2024-06-25 11:02:53 +08:00
chenxiaodong 2c672d9569 llm 2024-06-25 09:11:30 +08:00
chenxiaodong 21be0966dc nothing 2024-06-20 09:19:22 +08:00
chenxiaodong b5a1842147 nothing 2024-06-20 09:00:38 +08:00
chenxiaodong 63f145467b nothing 2024-06-19 16:01:00 +08:00
chenxiaodong 53d3ac9ca8 edit the layer normalization 2024-06-19 14:36:11 +08:00
chenxiaodong 25a5da0e00 test 2024-06-18 14:54:23 +08:00
chenxiaodong 488a893354 test 2024-06-18 11:24:43 +08:00
chenxiaodong 5fd1ac5e14 'init' 2024-06-18 10:49:43 +08:00