{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":563755409,"defaultBranch":"main","name":"RLProject","ownerLogin":"Guosy0506","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2022-11-09T09:11:16.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/86408789?v=4","public":true,"private":false,"isOrgOwned":false},"refInfo":{"name":"","listCacheKey":"v0:1667987834.525144","currentOid":""},"activityList":{"items":[{"before":"da82a62478ec26ad83498671ff76821044727230","after":"bfdebaede743d84978698ba29fa59ea81f528885","ref":"refs/heads/main","pushedAt":"2024-03-18T07:55:01.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"c3d6701d1290842024504d6753ef5c7e1751a1d6","after":"da82a62478ec26ad83498671ff76821044727230","ref":"refs/heads/main","pushedAt":"2024-03-18T06:30:58.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"调整评估频率,总步数,样本池大小,\n添加进度提示文本框","shortMessageHtmlLink":"调整评估频率,总步数,样本池大小,"}},{"before":"4b30b7eb35ca4d4fdd421241ced1cbc872738fa2","after":"c3d6701d1290842024504d6753ef5c7e1751a1d6","ref":"refs/heads/main","pushedAt":"2024-03-16T11:44:23.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"调整:添加速度奖励","shortMessageHtmlLink":"调整:添加速度奖励"}},{"before":"57e8d69d65c38be22451ef765da8f5986bf23bda","after":"4b30b7eb35ca4d4fdd421241ced1cbc872738fa2","ref":"refs/heads/main","pushedAt":"2024-03-16T11:31:10.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"调整各步长:最大训练1e6,评估频率2.5e4,经验池大小1e5,回放池大小5e4\n调整:当分数达到5000分时,结束回合\n调整:经验池的1/5将始终保留最开始的随机采样。","shortMessageHtmlLink":"调整各步长:最大训练1e6,评估频率2.5e4,经验池大小1e5,回放池大小5e4"}},{"before":"1bcc2f20e28be91a0df01b35a87d0161a529b8f4","after":"57e8d69d65c38be22451ef765da8f5986bf23bda","ref":"refs/heads/main","pushedAt":"2024-03-14T12:43:14.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"修正奖励函数(每一步不扣分,吃到砖块的奖励乘以系数10)\naction repeat 8->2\n终止条件改为50步平均扣10分以上\n进草单步扣200分\n加入了对步数的判断条件","shortMessageHtmlLink":"修正奖励函数(每一步不扣分,吃到砖块的奖励乘以系数10)"}},{"before":"71dfe838a568941b188e705a05325fbfd44db41d","after":"1bcc2f20e28be91a0df01b35a87d0161a529b8f4","ref":"refs/heads/main","pushedAt":"2024-03-14T12:21:33.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"f9f4bd2e36c4b7f0b4dbc0beb3083a7c40231225","after":"71dfe838a568941b188e705a05325fbfd44db41d","ref":"refs/heads/main","pushedAt":"2024-03-12T10:59:24.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"d63cb4d76903b027faa569b14537a42f738010b4","after":"f9f4bd2e36c4b7f0b4dbc0beb3083a7c40231225","ref":"refs/heads/main","pushedAt":"2024-03-08T13:55:47.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"e78b62dc5abaf25bcf78b413f942db3d9950b07d","after":"d63cb4d76903b027faa569b14537a42f738010b4","ref":"refs/heads/main","pushedAt":"2024-03-08T10:51:01.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"每次执行程序时print工作路径,将所有与路径有关的代码更正为相对路径,合理调整采样步数,评估步数,总训练步数","shortMessageHtmlLink":"每次执行程序时print工作路径,将所有与路径有关的代码更正为相对路径,合理调整采样步数,评估步数,总训练步数"}},{"before":"64e653d3492ea2419bbfb5c757eab58d921adc10","after":"e78b62dc5abaf25bcf78b413f942db3d9950b07d","ref":"refs/heads/main","pushedAt":"2024-03-07T14:18:42.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"4db0539250f0a4e16f1d16407687c64be955c753","after":"64e653d3492ea2419bbfb5c757eab58d921adc10","ref":"refs/heads/main","pushedAt":"2024-03-07T13:08:04.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"修正 data_train的路径\n减小总训练次数 3e6->1e5\n增加 草地惩罚为-10","shortMessageHtmlLink":"修正 data_train的路径"}},{"before":"46de4346ceeec51bc878bd2205535674c1518769","after":"4db0539250f0a4e16f1d16407687c64be955c753","ref":"refs/heads/main","pushedAt":"2024-03-05T13:09:46.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"路径debug\n尝试多次反向传播梯度debug","shortMessageHtmlLink":"路径debug"}},{"before":"00650dc05d990b1688b06a94b2877a7f2576f23a","after":"46de4346ceeec51bc878bd2205535674c1518769","ref":"refs/heads/main","pushedAt":"2024-03-05T11:05:48.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"对部分变量进行重命名,\nbug问题在于252行调用critic网络之前没有经过卷积网络对图像输入进行预处理,已更正。","shortMessageHtmlLink":"对部分变量进行重命名,"}},{"before":"48f107cc0ebbe492c29a6fdcda3fd1cd65658b1c","after":"00650dc05d990b1688b06a94b2877a7f2576f23a","ref":"refs/heads/main","pushedAt":"2024-03-02T16:01:41.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"debug","shortMessageHtmlLink":"debug"}},{"before":"d83c3fc5434483fbaf651b3895d732e1cfeb24fd","after":"48f107cc0ebbe492c29a6fdcda3fd1cd65658b1c","ref":"refs/heads/main","pushedAt":"2024-03-02T15:44:55.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"新建目录SAC_baseline,并对代码做如下调整:\n1. 去掉env中速度奖励项与草地惩罚项\n2. 缩减卷积层结构,原6->4层,输出为64维向量\n3. 每次输出的动作重复8->4步\n4. 保留actor网络的卷积层,通过_,_,s_cnn=actor(s)输出经卷积变换后的s给critic网络,去掉critic网络中相关内容。\n5. 因不确定optim过程是否相互干涉,去掉Freeze critic networks parameters相关内容(268-270)","shortMessageHtmlLink":"新建目录SAC_baseline,并对代码做如下调整:"}},{"before":"825fbddb9af90a47b9fc4c7c6504ae7d22b2f0b6","after":"d83c3fc5434483fbaf651b3895d732e1cfeb24fd","ref":"refs/heads/main","pushedAt":"2024-03-02T08:28:36.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"Guosy0506","name":null,"path":"/Guosy0506","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/86408789?s=80&v=4"},"commit":{"message":"'版本对齐'","shortMessageHtmlLink":"'版本对齐'"}}],"hasNextPage":false,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEGENXTQA","startCursor":null,"endCursor":null}},"title":"Activity · Guosy0506/RLProject"}