training performance of intrinsic module different from mlagents icm module #11

mchivuku · 2023-07-13T16:51:26Z

Hello,

I used the example code provided: https://github.com/yuanmingqi/rl-exploration-baselines/blob/main/examples/ppo_re3_bullet.py
to compute intrinsic rewards using ICM module. I found the results were different from what I have got from running mlagents icm module. This is against the custom unity game wrapped in gym wrapper. Can you please let me know what could be different that I must be missing? Thank you so much for the help. I can also share the code.

yuanmingqi · 2024-02-29T22:01:07Z

Hello! We've published a big update that provides more reasonable implementations of these intrinsic rewrads.

Now you may get the same performance of the ICM.

If you have any other questions, please don't hesitate to ask here.

@mchivuku

mchivuku · 2024-02-29T22:55:16Z

Thank you. I will work on taking the update and applying it on my problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training performance of intrinsic module different from mlagents icm module #11

training performance of intrinsic module different from mlagents icm module #11

mchivuku commented Jul 13, 2023

yuanmingqi commented Feb 29, 2024

mchivuku commented Feb 29, 2024

training performance of intrinsic module different from mlagents icm module #11

training performance of intrinsic module different from mlagents icm module #11

Comments

mchivuku commented Jul 13, 2023

yuanmingqi commented Feb 29, 2024

mchivuku commented Feb 29, 2024