extrinsic and intrinsic combination

Hello, I am trying to implement ICM in PPO with both extrinsic and intrinsic combination. I have seen in few repos where they weight out an extrinsic reward more than intrinsic i.e.   `combine_reward = (1-int_coef) * rewards + int_coef * intrinsic_reward` where` int_coeff =  0.01 `which reduces the effect of intrinsic rewards significantly. Seeing your paper, you have nowhere mentioned this sort of equation for both the rewards. I wonder if you can tell me that the equation mentioned above can be implemented for a dual reward setting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extrinsic and intrinsic combination #44

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

extrinsic and intrinsic combination #44

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions