I want to load an additional reward model (such as CLIP or a smaller-scale model) to calculate rewards for the response. How should I proceed?
I want to load an additional reward model (such as CLIP or a smaller-scale model) to calculate rewards for the response. How should I proceed?