from .rm_trainer_general_preference import GeneralPreferenceRewardTrainer