I studied that in the stage 2 RL, the data source name is codeio-forward-incomplete-depth2. But for your default setting the mp reward class would raise not impemented error for this data source name. Is there anything that I should do before running stage 2 RL to avoid this happen?