Skip to content

奖励模型和批评模型的相关问题? #2

@liumingzhu6060

Description

@liumingzhu6060

你好,看了数据集都是英文的,请问用英文训练的奖励模型是批评模型是否能用于中文呢?后续是否会开源中文的RLHF数据集?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions