What's the hyperparameter setting to obtain Windy0822/ultrainteract_math_rollout train dataset?

Thank you for your impressive work! However, I'd like to ask is there code available to generate the training data Windy0822/ultrainteract_math_rollout? I am interested in the followings:

- What are the generation hyperparameters (e.g. temperature, top_p, max_new_tokens ...)? What's the instruction (like "reason step by step") given to the model?
- How to split the steps? Is there code or explicit rules available?
- How to evaluate on whether a reasoning path is correct or not? Is it the same as your prior work https://github.com/OpenBMB/Eurus/blob/main/eval/Math/math/evaluate_math_cot.py ?

Thank you and wish you all the best!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What's the hyperparameter setting to obtain Windy0822/ultrainteract_math_rollout train dataset? #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What's the hyperparameter setting to obtain Windy0822/ultrainteract_math_rollout train dataset? #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions