Fortrain/qw/open_r1
2025-03-31 15:56:36 +08:00
..
__pycache__ qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
trainer qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
utils qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
vlm_modules qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
__init__.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
configs.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
evaluate.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
generate.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
grpo_jsonl.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
grpo_rec.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
grpo.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00
sft.py qw和gemma3 grpo 2025-03-31 15:56:36 +08:00