Skip to content

argentos-fine-tuning-with-trl

Fine-tune LLMs using reinforcement learning with TRL - SFT for

Repository Source folder

Details

Path
skills/hermes/mlops/training/trl-fine-tuning/SKILL.md
License
MIT
Dependencies
4

FAQ