DPO on Pythia-2.8B
Stanford UniversityCZ Biohub NetworkLanguage modeling/generationQuestion answering
DPO on Pythia-2.8B is language modeling/generation model published by Stanford University,CZ Biohub Network in 2023 featuring 2800000000.0 parameters.
About DPO on Pythia-2.8B
While large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of their behavior is difficult due to the completely unsupervised nature of their training. Existing methods for gai
Details
- Provider
- Stanford University,CZ Biohub Network
- Task
- Language modeling/generation,Question answering
- Parameters
- 2800000000.0
- Released
- 2023-05-29
- Open weights
- No