Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA.
Sergey Pankevich
spankevich
AI & ML interests
None yet
Organizations
None yet
models 9
spankevich/output
1.26M • Updated
• 1
spankevich/llm-course-hw3-tinyllamma-qlora
Updated
spankevich/llm-course-hw3-dora
Text Generation • 0.3B • Updated
• 1
spankevich/llm-course-hw3-lora
Text Generation • 0.3B • Updated
• 1
spankevich/llm-hw-2-ppo
Text Generation • 0.1B • Updated
• 1
spankevich/trainer_output
Text Classification • 0.1B • Updated
spankevich/llm-hw-2-dpo
Text Generation • 0.1B • Updated
spankevich/llm-hw-2
Updated
spankevich/llm-course-hw1
Text Generation • Updated
datasets 0
None public yet