Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA.
Sergey Pankevich
spankevich
AI & ML interests
None yet
Organizations
None yet
models
9
spankevich/output
1.26M
•
Updated
•
3
spankevich/llm-course-hw3-tinyllamma-qlora
Updated
spankevich/llm-course-hw3-dora
Text Generation
•
0.3B
•
Updated
•
1
spankevich/llm-course-hw3-lora
Text Generation
•
0.3B
•
Updated
•
5
spankevich/llm-hw-2-ppo
Text Generation
•
0.1B
•
Updated
•
3
spankevich/trainer_output
Text Classification
•
0.1B
•
Updated
•
8
spankevich/llm-hw-2-dpo
Text Generation
•
0.1B
•
Updated
•
3
spankevich/llm-hw-2
Updated
spankevich/llm-course-hw1
Text Generation
•
Updated
•
7
datasets
0
None public yet