Solo

Model Details

Base Model Qwen/Qwen3-0.6B
Method LoRA (PEFT)
Parameters 0.6B

Training Hyperparameters

Epochs 1
Max Steps 100
Batch Size 4
Gradient Accumulation 4
Learning Rate 0.0002
LoRA r 4
LoRA Alpha 4
Max Sequence Length 2048
Training Duration 8m 49s

Dataset

GetSoloTech/Code-Reasoning


Trained with Solo

Downloads last month
369
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zeeshaan-ai/GetSoloTech

Finetuned
Qwen/Qwen3-0.6B
Adapter
(360)
this model

Dataset used to train zeeshaan-ai/GetSoloTech