-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Sean McLeish PRO
smcleish
AI & ML interests
None yet
Recent Activity
updated
a model about 4 hours ago
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256 updated
a collection
about 11 hours ago
compression published
a model about 11 hours ago
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256 Organizations
compression
-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Diff Datasets
Datasets containing github diffs
models 57
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256
Updated
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-attn-mlp-ov256-chunksize-8
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-attn-mlp-ov256
Updated
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-100
Text Generation • 2B • Updated
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-200
Text Generation • 2B • Updated
• 1
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-300
Text Generation • 2B • Updated
• 3