zhentao tan

tzt

4 2 23

tzt101

AI & ML interests

Computer Vision

Recent Activity

submitted a paper 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

authored a paper 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

upvoted a paper 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

View all activity

Organizations

None yet

submitted a paper to Daily Papers 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 17 days ago • 18

authored a paper 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 17 days ago • 18

upvoted a paper 12 days ago

HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

Paper • 2606.20097 • Published 17 days ago • 18

liked a model 3 months ago

ATH-MaaS/Marco-Mini-Instruct

Text Generation • 17B • Updated Apr 10 • 907 • 47

liked a dataset 4 months ago

vaishali/spider-tableQA

Viewer • Updated Feb 21, 2024 • 7.7k • 49 • 11

upvoted a collection 6 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 23 days ago • 176

New activity in allenai/OLMoE-1B-7B-0125-Instruct 8 months ago

Tokenizer Question

#5 opened 8 months ago by

tzt

liked a dataset 9 months ago

allenai/SciRIFF-train-mix

Viewer • Updated Jun 13, 2024 • 70.7k • 47 • 10

liked a model over 1 year ago

aaghaazkhan/Qwen2.5-3B-law-instruct

Text Generation • Updated Nov 17, 2025 • 7 • 2

liked 4 datasets over 1 year ago

updated a collection over 1 year ago

LLMs reasoning

Collection

2 items • Updated Mar 27, 2025

liked 2 models over 1 year ago

allenai/Llama-3.1-Tulu-3.1-8B

Text Generation • 8B • Updated Feb 10, 2025 • 676 • • 39

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 541k • 1.61k

liked 2 datasets over 1 year ago

instruction-pretrain/medicine-instruction-augmented-corpora

Preview • Updated Mar 2 • 271 • 13

casinca/PUBMED_title_abstracts_2019_baseline

Viewer • Updated May 17, 2024 • 3.68M • 164 • 9

liked a model over 1 year ago

m-a-p/FineFineWeb-bert

Updated Dec 19, 2024 • 6

liked a dataset over 1 year ago

datajuicer/the-pile-pubmed-central-refined-by-data-juicer

Viewer • Updated Oct 23, 2023 • 100 • 12 • 2

zhentao tan

AI & ML interests

Recent Activity

Organizations

tzt's activity

Tokenizer Question