6 3

Max

ZhMax

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

upvoted a paper 3 months ago

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

updated a dataset 4 months ago

ZhMax/osworld_cache

View all activity

Organizations

None yet

upvoted a paper 25 days ago

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

Paper • 2604.02340 • Published Apr 11 • 9

upvoted a paper 3 months ago

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Paper • 2602.11008 • Published Feb 11 • 18

updated a dataset 4 months ago

ZhMax/osworld_cache

Updated Jan 26 • 8

published a dataset 4 months ago

ZhMax/osworld_cache

Updated Jan 26 • 8

upvoted an article 7 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

liked 2 datasets 11 months ago

OpenCoder-LLM/opc-sft-stage2

Viewer • Updated Nov 24, 2024 • 436k • 2.12k • 103

nvidia/OpenCodeGeneticInstruct

Viewer • Updated May 23, 2025 • 15.1M • 578 • 20

upvoted a paper 12 months ago

Risk-Averse Reinforcement Learning with Itakura-Saito Loss

Paper • 2505.16925 • Published May 22, 2025 • 26

updated 2 models over 1 year ago

ZhMax/llama-2-13b-ebft-sparsegpt-outlier-wiki-block-outlier

Text Generation • 13B • Updated Dec 19, 2024 • 2

ZhMax/llama-2-7b-ebft-sparsegpt-outlier-wiki-block-outlier

Text Generation • 7B • Updated Dec 15, 2024 • 3

updated 4 datasets over 1 year ago

upvoted a paper over 1 year ago

GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs

Paper • 2408.15300 • Published Aug 27, 2024 • 3

updated a model almost 2 years ago

ZhMax/Llama-2-7B-admm-50pr-quik-8bit

Text Generation • 7B • Updated Aug 8, 2024 • 8

updated a dataset almost 2 years ago

ZhMax/commonsense_170k_sft

Viewer • Updated Aug 3, 2024 • 170k • 11

liked a model almost 2 years ago

apple/DCLM-7B

7B • Updated Jul 26, 2024 • 214 • 834

updated 2 models almost 2 years ago

ZhMax/Llama-3-8B-quikoutliers-random11

Text Generation • 8B • Updated Jun 24, 2024

ZhMax/Llama-3-8B-quikoutliers-random10

Text Generation • 8B • Updated Jun 24, 2024

Max

AI & ML interests

Recent Activity

Organizations

ZhMax's activity

Smol2Operator: Post-Training GUI Agents for Computer Use