Xuan Yang's picture

Xuan Yang

TorresYang

·

https://torresyangx.github.io/

AI & ML interests

LLM reasoning, agent

Recent Activity

updated a collection 4 days ago

updated a collection 4 days ago

new activity 4 days ago

Miaow-Lab/RUT-Bench:Add task categories and link to paper

View all activity

Organizations

updated 2 collections 4 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 4 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 4 days ago

New activity in Miaow-Lab/RUT-Bench 4 days ago

Add task categories and link to paper

#1 opened 4 days ago by

updated 2 collections 5 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 4 days ago

RUT-Bench

Benchmark data in "Beyond Ideal Instruction: A Comprehensive Framework for Evaluating LLMs in Realistic Interactions". • 2 items • Updated 4 days ago

updated a dataset 5 days ago

Miaow-Lab/RUT-Bench

Viewer • Updated 4 days ago • 1.64k • 61

upvoted a collection 5 days ago

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated Mar 4 • 3

published a dataset 5 days ago

Miaow-Lab/RUT-Bench

Viewer • Updated 4 days ago • 1.64k • 61

authored a paper 3 months ago

Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Paper • 2603.03031 • Published Mar 3

updated 2 collections 3 months ago

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated Mar 4 • 3

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated Mar 4

updated a model 3 months ago

Miaow-Lab/SSAE-Checkpoints

Feature Extraction • Updated Mar 4

updated a dataset 3 months ago

Miaow-Lab/SSAE-Dataset

Viewer • Updated Mar 4 • 1.28M • 63

updated 2 collections 3 months ago

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated Mar 4 • 3

SSAE

Training and evaluation dataset, model checkpoints in 'Step-Level Sparse Autoencoder for Reasoning Process Interpretation' • 3 items • Updated Mar 4

published a model 3 months ago

Miaow-Lab/SSAE-Checkpoints

Feature Extraction • Updated Mar 4

published a dataset 3 months ago

Miaow-Lab/SSAE-Dataset

Viewer • Updated Mar 4 • 1.28M • 63