J's picture

J

dashfunnydashdash

·

AI & ML interests

Electric Sheep

Recent Activity

upvoted a paper 7 days ago

Unified Latents (UL): How to train your latents

upvoted a paper 17 days ago

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

upvoted a paper 21 days ago

ERNIE 5.0 Technical Report

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published 7 days ago • 53

upvoted a paper 17 days ago

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Paper • 2602.08236 • Published 18 days ago • 9

upvoted a paper 21 days ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 22 days ago • 260

upvoted 4 papers about 2 months ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published Jan 4 • 45

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 154

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published Dec 29, 2025 • 24

Shape of Thought: When Distribution Matters More than Correctness in Reasoning Tasks

Paper • 2512.22255 • Published Dec 24, 2025 • 6

upvoted 2 papers 2 months ago

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published Dec 18, 2025 • 38

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

upvoted 5 papers 3 months ago

EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing

Paper • 2512.06065 • Published Dec 5, 2025 • 29

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 158

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 258

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 113

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 77

upvoted 6 papers 4 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 211

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 106

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 240

Towards Robust Mathematical Reasoning

Paper • 2511.01846 • Published Nov 3, 2025 • 10

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

Paper • 2510.26236 • Published Oct 30, 2025 • 30

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 85