1 97 177

Unknown Entity

unknownentity

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

sapientinc/HRM-Text-1B

liked a model 21 days ago

ideogram-ai/ideogram-4-fp8

liked a model 28 days ago

zhen-nan/L2P

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 355

upvoted 6 papers 2 months ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published Mar 23 • 125

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published Apr 10 • 51

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published Apr 15 • 126

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published Apr 21 • 88

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

ELT: Elastic Looped Transformers for Visual Generation

Paper • 2604.09168 • Published Apr 10 • 24

upvoted a collection 3 months ago

UnifoLM_WBT_Dataset

Collection

14 items • Updated May 18 • 84

upvoted an article 3 months ago

Article

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

nvidia

•

Mar 16

• 31

upvoted a paper 3 months ago

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published Mar 12 • 91

upvoted a collection 4 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.69k

upvoted a paper 4 months ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Mar 13 • 55

upvoted 6 papers 6 months ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published Dec 18, 2025 • 76

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published Nov 29, 2025 • 27

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Paper • 2512.08829 • Published Dec 9, 2025 • 23

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 51

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper • 2512.09824 • Published Dec 10, 2025 • 28

upvoted 2 papers 7 months ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published Dec 8, 2025 • 51

Unknown Entity

AI & ML interests

Recent Activity

Organizations

unknownentity's activity

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics