Raul

lefutureman

3 19

rally12

AI & ML interests

LLM, CV

Recent Activity

liked a model 19 days ago

Qwen/Qwen-AgentWorld-35B-A3B

upvoted a collection 2 months ago

DeepSeek-V4

upvoted a paper 2 months ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

View all activity

Organizations

None yet

liked a model 19 days ago

Qwen/Qwen-AgentWorld-35B-A3B

Text Generation • 35B • Updated 21 days ago • 101k • 595

upvoted a collection 2 months ago

DeepSeek-V4

Collection

6 items • Updated 19 days ago • 736

upvoted a paper 2 months ago

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Paper • 2603.19312 • Published Mar 13 • 50

liked 2 models 5 months ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Dec 3, 2025 • 1.75M • • 1.16k

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 2.36M • 1.71k

upvoted a paper 6 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

liked 2 models 6 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 324k • • 2.6k

numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Jun 5 • 31.4k • 477

liked a Space 9 months ago

The Smol Training Playbook

📚

3.24k

The secrets to building world-class LLMs

liked 2 models 9 months ago

nvidia/DLER-R1-1.5B-Research

2B • Updated Oct 25, 2025 • 105 • 19

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 72.1k • 1.22k

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.94k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

Babelscape/t5-base-summarization-claim-extractor

0.2B • Updated Jan 22 • 4.15k • 15

liked a dataset about 1 year ago

HPLT/HPLT2.0_cleaned

Updated Jun 11 • 93.9k • 43

liked 2 models over 1 year ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 7.01M • • 535

Weyaxi/Qwen-72B-Llama

Text Generation • 72B • Updated Feb 2, 2024 • 87 • 12

liked a model about 2 years ago

facebook/blenderbot-400M-distill

Updated Mar 30, 2023 • 7.97k • 468

liked 2 models over 2 years ago

Yukang/Llama-2-13b-chat-longlora-32k-sft

Text Generation • 13B • Updated Oct 13, 2023 • 91 • 22

Yukang/Llama-2-70b-longlora-32k

Text Generation • Updated Oct 25, 2023 • 6 • 18

liked a dataset over 2 years ago

uonlp/CulturaX

Viewer • Updated Dec 16, 2024 • 7.18B • 20.3k • 649