Open to Work

1 8

Nate

n-ate

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

unsloth/Qwen3.5-122B-A10B-GGUF

updated a collection 22 days ago

Local develoment

updated a collection 22 days ago

Local develoment

View all activity

Organizations

None yet

liked a model 17 days ago

unsloth/Qwen3.5-122B-A10B-GGUF

Image-Text-to-Text • 122B • Updated Mar 5 • 190k • 255

updated a collection 22 days ago

Local develoment

Collection

3 items • Updated 22 days ago

liked a model about 1 month ago

unsloth/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated Mar 6 • 210k • 611

liked a model 2 months ago

ggml-org/embeddinggemma-300M-GGUF

0.3B • Updated 6 days ago • 543k • 28

liked a model 3 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated Mar 2 • 491k • 2.48k

reacted to retronic's post with 😎 about 1 year ago

Post

4662

Colox, a reasoning AI model. I am currently working on a model smarter than GPT o1 that thinks before it speaks. It is coming tomorrow in the afternoon.

7 replies

reacted to sebblers's post with 😔 about 1 year ago

Post

2204

Subscribed to pro a month ago because I wanted to get 25 minutes of zero gpu quota.

I get error messages saying that I have exceeded quota on ALL spaces on this site.

I haven't even used any quota. It says I have 25 minutes left to use. I can't try anything out!

Been like this for a whole month now. What is this!? What did I sign up for exactly?

10 replies

reacted to grimjim's post with 😎 about 1 year ago

Post

2651

I've made yet another merge of reasoning models with incremental gains on the current Open LLM leaderboard.
open-llm-leaderboard/open_llm_leaderboard

Merging in DeepSeek R1 distillation to Llama 3.1 8B (at 10% task arithmetic weight, using the Llama 3.1 8B base model as the case rather than the instruct model) with a prior best merge resulted in a slightly lower IFEval, but a higher result in every other benchmark save for MMLU-PRO, which went down only marginally. MATH Lvl5 and GPQA went up palpably.
grimjim/DeepSauerHuatuoSkywork-R1-o1-Llama-3.1-8B

This result is currently my best Llama 3.1 8B merge result to date. The actual R1 distillation itself scored quite badly, so this would seem to be another case of unexpected formatting (reflected in IFEval) hurting the evaluation results, obscuring the strength of a model.

It is also possible to use the text generation feature of this model to generate roleplay completions. Based on informal testing, this model's bias toward problem-solving will subtly impact narration.

reacted to lin-tan's post with 👍 about 1 year ago

Post

3384

🚀 Excited to share that our paper, "SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models", has been accepted to #ICRA2025! 🔗 Preprint: https://arxiv.org/pdf/2409.19471

We introduce SELP (Safe Efficient LLM Planner), a novel approach for generating plans that adhere to user-specified constraints while optimizing for time-efficient execution. By leveraging linear temporal logic (LTL) to interpret natural language commands, SELP effectively handles complex commands and long-horizon tasks. 🤖

💡SELP presents three key insights:
1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications.
2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the autoregressive inference of plans, ensuring the generated plans conform to the LTL.
3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency.

📊 Experiment: Our experiments demonstrate SELP’s effectiveness and generalizability across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate.

@yiwu @jiang719

#ICRA2025 #LLM #Robotics #Agent #LLMPlanner