-
agentlans/prompt-safety-classification
Viewer • Updated • 72.1k • 62 -
Jammies-io/safety-refusal
Viewer • Updated • 100 • 58 -
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
Paper • 2510.10390 • Published • 5 -
nvidia/Aegis-AI-Content-Safety-Dataset-2.0
Viewer • Updated • 33.4k • 8.29k • 93
Daniel Bis
danielbis
·
AI & ML interests
https://scholar.google.com/citations?user=ArMgXHYAAAAJ&hl=en
Recent Activity
updated a collection 3 days ago
agents updated a collection 3 days ago
agents updated a collection 3 days ago
agentsOrganizations
None yet
agents
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 18 -
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
Paper • 2402.15491 • Published • 15 -
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks
Paper • 2510.12635 • Published • 17 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 56
cpt
safety
-
agentlans/prompt-safety-classification
Viewer • Updated • 72.1k • 62 -
Jammies-io/safety-refusal
Viewer • Updated • 100 • 58 -
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
Paper • 2510.10390 • Published • 5 -
nvidia/Aegis-AI-Content-Safety-Dataset-2.0
Viewer • Updated • 33.4k • 8.29k • 93
Datasets
agents
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 18 -
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
Paper • 2402.15491 • Published • 15 -
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks
Paper • 2510.12635 • Published • 17 -
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 56
decoding
cpt