DiagnosticIQ: A Benchmark for LLM-Based Industrial Maintenance Action Recommendation from Symbolic Rules Paper • 2605.08614 • Published 13 days ago • 7
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds Paper • 2605.18827 • Published 10 days ago • 6
Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines Paper • 2605.20630 • Published 2 days ago • 10
Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines Paper • 2605.20630 • Published 2 days ago • 10
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds Paper • 2605.18827 • Published 10 days ago • 6
SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search Paper • 2512.23167 • Published Dec 29, 2025 • 1
IndustryAssetEQA: A Neurosymbolic Operational Intelligence System for Embodied Question Answering in Industrial Asset Maintenance Paper • 2604.23446 • Published 27 days ago • 4
MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments Paper • 2605.09131 • Published 13 days ago • 55
Results and Retrospective Analysis of the CODS 2025 AssetOpsBench Challenge Paper • 2605.08518 • Published 14 days ago • 10
SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks Paper • 2605.14051 • Published 9 days ago • 1
DiagnosticIQ: A Benchmark for LLM-Based Industrial Maintenance Action Recommendation from Symbolic Rules Paper • 2605.08614 • Published 13 days ago • 7
SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks Paper • 2605.14051 • Published 9 days ago • 1
Results and Retrospective Analysis of the CODS 2025 AssetOpsBench Challenge Paper • 2605.08518 • Published 14 days ago • 10
MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments Paper • 2605.09131 • Published 13 days ago • 55
IndustryAssetEQA: A Neurosymbolic Operational Intelligence System for Embodied Question Answering in Industrial Asset Maintenance Paper • 2604.23446 • Published 27 days ago • 4
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents Paper • 2603.22386 • Published Mar 23 • 57
A Transformer-based Framework for Multivariate Time Series Representation Learning Paper • 2010.02803 • Published Oct 6, 2020
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance Paper • 2506.03828 • Published Jun 4, 2025 • 20
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes Paper • 2506.03278 • Published Jun 3, 2025 • 7