π€ AI Agent9 items
Latest AI agent research papers, analysis, and improvement insights
ClawTrace: Cost-Aware Tracing for LLM Agent Skill Distillation
μμ΄μ νΈμ μ€ν λ³ λΉμ© μΆμ λ° μ€ν¬ μ¦λ₯ νλ«νΌ
Revisable by Design: A Theory of Streaming LLM Agent Execution
μ¬μ©μ κ°μ κ°λ₯ν μ€νΈλ¦¬λ° κΈ°λ° μμ΄μ νΈ μ€ν ν¨λ¬λ€μ
Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems
λ€μ€ μμ΄μ νΈ μμ€ν μ μ₯μ μμΈ λΆμ λ° μ± μ νμ λ²€μΉλ§ν¬
BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks
LLMμ΄ λ²€μΉλ§ν¬ μ체 μ€λ₯λ₯Ό κ°μνλ μλ κ°μ νλ μμν¬
ITAS: A Multi-Agent Architecture for LLM-Based Intelligent Tutoring
λν κ°μ’μ μ€μ λ°°ν¬λ μ λ¬Έκ° μμ΄μ νΈ κΈ°λ° νν°λ§ μμ€ν
CUJBench: Benchmarking LLM-Agent on Cross-Modal Failure Diagnosis from Browser to Backend
λΈλΌμ°μ -λ°±μλ ν¬λ‘μ€λͺ¨λ¬ μ₯μ μ§λ¨ μμ΄μ νΈ λ²€μΉλ§ν¬
LLM-Powered Multi-Agent Framework for Automated PPV Prediction in Tunnel Blasting
λ€μ€ μμ΄μ νΈ νλ ₯ κΈ°λ° μλνλ ML νμ΄νλΌμΈ ꡬμΆ
Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation
κ²μ 3D μ»·μ μλ μμ±μ μν LLM μμ΄μ νΈ νλ μμν¬
How Personal Characteristics Shape User Exploration of Diverse Movie Recommendations with a LLM-Based Multi-Agent System
κ°μΈ νΉμ±μ λ°λ₯Έ λ€μ€ μμ΄μ νΈ μΆμ² μμ€ν μ μ¬μ©μ νμ λΆμ