🤖 AI Agent24 items
Latest AI agent research papers, analysis, and improvement insights
Agentic Software Engineering Book
Agentic Software Engineering Book
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards
Show HN: Emdash – Open-source agentic development environment
Show HN: Emdash – Open-source agentic development environment
Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training
Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training
LLM Agent-based Shilling Attack on Recommender Systems
LLM Agent-based Shilling Attack on Recommender Systems
Graph-driven embedding reinforcement and traceable LLM agent for reliable element alignment in construction report generation
Graph-driven embedding reinforcement and traceable LLM agent for reliable element alignment in construction report generation
Grid-Mind: An LLM-Orchestrated Multi-Fidelity Agent for Automated Connection Impact Assessment
Grid-Mind: An LLM-Orchestrated Multi-Fidelity Agent for Automated Connection Impact Assessment
HieraMAS: Optimizing Intra-Node LLM Mixtures and Inter-Node Topology for Multi-Agent Systems
HieraMAS: Optimizing Intra-Node LLM Mixtures and Inter-Node Topology for Multi-Agent Systems
NeuroWise: A Multi-Agent LLM"Glass-Box"System for Practicing Double-Empathy Communication with Autistic Partners
NeuroWise: A Multi-Agent LLM"Glass-Box"System for Practicing Double-Empathy Communication with Autistic Partners
MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
End-to-End PET/CT Interpretation and Quantification with an LLM-Orchestrated AI Agent: A Real-World Pilot Study
End-to-End PET/CT Interpretation and Quantification with an LLM-Orchestrated AI Agent: A Real-World Pilot Study
OpenAI announces Frontier Alliance Partners
OpenAI announces Frontier Alliance Partners
Our First Proof submissions
Our First Proof submissions
Launch HN: Cardboard (YC W26) – Agentic video editor
Launch HN: Cardboard (YC W26) – Agentic video editor
You don’t know what your agent will do until it’s in production
You don’t know what your agent will do until it’s in production
How we built Agent Builder’s memory system
How we built Agent Builder’s memory system
Agent Observability Powers Agent Evaluation
Agent Observability Powers Agent Evaluation
An AI Agent Published a Hit Piece on Me – The Operator Came Forward
An AI Agent Published a Hit Piece on Me – The Operator Came Forward
The Missing Semester of Your CS Education – Revised for 2026
The Missing Semester of Your CS Education – Revised for 2026