← PrevSat, Feb 21Next →

🤖 AI Agent23 items

Latest AI agent research papers, analysis, and improvement insights

arxivcs.LG

MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning

LLM 추론을 위한 견고한 그래디언트 활용 최적화

arxivcs.AI

AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing

과학 계산용 PDE 해결 다중 에이전트 자동 설계

arxivcs.CL

Modeling Distinct Human Interaction in Web Agents

웹 에이전트의 인간 개입 패턴 모델링 및 협력

arxivcs.AI

KLong: Training LLM Agent for Extremely Long-horizon Tasks

궤적 분할 SFT와 점진적 RL로 장기 작업 LLM 에이전트 훈련

semantic_scholarresearch

MedClarify: An information-seeking AI agent for medical diagnosis with case-specific follow-up questions

반복적 질문으로 의료 진단 불확실성 해결하는 에이전트

semantic_scholarresearch

KLong: Training LLM Agent for Extremely Long-horizon Tasks

궤적 분할 SFT와 점진적 RL로 장기 작업 LLM 에이전트 훈련

semantic_scholarresearch

What Makes a Good LLM Agent for Real-world Penetration Testing?

침투 테스트 LLM 에이전트의 실패 원인 분석

arxivcs.LG

FAMOSE: A ReAct Approach to Automated Feature Discovery

ReAct 기반 자동 특성 공학 프레임워크

arxivcs.LG

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

LLM 추론의 약약/강 검증 신뢰도 및 비용 분석

arxivcs.LG

Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs

LLM 비동기 강화학습의 분산 제어 기법

arxivcs.AI

Evaluating Chain-of-Thought Reasoning through Reusability and Verifiability

다중 에이전트 CoT 추론의 재사용성과 검증 평가

semantic_scholarresearch

NeuDiff Agent: A Governed AI Workflow for Single-Crystal Neutron Crystallography

결정학 자동 분석 도구 사용 AI 워크플로우

semantic_scholarresearch

Testing BDI-based Multi-Agent Systems using Discrete Event Simulation

BDI 다중 에이전트 시스템 시뮬레이션 테스트

arxivcs.AI

AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games

인간 게임으로 AI 일반 지능 평가 벤치마크

arxivmath.OC

Adaptive Decentralized Composite Optimization via Three-Operator Splitting

네트워크 분산 에이전트의 적응적 최적화 알고리즘

arxivphysics.acc-ph

Toward a Fully Autonomous, AI-Native Particle Accelerator

자율 입자가속기 운영을 위한 AI 공동 설계

semantic_scholarresearch

Unveiling the augmented human agent: How communication style mitigates the heuristic barriers of human-AI collaboration in online customer service encounters

고객 서비스의 인간-AI 협력 커뮤니케이션 연구

blog_openairesearch

Our First Proof submissions

수학 증명 챌린지에서 AI 모델의 추론 능력 시연

arxivcs.AI

CLEF HIPE-2026: Evaluating Accurate and Efficient Person-Place Relation Extraction from Multilingual Historical Texts

다국어 역사 텍스트에서 인물-장소 관계 추출 평가

arxivcs.CL

Unmasking the Factual-Conceptual Gap in Persian Language Models

페르시아 언어 모델의 사실-개념 격차 진단

Page 1 of 2Next