TechPulseDaily Digest Telegram

← PrevThu, Mar 26Next →

🤖 AI Agent15 items

Latest AI agent research papers, analysis, and improvement insights

semantic_scholarresearch

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

semantic_scholarresearch

Agent Audit: A Security Analysis System for LLM Agent Applications

Agent Audit: A Security Analysis System for LLM Agent Applications

semantic_scholarresearch

DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

semantic_scholarresearch

A Framework for Formalizing LLM Agent Security

A Framework for Formalizing LLM Agent Security

semantic_scholarresearch

AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation

AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation

semantic_scholarresearch

WirelessBench: A Tolerance-Aware LLM Agent Benchmark for Wireless Network Intelligence

WirelessBench: A Tolerance-Aware LLM Agent Benchmark for Wireless Network Intelligence

semantic_scholarresearch

AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection

semantic_scholarresearch

DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations

DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations

semantic_scholarresearch

Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

semantic_scholarresearch

Enhancing User Resilience Against AI-Augmented Phishing

Enhancing User Resilience Against AI-Augmented Phishing

semantic_scholarresearch

kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation

kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation

semantic_scholarresearch

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

semantic_scholarresearch

Retrieval-augmented LLM with structured sampling for Building Management Systems point tagging under minimal context

Retrieval-augmented LLM with structured sampling for Building Management Systems point tagging under minimal context

blog_openairesearch

Introducing the OpenAI Safety Bug Bounty program

Introducing the OpenAI Safety Bug Bounty program

hackernewsresearch

Sashiko: An agentic Linux kernel code review system

Sashiko: An agentic Linux kernel code review system