GitHub - Lyz103/LLM-Agent-Paper-daily: Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2026.04.13

Usage instructions: here

Table of Contents

Agents

Agents

Publish Date	Title	Authors	PDF	Code
2025-07-23	DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models	Liwenhan Xie et.al.	2507.17734	null
2025-07-23	BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems	Malsha Ashani Mahawatta Dona et.al.	2507.17722	null
2025-07-23	Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks	Ilias Chatzistefanidis et.al.	2507.17695	null
2025-07-23	Simulating multiple human perspectives in socio-ecological systems using large language models	Yongchao Zeng et.al.	2507.17680	null
2025-07-23	LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning	Luca Salvatore Lorello et.al.	2507.17482	null
2025-07-23	ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents	Chang Nie et.al.	2507.17462	null
2025-07-23	IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception	Haichuan Li et.al.	2507.17445	null
2025-07-23	Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach	Hugh Adams et.al.	2507.17433	null
2025-07-23	CAPRI-CT: Causal Analysis and Predictive Reasoning for Image Quality Optimization in Computed Tomography	Sneha George Gnanakalavathy et.al.	2507.17420	null
2025-07-23	Residual Prophet Inequalities	Jose Correa et.al.	2507.17391	null
2025-07-23	DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning	Chuzhan Hao et.al.	2507.17365	null
2025-07-23	DeMo++: Motion Decoupling for Autonomous Driving	Bozhou Zhang et.al.	2507.17342	null
2025-07-23	HuNavSim 2.0	Miguel Escudero-Jiménez et.al.	2507.17317	null
2025-07-23	EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents	Zijie Guo et.al.	2507.17311	null
2025-07-23	Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments	Shitong Zhu et.al.	2507.17289	null
2025-07-23	Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance	Rishi Parekh et.al.	2507.17273	null
2025-07-23	Agent Identity Evals: Measuring Agentic Identity	Elija Perrier et.al.	2507.17257	null
2025-07-23	LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks	Lijie Zheng et.al.	2507.17188	null
2025-07-23	Optimal Calibrated Signaling in Digital Auctions	Zhicheng Du et.al.	2507.17187	null
2025-07-23	FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance	Lingfeng Zeng et.al.	2507.17186	null
2025-07-23	Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria	Die Hu et.al.	2507.17183	null
2025-07-23	JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction	Fangze Lin et.al.	2507.17152	null
2025-07-23	CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards	Cheng Liu et.al.	2507.17147	null
2025-07-23	Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination	Mariam ALMutairi et.al.	2507.17134	null
2025-07-23	Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance	Yufei He et.al.	2507.17131	null
2025-07-23	Stochastically Structured Reservoir Computers for Financial and Economic System Identification	Lendy Banegas et.al.	2507.17115	null
2025-07-22	Deformable Cluster Manipulation via Whole-Arm Policy Learning	Jayadeep Jacob et.al.	2507.17085	null
2025-07-22	VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings	Ramin Giahi et.al.	2507.17080	null
2025-07-22	Approximation Techniques for the Reconstruction of the Probability Measure and the Coupling Parameters in a Curie-Weiss Model for Large Populations	Miguel Ballesteros et.al.	2507.17073	null
2025-07-22	Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation	Jessup Byun et.al.	2507.17066	null
2025-07-22	Parallelism Meets Adaptiveness: Scalable Documents Understanding in Multi-Agent LLM Systems	Chengxuan Xia et.al.	2507.17061	null
2025-07-22	Shared Control of Holonomic Wheelchairs through Reinforcement Learning	Jannis Bähler et.al.	2507.17055	null
2025-07-22	New Mechanisms in Flex Distribution for Bounded Suboptimal Multi-Agent Path Finding	Shao-Hung Chan et.al.	2507.17054	null
2025-07-22	Evaluating Uncertainty and Quality of Visual Language Action-enabled Robots	Pablo Valle et.al.	2507.17049	null
2025-07-22	Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry	Koushik Mondal et.al.	2507.17023	null
2025-07-22	Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?	Arduin Findeis et.al.	2507.17015	null
2025-07-22	Quantitative convergence for displacement monotone Mean Field Games of control	Joe Jackson et.al.	2507.17014	null
2025-07-22	Towards Autonomous Sustainability Assessment via Multimodal AI Agents	Zhihan Zhang et.al.	2507.17012	null
2025-07-22	On-chip stencil lithography for superconducting qubits	Roudy Hanna et.al.	2507.17005	null
2025-07-22	Hierarchical Reinforcement Learning Framework for Adaptive Walking Control Using General Value Functions of Lower-Limb Sensor Signals	Sonny T. Jones et.al.	2507.16983	null
2025-07-22	Text-to-SPARQL Goes Beyond English: Multilingual Question Answering Over Knowledge Graphs through Human-Inspired Reasoning	Aleksandr Perevalov et.al.	2507.16971	null
2025-07-22	Fundamental limits of distributed covariance matrix estimation via a conditional strong data processing inequality	Mohammad Reza Rahmani et.al.	2507.16953	null
2025-07-22	Multi-agent Reinforcement Learning for Robotized Coral Reef Sample Collection	Daniel Correa et.al.	2507.16941	null
2025-07-22	AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation	Nima Fathi et.al.	2507.16940	null
2025-07-22	Budget Allocation Policies for Real-Time Multi-Agent Path Finding	Raz Beck et.al.	2507.16874	null
2025-07-21	Reinforcement Learning in hyperbolic space for multi-step reasoning	Tao Xu et.al.	2507.16864	null
2025-07-21	MobileUse: A GUI Agent with Hierarchical Reflection for Autonomous Mobile Operation	Ning Li et.al.	2507.16853	null
2025-07-21	Dynamic Simulation Framework for Disinformation Dissemination and Correction With Social Bots	Boyu Qiao et.al.	2507.16848	null
2025-07-22	ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning	Chi-Pin Huang et.al.	2507.16815	null
2025-07-22	LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs	Da-Chen Lian et.al.	2507.16809	null
2025-07-23	Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning	Yanjun Zheng et.al.	2507.16802	null
2025-07-23	Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent	Xiaoyu Zhan et.al.	2507.16799	null
2025-07-22	Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning	Mian Ibad Ali Shah et.al.	2507.16796	null
2025-07-22	Generalized non-reciprocal phase transitions in multipopulation systems	Cheyne Weis et.al.	2507.16763	null
2025-07-22	AI-enhanced conversational agents for personalized asthma support Factors for engagement, value and efficacy	Laura Moradbakhti et.al.	2507.16735	null
2025-07-23	Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints	Zhenyun Yin et.al.	2507.16727	null
2025-07-22	RAVine: Reality-Aligned Evaluation for Agentic Search	Yilong Xu et.al.	2507.16725	null
2025-07-22	Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation	Viktor Muryn et.al.	2507.16704	null
2025-07-22	FOGNITE: Federated Learning-Enhanced Fog-Cloud Architecture	Somayeh Sobati-M et.al.	2507.16668	null
2025-07-22	Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis	Sara Giordano et.al.	2507.16641	null
2025-07-22	Novel Multi-Agent Action Masked Deep Reinforcement Learning for General Industrial Assembly Lines Balancing Problems	Ali Mohamed Ali et.al.	2507.16635	null
2025-07-22	Augmenting Von Neumann's Architecture for an Intelligent Future	Rajpreet Singh et.al.	2507.16628	null
2025-07-22	CTSL: Codebook-based Temporal-Spatial Learning for Accurate Non-Contrast Cardiac Risk Prediction Using Cine MRIs	Haoyang Su et.al.	2507.16612	null
2025-07-22	Smooth Games of Configuration in the Linear-Quadratic Setting	Jesse Milzman et.al.	2507.16611	null
2025-07-22	Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis	Xiaojiao Xiao et.al.	2507.16579	null
2025-07-22	Evaluating Social Acceptance of eXtended Reality (XR) Agent Technology: A User Study (Extended Version)	Megha Quamara et.al.	2507.16562	null
2025-07-22	A Distributed Actor-Critic Algorithm for Fixed-Time Consensus in Nonlinear Multi-Agent Systems	Aria Delshad et.al.	2507.16520	null
2025-07-22	Analogy making as amortised model construction	David G. Nagy et.al.	2507.16511	null
2025-07-22	Agentic RAG with Knowledge Graphs for Complex Multi-Hop Reasoning in Real-World Applications	Jean Lelong et.al.	2507.16507	null
2025-07-22	Arbitrage Tactics in the Local Markets via Hierarchical Multi-agent Reinforcement Learning	Haoyang Zhang et.al.	2507.16479	null
2025-07-22	Adaptive Bayesian Single-Shot Quantum Sensing	Ivana Nikoloska et.al.	2507.16477	null
2025-07-22	Towards Enforcing Company Policy Adherence in Agentic Workflows	Naama Zwerdling et.al.	2507.16459	null
2025-07-22	Distributed Oscillatory Guidance for Formation Flight of Fixed-Wing Drones	Yang Xu et.al.	2507.16458	null
2025-07-23	RIS-aided Latent Space Alignment for Semantic Channel Equalization	Tomás Hüttebräucker et.al.	2507.16450	null
2025-07-22	From model-based learning to model-free behaviour with Meta-Interpretive Learning	Stassa Patsantzis et.al.	2507.16434	null
2025-07-22	LLM-Driven Collaborative Model for Untangling Commits via Explicit and Implicit Dependency Reasoning	Bo Hou et.al.	2507.16395	null
2025-07-22	Application of LLM Guided Reinforcement Learning in Formation Control with Collision Avoidance	Chenhao Yao et.al.	2507.16382	null
2025-07-22	COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network	Xingjian Zhang et.al.	2507.16306	null
2025-07-22	ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry	Tianze Xu et.al.	2507.16280	null
2025-07-22	Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping	Srivatsan Krishnan et.al.	2507.16249	null
2025-07-22	FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents	Run Sun et.al.	2507.16248	null
2025-07-22	Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery	Bo Wen et.al.	2507.16229	null
2025-07-22	Unbeatable imitation of a friend	Masahiko Ueda et.al.	2507.16221	null
2025-07-22	Best-of-Both-Worlds Guarantees with Fairer Endings	Telikepalli Kavitha et.al.	2507.16209	null
2025-07-22	CHIMERA: Compressed Hybrid Intelligence for Twin-Model Enhanced Multi-Agent Deep Reinforcement Learning for Multi-Functional RIS-Assisted Space-Air-Ground Integrated Networks	Li-Hsiang Shen et.al.	2507.16204	null
2025-07-22	SVAgent: AI Agent for Hardware Security Verification Assertion	Rui Guo et.al.	2507.16203	null
2025-07-22	RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs	Pengwei Jin et.al.	2507.16200	null
2025-07-22	Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task	Jared Moore et.al.	2507.16196	null
2025-07-22	Emergent Cognitive Convergence via Implementation: A Structured Loop Reflecting Four Theories of Mind (A Position Paper)	Myung Ho Kim et.al.	2507.16184	null
2025-07-22	Benchmarking LLM Privacy Recognition for Social Robot Decision Making	Dakota Sullivan et.al.	2507.16124	null
2025-07-21	Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization	Shengchao Liu et.al.	2507.16110	null
2025-07-21	Deep Researcher with Test-Time Diffusion	Rujun Han et.al.	2507.16075	null
2025-07-21	Asymptotic consensus with transmission and reaction delay: an overview	Jan Haskovec et.al.	2507.16072	null
2025-07-21	Is memory all you need? Data-driven Mori-Zwanzig modeling of Lagrangian particle dynamics in turbulent flows	Xander de Wit et.al.	2507.16058	null
2025-07-23	Making REST APIs Agent-Ready: From OpenAPI to Model Context Protocol Servers for Tool-Augmented LLMs	Meriem Mastouri et.al.	2507.16044	null
2025-07-21	A Pilot Study on LLM-Based Agentic Translation from Android to iOS: Pitfalls and Insights	Zhili Zeng et.al.	2507.16037	null
2025-07-21	Minor Embedding for Quantum Annealing with Reinforcement Learning	Riccardo Nembrini et.al.	2507.16004	null
2025-07-21	Automated Design of Structured Variational Quantum Circuits with Reinforcement Learning	Gloria Turati et.al.	2507.16001	null
2025-07-21	Red Supergiant Mass Loss and Mass-Loss Rates	Jacco Th. van Loon et.al.	2507.15971	null
2025-07-23	HyDRA: A Hybrid-Driven Reasoning Architecture for Verifiable Knowledge Graphs	Adrian Kaiser et.al.	2507.15917	null
2025-07-21	Towards Mitigation of Hallucination for LLM-empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor	Siyuan Liu et.al.	2507.15903	null
2025-07-21	Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation	Joydeep Chandra et.al.	2507.15901	null
2025-07-20	Integrating Reason-Based Moral Decision-Making in the Reinforcement Learning Architecture	Lisa Dargasz et.al.	2507.15895	null
2025-07-20	StaAgent: An Agentic Framework for Testing Static Analyzers	Elijah Nnorom et.al.	2507.15892	null
2025-07-19	AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?	Ori Press et.al.	2507.15887	null
2025-07-18	ADEPTS: A Capability Framework for Human-Centered Agent Design	Pierluca D'Oro et.al.	2507.15885	null
2025-07-21	LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra	Seth Karten et.al.	2507.15815	null
2025-07-21	Density control of multi-agent swarms via bio-inspired leader-follower plasticity	Gian Carlo Maffettone et.al.	2507.15781	null
2025-07-21	A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining	Yifan Shen et.al.	2507.15770	null
2025-07-21	GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts	Jingyi Zheng et.al.	2507.15761	null
2025-07-21	Towards physician-centered oversight of conversational diagnostic AI	Elahe Vedadi et.al.	2507.15743	null
2025-07-21	General Matching Games	Felipe Garrido-Lucero et.al.	2507.15737	null
2025-07-21	Competitive Algorithms for Cooperative Multi-Agent Ski-Rental Problems	Xuchuang Wang et.al.	2507.15727	null
2025-07-21	Agentic AI for autonomous anomaly management in complex systems	Reza Vatankhah Barenji et.al.	2507.15676	null
2025-07-21	BugScope: Learn to Find Bugs Like Human	Jinyao Guo et.al.	2507.15671	null
2025-07-21	Asynchronous Collective Tree Exploration: a Distributed Algorithm, and a new Lower Bound	Romain Cosson et.al.	2507.15658	null
2025-07-21	Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training	Kailai Yang et.al.	2507.15640	null
2025-07-21	TacticCraft: Natural Language-Driven Tactical Adaptation for StarCraft II	Weiyu Ma et.al.	2507.15618	null
2025-07-21	Why can't Epidemiology be automated (yet)?	David Bann et.al.	2507.15617	null
2025-07-21	DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving	Zhihao Zhang et.al.	2507.15615	null
2025-07-21	Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario	Yinsong Chen et.al.	2507.15587	null
2025-07-21	FlowForge: Guiding the Creation of Multi-agent Workflows with Design Space Visualization as a Thinking Scaffold	Pan Hao et.al.	2507.15559	null
2025-07-21	PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors	Yimeng Chen et.al.	2507.15550	null
2025-07-21	HAMLET: Hyperadaptive Agent-based Modeling for Live Embodied Theatrics	Sizhou Chen et.al.	2507.15518	null
2025-07-21	The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents	Simon Kohaut et.al.	2507.15478	null
2025-07-21	The Emergence of Deep Reinforcement Learning for Path Planning	Thanh Thi Nguyen et.al.	2507.15469	null
2025-07-23	Solving nonconvex Hamilton--Jacobi--Isaacs equations with PINN-based policy iteration	Hee Jun Yang et.al.	2507.15455	null
2025-07-21	EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent	Jiaao Li et.al.	2507.15428	null
2025-07-21	PhishIntentionLLM: Uncovering Phishing Website Intentions through Multi-Agent Retrieval-Augmented Generation	Wenhao Li et.al.	2507.15419	null
2025-07-21	RAD: Retrieval High-quality Demonstrations to Enhance Decision-making	Lu Guo et.al.	2507.15356	null
2025-07-21	One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms	Zijian Zhao et.al.	2507.15351	null
2025-07-21	QSAF: A Novel Mitigation Framework for Cognitive Degradation in Agentic AI	Hammad Atta et.al.	2507.15330	null
2025-07-21	Strategically Robust Game Theory via Optimal Transport	Nicolas Lanzetti et.al.	2507.15325	null
2025-07-21	Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems	Qian Xiong et.al.	2507.15296	null
2025-07-21	Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning	Elias Malomgré et.al.	2507.15287	null
2025-07-21	Event-Triggered Resilient Consensus of Networked Euler-Lagrange Systems Under Byzantine Attacks	Yuliang Fu et.al.	2507.15283	null
2025-07-21	IM-Chat: A Multi-agent LLM-based Framework for Knowledge Transfer in Injection Molding Industry	Junhyeong Lee et.al.	2507.15268	null
2025-07-21	SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search	Xiaofeng Shi et.al.	2507.15245	null
2025-07-21	FaultLine: Automated Proof-of-Vulnerability Generation Using LLM Agents	Vikram Nitin et.al.	2507.15241	null
2025-07-21	Solving Formal Math Problems by Decomposition and Iterative Reflection	Yichi Zhou et.al.	2507.15225	null
2025-07-21	EchoVoices: Preserving Generational Voices and Memories for Seniors and Children	Haiying Xu et.al.	2507.15221	null
2025-07-21	PromptArmor: Simple yet Effective Prompt Injection Defenses	Tianneng Shi et.al.	2507.15219	null
2025-07-21	1H Polarization above 60% at room temperature by triplet dynamic nuclear polarization	Kenichiro Tateishi et.al.	2507.15217	null
2025-07-21	Personalized 3D Myocardial Infarct Geometry Reconstruction from Cine MRI with Explicit Cardiac Motion Modeling	Yilin Lyu et.al.	2507.15194	null
2025-07-21	Joint-Local Grounded Action Transformation for Sim-to-Real Transfer in Multi-Agent Traffic Control	Justin Turnau et.al.	2507.15174	null
2025-07-20	STL-GO: Spatio-Temporal Logic with Graph Operators for Distributed Systems with Multiple Network Topologies	Yiqi Zhao et.al.	2507.15147	null
2025-07-20	Can We Move Freely in NEOM's The Line? An Agent-Based Simulation of Human Mobility in a Futuristic Smart City	Abderaouf Bahi et.al.	2507.15143	null
2025-07-20	Statistical state dynamics based study of turbulent Eady fronts. Part 2. Finite amplitude equilibria	Eojin Kim et.al.	2507.15134	null
2025-07-20	Initialization-driven neural generation and training for high-dimensional optimal control and first-order mean field games	Mouhcine Assouli et.al.	2507.15126	null
2025-07-20	From Kicking to Causality: Simulating Infant Agency Detection with a Robust Intrinsic Reward	Xia Xu et.al.	2507.15106	null
2025-07-20	Search-Based Autonomous Vehicle Motion Planning Using Game Theory	Pouya Panahandeh et.al.	2507.15088	null
2025-07-20	WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization	Zhengwei Tao et.al.	2507.15061	null
2025-07-20	LibLMFuzz: LLM-Augmented Fuzz Target Generation for Black-box Libraries	Ian Hardgrove et.al.	2507.15058	null
2025-07-20	EduThink4AI: Translating Educational Critical Thinking into Multi-Agent LLM Systems	Xinmeng Hou et.al.	2507.15015	null
2025-07-20	The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering	Hao Li et.al.	2507.15003	null
2025-07-20	LLM-Enhanced Multi-Agent Reinforcement Learning with Expert Workflow for Real-Time P2P Energy Trading	Chengwei Lou et.al.	2507.14995	null
2025-07-20	Think Like an Engineer: A Neuro-Symbolic Collaboration Agent for Generative Software Requirements Elicitation and Self-Review	Sai Zhang et.al.	2507.14969	null
2025-07-20	STEPC: A Pixel-wise Nonuniformity Correction Framework for Photon-Counting CT in Multi-material Imaging Scenarios	Enze Zhou et.al.	2507.14963	null
2025-07-20	Probing EFX via PMMS: (Non-)Existence Results in Discrete Fair Division	Jarosław Byrka et.al.	2507.14957	null
2025-07-20	Echoes of the Land: An Interactive Installation Based on Physical Model of Earthquake	Ivan C. H. Liu et.al.	2507.14947	null
2025-07-20	Byzantine-Robust Decentralized Coordination of LLM Agents	Yongrae Jo et.al.	2507.14928	null
2025-07-20	Redefining Elderly Care with Agentic AI: Challenges and Opportunities	Ruhul Amin Khalil et.al.	2507.14912	null
2025-07-20	TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP	Fan Li et.al.	2507.14904	null
2025-07-20	Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies	Armin Kekić et.al.	2507.14901	null
2025-07-20	InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis	Jiale Liu et.al.	2507.14899	null
2025-07-20	AgentFly: Extensible and Scalable Reinforcement Learning for LM Agents	Renxi Wang et.al.	2507.14897	null
2025-07-20	Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems	H. M. Sabbir Ahmad et.al.	2507.14850	null
2025-07-20	Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree	Sam Johnson et.al.	2507.14799	null
2025-07-19	Towards AI Urban Planner in the Age of GenAI, LLMs, and Agentic AI	Yanjie Fu et.al.	2507.14730	null
2025-07-19	Simulating Chirality: Solving Distance- $k$ -Dispersion on an 1-Interval Connected Ring	Brati Mondal et.al.	2507.14723	null
2025-07-19	Configurable multi-agent framework for scalable and realistic testing of llm-based agents	Sai Wang et.al.	2507.14705	null
2025-07-19	WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis	Xinheng Lyu et.al.	2507.14680	null
2025-07-19	When Autonomy Goes Rogue: Preparing for Risks of Multi-Agent Collusion in Social Systems	Qibing Ren et.al.	2507.14660	null
2025-07-19	Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence	Faizan Contractor et.al.	2507.14658	null
2025-07-19	Agentic Satellite-Augmented Low-Altitude Economy and Terrestrial Networks: A Survey on Generative Approaches	Xiaozheng Gao et.al.	2507.14633	null
2025-07-19	Towards a Proactive Autoscaling Framework for Data Stream Processing at the Edge using GRU and Transfer Learning	Eugene Armah et.al.	2507.14597	null
2025-07-19	Amico: An Event-Driven Modular Framework for Persistent and Embedded Autonomy	Hongyi Yang et.al.	2507.14513	null
2025-07-19	Federated Reinforcement Learning in Heterogeneous Environments	Ukjo Hwang et.al.	2507.14487	null
2025-07-22	Routine: A Structural Planning Framework for LLM Agent System in Enterprise	Guancheng Zeng et.al.	2507.14447	null
2025-07-18	NetIntent: Leveraging Large Language Models for End-to-End Intent-Based SDN Automation	Md. Kamrul Hossain et.al.	2507.14398	null
2025-07-18	Adaptive Multi-Agent Reasoning via Automated Workflow Generation	Humza Sami et.al.	2507.14393	null
2025-07-18	Text-to-SQL for Enterprise Data Analytics	Albert Chen et.al.	2507.14372	null
2025-07-18	Stable matchings with switching costs	Boris Pittel et.al.	2507.14362	null
2025-07-18	FedStrategist: A Meta-Learning Framework for Adaptive and Robust Aggregation in Federated Learning	Md Rafid Haque et.al.	2507.14322	null
2025-07-18	Semantic Segmentation based Scene Understanding in Autonomous Vehicles	Ehsan Rassekh et.al.	2507.14303	null
2025-07-18	Distributed consensus-based observer design for target state estimation with bearing measurements	Marcelo Jacinto et.al.	2507.14300	null
2025-07-18	Age of Information Minimization in UAV-Enabled Integrated Sensing and Communication Systems	Yu Bai et.al.	2507.14299	null
2025-07-18	WebGuard: Building a Generalizable Guardrail for Web Agents	Boyuan Zheng et.al.	2507.14293	null
2025-07-18	DREAMS: Density Functional Theory Based Research Engine for Agentic Materials Simulation	Ziqi Wang et.al.	2507.14267	null
2025-07-18	Beyond DNS: Unlocking the Internet of AI Agents via the NANDA Index and Verified AgentFacts	Ramesh Raskar et.al.	2507.14263	null
2025-07-17	Towards an ABM on Proactive Community Adaptation for Climate Change	Önder Gürcan et.al.	2507.14233	null
2025-07-17	Intent-Based Network for RAN Management with Large Language Models	Fransiscus Asisi Bimo et.al.	2507.14230	null
2025-07-18	DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration	Xiyun Li et.al.	2507.14088	null
2025-07-18	Collaborative Rational Speech Act: Pragmatic Reasoning for Multi-Turn Dialog	Lautaro Estienne et.al.	2507.14063	null
2025-07-23	Well-posedness and propagation of chaos for multi-agent models with strategies and diffusive effects	Alessandro Baldi et.al.	2507.14058	null
2025-07-18	Online MMS Allocation for Chores	Jiaxin Song et.al.	2507.14039	null
2025-07-18	Architecting Human-AI Cocreation for Technical Services -- Interaction Modes and Contingency Factors	Jochen Wulf et.al.	2507.14034	null
2025-07-18	Byzantine-resilient federated online learning for Gaussian process regression	Xu Zhang et.al.	2507.14021	null
2025-07-18	DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation	Haoran Li et.al.	2507.13985	null
2025-07-18	A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints	Roberto Morales et.al.	2507.13983	null
2025-07-18	Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need	Bhishma Dedhia et.al.	2507.13966	null
2025-07-18	NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning	Qingyi Chen et.al.	2507.13940	null
2025-07-18	Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support	Jan Trienes et.al.	2507.13937	null
2025-07-18	Reframing attention as a reinforcement learning problem for causal discovery	Turan Orujlu et.al.	2507.13920	null
2025-07-18	Advanced X-rays techniques for research-oriented high-resolution imaging of articular cartilage: a scoping review	Simone Fantoni et.al.	2507.13854	null
2025-07-18	Impact of homophily in adherence to anti-epidemic measures on the spread of infectious diseases in social networks	Piotr Bentkowski et.al.	2507.13848	null
2025-07-18	Causal Knowledge Transfer for Multi-Agent Reinforcement Learning in Dynamic Environments	Kathrin Korte et.al.	2507.13846	null
2025-07-18	Principles and Reasons Behind Automated Vehicle Decisions in Ethically Ambiguous Everyday Scenarios	Lucas Elbert Suryana et.al.	2507.13837	null
2025-07-18	Conformal Data Contamination Tests for Trading or Sharing of Data	Martin V. Vejling et.al.	2507.13835	null
2025-07-18	Scalable Submodular Policy Optimization via Pruned Submodularity Graph	Aditi Anand et.al.	2507.13834	null
2025-07-18	CodeEdu: A Multi-Agent Collaborative Platform for Personalized Coding Education	Jianing Zhao et.al.	2507.13814	null
2025-07-18	From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning	Renato Ghisellini et.al.	2507.13768	null
2025-07-21	Navigating the Lobbying Landscape: Insights from Opinion Dynamics Models	Daniele Giachini et.al.	2507.13767	null
2025-07-18	AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework	Yu Yao et.al.	2507.13729	null
2025-07-18	CogniQ-H: A Soft Hierarchical Reinforcement Learning Paradigm for Automated Data Preparation	Jing Chang et.al.	2507.13710	null
2025-07-18	Minimum Clustering of Matrices Based on Phase Alignment	Honghao Wu et.al.	2507.13678	null
2025-07-18	Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones	Minze Li et.al.	2507.13647	null
2025-07-18	Differential Privacy in Kernelized Contextual Bandits via Random Projections	Nikola Pavlovic et.al.	2507.13639	null
2025-07-17	Evolving Neural Controllers for Xpilot-AI Racing Using Neuroevolution of Augmenting Topologies	Jim O'Connor et.al.	2507.13549	null
2025-07-17	Human-Like Trajectories Generation via Receding Horizon Tracking Applied to the TickTacking Interface	Daniele Masti et.al.	2507.13528	null
2025-07-17	Humans learn to prefer trustworthy AI over human partners	Yaomin Jiang et.al.	2507.13524	null
2025-07-17	GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination	Nabil Abdelaziz Ferhat Taleb et.al.	2507.13511	null
2025-07-17	Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents	Thomas Banker et.al.	2507.13491	null
2025-07-17	LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data	Aleksey Lapin et.al.	2507.13413	null
2025-07-21	A Survey of Context Engineering for Large Language Models	Lingrui Mei et.al.	2507.13334	null
2025-07-17	N Bugs on a Circle	Josh Briley et.al.	2507.13333	null
2025-07-17	Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis	Wang Xi et.al.	2507.13285	null
2025-07-20	Analysis Theory of Data Economy: Dataization, Technological Progress and Dynamic General Equilibrium	Yongheng Hu et.al.	2507.13274	null
2025-07-17	RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality	Ruohao Li et.al.	2507.13247	null
2025-07-17	GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems	Jisoo Lee et.al.	2507.13190	null
2025-07-17	Black Box Deployed -- Functional Criteria for Artificial Moral Agents in the LLM Era	Matthew E. Brophy et.al.	2507.13175	null
2025-07-17	Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback	Suzie Kim et.al.	2507.13171	null
2025-07-17	Prompt Injection 2.0: Hybrid AI Threats	Jeremy McHugh et.al.	2507.13169	null
2025-07-17	SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models	Xiangyu Dong et.al.	2507.13152	null
2025-07-17	RIDAS: A Multi-Agent Framework for AI-RAN with Representation- and Intention-Driven Agents	Kuiyuan Ding et.al.	2507.13140	null
2025-07-17	Governance, productivity and economic development	Cuong Le Van et.al.	2507.13099	null
2025-07-17	iReDev: A Knowledge-Driven Multi-Agent Framework for Intelligent Requirements Development	Dongming Jin et.al.	2507.13081	null
2025-07-17	Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication	Tianyu Song et.al.	2507.13052	null
2025-07-17	What Can Robots Teach Us About Trust and Reliance? An interdisciplinary dialogue between Social Sciences and Social Robotics	Julien Wacquez et.al.	2507.13041	null
2025-07-17	MAD-Spear: A Conformity-Driven Prompt Injection Attack on Multi-Agent Debate Systems	Yu Cui et.al.	2507.13038	null
2025-07-17	Lower Bound for Online MMS Assignment of Indivisible Chores	Masoud Seddighin et.al.	2507.12984	null
2025-07-17	Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning	Giwon Lee et.al.	2507.12977	null
2025-07-21	LaViPlan : Language-Guided Visual Path Planning with RLVR	Hayeon Oh et.al.	2507.12911	null
2025-07-17	Autonomous Resource Management in Microservice Systems via Reinforcement Learning	Yujun Zou et.al.	2507.12879	null
2025-07-20	Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command	Taylan Akay et.al.	2507.12862	null
2025-07-17	Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering	Muhammad Fadhil Ginting et.al.	2507.12846	null
2025-07-17	Machine-Readable Ads: Accessibility and Trust Patterns for AI Web Agents interacting with Online Advertisements	Joel Nitu et.al.	2507.12844	null
2025-07-22	Assessing Adaptive World Models in Machines with Novel Games	Lance Ying et.al.	2507.12821	null
2025-07-17	From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning	Gaurav Chaudhary et.al.	2507.12815	null
2025-07-17	MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models	Zhiwei Liu et.al.	2507.12806	null
2025-07-17	Imitating Mistakes in a Learning Companion AI Agent for Online Peer Learning	Sosui Moribe et.al.	2507.12801	null
2025-07-17	City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning	Penglei Sun et.al.	2507.12795	null
2025-07-17	A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models	Weijieying Ren et.al.	2507.12774	null
2025-07-17	Autonomy for Older Adult-Agent Interaction	Jiaxin An et.al.	2507.12767	null
2025-07-17	Public Evaluation on Potential Social Impacts of Fully Autonomous Cybernetic Avatars for Physical Support in Daily-Life Environments: Large-Scale Demonstration and Survey at Avatar Land	Lotfi El Hafi et.al.	2507.12741	null
2025-07-17	Competition Erases Simplicity: Tight Regret Bounds for Uniform Pricing with Multiple Buyers	Houshuang Chen et.al.	2507.12733	null
2025-07-17	Strategy Adaptation in Large Language Model Werewolf Agents	Fuya Nakamori et.al.	2507.12732	null
2025-07-17	Identification of Authoritative Nodes and Dismantling of Illicit Networks Using a Novel Metric for Measuring Strength of a Graph	Kartikeya Kansal et.al.	2507.12711	null
2025-07-16	Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models	Alex Zook et.al.	2507.12666	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	null
2025-07-16	A Survey of Explainable Reinforcement Learning: Targets, Methods and Needs	Léo Saulières et.al.	2507.12599	null
2025-07-16	The Impact of Social Attractiveness on Casual Group Formation: Power-Law Group Sizes and Suppressed Percolation	Matheus S. Mariano et.al.	2507.12585	null
2025-07-20	Can Mental Imagery Improve the Thinking Capabilities of AI Systems?	Slimane Larabi et.al.	2507.12555	null
2025-07-15	FOUNDER: Grounding Foundation Models in World Models for Open-Ended Embodied Decision Making	Yucen Wang et.al.	2507.12496	null
2025-07-15	MR-LDM -- The Merge-Reactive Longitudinal Decision Model: Game Theoretic Human Decision Modeling for Interactive Sim Agents	Dustin Holley et.al.	2507.12494	null
2025-07-15	On multiagent online problems with predictions	Gabriel Istrate et.al.	2507.12486	null
2025-07-14	AI-Powered Math Tutoring: Platform for Personalized and Adaptive Education	Jarosław A. Chudziak et.al.	2507.12484	null
2025-07-16	Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	Chandana Cheerla et.al.	2507.12425	null
2025-07-16	Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment	Noble Harasha et.al.	2507.12400	null
2025-07-16	Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate	Ana Davila et.al.	2507.12370	null
2025-07-21	GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities	Diganta Misra et.al.	2507.12367	null
2025-07-16	Social polarization promoted by sparse higher-order interactions	Hugo Pérez-Martínez et.al.	2507.12325	null
2025-07-17	Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot	Luca Garello et.al.	2507.12273	null
2025-07-16	Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes	Johann Frei et.al.	2507.12261	null
2025-07-16	Toward a Behavioural Translation Style Space: Simulating the Temporal Dynamics of Affect, Behaviour, and Cognition in Human Translation Production	Michael Carl et.al.	2507.12208	null
2025-07-16	BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search	Azhar Ikhtiarudin et.al.	2507.12189	null
2025-07-16	Fast and Scalable Game-Theoretic Trajectory Planning with Intentional Uncertainties	Zhenmin Huang et.al.	2507.12174	null
2025-07-16	Convergence Rate of Generalized Nash Equilibrium Learning in Strongly Monotone Games with Linear Constraints	Tatiana Tatarenko et.al.	2507.12112	null
2025-07-16	Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVs	Ye Han et.al.	2507.12110	null
2025-07-16	Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics	Muleilan Pei et.al.	2507.12083	null
2025-07-16	Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited	Anthony G Cohn et.al.	2507.12059	null
2025-07-16	Contracting with a Mechanism Designer	Tian Bai et.al.	2507.12054	null
2025-07-16	ARRC: Explainable, Workflow-Integrated Recommender for Sustainable Resource Optimization Across the Edge-Cloud Continuum	Brian-Frederik Jahnke et.al.	2507.12032	null
2025-07-16	QAS-QTNs: Curriculum Reinforcement Learning-Driven Quantum Architecture Search for Quantum Tensor Networks	Siddhant Dutta et.al.	2507.12013	null
2025-07-16	Understanding visual attention beehind bee-inspired UAV navigation	Pranav Rajbhandari et.al.	2507.11992	null
2025-07-17	Aime: Towards Fully-Autonomous Multi-Agent Framework	Yexuan Shi et.al.	2507.11988	null
2025-07-16	Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness	Yuki Sakamoto et.al.	2507.11979	null
2025-07-16	Online Training and Pruning of Deep Reinforcement Learning Networks	Valentin Frank Ingmar Guenter et.al.	2507.11975	null
2025-07-16	Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker	Yuhong Zhang et.al.	2507.11972	null
2025-07-16	IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving	Kanghyun Ryu et.al.	2507.11940	null
2025-07-16	From Generative to Episodic: Sample-Efficient Replicable Reinforcement Learning	Max Hopkins et.al.	2507.11926	null
2025-07-16	Hybrid Conformal Prediction-based Risk-Aware Model Predictive Planning in Dense, Uncertain Environments	Jeongyong Yang et.al.	2507.11920	null
2025-07-16	CoCre-Sam (Kokkuri-san): Modeling Ouija Board as Collective Langevin Dynamics Sampling from Fused Language Models	Tadahiro Taniguchi et.al.	2507.11906	null
2025-07-16	Extremal Testing for Network Software using LLMs	Rathin Singha et.al.	2507.11898	null
2025-07-16	Generative Intelligence Systems in the Flow of Group Emotions	Fernando Koch et.al.	2507.11831	null
2025-07-16	The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist	Haoxuan Zhang et.al.	2507.11810	null
2025-07-16	New allocation rule based on graph structures and their application to economic phenomena	Taiki Yamada et.al.	2507.11808	null
2025-07-15	Large-scale distributed synchronization systems, using a cancel-on-completion redundancy mechanism	Alexander Stolyar et.al.	2507.11779	null
2025-07-15	A Cellular Automata Approach to Donation Game	Marcin Kowalik et.al.	2507.11744	null
2025-07-15	Let's Think in Two Steps: Mitigating Agreement Bias in MLLMs with Self-Grounded Verification	Moises Andrade et.al.	2507.11662	null
2025-07-15	STAGED: A Multi-Agent Neural Network for Learning Cellular Interaction Dynamics	Joao F. Rocha et.al.	2507.11660	null
2025-07-15	VISTA: Monocular Segmentation-Based Mapping for Appearance and View-Invariant Global Localization	Hannah Shafferman et.al.	2507.11653	null
2025-07-15	General Modular Harness for LLM Agents in Multi-Turn Gaming Environments	Yuxuan Zhang et.al.	2507.11633	null
2025-07-15	AI, Humans, and Data Science: Optimizing Roles Across Workflows and the Workforce	Richard Timpone et.al.	2507.11597	null
2025-07-14	Consumer Law for AI Agents	Christoph Busch et.al.	2507.11567	null
2025-07-14	Emergent Heterogeneous Swarm Control Through Hebbian Learning	Fuda van Diggelen et.al.	2507.11566	null
2025-07-14	A Model Aware AIGC Task Offloading Algorithm in IIoT Edge Computing	Xin Wang et.al.	2507.11560	null
2025-07-15	DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Yinsheng Li et.al.	2507.11527	null
2025-07-15	Opinion dynamics: Statistical physics and beyond	Michele Starnini et.al.	2507.11521	null
2025-07-15	AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air	Shiyi Yang et.al.	2507.11515	null
2025-07-15	On the Complexity of the Optimal Correlated Equilibria in Extensive-Form Games	Vincent Cheval et.al.	2507.11509	null
2025-07-15	LF: Online Multi-Robot Path Planning Meets Optimal Trajectory Control	Ajay Shankar et.al.	2507.11464	null
2025-07-15	EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes	LG AI Research et.al.	2507.11407	null
2025-07-15	From Production Logistics to Smart Manufacturing: The Vision for a New RoboCup Industrial League	Supun Dissanayaka et.al.	2507.11402	null
2025-07-20	Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian	Andrei Niculae et.al.	2507.11299	null
2025-07-15	Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems	Dany Moshkovich et.al.	2507.11277	null
2025-07-15	An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling	Anh Nguyen-Duc et.al.	2507.11272	null
2025-07-15	Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound	Tal Fiskus et.al.	2507.11269	null
2025-07-15	An Agentic Flow for Finite State Machine Extraction using Prompt Chaining	Fares Wael et.al.	2507.11222	null
2025-07-15	Fair Contracts	Matteo Castiglioni et.al.	2507.11214	null
2025-07-15	Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias	Rushia Harada et.al.	2507.11210	null
2025-07-15	Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding	Conrad Borchers et.al.	2507.11198	null
2025-07-15	Quantized Rank Reduction: A Communications-Efficient Federated Learning Scheme for Network-Critical Applications	Dimitrios Kritsiolis et.al.	2507.11183	null
2025-07-15	AI Agent Architecture for Decentralized Trading of Alternative Assets	Ailiya Borjigin et.al.	2507.11117	null
2025-07-15	Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander	Li Wang et.al.	2507.11079	null
2025-07-17	SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks	Pavel Adamenko et.al.	2507.11059	null
2025-07-16	Journalism-Guided Agentic In-Context Learning for News Stance Detection	Dahyun Lee et.al.	2507.11049	null
2025-07-15	Value of History in Social Learning: Applications to Markets for History	Hiroto Sato et.al.	2507.11029	null
2025-07-15	DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models	Anthony Miyaguchi et.al.	2507.10958	null
2025-07-15	A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge	Shuangyao Huang et.al.	2507.10913	null
2025-07-15	Lessons Learned from Evaluation of LLM based Multi-agents in Safer Therapy Recommendation	Yicong Wu et.al.	2507.10911	null
2025-07-15	NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization	Zongtao He et.al.	2507.10894	null
2025-07-15	Start from the End: A Framework for Computational Policy Exploration to Inform Effective and Geospatially Consistent Interventions applied to COVID-19 in St. Louis	David O'Gara et.al.	2507.10870	null
2025-07-14	LLM-Guided Agentic Object Detection for Open-World Understanding	Furkan Mumcu et.al.	2507.10844	null
2025-07-14	Past, Present and Future: Exploring Adaptive AI in Software Development Bots	Omar Elsisi et.al.	2507.10822	null
2025-07-14	Semantic Context for Tool Orchestration	Robert Müller et.al.	2507.10820	null
2025-07-14	Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection	Huiyi Wang et.al.	2507.10814	null
2025-07-14	React to This (RTT): A Nonverbal Turing Test for Embodied AI	Chuxuan Zhang et.al.	2507.10812	null
2025-07-14	Warehouse Spatial Question Answering with LLM Agent	Hsiang-Wei Huang et.al.	2507.10778	null
2025-07-14	RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding	Benjamin Stoler et.al.	2507.10749	null
2025-07-14	Ground-Compose-Reinforce: Tasking Reinforcement Learning Agents through Formal Language	Andrew C. Li et.al.	2507.10741	null
2025-07-14	Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems	Sohan Shankar et.al.	2507.10722	null
2025-07-14	Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health	Jabari Kwesi et.al.	2507.10695	null
2025-07-14	Vision Language Action Models in Robotic Manipulation: A Systematic Review	Muhayy Ud Din et.al.	2507.10672	null
2025-07-16	From Semantic Web and MAS to Agentic AI: A Unified Narrative of the Web of Agents	Tatiana Petrova et.al.	2507.10644	null
2025-07-14	Enhancing the Capabilities of Large Language Models for API calls through Knowledge Graphs	Ye Yang et.al.	2507.10630	null
2025-07-14	Game Theory Meets LLM and Agentic AI: Reimagining Cybersecurity for the Age of Intelligent Threats	Quanyan Zhu et.al.	2507.10621	null
2025-07-13	Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks	Oluwaseyi Giwa et.al.	2507.10619	null
2025-07-13	LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents	Zihe Yan et.al.	2507.10610	null
2025-07-12	Emergence of Hierarchical Emotion Organization in Large Language Models	Bo Zhao et.al.	2507.10599	null
2025-07-11	ARPaCCino: An Agentic-RAG for Policy as Code Compliance	Francesco Romeo et.al.	2507.10584	null
2025-07-11	An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation	Vimaleswar A et.al.	2507.10580	null
2025-07-16	Truth Sleuth and Trend Bender: AI Agents to fact-check YouTube videos and influence opinions	Cécile Logé et.al.	2507.10577	null
2025-07-14	EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Mingxian Lin et.al.	2507.10548	null
2025-07-14	Graph World Model	Tao Feng et.al.	2507.10539	null
2025-07-14	DeepResearch $^{\text{Eco}}$ : A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology	Jennifer D'Souza et.al.	2507.10522	null
2025-07-14	An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments	Mikko Korkiakoski et.al.	2507.10469	null
2025-07-14	Logic layer Prompt Control Injection (LPCI): A Novel Security Vulnerability Class in Agentic Systems	Hammad Atta et.al.	2507.10457	null
2025-07-14	Negative entropy and non-equilibrium Euclidean shell	Yang An et.al.	2507.10450	null
2025-07-14	Am I on the Right Track? What Can Predicted Query Performance Tell Us about the Search Behaviour of Agentic RAG	Fangzheng Tian et.al.	2507.10411	null
2025-07-14	Machine-Learning to Trust	Ran Spiegler et.al.	2507.10363	null
2025-07-14	Toolsuite for Implementing Multiagent Systems Based on Communication Protocols	Amit K. Chopra et.al.	2507.10324	null
2025-07-14	Prompt Informed Reinforcement Learning for Visual Coverage Path Planning	Venkat Margapuri et.al.	2507.10284	null
2025-07-14	Toward Real-World Table Agents: Capabilities, Workflows, and Design Principles for LLM-based Table Intelligence	Jiaming Tian et.al.	2507.10281	null
2025-07-14	ToMacVF : Temporal Macro-action Value Factorization for Asynchronous Multi-Agent Reinforcement Learning	Wenjing Zhang et.al.	2507.10251	null
2025-07-14	Should We Ever Prefer Decision Transformer for Offline Reinforcement Learning?	Yumi Omori et.al.	2507.10174	null
2025-07-14	Play Style Identification Using Low-Level Representations of Play Traces in MicroRTS	Ruizhe Yu Xia et.al.	2507.10172	null
2025-07-14	Simulating Biases for Interpretable Fairness in Offline and Online Classifiers	Ricardo Inácio et.al.	2507.10154	null
2025-07-14	Adaptability in Multi-Agent Reinforcement Learning: A Framework and Unified Review	Siyi Hu et.al.	2507.10142	null
2025-07-16	A PBN-RL-XAI Framework for Discovering a "Hit-and-Run" Therapeutic Strategy in Melanoma	Zhonglin Liu et.al.	2507.10136	null
2025-07-14	Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering	Tung Sum Thomas Kwok et.al.	2507.10088	null
2025-07-14	Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires	Simon Münker et.al.	2507.10073	null
2025-07-14	Finetuning Deep Reinforcement Learning Policies with Evolutionary Strategies for Control of Underactuated Robots	Marco Calì et.al.	2507.10030	null
2025-07-14	The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents	Lixu Wang et.al.	2507.10016	null
2025-07-14	On The Role of Intentionality in Knowledge Representation: Analyzing Scene Context for Cognitive Agents with a Tiny Language Model	Mark Burgess et.al.	2507.10000	null
2025-07-17	Predictive & Trust-based Multi-Agent Coordination	Venkatraman Renganathan et.al.	2507.09997	null
2025-07-14	Evolution of Fear and Social Rewards in Prey-Predator Relationship	Yuji Kanagawa et.al.	2507.09992	null
2025-07-14	Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient	Xiaoyang Yu et.al.	2507.09989	null
2025-07-14	Quantum measurement of work in mesoscopic systems	Anant Vijay Varma et.al.	2507.09977	null
2025-07-14	Generalized Quantal Response Equilibrium: Existence and Efficient Learning	Apurv Shukla et.al.	2507.09928	null
2025-07-14	Intelligent Task Management via Dynamic Multi-region Division in LEO Satellite Networks	Zixuan Song et.al.	2507.09926	null
2025-07-14	Energy-Stable Swarm-Based Inertial Algorithms for Optimization	Xuelong Gu et.al.	2507.09909	null
2025-07-14	Large Population Models	Ayush Chopra et.al.	2507.09901	null
2025-07-14	Towards Realistic and Interpretable Market Simulations: Factorizing Financial Power Law using Optimal Transport	Ryuji Hashimoto et.al.	2507.09863	null
2025-07-14	Multi-residual Mixture of Experts Learning for Cooperative Control in Multi-vehicle Systems	Vindula Jayawardana et.al.	2507.09836	null
2025-07-20	Active Probing with Multimodal Predictions for Motion Planning	Darshan Gadginmath et.al.	2507.09822	null
2025-07-13	An infinitesimal generator approach on weak convergence of regulated multi-class matching systems	Bowen Xie et.al.	2507.09789	null
2025-07-13	TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit	Paulo Salem et.al.	2507.09788	null
2025-07-13	Toward accurate RUL and SOH estimation using reinforced graph-based PINNs enhanced with dynamic weights	Mohamadreza Akbari Pour et.al.	2507.09766	null
2025-07-13	IteraOptiRacing: A Unified Planning-Control Framework for Real-time Autonomous Racing for Iterative Optimal Performance	Yifan Zeng et.al.	2507.09714	null
2025-07-13	Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI	Phat Nguyen et.al.	2507.09702	null
2025-07-13	Networked Information Aggregation via Machine Learning	Michael Kearns et.al.	2507.09683	null
2025-07-13	Negotiating Comfort: Simulating Personality-Driven LLM Agents in Shared Residential Social Networks	Ann Nedime Nese Rende et.al.	2507.09657	null
2025-07-13	humancompatible.interconnect: Testing Properties of Repeated Uses of Interconnections of AI Systems	Rodion Nazarov et.al.	2507.09626	null
2025-07-13	On the existence of EFX allocations for goods	Ujjwal Kumar et.al.	2507.09600	null
2025-07-17	THOR: Transformer Heuristics for On-Demand Retrieval	Isaac Shi et.al.	2507.09592	null
2025-07-13	eSapiens: A Platform for Secure and Auditable Retrieval-Augmented Generation	Isaac Shi et.al.	2507.09588	null
2025-07-13	AICrypto: A Comprehensive Benchmark For Evaluating Cryptography Capabilities of Large Language Models	Yu Wang et.al.	2507.09580	null
2025-07-13	On Probabilistic Assignment Rules	Sreedurga Gogulapati et.al.	2507.09550	null
2025-07-13	Existence of Fair and Efficient Allocation of Indivisible Chores	Ryoga Mahara et.al.	2507.09544	null
2025-07-13	Learning to Control Dynamical Agents via Spiking Neural Networks and Metropolis-Hastings Sampling	Ali Safa et.al.	2507.09540	null
2025-07-13	Self-supervised Pretraining for Integrated Prediction and Planning of Automated Vehicles	Yangang Ren et.al.	2507.09537	null
2025-07-13	TruckV2X: A Truck-Centered Perception Dataset	Tenghui Xie et.al.	2507.09505	null
2025-07-13	GoalfyMax: A Protocol-Driven Multi-Agent System for Intelligent Experience Entities	Siyi Wu et.al.	2507.09497	null
2025-07-13	GenAI-based Multi-Agent Reinforcement Learning towards Distributed Agent Intelligence: A Generative-RL Agent Perspective	Hang Wang et.al.	2507.09495	null
2025-07-13	Evaluating LLMs on Sequential API Call Through Automated Test Generation	Yuheng Huang et.al.	2507.09481	null
2025-07-16	Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs	Yangning Li et.al.	2507.09477	null
2025-07-13	Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints	Yan Dai et.al.	2507.09473	null
2025-07-13	MobiWorld: World Models for Mobile Wireless Network	Haoye Chai et.al.	2507.09462	null
2025-07-13	Intermediate Interaction Strategies for Collective Behavior	Y. Kikuchi et.al.	2507.09457	null
2025-07-13	Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions	Yuanhong Zheng et.al.	2507.09446	null
2025-07-12	Contracting a crowd of heterogeneous agents	Guillermo Alonso Alvarez et.al.	2507.09415	null
2025-07-12	Adaptive Social Learning using Theory of Mind	Lance Ying et.al.	2507.09409	null
2025-07-12	LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing	Quanyan Zhu et.al.	2507.09407	null
2025-07-12	Knowledge Conceptualization Impacts RAG Efficacy	Chris Davis Jaldi et.al.	2507.09389	null
2025-07-12	Constrained Style Learning from Imperfect Demonstrations under Task Optimality	Kehan Wen et.al.	2507.09371	null
2025-07-15	Simulation for All: A Step-by-Step Cookbook for Developing Human-Centered Multi-Agent Transportation Simulators	Shiva Azimi et.al.	2507.09367	null
2025-07-12	When Developer Aid Becomes Security Debt: A Systematic Analysis of Insecure Behaviors in LLM Coding Agents	Matous Kozak et.al.	2507.09329	null
2025-07-12	StockSim: A Dual-Mode Order-Level Simulator for Evaluating Multi-Agent LLMs in Financial Markets	Charidimos Papadakis et.al.	2507.09255	null
2025-07-12	Hide-and-Shill: A Reinforcement Learning Framework for Market Manipulation Detection in Symphony-a Decentralized Multi-Agent System	Ronghua Shi et.al.	2507.09179	null
2025-07-12	Continual Reinforcement Learning by Planning with Online World Models	Zichen Liu et.al.	2507.09177	null
2025-07-12	RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking	Shuo Yang et.al.	2507.09174	null
2025-07-12	Tactile-VLA: Unlocking Vision-Language-Action Model's Physical Knowledge for Tactile Generalization	Jialei Huang et.al.	2507.09160	null
2025-07-12	Egalitarian-equivalent and strategy-proof mechanisms in homogeneous multi-object allocation problems	Hinata Kurashita et.al.	2507.09152	null
2025-07-12	A Study of Value-Aware Eigenoptions	Harshil Kotamreddy et.al.	2507.09127	null
2025-07-12	Proactive AI-and-RAN Workload Orchestration in O-RAN Architectures for 6G Networks	Syed Danial Ali Shah et.al.	2507.09124	null
2025-07-12	AInsight: Augmenting Expert Decision-Making with On-the-Fly Insights Grounded in Historical Data	Mohammad Abolnejadian et.al.	2507.09100	null
2025-07-12	Transformer based Collaborative Reinforcement Learning for Fluid Antenna System (FAS)-enabled 3D UAV Positioning	Xiaoren Xu et.al.	2507.09094	null
2025-07-12	Learning from Synthetic Labs: Language Models as Auction Participants	Anand Shah et.al.	2507.09083	null
2025-07-11	Infinite Video Understanding	Dell Zhang et.al.	2507.09068	null
2025-07-11	SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments	Avi Arora et.al.	2507.09063	null
2025-07-11	Behavioral Exploration: Learning to Explore via In-Context Adaptation	Andrew Wagenmaker et.al.	2507.09041	null
2025-07-11	Accelerating Drug Discovery Through Agentic AI: A Multi-Agent Approach to Laboratory Automation in the DMTA Cycle	Yao Fehlis et.al.	2507.09023	null
2025-07-11	How to Train a Leader: Hierarchical Reasoning in Multi-Agent LLMs	Andrew Estornell et.al.	2507.08960	null
2025-07-15	Bridging Literature and the Universe Via A Multi-Agent Large Language Model System	Xiaowen Zhang et.al.	2507.08958	null
2025-07-11	Optimizing Sequential Multi-Step Tasks with Parallel LLM Agents	Enhao Zhang et.al.	2507.08944	null
2025-07-10	AirScape: An Aerial Generative World Model with Motion Controllability	Baining Zhao et.al.	2507.08885	null
2025-07-10	Agent-based visualization of streaming text	Jordan Riley Benson et.al.	2507.08884	null
2025-07-11	NeuralOS: Towards Simulating Operating Systems via Neural Generative Models	Luke Rivard et.al.	2507.08800	null
2025-07-11	SPLASH! Sample-efficient Preference-based inverse reinforcement learning for Long-horizon Adversarial tasks from Suboptimal Hierarchical demonstrations	Peter Crowley et.al.	2507.08707	null
2025-07-11	elsciRL: Integrating Language Solutions into Reinforcement Learning Problem Settings	Philip Osborne et.al.	2507.08705	null
2025-07-11	Introspection of Thought Helps AI Agents	Haoran Sun et.al.	2507.08664	null
2025-07-11	Safe Deep Reinforcement Learning for Resource Allocation with Peak Age of Information Violation Guarantees	Berire Gunes Reyhan et.al.	2507.08653	null
2025-07-11	DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images	Haoran Sun et.al.	2507.08648	null
2025-07-11	OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception	Junho Koh et.al.	2507.08644	null
2025-07-11	Agentic Large Language Models for Conceptual Systems Engineering and Design	Soheyl Massoudi et.al.	2507.08619	null
2025-07-11	AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs	Florian Grötschla et.al.	2507.08616	null
2025-07-11	Emergent Natural Language with Communication Games for Improving Image Captioning Capabilities without Additional Data	Parag Dutta et.al.	2507.08610	null
2025-07-11	Unlocking Speech Instruction Data Potential with Query Rewriting	Yonghua Hei et.al.	2507.08603	null
2025-07-11	To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions	Dimitrios Emmanoulopoulos et.al.	2507.08584	null
2025-07-11	SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2	Alen Adamyan et.al.	2507.08548	null
2025-07-11	Recursive Reward Aggregation	Yuting Tang et.al.	2507.08537	null
2025-07-11	Occlusion-Guided Feature Purification Learning via Reinforced Knowledge Distillation for Occluded Person Re-Identification	Yufei Zheng et.al.	2507.08520	null
2025-07-11	The stability of bi-polarization on dynamical directed graphs: an emergent game perspective	Yakun Wang et.al.	2507.08449	null
2025-07-11	Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences	Selina Heller et.al.	2507.08440	null
2025-07-11	Age of Information Optimization in Laser-charged UAV-assisted IoT Networks: A Multi-agent Deep Reinforcement Learning Method	Geng Sun et.al.	2507.08429	null
2025-07-11	A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities	Lu Xiang et.al.	2507.08425	null
2025-07-11	Temperature Measurement in Agent Systems	Christoph J. Börner et.al.	2507.08394	null
2025-07-11	Multi-Agent LLMs as Ethics Advocates in AI-Based Systems	Asma Yamani et.al.	2507.08392	null
2025-07-11	Online Pre-Training for Offline-to-Online Reinforcement Learning	Yongjae Shin et.al.	2507.08387	null
2025-07-11	Exploring Design of Multi-Agent LLM Dialogues for Research Ideation	Keisuke Ueda et.al.	2507.08350	null
2025-07-11	What Factors Affect LLMs and RLLMs in Financial Question Answering?	Peng Wang et.al.	2507.08339	null
2025-07-11	MK2 at PBIG Competition: A Prompt Generation Solution	Yuzheng Xu et.al.	2507.08335	null
2025-07-11	CRMAgent: A Multi-Agent LLM System for E-Commerce CRM Message Template Generation	Yinzhu Quan et.al.	2507.08325	null
2025-07-15	KAT-V1: Kwai-AutoThink Technical Report	Zizheng Zhan et.al.	2507.08297	null
2025-07-11	Agent Safety Alignment via Reinforcement Learning	Zeyang Sha et.al.	2507.08270	null
2025-07-11	Giving AI Agents Access to Cryptocurrency and Smart Contracts Creates New Vectors of AI Harm	Bill Marino et.al.	2507.08249	null
2025-07-11	Advancing AI Capabilities and Evolving Labor Outcomes	Jacob Dominski et.al.	2507.08244	null
2025-07-10	Effect of Static vs. Conversational AI-Generated Messages on Colorectal Cancer Screening Intent: a Randomized Controlled Trial	Neil K. R. Sehgal et.al.	2507.08211	null
2025-07-10	From Curiosity to Competence: How World Models Interact with the Dynamics of Exploration	Fryderyk Mantiuk et.al.	2507.08210	null
2025-07-10	Reasoning and Behavioral Equilibria in LLM-Nash Games: From Mindsets to Actions	Quanyan Zhu et.al.	2507.08208	null
2025-07-10	A Dynamic Stackelberg Game Framework for Agentic AI Defense Against LLM Jailbreaking	Zhengye Han et.al.	2507.08207	null
2025-07-10	KP-A: A Unified Network Knowledge Plane for Catalyzing Agentic Network Intelligence	Yun Tang et.al.	2507.08164	null
2025-07-10	Code with Me or for Me? How Increasing AI Automation Transforms Developer Workflows	Valerie Chen et.al.	2507.08149	null
2025-07-10	AI for NONMEM Coding in Pharmacometrics Research and Education: Shortcut or Pitfall?	Wenhao Zheng et.al.	2507.08144	null
2025-07-10	Noise-Enabled Goal Attainment in Crowded Collectives	Lucy Liu et.al.	2507.08100	null
2025-07-10	Multi-Scale Network Dynamics and Systemic Risk: A Model Context Protocol Approach to Financial Markets	Avishek Bhandari et.al.	2507.08065	null
2025-07-10	MCPmed: A Call for MCP-Enabled Bioinformatics Web Services for LLM-Driven Discovery	Matthias Flotho et.al.	2507.08055	null
2025-07-09	AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research	Talor Abramovich et.al.	2507.08038	null
2025-07-14	PyVision: Agentic Vision with Dynamic Tooling	Shitian Zhao et.al.	2507.07998	null
2025-07-10	OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding	JingLi Lin et.al.	2507.07984	null
2025-07-15	Reinforcement Learning with Action Chunking	Qiyang Li et.al.	2507.07969	null
2025-07-10	MIRIX: Multi-Agent Memory System for LLM-Based Agents	Yu Wang et.al.	2507.07957	null
2025-07-10	Agentic Retrieval of Topics and Insights from Earnings Calls	Anant Gupta et.al.	2507.07906	null
2025-07-11	The Trust Fabric: Decentralized Interoperability and Economic Coordination for the Agentic Web	Sree Bhargavi Balija et.al.	2507.07901	null
2025-07-10	Automating MD simulations for Proteins using Large language Models: NAMD-Agent	Achuth Chandrasekhar et.al.	2507.07887	null
2025-07-10	DocCHA: Towards LLM-Augmented Interactive Online diagnosis System	Xinyi Liu et.al.	2507.07870	null
2025-07-10	"So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents	Giovanni Dispoto et.al.	2507.07848	null
2025-07-10	Perceptual Distortions and Autonomous Representation Learning in a Minimal Robotic System	David Warutumo et.al.	2507.07845	null
2025-07-10	BEAVER: Building Environments with Assessable Variation for Evaluating Multi-Objective Reinforcement Learning	Ruohong Liu et.al.	2507.07769	null
2025-07-10	Beyond Connectivity: Higher-Order Network Framework for Capturing Memory-Driven Mobility Dynamics	Chen Zhang et.al.	2507.07727	null
2025-07-10	Multi-agent Reinforcement Learning-based In-place Scaling Engine for Edge-cloud Systems	Jovan Prodanov et.al.	2507.07671	null
2025-07-10	Upper Expected Meeting Times for Interdependent Stochastic Agents	Marco Sangalli et.al.	2507.07626	null
2025-07-10	Position: We Need An Algorithmic Understanding of Generative AI	Oliver Eberle et.al.	2507.07544	null
2025-07-10	Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System	Yuanchen Shi et.al.	2507.07509	null
2025-07-10	The Pandora's Box Problem with Sequential Inspections	Ali Aouad et.al.	2507.07508	null
2025-07-15	Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models	Varin Sikka et.al.	2507.07505	null
2025-07-11	StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley	Weihao Tan et.al.	2507.07445	null
2025-07-10	SAND: Boosting LLM Agents with Self-Taught Action Deliberation	Yu Xia et.al.	2507.07441	null
2025-07-12	DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search	Zerui Yang et.al.	2507.07426	null
2025-07-10	KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows	Zaifeng Pan et.al.	2507.07400	null
2025-07-10	PILOC: A Pheromone Inverse Guidance Mechanism and Local-Communication Framework for Dynamic Target Search of Multi-Agent in Unknown Environments	Hengrui Liu et.al.	2507.07376	null
2025-07-11	FLoRA: An Advanced AI-Powered Engine to Facilitate Hybrid Human-AI Regulated Learning	Xinyu Li et.al.	2507.07362	null
2025-07-09	Optimizing Model Splitting and Device Task Assignment for Deceptive Signal Assisted Private Multi-hop Split Learning	Dongyu Wei et.al.	2507.07323	null
2025-07-09	Optimizing Communication and Device Clustering for Clustered Federated Learning with Differential Privacy	Dongyu Wei et.al.	2507.07320	null
2025-07-09	Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation	Anirban Saha Anik et.al.	2507.07307	null
2025-07-09	ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning	Yichen Lu et.al.	2507.07306	null
2025-07-09	Application of LLMs to Multi-Robot Path Planning and Task Allocation	Ashish Kumar et.al.	2507.07302	null
2025-07-09	LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation	Sonia Raychaudhuri et.al.	2507.07299	null
2025-07-09	The Impact of Background Speech on Interruption Detection in Collaborative Groups	Mariah Bradford et.al.	2507.07280	null
2025-07-09	Convergence and Robustness Bounds for Distributed Asynchronous Shortest-Path	Jared Miller et.al.	2507.07263	null
2025-07-11	Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery	Licong Xu et.al.	2507.07257	null
2025-07-09	Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning	Elia Piccoli et.al.	2507.07197	null
2025-07-09	Evaluating Retrieval-Augmented Generation Agents for Autonomous Scientific Discovery in Astrophysics	Xueqing Xu et.al.	2507.07155	null
2025-07-09	4KAgent: Agentic Any Image to 4K Super-Resolution	Yushen Zuo et.al.	2507.07105	null
2025-07-09	Graph-Based Complexity Metrics for Multi-Agent Curriculum Learning: A Validated Approach to Task Ordering in Cooperative Coordination Environments	Farhaan Ebadulla et.al.	2507.07074	null
2025-07-09	Robust signal decompositions on the circle	Aral Kose et.al.	2507.07007	null
2025-07-09	Federated Learning-based MARL for Strengthening Physical-Layer Security in B5G Networks	Deemah H. Tashman et.al.	2507.06997	null
2025-07-09	The User-Centric Geo-Experience: An LLM-Powered Framework for Enhanced Planning, Navigation, and Dynamic Adaptation	Jieren Deng et.al.	2507.06993	null
2025-07-09	Optimizing Cognitive Networks: Reinforcement Learning Meets Energy Harvesting Over Cascaded Channels	Deemah H. Tashman et.al.	2507.06981	null
2025-07-09	Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues	Fareya Ikram et.al.	2507.06910	null
2025-07-09	MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection	Ziyan Liu et.al.	2507.06908	null
2025-07-09	SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds	Matthias Zeller et.al.	2507.06906	null
2025-07-09	Designing Adaptive Algorithms Based on Reinforcement Learning for Dynamic Optimization of Sliding Window Size in Multi-Dimensional Data Streams	Abolfazl Zarghani et.al.	2507.06901	null
2025-07-09	VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation	Ziang Ye et.al.	2507.06899	null
2025-07-09	Toward Neurodivergent-Aware Productivity: A Systems and AI-Based Human-in-the-Loop Framework for ADHD-Affected Professionals	Raghavendra Deshmukh et.al.	2507.06864	null
2025-07-11	The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover	Matteo Lupinacci et.al.	2507.06850	null
2025-07-10	Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning	Matej Straka et.al.	2507.06825	null
2025-07-09	Comparing Dialectical Systems: Contradiction and Counterexample in Belief Change (Extended Version)	Uri Andrews et.al.	2507.06798	null
2025-07-09	Multi-Task Multi-Agent Reinforcement Learning via Skill Graphs	Guobin Zhu et.al.	2507.06690	null
2025-07-09	Peer influence breaks ergodicity in an opinion dynamics model with external information	Federica De Domenico et.al.	2507.06661	null
2025-07-09	Growing Trees with an Agent: Accelerating RRTs with Learned, Multi-Step Episodic Exploration	Xinyu Wu et.al.	2507.06605	null
2025-07-09	Generalization in Reinforcement Learning for Radio Access Networks	Burak Demirel et.al.	2507.06602	null
2025-07-15	A Mathematical Theory of Discursive Networks	Juan B. Gutiérrez et.al.	2507.06565	null
2025-07-09	SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments	Tianshun Li et.al.	2507.06564	null
2025-07-09	On the Hardness of Unsupervised Domain Adaptation: Optimal Learners and Information-Theoretic Perspective	Zhiyi Dong et.al.	2507.06552	null
2025-07-09	ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture	Mingjin Zeng et.al.	2507.06531	null
2025-07-09	InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior	Huisheng Wang et.al.	2507.06528	null
2025-07-09	Gradientsys: A Multi-Agent LLM Scheduler with ReAct Orchestration	Xinyuan Song et.al.	2507.06520	null
2025-07-13	Prediction-Augmented Mechanism Design for Weighted Facility Location	Yangguang Shi et.al.	2507.06509	null
2025-07-09	Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings	Russell Taylor et.al.	2507.06506	null
2025-07-09	Learning To Communicate Over An Unknown Shared Network	Shivangi Agarwal et.al.	2507.06499	null
2025-07-09	Learning Japanese with Jouzu: Interaction Outcomes with Stylized Dialogue Fictional Agents	Zackary Rackauckas et.al.	2507.06483	null
2025-07-09	Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models	Aaron Dharna et.al.	2507.06466	null
2025-07-08	Eyes on the Road, Mind Beyond Vision: Context-Aware Multi-modal Enhanced Risk Anticipation	Jiaxun Zhang et.al.	2507.06444	null
2025-07-08	Distributed Optimization of Finite Condition Number for Laplacian Matrix in Multi-Agent Systems	Yicheng Xu et.al.	2507.06440	null
2025-07-08	Experience-Centric Resource Management in ISAC Networks: A Digital Agent-Assisted Approach	Xinyu Huang et.al.	2507.06436	null
2025-07-08	Representing Prompting Patterns with PDL: Compliance Agent Case Study	Mandana Vaziri et.al.	2507.06396	null
2025-07-08	VoI-aware Scheduling Schemes for Multi-Agent Formation Control	Federico Chiariotti et.al.	2507.06392	null
2025-07-08	Solving the Constrained Random Disambiguation Path Problem via Lagrangian Relaxation and Graph Reduction	Li Zhou et.al.	2507.06346	null
2025-07-08	Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms	Tarek Gasmi et.al.	2507.06323	null
2025-07-08	Too Human to Model:The Uncanny Valley of LLMs in Social Simulation -- When Generative Language Agents Misalign with Modelling Principles	Yongchao Zeng et.al.	2507.06310	null
2025-07-08	A Survey of Multi Agent Reinforcement Learning: Federated Learning and Cooperative and Noncooperative Decentralized Regimes	Kemboi Cheruiyot et.al.	2507.06278	null
2025-07-11	Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities	Gheorghe Comanici et.al.	2507.06261	null
2025-07-10	Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving	Xiangru Tang et.al.	2507.06229	null
2025-07-08	Aligned Textual Scoring Rules	Yuxuan Lu et.al.	2507.06221	null
2025-07-08	Evaluation of Habitat Robotics using Large Language Models	William Li et.al.	2507.06157	null
2025-07-08	OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety	Sanidhya Vijayvargiya et.al.	2507.06134	null
2025-07-08	A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem	Souvik Roy et.al.	2507.06126	null
2025-07-08	On Lockean beliefs that are deductively closed and minimal change	Tommaso Flaminio et.al.	2507.06042	null
2025-07-08	Conditional Multi-Stage Failure Recovery for Embodied Agents	Youmna Farag et.al.	2507.06016	null
2025-07-08	From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination	Chang Yao et.al.	2507.06004	null
2025-07-08	Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models	Marc Oriol et.al.	2507.05981	null
2025-07-08	CogniPlay: a work-in-progress Human-like model for General Game Playing	Aloïs Rautureau et.al.	2507.05868	null
2025-07-08	Constella: Supporting Storywriters' Interconnected Character Creation through LLM-based Multi-Agents	Syemin Park et.al.	2507.05820	null
2025-07-08	Just Say Better or Worse: A Human-AI Collaborative Framework for Medical Image Segmentation Without Manual Annotations	Yizhe Zhang et.al.	2507.05815	null
2025-07-10	GTA1: GUI Test-time Scaling Agent	Yan Yang et.al.	2507.05791	null
2025-07-08	On the detection of medium inhomogeneity by contrast agent: wave scattering models and numerical implementations	Zhe Wang et.al.	2507.05773	null
2025-07-08	An autonomous agent for auditing and improving the reliability of clinical AI models	Lukas Kuhn et.al.	2507.05755	null
2025-07-08	An efficiency ordering of k-price auctions under complete information	Sumit Goel et.al.	2507.05738	null
2025-07-08	Large Language Models for Agent-Based Modelling: Current and possible uses across the modelling cycle	Loïs Vanhée et.al.	2507.05723	null
2025-07-08	MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment	Yucheng Shi et.al.	2507.05720	null
2025-07-08	Agentic-R1: Distilled Dual-Strategy Reasoning	Weihua Du et.al.	2507.05707	null
2025-07-08	R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding	Joonhyung Park et.al.	2507.05673	null
2025-07-08	ECom-Bench: Can LLM Agent Resolve Real-World E-commerce Customer Support Issues?	Haoxin Wang et.al.	2507.05639	null
2025-07-08	LLMs are Introvert	Litian Zhang et.al.	2507.05638	null
2025-07-08	How Not to Detect Prompt Injections with an LLM	Sarthak Choudhary et.al.	2507.05630	null
2025-07-08	Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study	Ibne Farabi Shihab et.al.	2507.05619	null
2025-07-08	Density Discontinuity Regression	Surya T Tokdar et.al.	2507.05581	null
2025-07-08	Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines	Wilka Carvalho et.al.	2507.05561	null
2025-07-09	AI Agent Smart Contract Exploit Generation	Arthur Gervais et.al.	2507.05558	null
2025-07-07	Evolutionary and Coevolutionary Multi-Agent Design Choices and Dynamics	Erik Hemberg et.al.	2507.05534	null
2025-07-07	Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment	Jiahuan Pei et.al.	2507.05528	null
2025-07-07	Cultivating Multimodal Intelligence: Interpretive Reasoning and Agentic RAG Approaches to Dermatological Diagnosis	Karishma Thakrar et.al.	2507.05520	null
2025-07-09	Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications	Jean-Philippe Corbeil et.al.	2507.05517	null
2025-07-07	Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents	Prahaladh Chandrahasan et.al.	2507.05495	null
2025-07-07	Constraint Hypergraphs as a Unifying Framework for Digital Twins	John Morris et.al.	2507.05494	null
2025-07-07	Inaugural MOASEI Competition at AAMAS'2025: A Technical Report	Ceferino Patino et.al.	2507.05469	null
2025-07-07	2048: Reinforcement Learning in a Delayed Reward Environment	Prady Saligram et.al.	2507.05465	null
2025-07-07	A Systematization of Security Vulnerabilities in Computer Use Agents	Daniel Jones et.al.	2507.05445	null
2025-07-07	Motion Generation: A Survey of Generative Approaches and Benchmarks	Aliasghar Khani et.al.	2507.05419	null
2025-07-07	MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents	Ming Gong et.al.	2507.05330	null
2025-07-07	AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts	Kwangsuk Park et.al.	2507.05321	null
2025-07-07	OASBuilder: Generating OpenAPI Specifications from Online API Documentation with Large Language Models	Koren Lazar et.al.	2507.05316	null
2025-07-10	Fuzzy Classification Aggregation for a Continuum of Agents	Zijun Meng et.al.	2507.05297	null
2025-07-05	A LLM-Driven Multi-Agent Systems for Professional Development of Mathematics Teachers	Kaiqi Yang et.al.	2507.05292	null
2025-07-03	A Fuzzy Supervisor Agent Design for Clinical Reasoning Assistance in a Multi-Agent Educational Clinical Scenario Simulation	Weibing Zheng et.al.	2507.05275	null
2025-07-07	Spatio-Temporal LLM: Reasoning about Environments and Actions	Haozhen Zheng et.al.	2507.05258	null
2025-07-07	Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions	Yuanzhe Hu et.al.	2507.05257	null
2025-07-07	From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving	Fabian Konstantinidis et.al.	2507.05254	null
2025-07-07	Action Space Reduction Strategies for Reinforcement Learning in Autonomous Driving	Elahe Delavari et.al.	2507.05251	null
2025-07-07	Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration	Benjamin Li et.al.	2507.05244	null
2025-07-08	SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam?	Jingyi Chai et.al.	2507.05241	null
2025-07-07	StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling	Meng Wei et.al.	2507.05240	null
2025-07-12	MedGemma Technical Report	Andrew Sellergren et.al.	2507.05201	null
2025-07-07	CREW-WILDFIRE: Benchmarking Agentic Multi-Agent Collaborations at Scale	Jonathan Hyun et.al.	2507.05178	null
2025-07-07	Vector Cost Bimatrix Games with Applications to Autonomous Racing	Benjamin R. Toaz et.al.	2507.05171	null
2025-07-07	Critiques of World Models	Eric Xing et.al.	2507.05169	null
2025-07-07	Macroscopic Structural Light Absorbers	Jan M. Kaster et.al.	2507.05152	null
2025-07-07	Effects of Unplanned Incoming Flights on Airport Relief Processes after a Major Natural Disaster	Luka Van de Sype et.al.	2507.05150	null
2025-07-07	LERa: Replanning with Visual Feedback in Instruction Following	Svyatoslav Pchelintsev et.al.	2507.05135	null
2025-07-07	Optimal Consumption-Investment for General Utility with a Drawdown Constraint over a Finite-Time Horizon	Chonghu Guan et.al.	2507.05115	null
2025-07-07	Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance	Tobias Demmler et.al.	2507.05098	null
2025-07-07	Perspectives on How Sociology Can Advance Theorizing about Human-Chatbot Interaction and Developing Chatbots for Social Good	Celeste Campos-Castillo et.al.	2507.05030	null
2025-07-07	Linking Homeostasis to Reinforcement Learning: Internal State Control of Motivated Behavior	Naoto Yoshida et.al.	2507.04998	null
2025-07-07	From Autonomy to Agency: Agentic Vehicles for Human-Centered Mobility Systems	Jiangbo Yu et.al.	2507.04996	null
2025-07-07	Leadership Detection via Time-Lagged Correlation-Based Network Inference	Thayanne França da Silva et.al.	2507.04917	null
2025-07-07	MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction	Kaleem Ullah Qasim et.al.	2507.04893	null
2025-07-07	Fine-tuning on simulated data outperforms prompting for agent tone of voice	Ingo Marquardt et.al.	2507.04889	null
2025-07-07	Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning	Giwon Lee et.al.	2507.04790	null
2025-07-07	Training-free Generation of Temporally Consistent Rewards from VLMs	Yinuo Zhao et.al.	2507.04789	null
2025-07-07	FurniMAS: Language-Guided Furniture Decoration using Multi-Agent System	Toan Nguyen et.al.	2507.04770	null
2025-07-07	Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking	Maria Damanaki et.al.	2507.04762	null
2025-07-07	LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction	Sungmin Lee et.al.	2507.04748	null
2025-07-07	Who's the Mole? Modeling and Detecting Intention-Hiding Malicious Agents in LLM-Based Multi-Agent Systems	Yizhe Xie et.al.	2507.04724	null
2025-07-07	UrbanMind: Towards Urban General Intelligence via Tool-Enhanced Retrieval-Augmented Generation and Multilevel Optimization	Kai Yang et.al.	2507.04706	null
2025-07-07	Interpretable Reward Modeling with Active Concept Bottlenecks	Sonia Laguna et.al.	2507.04695	null
2025-07-07	Quantitative Single-particle Profiling of Extracellular Vesicles via Fluorescent Nanoparticle Tracking Analysis	Yiting Liu et.al.	2507.04655	null
2025-07-07	LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction	Yixin Yan et.al.	2507.04634	null
2025-07-07	Equilibrium Strategies for the N-agent Mean-Variance Investment Problem over a Random Horizon	Xiaoqing Liang et.al.	2507.04611	null
2025-07-07	VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents	Rui Meng et.al.	2507.04590	null
2025-07-08	Greedy Dynamic Matching	Nick Arnosti et.al.	2507.04551	null
2025-07-06	Grounded Gesture Generation: Language, Motion, and Space	Anna Deichler et.al.	2507.04522	null
2025-07-09	Constant-Approximate and Constant-Strategyproof Two-Facility Location	Elijah Journey Fullerton et.al.	2507.04485	null
2025-07-06	Agentic Distributed Computing	Ajay D. Kshemkalyani et.al.	2507.04459	null
2025-07-06	"Hi AirStar, Guide Me to the Badminton Court."	Ziqin Wang et.al.	2507.04430	null
2025-07-06	MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind	Emilio Villa-Cueva et.al.	2507.04415	null
2025-07-06	Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models	Huy Hoan Le et.al.	2507.04410	null
2025-07-06	Inverse Reinforcement Learning using Revealed Preferences and Passive Stochastic Optimization	Vikram Krishnamurthy et.al.	2507.04396	null
2025-07-08	MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Intelligence Agents	Georgios Ioannides et.al.	2507.04376	null
2025-07-06	Adaptive Malware Detection using Sequential Feature Selection: A Dueling Double Deep Q-Network (D3QN) Framework for Intelligent Classification	Naseem Khan et.al.	2507.04372	null
2025-07-06	WebSynthesis: World-Model-Guided MCTS for Efficient WebUI-Trajectory Synthesis	Yifei Gao et.al.	2507.04370	null
2025-07-06	Mission-Aligned Learning-Informed Control of Autonomous Systems: Formulation and Foundations	Vyacheslav Kungurtsev et.al.	2507.04356	null
2025-07-06	Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks	Hao Huang et.al.	2507.04331	null
2025-07-06	Covalently Integrated CNT@rGO for Superior Conductivity and Cycling Stability in Lithium-Ion Batterie	Junwen Tang et.al.	2507.04296	null
2025-07-06	SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement	Liwen Xiao et.al.	2507.04263	null
2025-07-06	Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties	Guohong Liu et.al.	2507.04227	null
2025-07-05	Gathering Teams of Bounded Memory Agents on a Line	Younan Gao et.al.	2507.04172	null
2025-07-05	Comparative Evaluation of VR-Enabled Robots and Human Operators for Targeted Disease Management in Vineyards	Hasan Seyyedhasani et.al.	2507.04167	null
2025-07-08	Adaptive Two-sided Assortment Optimization: Revenue Maximization	Mohammadreza Ahmadnejadsaein et.al.	2507.04156	null
2025-07-05	Learning Humanoid Arm Motion via Centroidal Momentum Regularized Multi-Agent Reinforcement Learning	Ho Jae Lee et.al.	2507.04140	null
2025-07-05	BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering	Costas Mavromatis et.al.	2507.04127	null
2025-07-05	Enhancing Robustness of LLM-Driven Multi-Agent Systems through Randomized Smoothing	Jinwei Hu et.al.	2507.04105	null
2025-07-05	How to Train Your LLM Web Agent: A Statistical Diagnosis	Dheeraj Vattikonda et.al.	2507.04103	null
2025-07-05	Dynamic Asset Pricing with α-MEU Model	Jiacheng Fan et.al.	2507.04093	null
2025-07-05	Accurate and Efficient World Modeling with Masked Latent Transformers	Maxime Burchi et.al.	2507.04075	null
2025-07-05	Efficiency through Evolution, A Darwinian Approach to Agent-Based Economic Forecast Modeling	Martin Jaraiz et.al.	2507.04074	null
2025-07-05	HAWK: A Hierarchical Workflow Framework for Multi-Agent Collaboration	Yuyang Cheng et.al.	2507.04067	null
2025-07-05	TopoMAS: Large Language Model Driven Topological Materials Multiagent System	Baohua Zhang et.al.	2507.04053	null
2025-07-05	Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation	Ziying Song et.al.	2507.04049	null
2025-07-05	Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation	Ziyu Zhu et.al.	2507.04047	null
2025-07-05	Ready Jurist One: Benchmarking Language Agents for Legal Intelligence in Dynamic Environments	Zheng Jia et.al.	2507.04037	null
2025-07-05	PresentAgent: Multimodal Agent for Presentation Video Generation	Jingwei Shi et.al.	2507.04036	null
2025-07-05	Exploring a Gamified Personality Assessment Method through Interaction with Multi-Personality LLM Agents	Baiqiao Zhang et.al.	2507.04005	null
2025-07-05	MalVol-25: A Diverse, Labelled and Detailed Volatile Memory Dataset for Malware Detection and Response Testing and Validation	Dipo Dunsin et.al.	2507.03993	null
2025-07-05	Fair and Efficient Allocation of Indivisible Mixed Manna	Siddharth Barman et.al.	2507.03946	null
2025-07-05	CortexDebate: Debating Sparsely and Equally for Multi-Agent Debate	Yiliu Sun et.al.	2507.03928	null
2025-07-05	Agent Exchange: Shaping the Future of AI Agent Economics	Yingxuan Yang et.al.	2507.03904	null
2025-07-05	Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing	Rahil P Mehta et.al.	2507.03870	null
2025-07-04	Participatory Evolution of Artificial Life Systems via Semantic Feedback	Shuowen Li et.al.	2507.03839	null
2025-07-04	Leveraging Large Language Models for Tacit Knowledge Discovery in Organizational Contexts	Gianlucca Zuin et.al.	2507.03811	null
2025-07-04	Generating Novelty in Open-World Multi-Agent Strategic Board Games	Mayank Kejriwal et.al.	2507.03802	null
2025-07-04	Learning Dark Souls Combat Through Pixel Input With Neuroevolution	Jim O'Connor et.al.	2507.03793	null
2025-07-04	Less is More: Empowering GUI Agent with Context-Aware Simplification	Gongwei Chen et.al.	2507.03730	null
2025-07-04	Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models	Riya Naik et.al.	2507.03726	null
2025-07-09	Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making	Sang Quang Nguyen et.al.	2507.03711	null
2025-07-04	Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning	Rebekah A. Gelpí et.al.	2507.03682	null
2025-07-04	STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking	Tek Raj Chhetri et.al.	2507.03674	null
2025-07-04	Recon, Answer, Verify: Agents in Search of Truth	Satyam Shukla et.al.	2507.03671	null
2025-07-04	When Does Diversity Matter? A Unified Framework for Binary-Choice Dynamics	Arkadiusz Jędrzejewski et.al.	2507.03665	null
2025-07-04	Is It Time To Treat Prompts As Code? A Multi-Use Case Study For Prompt Optimization Using DSPy	Francisca Lemos et.al.	2507.03620	null
2025-07-04	EvoAgentX: An Automated Framework for Evolving Agentic Workflows	Yingxu Wang et.al.	2507.03616	null
2025-07-04	On characterization and existence of a constrained correlated equilibria in Markov games	Tingting Ni et.al.	2507.03502	null
2025-07-09	Reinforcement Learning-based Feature Generation Algorithm for Scientific Data	Meng Xiao et.al.	2507.03498	null
2025-07-04	AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions	Abdellah Zeggai et.al.	2507.03493	null
2025-07-04	Explainable Information Retrieval in the Audit Domain	Alexander Frummet et.al.	2507.03479	null
2025-07-04	REAL: Benchmarking Abilities of Large Language Models for Housing Transactions and Services	Kexin Zhu et.al.	2507.03477	null
2025-07-04	Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis	Weitong Zhang et.al.	2507.03460	null
2025-07-04	ElliottAgents: A Natural Language-Driven Multi-Agent System for Stock Market Analysis and Prediction	Jarosław A. Chudziak et.al.	2507.03435	null
2025-07-04	Lessons from a Chimp: AI "Scheming" and the Quest for Ape Language	Christopher Summerfield et.al.	2507.03409	null
2025-07-04	Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky	Ashutosh Hathidara et.al.	2507.03336	null
2025-07-04	Mirror in the Model: Ad Banner Image Generation via Reflective Multi-LLM and Multi-modal Agents	Zhao Wang et.al.	2507.03326	null
2025-07-04	GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation	Himanshu Dutta et.al.	2507.03311	null
2025-07-04	Dyn-O: Building Structured World Models with Object-Centric Representations	Zizhao Wang et.al.	2507.03298	null
2025-07-04	LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents	Anand Gokhale et.al.	2507.03293	null
2025-07-04	Conformal Information Pursuit for Interactively Guiding Large Language Models	Kwan Ho Ryan Chan et.al.	2507.03279	null
2025-07-04	GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning	Jie Peng et.al.	2507.03267	null
2025-07-04	Coalitional stability under myopic expectations and externalities	Agustin G. Bonifacio et.al.	2507.03259	null
2025-07-04	CodeAgents: A Token-Efficient Framework for Codified Multi-Agent Reasoning in LLMs	Bruce Yang et.al.	2507.03254	null
2025-07-03	SI-Agent: An Agentic Framework for Feedback-Driven Generation and Tuning of Human-Readable System Instructions for Large Language Models	Jeshwanth Challagundla et.al.	2507.03223	null
2025-07-03	In vivo imaging of central nervous system fluid spaces using synchrotron radiation-based micro computed tomography	Marta Girona Alarcón et.al.	2507.03186	null
2025-07-03	Last-Iterate Convergence of No-Regret Learning for Equilibria in Bargaining Games	Serafina Kamp et.al.	2507.03150	null
2025-07-03	RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents	Peisong Wang et.al.	2507.03112	null
2025-07-03	From Turing to Tomorrow: The UK's Approach to AI Regulation	Oliver Ritchie et.al.	2507.03050	null
2025-07-02	Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains	Abhishek Verma et.al.	2507.03026	null
2025-07-02	OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering	Zipeng Qiu et.al.	2507.03018	null
2025-07-10	Establishing Best Practices for Building Rigorous Agentic Benchmarks	Yuxuan Zhu et.al.	2507.02825	null
2025-07-03	Moral Responsibility or Obedience: What Do We Want from AI?	Joseph Boland et.al.	2507.02788	null
2025-07-06	KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs	Yuzhang Xie et.al.	2507.02773	null
2025-07-03	Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work	Guangwei Zhang et.al.	2507.02760	null
2025-07-03	Defining and classifying models of groups: The social ontology of higher-order networks	Jonathan St-Onge et.al.	2507.02758	null
2025-07-03	Multi-agent Auditory Scene Analysis	Caleb Rascon et.al.	2507.02755	null
2025-07-03	Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks	Sizhe Chen et.al.	2507.02735	null
2025-07-03	Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving	Matthieu Zimmer et.al.	2507.02726	null
2025-07-03	A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control	Zilin Kang et.al.	2507.02712	null
2025-07-03	Fluid Democracy in Federated Data Aggregation	Aditya Vema Reddy Kesari et.al.	2507.02710	null
2025-07-03	Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents	Jiangrong Wu et.al.	2507.02699	null
2025-07-03	Multi-Agent Reinforcement Learning for Dynamic Pricing in Supply Chains: Benchmarking Strategic Agent Behaviours under Realistically Simulated Market Conditions	Thomas Hazenberg et.al.	2507.02698	null
2025-07-03	On the Convergence of Large Language Model Optimizer for Black-Box Network Management	Hoon Lee et.al.	2507.02689	null
2025-07-03	TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for Spatial Public Goods Games	Zhaoqilin Yang et.al.	2507.02675	null
2025-07-03	Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification	Deepak Narayan Gadde et.al.	2507.02660	null
2025-07-03	Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search	Jiajie Jin et.al.	2507.02652	null
2025-07-03	On Efficient Bayesian Exploration in Model-Based Reinforcement Learning	Alberto Caron et.al.	2507.02639	null
2025-07-03	VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning	Siran Chen et.al.	2507.02626	null
2025-07-03	Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory	Kenneth Payne et.al.	2507.02618	null
2025-07-03	DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making	Tianqi Shang et.al.	2507.02616	null
2025-07-03	WebSailor: Navigating Super-human Reasoning for Web Agent	Kuan Li et.al.	2507.02592	null
2025-07-03	AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench	Edan Toledo et.al.	2507.02554	null
2025-07-03	Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue	Paulo Ricardo Knob et.al.	2507.02537	null
2025-07-03	A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion	Maryem Fadili et.al.	2507.02430	null
2025-07-03	CyberRAG: An agentic RAG cyber attack classification and reporting tool	Francesco Blefari et.al.	2507.02424	null
2025-07-03	Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization	Caio Azevedo et.al.	2507.02406	null
2025-07-03	Deep Reinforcement Learning-Based DRAM Equalizer Parameter Optimization Using Latent Representations	Muhammad Usama et.al.	2507.02365	null
2025-07-03	OMS: On-the-fly, Multi-Objective, Self-Reflective Ad Keyword Generation via LLM Agent	Bowen Chen et.al.	2507.02353	null
2025-07-03	CineMyoPS: Segmenting Myocardial Pathologies from Cine Cardiac MR	Wangbin Ding et.al.	2507.02289	null
2025-07-03	MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent	Hongli Yu et.al.	2507.02259	null
2025-07-03	SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement	Zeyu Lei et.al.	2507.02252	null
2025-07-04	CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset in Adverse Weather	Minghao Ning et.al.	2507.02245	null
2025-07-04	Dilution, Diffusion and Symbiosis in Spatial Prisoner's Dilemma with Reinforcement Learning	Gustavo C. Mangold et.al.	2507.02211	null
2025-07-08	Average Action Efficiency Rises Monotonically in Self-Organizing Systems via Stochastic Least-Action Dynamics	Georgi Yordanov Georgiev et.al.	2507.02209	null
2025-07-02	Operator-Theoretic Methods for Differential Games	Craig Bakker et.al.	2507.02203	null
2025-07-02	Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations of Human Trust	Amogh Mannekote et.al.	2507.02197	null
2025-07-02	Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models	Fangjian Lei et.al.	2507.02182	null
2025-07-02	Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNN	Miroslav Cibula et.al.	2507.02171	null
2025-07-02	Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System	Adam Kostka et.al.	2507.02170	null
2025-07-02	The optimal degree for maximizing rumor spreading on a ring lattice	Ana C. Díaz Bacca et.al.	2507.02141	null
2025-07-02	PAL: Designing Conversational Agents as Scalable, Cooperative Patient Simulators for Palliative-Care Training	Neil K. R. Sehgal et.al.	2507.02122	null
2025-07-02	What Neuroscience Can Teach AI About Learning in Continuously Changing Environments	Daniel Durstewitz et.al.	2507.02103	null
2025-07-02	The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems	Reza Yousefi Maragheh et.al.	2507.02097	null
2025-07-02	Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab	Haonan Duan et.al.	2507.02083	null
2025-07-02	Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs	Mohammad Ali Alomrani et.al.	2507.02076	null
2025-07-05	RoboBrain 2.0 Technical Report	BAAI RoboBrain Team et.al.	2507.02029	null
2025-07-01	STELLA: Self-Evolving LLM Agent for Biomedical Research	Ruofan Jin et.al.	2507.02004	null
2025-07-01	Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models	Shaurya Mallampati et.al.	2507.02002	null
2025-07-04	Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting	Zhihao Wang et.al.	2507.01997	null
2025-06-29	Integrating Large Language Models in Financial Investments and Market Analysis: A Survey	Sedigheh Mahdavi et.al.	2507.01990	null
2025-07-02	The Thin Line Between Comprehension and Persuasion in LLMs	Adrian de Wynter et.al.	2507.01936	null
2025-07-03	Decision-Oriented Text Evaluation	Yu-Shiang Huang et.al.	2507.01923	null
2025-07-02	An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram	Sunder Neelakantan et.al.	2507.01867	null
2025-07-02	Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents	Sanjay Krishna Anbalagan et.al.	2507.01862	null
2025-07-02	TD-MPC-Opt: Distilling Model-Based Multi-Task Reinforcement Learning Agents	Dmytro Kuzmenko et.al.	2507.01823	null
2025-07-06	AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction	Bin Rao et.al.	2507.01801	null
2025-07-02	ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving	Kai Chen et.al.	2507.01735	null
2025-07-02	Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI	Gopichand Kanumolu et.al.	2507.01717	null
2025-07-02	Using Machine Learning to Compute Constrained Optimal Carbon Tax Rules	Felix Kübler et.al.	2507.01704	null
2025-07-02	AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness	Zixin Chen et.al.	2507.01702	null
2025-07-02	Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture	Bochen Han et.al.	2507.01701	null
2025-07-02	Quantum reinforcement learning in dynamic environments	Oliver Sefrin et.al.	2507.01691	null
2025-07-02	What does really matter in image goal navigation?	Gianluca Monaci et.al.	2507.01667	null
2025-07-02	Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems	Zhaoyan Sun et.al.	2507.01599	null
2025-07-02	Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation	Shutong Feng et.al.	2507.01594	null
2025-07-02	Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors	Yulan Gao et.al.	2507.01574	null
2025-07-02	Time-Varying Coverage Control: A Distributed Tracker-Planner MPC Framework	Patrick Benito Eberhard et.al.	2507.01567	null
2025-07-02	Chargax: A JAX Accelerated EV Charging Simulator	Koen Ponse et.al.	2507.01522	null
2025-07-02	Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning	Yanfei Zhang et.al.	2507.01489	null
2025-07-02	BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments	Yibo Qiu et.al.	2507.01485	null
2025-07-02	Using multi-agent architecture to mitigate the risk of LLM hallucinations	Abd Elrahman Amer et.al.	2507.01446	null
2025-07-02	Reinforcement Learning for Discrete-time LQG Mean Field Social Control Problems with Unknown Dynamics	Hanfang Zhang et.al.	2507.01420	null
2025-07-02	Evaluating LLM Agent Collusion in Double Auctions	Kushal Agrawal et.al.	2507.01413	null
2025-07-02	RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms	Ziyao Wang et.al.	2507.01378	null
2025-07-02	AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing	Yinwang Ren et.al.	2507.01376	null
2025-07-02	Context-Aware Code Wiring Recommendation with LLM-based Agent	Taiming Wang et.al.	2507.01315	null
2025-07-02	LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction	Muhammad Atta ur Rahman et.al.	2507.01308	null
2025-07-02	Optimal Dispersion Under Asynchrony	Debasish Pattanayak et.al.	2507.01298	null
2025-07-05	Frustratingly Simple Retrieval Improves Challenging, Reasoning-Intensive Benchmarks	Xinxi Lyu et.al.	2507.01297	null
2025-07-02	GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant	Michał Matak et.al.	2507.01259	null
2025-07-02	AIGVE-MACS: Unified Multi-Aspect Commenting and Scoring Model for AI-Generated Video Evaluation	Xiao Liu et.al.	2507.01255	null
2025-07-01	Rethinking the Illusion of Thinking	Iñaki Dellibarda Varela et.al.	2507.01231	null
2025-07-01	SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound	Yunke Ao et.al.	2507.01152	null
2025-07-01	Agentic AI in Product Management: A Co-Evolutionary Model	Nishant A. Parikh et.al.	2507.01069	null
2025-06-30	Epitome: Pioneering an Experimental Platform for AI-Social Science Integration	Jingjing Qu et.al.	2507.01061	null
2025-06-30	Optimizing Conversational Product Recommendation via Reinforcement Learning	Kang Liu et.al.	2507.01060	null
2025-06-29	Automated Vehicles Should be Connected with Natural Language	Xiangbo Gao et.al.	2507.01059	null
2025-07-01	Running Quantum Computers in Discovery Mode	Benedikt Placke et.al.	2507.01013	null
2025-07-02	GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning	GLM-V Team et.al.	2507.01006	null
2025-07-01	RTMap: Real-Time Recursive Mapping with Change Detection and Localization	Yuheng Du et.al.	2507.00980	null
2025-07-01	Enhancing LLM Agent Safety via Causal Influence Prompting	Dongyoon Hahm et.al.	2507.00979	null
2025-07-01	Decentralised Multi-Manager Fund Framework	Arman Abgaryan et.al.	2507.00978	null
2025-07-01	Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact	Rizwan Qureshi et.al.	2507.00951	null
2025-07-01	WebArXiv: Evaluating Multimodal Agents on Time-Invariant arXiv Tasks	Zihao Sun et.al.	2507.00938	null
2025-07-01	A Survey: Learning Embodied Intelligence from Physical Simulators and World Models	Xiaoxiao Long et.al.	2507.00917	null
2025-07-01	Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications	Jindong Han et.al.	2507.00914	null
2025-07-01	MemeCMD: An Automatically Generated Chinese Multi-turn Dialogue Dataset with Contextually Retrieved Memes	Yuheng Wang et.al.	2507.00891	null
2025-07-01	TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation	Xi Xuan et.al.	2507.00875	null
2025-07-01	The Evolution of Altruistic Rationality Provides a Solution to Social Dilemmas via Rational Reciprocity	Mohammad Salahshour et.al.	2507.00858	null
2025-07-01	Enhancing Vehicular Platooning with Wireless Federated Learning: A Resource-Aware Control Framework	Beining Wu et.al.	2507.00856	null
2025-07-01	Ranking Quantilized Mean-Field Games with an Application to Early-Stage Venture Investments	Rinel Foguen Tchuendom et.al.	2507.00853	null
2025-07-01	SafeMobile: Chain-level Jailbreak Detection and Automated Evaluation for Multimodal Mobile Agents	Siyuan Liang et.al.	2507.00841	null
2025-07-01	Many LLMs Are More Utilitarian Than One	Anita Keshmirian et.al.	2507.00814	null
2025-07-02	Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments	Tom Maus et.al.	2507.00762	null
2025-07-01	Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity	Jacopo Nudo et.al.	2507.00657	null
2025-07-01	ChatHLS: Towards Systematic Design Automation and Optimization for High-Level Synthesis	Runkai Li et.al.	2507.00642	null
2025-07-04	Horus: A Protocol for Trustless Delegation Under Uncertainty	David Shi et.al.	2507.00631	null
2025-07-01	Quantum Circuit Structure Optimization for Quantum Reinforcement Learning	Seok Bin Son et.al.	2507.00589	null
2025-07-01	Collaborative Multi-Agent Reinforcement Learning Approach for Elastic Cloud Resource Scaling	Bruce Fang et.al.	2507.00550	null
2025-07-01	Rethinking Group Recommender Systems in the Era of Generative AI: From One-Shot Recommendations to Agentic Group Decision Support	Dietmar Jannach et.al.	2507.00535	null
2025-07-01	PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning	Weiran Guo et.al.	2507.00485	null
2025-07-01	ARIG: Autoregressive Interactive Head Generation for Real-time Conversations	Ying Guo et.al.	2507.00472	null
2025-07-01	Best Agent Identification for General Game Playing	Matthew Stephenson et.al.	2507.00451	null
2025-07-01	Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems	Reza Ahmadvand et.al.	2507.00443	null
2025-07-01	Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning	Maggie Huan et.al.	2507.00432	null
2025-07-01	Multi-Agent Coordination under Poisson Observations: A Global Game Approach	Marcos M. Vasconcelos et.al.	2507.00424	null
2025-07-01	Evolutionary Dynamics with Self-Interaction Learning in Networked Systems	Ziyan Zeng et.al.	2507.00422	null
2025-07-01	Minimal Construction of Graphs with Maximum Robustness	Haejoon Lee et.al.	2507.00415	null
2025-07-01	iPanda: An Intelligent Protocol Testing and Debugging Agent for Conformance Testing	Xikai Sun et.al.	2507.00378	null
2025-07-01	VTS-Guided AI Interaction Workflow for Business Insights	Sun Ding et.al.	2507.00347	null
2025-06-30	Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems	Oren Fivel et.al.	2507.00268	null
2025-06-30	Examining Reject Relations in Stimulus Equivalence Simulations	Alexis Carrillo et.al.	2507.00265	null
2025-06-30	Endogenous Network Structures with Precision and Dimension Choices	Nikhil Kumar et.al.	2507.00249	null
2025-06-30	LineRetriever: Planning-Aware Observation Reduction for Web Agents	Imene Kerboua et.al.	2507.00210	null
2025-06-30	BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis	Vidhi Rathore et.al.	2507.00180	null
2025-06-30	AI-Governed Agent Architecture for Web-Trustworthy Tokenization of Alternative Assets	Ailiya Borjigin et.al.	2507.00096	null
2025-06-30	State and Memory is All You Need for Robust and Reliable AI Agents	Matthew Muhoberac et.al.	2507.00081	null
2025-06-29	VoyagerVision: Investigating the Role of Multi-modal Information for Open-ended Learning Systems	Ethan Smyth et.al.	2507.00079	null
2025-07-01	SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Bo Liu et.al.	2506.24119	null
2025-06-30	Protocol insecurity with finitely many sessions and XOR	R Ramanujam et.al.	2506.24072	null
2025-06-30	Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC	Xinming Wei et.al.	2506.24045	null
2025-06-30	Ella: Embodied Social Agents with Lifelong Memory	Hongxin Zhang et.al.	2506.24019	null
2025-06-30	Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning	Seungjun Yi et.al.	2506.23998	null
2025-06-30	Harnessing AI Agents to Advance Research on Refugee Child Mental Health	Aditya Shrivastava et.al.	2506.23992	null
2025-06-30	LLM Agents Are the Antidote to Walled Gardens	Samuele Marro et.al.	2506.23978	null
2025-06-30	Flexible Moral Hazard Problems with Adverse Selection	Siwen Liu et.al.	2506.23954	null
2025-06-30	Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice	Akshit Kumar et.al.	2506.23924	null
2025-06-30	A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents	Hang Su et.al.	2506.23844	null
2025-06-30	Sociophysics models inspired by the Ising model	Pratik Mullick et.al.	2506.23837	null
2025-06-30	Towards the "Digital Me": A vision of authentic Conversational Agents powered by personal Human Digital Twins	Lluís C. Coll et.al.	2506.23826	null
2025-06-30	Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning	Anton Andreychuk et.al.	2506.23793	null
2025-07-01	Synthetically Expressive: Evaluating gesture and voice for emotion and empathy in VR and 2D scenarios	Haoyang Du et.al.	2506.23777	null
2025-06-30	Leveraging a Multi-Agent LLM-Based System to Educate Teachers in Hate Incidents Management	Ewelina Gajewska et.al.	2506.23774	null
2025-06-30	A Survey of LLM-based Automated Program Repair: Taxonomies, Design Paradigms, and Applications	Boyang Yang et.al.	2506.23749	null
2025-06-30	DABstep: Data Agent Benchmark for Multi-step Reasoning	Alex Egg et.al.	2506.23719	null
2025-06-30	Agent4S: The Transformation of Research Paradigms from the Perspective of Large Language Models	Boyuan Zheng et.al.	2506.23692	null
2025-06-30	PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red	Zihao Liu et.al.	2506.23689	null
2025-06-30	Efficient Interleaved Speech Modeling through Knowledge Distillation	Mohammadmahdi Nouriborji et.al.	2506.23670	null
2025-06-30	L0: Reinforcement Learning to Become General Agents	Junjie Zhang et.al.	2506.23667	null
2025-06-30	Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games	António Afonso et.al.	2506.23626	null
2025-06-30	Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs	Manuel Pratelli et.al.	2506.23610	null
2025-06-30	Evaluating Multi-Agent Defences Against Jailbreaking Attacks on Large Language Models	Maria Carolina Cornelia Wit et.al.	2506.23576	null
2025-06-30	CooT: Learning to Coordinate In-Context with Coordination Transformers	Huai-Chih Wang et.al.	2506.23549	null
2025-06-30	Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent	Haocheng Yu et.al.	2506.23485	null
2025-06-30	NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments	Xuan Yao et.al.	2506.23468	null
2025-06-30	Accessible Data Access and Analysis by People who are Blind or Have Low Vision	Samuel Reinders et.al.	2506.23443	null
2025-06-29	Do LLMs Dream of Discrete Algorithms?	Claudionor Coelho Jr et.al.	2506.23408	null
2025-06-29	ATGen: A Framework for Active Text Generation	Akim Tsvigun et.al.	2506.23342	null
2025-06-29	IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering	Parker Liu et.al.	2506.23329	null
2025-06-29	InfGen: Scenario Generation as Next Token Group Prediction	Zhenghao Peng et.al.	2506.23316	null
2025-06-29	GATSim: Urban Mobility Simulation with Generative Agents	Qi Liu et.al.	2506.23306	null
2025-06-29	Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games	David Guzman Piedrahita et.al.	2506.23276	null
2025-06-29	FinStat2SQL: A Text2SQL Pipeline for Financial Statement Analysis	Quang Hung Nguyen et.al.	2506.23273	null
2025-06-29	From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows	Mohamed Amine Ferrag et.al.	2506.23260	null
2025-06-29	Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models	Maarten Hugenholtz et.al.	2506.23164	null
2025-06-29	Benchmarking Deep Search over Heterogeneous Enterprise Data	Prafulla Kumar Choubey et.al.	2506.23139	null
2025-06-29	Learning Motion Skills with Adaptive Assistive Curriculum Force in Humanoid Robots	Zhanxiang Cao et.al.	2506.23125	null
2025-06-29	Curious Causality-Seeking Agents Learn Meta Causal World	Zhiyu Zhao et.al.	2506.23068	null
2025-06-29	AURA: Agent for Understanding, Reasoning, and Automated Tool Use in Voice-Driven Tasks	Leander Melroy Maben et.al.	2506.23049	null
2025-06-29	SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions	Xianzhe Fan et.al.	2506.23046	null
2025-06-28	Fragile, Robust, and Antifragile: A Perspective from Parameter Responses in Reinforcement Learning Under Stress	Zain ul Abdeen et.al.	2506.23036	null
2025-06-28	A "Good" Regulator May Provide a World Model for Intelligent Systems	Bradly Alicea et.al.	2506.23032	null
2025-06-28	Scenario-Based Hierarchical Reinforcement Learning for Automated Driving Decision Making	M. Youssef Abdelhamid et.al.	2506.23023	null
2025-06-28	A Reinforcement Learning Approach for Optimal Control in Microgrids	Davide Salaorni et.al.	2506.22995	null
2025-06-28	Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications	Mehdi Bennis et.al.	2506.22991	null
2025-06-28	Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models	Younwoo Choi et.al.	2506.22957	null
2025-06-28	GamerAstra: Enhancing Video Game Accessibility for Blind and Low-Vision Players through a Multi-Agent AI Framework	Tianrun Qiu et.al.	2506.22937	null
2025-06-28	Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example	Bei Zhou et.al.	2506.22894	null
2025-06-28	Agentic Enterprise: AI-Centric User to User-Centric AI	Arpit Narechania et.al.	2506.22893	null
2025-06-28	CP-Guard: A Unified, Probability-Agnostic, and Adaptive Framework for Malicious Agent Detection and Defense in Multi-Agent Embodied Perception Systems	Senkang Hu et.al.	2506.22890	null
2025-06-28	Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS	Shayak Nandi et.al.	2506.22876	null
2025-06-28	Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity	Mohammadreza Doostmohammadian et.al.	2506.22855	null
2025-07-02	DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues	Kyochul Jang et.al.	2506.22853	null
2025-06-28	Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems	Yucheng Cai et.al.	2506.22852	null
2025-06-28	Actively induced supercoiling can slow down plasmid solutions by trapping the threading entanglements	Roman Staňo et.al.	2506.22842	null
2025-06-28	Memory as a Service (MaaS): Rethinking Contextual Memory as Service-Oriented Modules for Collaborative Agents	Haichang Li et.al.	2506.22815	null
2025-06-28	BayesLoRA: Task-Specific Uncertainty in Low-Rank Adapters	Cooper Doyle et.al.	2506.22809	null
2025-06-28	Trusted Routing for Blockchain-Enabled Low-Altitude Intelligent Networks	Sijie He et.al.	2506.22745	null
2025-06-28	Questions as cognitive filters	Willem Conradie et.al.	2506.22735	null
2025-06-28	FairMarket-RL: LLM-Guided Fairness Shaping for Multi-Agent Reinforcement Learning in Peer-to-Peer Markets	Shrenik Jadhav et.al.	2506.22708	null
2025-06-28	General Autonomous Cybersecurity Defense: Learning Robust Policies for Dynamic Topologies and Diverse Attackers	Arun Ramamurthy et.al.	2506.22706	null
2025-06-27	Knowledge-Guided Multi-Agent Framework for Automated Requirements Development: A Vision	Jiangping Huang et.al.	2506.22656	null
2025-06-27	URSA: The Universal Research and Scientific Agent	Michael Grosskopf et.al.	2506.22653	null
2025-06-27	QoS-aware State-Augmented Learnable Algorithm for Wireless Coexistence Parameter Management	Mohammad Reza Fasihi et.al.	2506.22652	null
2025-06-27	Entropy Regularized Belief Reporting	Elchin Suleymanov et.al.	2506.22649	null
2025-06-27	Ludax: A GPU-Accelerated Domain Specific Language for Board Games	Graham Todd et.al.	2506.22609	null
2025-06-27	RExBench: Can coding agents autonomously implement AI research extensions?	Nicholas Edwards et.al.	2506.22598	null
2025-06-27	Capacity Planning in Stable Matching with Truthful or Strategic Preference Uncertainty	Maria Bazotte et.al.	2506.22560	null
2025-07-01	Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset	Vasu Agrawal et.al.	2506.22554	null
2025-06-26	Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures	Yubo Peng et.al.	2506.22507	null
2025-06-30	The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements	Bingchen Zhao et.al.	2506.22419	null
2025-06-27	Why Are Parsing Actions for Understanding Message Hierarchies Not Random?	Daichi Kato et.al.	2506.22366	null
2025-06-27	Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation	Tao Li et.al.	2506.22365	null
2025-07-03	Embodied AI Agents: Modeling the World	Pascale Fung et.al.	2506.22355	null
2025-06-27	Agent-based modeling and the sociology of money: some suggestions for refining monetary theory using social simulation	Eduardo Coltre Ferraciolli et.al.	2506.22318	null
2025-06-27	Artificial Intelligent Disobedience: Rethinking the Agency of Our Artificial Teammates	Reuth Mirsky et.al.	2506.22276	null
2025-06-27	Exploring Modularity of Agentic Systems for Drug Discovery	Laura van Weesep et.al.	2506.22189	null
2025-06-27	Autonomic Microservice Management via Agentic AI and MAPE-K Integration	Matteo Esposito et.al.	2506.22185	null
2025-06-27	A Different Approach to AI Safety: Proceedings from the Columbia Convening on Openness in Artificial Intelligence and AI Safety	Camille François et.al.	2506.22183	null
2025-06-27	ASVSim (AirSim for Surface Vehicles): A High-Fidelity Simulation Framework for Autonomous Surface Vehicle Research	Bavo Lesy et.al.	2506.22174	null
2025-06-27	Learning Distributed Safe Multi-Agent Navigation via Infinite-Horizon Optimal Graph Control	Fenglan Wang et.al.	2506.22117	null
2025-06-27	Flocking with random non-reciprocal interactions	Jiwon Choi et.al.	2506.22060	null
2025-06-27	Universal Retrieval for Multimodal Trajectory Modeling	Xuan Zhang et.al.	2506.22056	null
2025-06-27	TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning	Alessandro Sestini et.al.	2506.22008	null
2025-06-27	A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments	Akshay Jaitly et.al.	2506.21982	null
2025-06-27	SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model	Shuhan Tan et.al.	2506.21976	null
2025-06-27	Don't Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism	Simon Münker et.al.	2506.21974	null
2025-06-27	More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents	Weimin Xiong et.al.	2506.21967	null
2025-06-27	CAL-RAG: Retrieval-Augmented Multi-Agent Generation for Content-Aware Layout Design	Najmeh Forouzandehmehr et.al.	2506.21934	null
2025-06-27	ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation	Reza Yousefi Maragheh et.al.	2506.21931	null
2025-06-27	SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding	Zhao Jin et.al.	2506.21924	null
2025-06-27	Advancements and Challenges in Continual Reinforcement Learning: A Comprehensive Review	Amara Zuffer et.al.	2506.21899	null
2025-06-27	Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation	Qiyue Gao et.al.	2506.21876	null
2025-06-27	A Survey of Continual Reinforcement Learning	Chaofan Pan et.al.	2506.21872	null
2025-06-27	GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles	Mengyi Shan et.al.	2506.21839	null
2025-06-26	When Networks Mislead: How Partisan Communication Undermines Democratic Decision-Making	Hsuan-Wei Lee et.al.	2506.21820	null
2025-06-26	CitySim: Modeling Urban Behaviors and City Dynamics with Large-Scale LLM-Driven Agent Simulation	Nicolas Bougie et.al.	2506.21805	null
2025-06-26	Adaptive Multipath-Based SLAM for Distributed MIMO Systems	Xuhong Li et.al.	2506.21798	null
2025-06-26	MobiVerse: Scaling Urban Mobility Simulation with Hybrid Lightweight Domain-Specific Generator and Large Language Models	Yifan Liu et.al.	2506.21784	null
2025-06-26	Simultaneously Fair Allocation of Indivisible Items Across Multiple Dimensions	Yasushi Kawase et.al.	2506.21727	null
2025-06-26	SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents	Wanxin Tian et.al.	2506.21669	null
2025-06-26	Monetary Macro Accounting Theory	Renéee Menéndez et.al.	2506.21651	null
2025-06-23	TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge	Zhiyuan Zhang et.al.	2506.21618	null
2025-06-26	Whole-Body Conditioned Egocentric Video Prediction	Yutong Bai et.al.	2506.21552	null
2025-06-26	PsyLite Technical Report	Fangjun Ding et.al.	2506.21536	null
2025-07-03	Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge	Boyu Gou et.al.	2506.21506	null
2025-06-26	From multi-allocations to allocations, with subadditive valuations	Uriel Feige et.al.	2506.21493	null
2025-06-29	Ad-Hoc Human-AI Coordination Challenge	Tin Dizdarević et.al.	2506.21490	null
2025-06-26	Reinforcement Learning for Optimal Control of Spin Magnetometers	Logan W. Cooke et.al.	2506.21475	null
2025-06-26	Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents	Tianyi Men et.al.	2506.21252	null
2025-06-26	Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations	Elia Trevisan et.al.	2506.21205	null
2025-06-26	Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout	Apurva Shah et.al.	2506.21186	null
2025-06-26	Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4	Jongyeon Park et.al.	2506.21174	null
2025-06-26	Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks	Deepak Kumar Panda et.al.	2506.21129	null
2025-06-26	GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction	Muleilan Pei et.al.	2506.21121	null
2025-06-26	Homogenization of Multi-agent Learning Dynamics in Finite-state Markov Games	Yann Kerzreho et.al.	2506.21079	null
2025-06-26	RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment	Suorong Yang et.al.	2506.21037	null
2025-06-26	Evidence-based diagnostic reasoning with multi-agent copilot for human pathology	Chengkuan Chen et.al.	2506.20964	null
2025-06-26	Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation	Chenkai Sun et.al.	2506.20949	null
2025-06-26	ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks	Joshua H. Davis et.al.	2506.20938	null
2025-06-26	Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market	Chi-Sheng Chen et.al.	2506.20930	null
2025-06-26	LLM-guided Chemical Process Optimization with a Multi-Agent Approach	Tong Zeng et.al.	2506.20921	null
2025-06-26	*FaSTA $^$ : Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing**	Advait Gupta et.al.	2506.20911	null
2025-06-26	Smoothness Meets Autobidding: Tight Price of Anarchy Bounds for Simultaneous First-Price Auctions	Riccardo Colini-Baldeschi et.al.	2506.20908	null
2025-06-25	Complex Model Transformations by Reinforcement Learning with Uncertain Human Guidance	Kyanna Dagenais et.al.	2506.20883	null
2025-06-28	Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine	Sebastian Joseph et.al.	2506.20876	null
2025-06-25	GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization	Martin Andrews et.al.	2506.20807	null
2025-06-25	Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis	Zhonghao Zhan et.al.	2506.20806	null
2025-06-25	A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools	Minh-Hao Van et.al.	2506.20743	null
2025-06-25	MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation	Gurusha Juneja et.al.	2506.20737	null
2025-06-25	MMSearch-R1: Incentivizing LMMs to Search	Jinming Wu et.al.	2506.20670	null
2025-06-25	The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind	Andrei Lupu et.al.	2506.20664	null
2025-06-25	Memento: Note-Taking for Your Future Self	Chao Wan et.al.	2506.20642	null
2025-06-25	Towards Community-Driven Agents for Machine Learning Engineering	Sijie Li et.al.	2506.20640	null
2025-06-25	Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm	Baixiang Huang et.al.	2506.20606	null
2025-06-25	Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges	Alexander D. Kalian et.al.	2506.20598	null
2025-06-25	An Explicit Solution for the Problem of Optimal Investment with Random Endowment	Michael Donisch et.al.	2506.20506	null
2025-06-25	Engineering Sentience	Konstantin Demin et.al.	2506.20504	null
2025-06-25	Opinion Dynamics with Highly Oscillating Opinions	Víctor A. Vargas-Pérez et.al.	2506.20472	null
2025-06-25	An Agentic System for Rare Disease Diagnosis with Traceable Reasoning	Weike Zhao et.al.	2506.20430	null
2025-06-25	SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models	Dipayan Saha et.al.	2506.20415	null
2025-06-26	TAPS: Tool-Augmented Personalisation via Structured Tagging	Ekaterina Taktasheva et.al.	2506.20409	null
2025-06-25	A Visualization Framework for Exploring Multi-Agent-Based Simulations Case Study of an Electric Vehicle Home Charging Ecosystem	Kristoffer Christensen et.al.	2506.20400	null
2025-06-27	Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards	Jihao Gu et.al.	2506.20332	null
2025-06-26	Finding the Easy Way Through -- the Probabilistic Gap Planner for Social Robot Navigation	Malte Probst et.al.	2506.20320	null
2025-06-25	Exact and approximate maximin share allocations in multi-graphs	George Christodoulou et.al.	2506.20317	null
2025-06-25	Language Modeling by Language Models	Junyan Cheng et.al.	2506.20249	null
2025-06-25	Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox	Malikussaid et.al.	2506.20102	null
2025-06-25	PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models	Wang Bill Zhu et.al.	2506.20097	null
2025-06-25	From Conversation to Orchestration: HCI Challenges and Opportunities in Interactive Multi-Agentic Systems	Sarah Schömbs et.al.	2506.20091	null
2025-06-24	Beyond Autocomplete: Designing CopilotLens Towards Transparent and Explainable AI Coding Agents	Runlong Ye et.al.	2506.20062	null
2025-06-24	Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models	Zhicheng Zhang et.al.	2506.20061	null
2025-06-26	Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception	Eric C. Joyce et.al.	2506.20045	null
2025-06-24	Learning Bilateral Team Formation in Cooperative Multi-Agent Reinforcement Learning	Koorosh Moslemi et.al.	2506.20039	null
2025-06-24	Automated Generation of Diverse Courses of Actions for Multi-Agent Operations using Binary Optimization and Graph Learning	Prithvi Poddar et.al.	2506.20031	null
2025-06-24	Polynomial-Time Approximation Schemes via Utility Alignment: Unit-Demand Pricing and More	Robin Bowers et.al.	2506.20030	null
2025-06-24	QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges	Abdul Basit et.al.	2506.20008	null
2025-06-24	Can One Safety Loop Guard Them All? Agentic Guard Rails for Federated Computing	Narasimha Raghavan Veeraragavan et.al.	2506.20000	null
2025-06-24	Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation	Xinyi Ni et.al.	2506.19998	null
2025-07-02	TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design	Geonwoo Cho et.al.	2506.19997	null
2025-06-24	Prover Agent: An Agent-based Framework for Formal Mathematical Proofs	Kaito Baba et.al.	2506.19923	null
2025-06-24	JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning	Ai Han et.al.	2506.19846	null
2025-06-24	MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration	Yucheng Zhou et.al.	2506.19835	null
2025-06-24	Curating art exhibitions using machine learning	Eurico Covas et.al.	2506.19813	null
2025-06-24	LLM-Based Social Simulations Require a Boundary	Zengqing Wu et.al.	2506.19806	null
2025-06-24	Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning	Menglong Zhang et.al.	2506.19785	null
2025-06-24	SAGE: Strategy-Adaptive Generation Engine for Query Rewriting	Teng Wang et.al.	2506.19783	null
2025-06-24	A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects	Shulan Ruan et.al.	2506.19769	null
2025-06-24	From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking	Gyeongwon James Kim et.al.	2506.19724	null
2025-07-02	A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures	Dezhang Kong et.al.	2506.19676	null
2025-06-24	How trust networks shape students' opinions about the proficiency of artificially intelligent assistants	Yutong Bu et.al.	2506.19655	null
2025-06-24	HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions	Mrunmai Vivek Phatak et.al.	2506.19639	null
2025-06-24	Mobile oscillators in a mobile multi-cluster network	Venceslas Nguefoue Meli et.al.	2506.19617	null
2025-06-24	Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI	Sha Zhang et.al.	2506.19613	null
2025-06-24	Robotics Under Construction: Challenges on Job Sites	Haruki Uchiito et.al.	2506.19597	null
2025-06-30	Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning	Harisankar Babu et.al.	2506.19592	null
2025-06-24	Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects	Federico Tavella et.al.	2506.19579	null
2025-06-24	KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs	Kelin Fu et.al.	2506.19527	null
2025-06-24	MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications	Aleksandr Algazinov et.al.	2506.19502	null
2025-06-24	NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling	Yan Jiang et.al.	2506.19500	null
2025-06-24	SceneCrafter: Controllable Multi-View Driving Scene Editing	Zehao Zhu et.al.	2506.19488	null
2025-06-24	Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning	Russell Beale et.al.	2506.19484	null
2025-06-24	LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code	Shahbaz Siddeeq et.al.	2506.19481	null
2025-06-24	Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System	Lixuan He et.al.	2506.19433	null
2025-06-24	Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection	Yazhou Zhang et.al.	2506.19420	null
2025-06-24	Center of Gravity-Guided Focusing Influence Mechanism for Multi-Agent Reinforcement Learning	Yisak Park et.al.	2506.19417	null
2025-06-24	Is an object-centric representation beneficial for robotic manipulation ?	Alexandre Chapin et.al.	2506.19408	null
2025-06-24	Do cell culturing influence the radiosensitizing effect of gold nanoparticles part 1: scrutinizing recent evidence for data consistency	Hans Rabus et.al.	2506.19372	null
2025-06-24	Computing Tree Structures in Anonymous Graphs via Mobile Agents	Prabhat Kumar Chand et.al.	2506.19365	null
2025-06-24	Distributed Interview Selection for Stable Matching in Large Random Markets	Richard Cole et.al.	2506.19345	null
2025-06-26	The Autonomy of the Lightning Network: A Mathematical and Economic Proof of Structural Decoupling from BTC	Craig Steven Wright et.al.	2506.19333	null
2025-06-24	Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs	Liang Zeng et.al.	2506.19290	null
2025-06-24	Robust Behavior Cloning Via Global Lipschitz Regularization	Shili Wu et.al.	2506.19250	null
2025-06-24	Augmenting Multi-Agent Communication with State Delta Trajectory	Yichen Tang et.al.	2506.19209	null
2025-06-25	Vertex addition to a ball graph with application to reliability and area coverage in autonomous swarms	Calum Buchanan et.al.	2506.19197	null
2025-06-23	Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition	Craig Steven Wright et.al.	2506.19191	null
2025-06-23	Distilling Tool Knowledge into Language Models via Back-Translated Traces	Xingyue Huang et.al.	2506.19171	null
2025-06-23	AgenticControl: An Automated Control Design Framework Using Large Language Models	Mohammad Narimani et.al.	2506.19160	null
2025-06-23	Model Reference Adaptive Control of Networked Systems with State and Input Delays	Moh Kamalul Wafi et.al.	2506.19138	null
2025-06-23	Emergent collective dynamics from motile photokinetic organisms	J. Morales et.al.	2506.19081	null
2025-06-23	How brains build higher order representations of uncertainty	Megan A. K. Peters et.al.	2506.19057	null
2025-06-26	From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents	Weizhi Zhang et.al.	2506.18959	null
2025-06-23	A Comment On "The Illusion of Thinking": Reframing the Reasoning Cliff as an Agentic Gap	Sheraz Khan et.al.	2506.18957	null
2025-06-23	SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications	Jinyang Li et.al.	2506.18951	null
2025-06-22	Advanced Applications of Generative AI in Actuarial Science: Case Studies Beyond ChatGPT	Simon Hatzesberger et.al.	2506.18942	null
2025-06-23	Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2506.18900	null
2025-06-23	Steering Conceptual Bias via Transformer Latent-Subspace Activation	Vansh Sharma et.al.	2506.18887	null
2025-06-23	GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM	Annika Thomas et.al.	2506.18885	null
2025-06-23	Broad Validity of the First-Order Approach in Moral Hazard	Eduardo Azevedo et.al.	2506.18873	null
2025-06-25	Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning	Anthony Kobanda et.al.	2506.18847	null
2025-06-23	Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories	Islem Bouzenia et.al.	2506.18824	null
2025-06-23	Multi-Agent Online Control with Adversarial Disturbances	Anas Barakat et.al.	2506.18814	null
2025-06-23	Fair Allocation with Money: What is Your Objective?	Noga Klein Elmalem et.al.	2506.18794	null
2025-06-23	TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation	Kamil Szczepanik et.al.	2506.18783	null
2025-06-23	Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI	Daniel M. Lang et.al.	2506.18720	null
2025-06-23	Safety-Aware Optimal Scheduling for Autonomous Masonry Construction using Collaborative Heterogeneous Aerial Robots	Marios-Nektarios Stamatopoulos et.al.	2506.18697	null
2025-06-23	MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation	Ruicheng Zhang et.al.	2506.18679	null
2025-06-23	MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Tianchen Deng et.al.	2506.18678	null
2025-06-23	Dual-level Behavioral Consistency for Inter-group and Intra-group Coordination in Multi-Agent Systems	Shuocun Yang et.al.	2506.18651	null
2025-06-23	Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits	Yannik Mahlau et.al.	2506.18627	null
2025-06-23	Reply to "Emergent LLM behaviors are observationally equivalent to data leakage"	Ariel Flint Ashery et.al.	2506.18600	null
2025-06-23	Agentic Markets: Game Dynamics and Equilibrium in Markets with Learning Agents	Martin Bichler et.al.	2506.18571	null
2025-06-23	Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning	Jiexin Zhang et.al.	2506.18560	null
2025-06-23	T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent	Hong Qing Yu et.al.	2506.18559	null
2025-06-23	Unilateral determination of causal order in a cyclic process	Ilyass Mejdoub et.al.	2506.18540	null
2025-06-23	Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning	Azad Deihim et.al.	2506.18537	null
2025-06-23	Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance	Yu Han et.al.	2506.18511	null
2025-06-23	Reliability-Adjusted Prioritized Experience Replay	Leonard S. Pleiss et.al.	2506.18482	null
2025-06-23	AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction	Gengyuan Zhang et.al.	2506.18472	null
2025-06-23	Networked pointing system: Bearing-only target localization and pointing control	Shiyao Li et.al.	2506.18460	null
2025-06-23	A Motivational Architecture for Open-Ended Learning Challenges in Robots	Alejandro Romero et.al.	2506.18454	null
2025-06-23	GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System	Quang Nguyen et.al.	2506.18448	null
2025-06-23	A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction	Chengjie Liu et.al.	2506.18424	null
2025-06-23	Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots	Imene Tarakli et.al.	2506.18365	null
2025-06-27	Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team	Weilun Yu et.al.	2506.18348	null
2025-06-23	Use Property-Based Testing to Bridge LLM Code Generation and Validation	Lehan He et.al.	2506.18315	null
2025-06-23	A stochastic model for the diffusion of competing opinions with trend-following, opposition, and indifference	Manuel González-Navarrete et.al.	2506.18313	null
2025-06-23	Advanced For-Loop for QML algorithm search	FuTe Wong et.al.	2506.18260	null
2025-06-22	Wisdom of Crowds Through Myopic Self-Confidence Adaptation	Giacomo Como et.al.	2506.18195	null
2025-06-22	Mapping The Invisible Internet: Framework and Dataset	Siddique Abubakr Muntaka et.al.	2506.18159	null
2025-06-22	Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation	Xinzge Gao et.al.	2506.18158	null
2025-06-22	CoachGPT: A Scaffolding-based Academic Writing Assistant	Fumian Chen et.al.	2506.18149	null
2025-06-22	Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game	Xiang Yuming et.al.	2506.18126	null
2025-06-22	Deep Research Agents: A Systematic Examination And Roadmap	Yuxuan Huang et.al.	2506.18096	null
2025-06-27	MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering	Jisheng Dang et.al.	2506.18071	null
2025-06-26	Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities	Yuanchen Bei et.al.	2506.18019	null
2025-06-22	Ultra-Efficient Contracts: Breaking the Substitutes Barrier in Combinatorial Contracts	Michal Feldman et.al.	2506.18008	null
2025-06-22	An Axiomatization of the Random Priority Rule	Christian Basteck et.al.	2506.17997	null
2025-06-22	Non-Euclidean Enriched Contraction Theory for Monotone Operators and Monotone Dynamical Systems	Diego Deplano et.al.	2506.17990	null
2025-06-22	GeNIE: A Generalizable Navigation System for In-the-Wild Environments	Jiaming Wang et.al.	2506.17960	null
2025-06-22	ASTER: Adaptive Spatio-Temporal Early Decision Model for Dynamic Resource Allocation	Shulun Chen et.al.	2506.17929	null
2025-06-22	Learning, Reasoning, Refinement: A Framework for Kahneman's Dual-System Intelligence in GUI Agents	Jinjie Wei et.al.	2506.17913	null
2025-06-22	Towards Robust Fact-Checking: A Multi-Agent System with Advanced Evidence Retrieval	Tam Trinh et.al.	2506.17878	null
2025-06-21	Out of Control -- Why Alignment Needs Formal Control Theory (and an Alignment Control Stack)	Elija Perrier et.al.	2506.17846	null
2025-06-21	Reflective Verbal Reward Design for Pluralistic Alignment	Carter Blair et.al.	2506.17834	null
2025-06-21	Is Your Automated Software Engineer Trustworthy?	Noble Saji Mathews et.al.	2506.17812	null
2025-06-21	Bayesian Social Deduction with Graph-Informed Language Models	Shahab Rahimirad et.al.	2506.17788	null
2025-06-21	AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction	Song Wang et.al.	2506.17784	null
2025-06-21	Toward Autonomous UI Exploration: The UIExplorer Benchmark	Andrei Cristian Nica et.al.	2506.17779	null
2025-06-21	Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems	Sebastian Sansoni et.al.	2506.17775	null
2025-06-21	PAGENT: Learning to Patch Software Engineering Agents	Haoran Xue et.al.	2506.17772	null
2025-06-21	CARTS: Collaborative Agents for Recommendation Textual Summarization	Jiao Chen et.al.	2506.17765	null
2025-06-21	Experimental Evidence for the Propagation and Preservation of Machine Discoveries in Human Populations	Levin Brinkmann et.al.	2506.17741	null
2025-06-21	Distributed Butterfly Analysis using Mobile Agents	Prabhat Kumar Chand et.al.	2506.17721	null
2025-06-21	Wealth Thermalization Hypothesis	Klaus M. Frahm et.al.	2506.17720	null
2025-06-21	Beyond Syntax: Action Semantics Learning for App Agents	Bohan Tang et.al.	2506.17697	null
2025-06-21	Network Heterogeneity and Value of Information	Kota Murayama et.al.	2506.17660	null
2025-06-21	Diffusion of Tracer Particles in Early Growing Biofilms. A Computer Simulation Study	Fabian A. Garcia Daza et.al.	2506.17653	null
2025-06-21	May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs	Shaoyu Yang et.al.	2506.17642	null
2025-06-21	JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent	Yunlong Lin et.al.	2506.17612	null
2025-06-26	Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown	Bowen Wang et.al.	2506.17589	null
2025-06-21	Towards Zero-Shot Coordination between Teams of Agents: The N-XPlay Framework	Ava Abderezaei et.al.	2506.17560	null
2025-06-24	Breaking Single-Tester Limits: Multi-Agent LLMs for Multi-User Feature Testing	Sidong Feng et.al.	2506.17539	null
2025-06-20	Kaleidoscopic Teaming in Multi Agent Simulations	Ninareh Mehrabi et.al.	2506.17514	null
2025-06-20	A Grassroots Network and Community Roadmap for Interconnected Autonomous Science Laboratories for Accelerated Discovery	Rafael Ferreira da Silva et.al.	2506.17510	null
2025-06-20	From Unstructured Communication to Intelligent RAG: Multi-Agent Automation for Supply Chain Knowledge Bases	Yao Zhang et.al.	2506.17484	null
2025-06-20	General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting	Bernard Lange et.al.	2506.17462	null
2025-06-20	OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections	Manasa Bharadwaj et.al.	2506.17449	null
2025-06-20	Resource Rational Contractualism Should Guide AI Alignment	Sydney Levine et.al.	2506.17434	null
2025-06-20	UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making	Jinhao Duan et.al.	2506.17419	null
2025-06-20	Challenges in Grounding Language in the Real World	Peter Lindes et.al.	2506.17375	null
2025-06-20	Cash or Comfort? How LLMs Value Your Inconvenience	Mateusz Cedro et.al.	2506.17367	null
2025-06-19	Advanced Game-Theoretic Frameworks for Multi-Agent AI Challenges: A 2025 Outlook	Pavel Malinovskiy et.al.	2506.17348	null
2025-06-19	Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning	Zijian Long et.al.	2506.17342	null
2025-06-19	AI is the Strategy: From Agentic AI to Autonomous Business Models onto Strategy in the Age of AI	René Bohnsack et.al.	2506.17339	null
2025-06-24	PBFT-Backed Semantic Voting for Multi-Agent Memory Pruning	Duong Bach et.al.	2506.17338	null
2025-06-19	Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases	Yubeen Bae et.al.	2506.17336	link
2025-06-19	LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research	Shuo Yan et.al.	2506.17335	null
2025-06-19	Beyond Prediction -- Structuring Epistemic Integrity in Artificial Reasoning Systems	Craig Steven Wright et.al.	2506.17331	null
2025-06-18	MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant	Akash Awasthi et.al.	2506.17320	null
2025-06-18	Context manipulation attacks : Web agents are susceptible to corrupted memory	Atharv Singh Patlan et.al.	2506.17318	null
2025-06-18	Can Large Language Models Be Trusted Paper Reviewers? A Feasibility Study	Chuanlei Li et.al.	2506.17311	null
2025-06-17	SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library	Satyam Mishra et.al.	2506.17297	null
2025-06-25	VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning	Zhangyang Qi et.al.	2506.17221	null
2025-06-20	Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation	Xiuyu Yang et.al.	2506.17213	link
2025-06-20	Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems	Matias Martinez et.al.	2506.17208	null
2025-06-20	Towards AI Search Paradigm	Yuchen Li et.al.	2506.17188	null
2025-06-20	Capturing Misalignment	Pierfrancesco Guarino et.al.	2506.17176	null
2025-06-20	A Note on Proper Relational Structures	Adam Bjorndahl et.al.	2506.17142	null
2025-06-20	When Can Model-Free Reinforcement Learning be Enough for Thinking?	Josiah P. Hanna et.al.	2506.17124	null
2025-06-20	A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study	Elia Onofri et.al.	2506.17078	null
2025-06-20	Behavior Driven Development for 3D Games	Fernando Pastor Ricós et.al.	2506.17057	null
2025-06-20	Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment	Leizhen Wang et.al.	2506.17029	null
2025-06-20	A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving	Hanlin Wu et.al.	2506.17004	null
2025-06-20	Elevating Styled Mahjong Agents with Learning from Demonstration	Lingfeng Li et.al.	2506.16995	null
2025-06-20	RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering	Ines Besrour et.al.	2506.16988	link
2025-06-20	Formal Control for Uncertain Systems via Contract-Based Probabilistic Surrogates (Extended Version)	Oliver Schön et.al.	2506.16971	null
2025-06-20	LunarLoc: Segment-Based Global Localization on the Moon	Annika Thomas et.al.	2506.16940	link
2025-06-20	Do You Know What I Mean? A Syntactic Representation for Differential Bounded Awareness	Ani Guerdjikova et.al.	2506.16901	null
2025-06-20	Engineering Resilience: An Energy-Based Approach to Sustainable Behavioural Interventions	Arpitha Srivathsa Malavalli et.al.	2506.16836	null
2025-06-20	Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting	Michał Wawer et.al.	2506.16813	null
2025-06-20	Distributed Affine Formation Control of Linear Multi-agent Systems with Adaptive Event-triggering	Chenjun Liu et.al.	2506.16797	null
2025-06-20	Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly	Lance Ying et.al.	2506.16755	null
2025-06-20	Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation	Kosuke Nakanishi et.al.	2506.16753	link
2025-06-20	A Scalable Post-Processing Pipeline for Large-Scale Free-Space Multi-Agent Path Planning with PiBT	Arjo Chakravarty et.al.	2506.16748	link
2025-06-20	Incentivizing High-quality Participation From Federated Learning Agents	Jinlong Pang et.al.	2506.16731	null
2025-06-20	DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy	Weitao Zhou et.al.	2506.16720	null
2025-06-20	Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation	Chenxu Wang et.al.	2506.16718	link
2025-06-20	Mean-field and Monte Carlo Analysis of Multi-Species Dynamics of agents	Eduardo Velasco Stock et.al.	2506.16717	null
2025-06-20	Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models	Lu Gao et.al.	2506.16699	null
2025-06-20	Interpretable Low-Dimensional Modeling of Spatiotemporal Agent States for Decision Making in Football Tactics	Kenjiro Ide et.al.	2506.16696	null
2025-06-20	Closed curve covering and multiagent TSP ratios	Travis Dillon et.al.	2506.16675	null
2025-06-19	SemAgent: A Semantics Aware Program Repair Agent	Anvith Pabba et.al.	2506.16650	null
2025-06-19	Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces	Jiamin He et.al.	2506.16608	null
2025-06-19	AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions	Ihor Pysmennyi et.al.	2506.16586	link
2025-06-19	ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning	Zexi Liu et.al.	2506.16499	null
2025-06-19	Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support	Sophie Chiang et.al.	2506.16473	null
2025-06-19	StoryWriter: A Multi-Agent Framework for Long Story Generation	Haotian Xia et.al.	2506.16445	null
2025-06-19	Agentic Personalisation of Cross-Channel Marketing Experiences	Sami Abboud et.al.	2506.16429	null
2025-06-19	When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework	Zhen Xu et.al.	2506.16411	null
2025-06-19	IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks	Xiaoya Lu et.al.	2506.16402	null
2025-06-19	GoalLadder: Incremental Goal Discovery with Vision-Language Models	Alexey Zakharov et.al.	2506.16396	null
2025-06-19	AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios	Yunhao Hou et.al.	2506.16371	link
2025-06-19	Data-Driven Policy Mapping for Safe RL-based Energy Management Systems	Theo Zangato et.al.	2506.16352	null
2025-06-19	Improved Exploration in GFlownets via Enhanced Epistemic Neural Networks	Sajan Muhammad et.al.	2506.16313	null
2025-06-19	M-Predictive Spliner: Enabling Spatiotemporal Multi-Opponent Overtaking for Autonomous Racing	Nadine Imholz et.al.	2506.16301	null
2025-06-19	Coordination of Electrical and Heating Resources by Self-Interested Agents	Rico Schrage et.al.	2506.16277	null
2025-06-19	VideoGAN-based Trajectory Proposal for Automated Vehicles	Annajoyce Mariani et.al.	2506.16209	link
2025-06-19	Solving Zero-Sum Convex Markov Games	Fivos Kalogiannis et.al.	2506.16120	null
2025-06-19	Towards AI-Driven RANs for 6G and Beyond: Architectural Advancements and Future Horizons	Mathushaharan Rathakrishnan et.al.	2506.16070	null
2025-06-19	Human-Centered Shared Autonomy for Motor Planning, Learning, and Control Applications	MH Farhadi et.al.	2506.16044	null
2025-06-19	OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents	Reyna Abhyankar et.al.	2506.16042	null
2025-06-19	DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning	Boyu Li et.al.	2506.16012	link
2025-06-19	SimuPanel: A Novel Immersive Multi-Agent System to Simulate Interactive Expert Panel Discussion	Xiangyang He et.al.	2506.16010	null
2025-06-19	HybridRAG-based LLM Agents for Low-Carbon Optimization in Low-Altitude Economy Networks	Jinbo Wen et.al.	2506.15947	null
2025-06-19	On the optimal regret of collaborative personalized linear bandits	Bruce Huang et.al.	2506.15943	null
2025-06-19	Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues	Myke C. Cohen et.al.	2506.15928	null
2025-06-23	From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents	Mohammad Amaan Sayeed et.al.	2506.15911	null
2025-06-18	Fair Contracts in Principal-Agent Games with Heterogeneous Types	Jakub Tłuczek et.al.	2506.15887	null
2025-06-18	Modeling society with a responsible elite	Yana Tsodikova et.al.	2506.15877	null
2025-06-18	CooperRisk: A Driving Risk Quantification Pipeline with Multi-Agent Cooperative Perception and Prediction	Mingyue Lei et.al.	2506.15868	null
2025-06-18	Understanding Online Polarization Through Human-Agent Interaction in a Synthetic LLM-Based Social Network	Tim Donkers et.al.	2506.15866	null
2025-06-18	Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples	Viral Rasik Galaiya et.al.	2506.15865	null
2025-06-18	Learning to Coordinate Under Threshold Rewards: A Cooperative Multi-Agent Bandit Framework	Michael Ledford et.al.	2506.15856	null
2025-06-18	MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents	Zijian Zhou et.al.	2506.15841	null
2025-06-18	Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning	Emanuele Musumeci et.al.	2506.15828	null
2025-06-18	Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters	Luiz Pereira et.al.	2506.15825	null
2025-06-18	Veracity: An Open-Source AI Fact-Checking System	Taylor Lynn Curtis et.al.	2506.15794	null
2025-06-18	Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation	Ruoyu Wang et.al.	2506.15757	null
2025-06-18	RecBayes: Recurrent Bayesian Ad Hoc Teamwork in Large Partially Observable Domains	João G. Ribeiro et.al.	2506.15756	null
2025-06-23	OAgents: An Empirical Study of Building Effective Agents	He Zhu et.al.	2506.15741	null
2025-06-17	SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents	Jonathan Kutasov et.al.	2506.15740	null
2025-06-20	Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence	Yining Hong et.al.	2506.15677	null
2025-06-18	Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers	Tommaso Green et.al.	2506.15674	link
2025-06-18	SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence	Yao Zhang et.al.	2506.15672	null
2025-06-18	PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection	Wenhao Li et.al.	2506.15656	null
2025-06-18	FindingDory: A Benchmark to Evaluate Memory in Embodied Agents	Karmesh Yadav et.al.	2506.15635	null
2025-06-18	The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games	Lyle Goodyear et.al.	2506.15624	null
2025-06-18	Multi-Agent, Multi-Scale Systems with the Koopman Operator	Craig Bakker et.al.	2506.15589	null
2025-06-18	Learning to flock in open space by avoiding collisions and staying together	Martino Brambati et.al.	2506.15587	null
2025-06-18	Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents	Aline Dobrovsky et.al.	2506.15567	null
2025-06-18	Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning	Roger Creus Castanyer et.al.	2506.15544	link
2025-06-18	Co-Creative Learning via Metropolis-Hastings Interaction between Humans and AI	Ryota Okumura et.al.	2506.15468	null
2025-06-18	AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need	Zhouhong Gu et.al.	2506.15451	link
2025-06-18	Understanding GUI Agent Localization Biases through Logit Sharpness	Xingjian Tao et.al.	2506.15425	null
2025-06-18	Reward Models in Deep Reinforcement Learning: A Survey	Rui Yu et.al.	2506.15421	null
2025-06-18	Multi-Timescale Gradient Sliding for Distributed Optimization	Junhui Zhang et.al.	2506.15387	null
2025-06-18	Tractable Graph Structures in EFX Orientation	Václav Blažej et.al.	2506.15379	null
2025-06-18	Efficient and Generalizable Environmental Understanding for Visual Navigation	Ruoyu Wang et.al.	2506.15377	null
2025-06-18	Learning to Maximize Quantum Neural Network Expressivity via Effective Rank	Juan Yao et.al.	2506.15375	null
2025-06-18	Designing Intent: A Multimodal Framework for Human-Robot Cooperation in Industrial Workspaces	Francesco Chiossi et.al.	2506.15293	null
2025-06-18	RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments	Yuchuan Fu et.al.	2506.15253	link
2025-06-18	Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels	Jiahao You et.al.	2506.15225	null
2025-06-18	Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study	Mohamad A. Hady et.al.	2506.15207	null
2025-06-18	ImprovDML: Improved Trade-off in Private Byzantine-Resilient Distributed Machine Learning	Bing Liu et.al.	2506.15181	null
2025-06-18	From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem	Yanxu Mao et.al.	2506.15170	null
2025-06-18	Efficient reallocation of indivisible resources: Pair-efficiency versus Pareto-efficiency	Pinaki Mandal et.al.	2506.15169	null
2025-06-18	LLM Agent for Hyper-Parameter Optimization	Wanzhe Wang et.al.	2506.15167	null
2025-06-18	Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs	Jing Yang Lee et.al.	2506.15131	null
2025-06-19	Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality	Ziqin Chen et.al.	2506.15106	null
2025-06-18	DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory	Zihe Ji et.al.	2506.15096	null
2025-06-18	EmojiVoice: Towards long-term controllable expressivity in robot speech	Paige Tuttösí et.al.	2506.15085	null
2025-06-18	HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models	Trishna Chakraborty et.al.	2506.15065	null
2025-06-18	2BSDE with uncertain horizon and application to stochastic control in erratic environments	Alberto Gennaro et.al.	2506.15037	null
2025-06-19	Context Matters: Learning Generalizable Rewards via Calibrated Features	Alexandra Forsey-Smerek et.al.	2506.15012	null
2025-06-17	MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning	Tristan Tomilin et.al.	2506.14990	link
2025-06-17	Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits	Tianyi Xu et.al.	2506.14988	null
2025-06-17	OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents	Thomas Kuntz et.al.	2506.14866	link
2025-06-17	Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching	Qizheng Zhang et.al.	2506.14852	null
2025-06-13	Recent Advances in Multi-Agent Human Trajectory Prediction: A Comprehensive Review	Céline Finet et.al.	2506.14831	null
2025-06-17	RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills	Chunru Lin et.al.	2506.14763	null
2025-06-17	Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems	Shiyu Cheng et.al.	2506.14749	null
2025-06-17	AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jiahao Qiu et.al.	2506.14728	null
2025-06-17	Linear Planar 3-SAT and Its Applications in Planning	Victorien Desbois et.al.	2506.14713	null
2025-06-17	AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions	Aishan Liu et.al.	2506.14697	null
2025-06-17	Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference	Kalliyan Velasco et.al.	2506.14690	null
2025-06-17	Unified Software Engineering agent as AI Software Engineer	Leonhard Applis et.al.	2506.14683	null
2025-06-17	StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery	Jina Kim et.al.	2506.14670	null
2025-06-17	SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning	Hexian Ni et.al.	2506.14648	null
2025-06-17	GenerationPrograms: Fine-grained Attribution with Executable Programs	David Wan et.al.	2506.14580	link
2025-06-17	Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack	Daewon Kang et.al.	2506.14539	null
2025-06-17	Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution	Xiaohan Zheng et.al.	2506.14529	null
2025-06-17	SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks	Zijian Song et.al.	2506.14512	null
2025-06-17	Toward Safety-First Human-Like Decision Making for Autonomous Vehicles in Time-Varying Traffic Flow	Xiao Wang et.al.	2506.14502	null
2025-06-17	LLM-Powered Swarms: A New Frontier or a Conceptual Stretch?	Muhammad Atta Ur Rahman et.al.	2506.14496	null
2025-06-17	GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies	Jingqi Yang et.al.	2506.14477	link
2025-06-17	SimSpark: Interactive Simulation of Social Media Behaviors	Ziyue Lin et.al.	2506.14476	null
2025-06-17	Hamiltonian Formalism for Comparing Quantum and Classical Intelligence	Elija Perrier et.al.	2506.14456	null
2025-06-17	Active Digital Twins via Active Inference	Matteo Torzoni et.al.	2506.14453	null
2025-06-17	Adaptive Reinforcement Learning for Unobservable Random Delays	John Wikman et.al.	2506.14411	null
2025-06-17	System 0: Transforming Artificial Intelligence into a Cognitive Extension	Massimo Chiriatti et.al.	2506.14376	null
2025-06-18	ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies	Jinyan Yuan et.al.	2506.14315	null
2025-06-17	Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent	Xueyang Feng et.al.	2506.14302	null
2025-06-17	ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems	Fanzhi Zeng et.al.	2506.14299	null
2025-06-17	From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents	Seongbo Jang et.al.	2506.14285	link
2025-06-17	Mxplainer: Explain and Learn Insights by Imitating Mahjong Agents	Lingfeng Li et.al.	2506.14246	link
2025-06-17	A Novel Indicator for Quantifying and Minimizing Information Utility Loss of Robot Teams	Xiyu Zhao et.al.	2506.14237	null
2025-06-17	Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team	Md Tanzib Hosain et.al.	2506.14234	null
2025-06-17	AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents	Jingxu Xie et.al.	2506.14205	link
2025-06-17	MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment	Junghwan Kim et.al.	2506.14199	null
2025-06-17	Hierarchical Multi-Agent Reinforcement Learning-based Coordinated Spatial Reuse for Next Generation WLANs	Jiaming Yu et.al.	2506.14187	null
2025-06-17	Affective-CARA: A Knowledge Graph Driven Framework for Culturally Adaptive Emotional Intelligence in HCI	Nirodya Pussadeniya et.al.	2506.14166	null
2025-06-17	Light Aircraft Game : Basic Implementation and training results analysis	Hanzhong Cao et.al.	2506.14164	link
2025-06-17	Common Benchmarks Undervalue the Generalization Power of Programmatic Policies	Amirhossein Rajabpour et.al.	2506.14162	link
2025-06-17	StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework	Shayan Talaei et.al.	2506.14159	null
2025-06-17	Dividing Conflicting Items Fairly	Ayumi Igarashi et.al.	2506.14149	null
2025-06-17	RadFabric: Agentic AI System with Reasoning Capability for Radiology	Wenting Chen et.al.	2506.14142	null
2025-06-17	FormGym: Doing Paperwork with Agents	Matthew Toles et.al.	2506.14079	null
2025-06-17	Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification	Nathaniel Pinckney et.al.	2506.14074	link
2025-06-16	Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning	Martin Klissarov et.al.	2506.14045	null
2025-06-16	SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement	Chelsi Jain et.al.	2506.14035	link
2025-06-16	A Cooperative Contactless Object Transport with Acoustic Robots	Narsimlu Kemsaram et.al.	2506.13957	link
2025-06-16	ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture	Vishesh Kumar Tanwar et.al.	2506.13935	null
2025-06-16	How Does LLM Reasoning Work for Code? A Survey and a Call to Action	Ira Ceka et.al.	2506.13932	null
2025-06-16	Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems	Zhongzhi Yu et.al.	2506.13905	null
2025-06-16	LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning	Miho Koda et.al.	2506.13841	link
2025-06-16	Recent trends in socio-epidemic modelling: behaviours and their determinants	Daniele Proverbio et.al.	2506.13837	null
2025-06-15	The Reflexive Integrated Information Unit: A Differentiable Primitive for Artificial Consciousness	Gnankan Landry Regis N'guessan et.al.	2506.13825	link
2025-06-15	The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI	Marcelle Momha et.al.	2506.13818	null
2025-06-14	DeepSeq: High-Throughput Single-Cell RNA Sequencing Data Labeling via Web Search-Augmented Agentic Generative AI Foundation Models	Saleem A. Al Dajani et.al.	2506.13817	null
2025-06-13	Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study	Sompote Youwai et.al.	2506.13811	null
2025-06-13	Causality in the human niche: lessons for machine learning	Richard D. Lange et.al.	2506.13803	null
2025-06-13	Enhancing Clinical Decision Support and EHR Insights through LLMs and the Model Context Protocol: An Open-Source MCP-FHIR Framework	Abul Ehtesham et.al.	2506.13800	null
2025-06-16	MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering	Arya Fayyazi et.al.	2506.13755	null
2025-06-16	PB $^2$ : Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning	Brahim Driss et.al.	2506.13741	null
2025-06-16	The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning	Jiashun Liu et.al.	2506.13672	null
2025-06-16	We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems	Junfeng Fang et.al.	2506.13666	link
2025-06-16	Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning	Shulin Tian et.al.	2506.13654	null
2025-06-16	xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations	Kaiyuan Chen et.al.	2506.13651	null
2025-06-16	Deceptive Path Planning: A Bayesian Game Approach	Violetta Rostobaya et.al.	2506.13650	null
2025-06-16	CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation	Yuwei Du et.al.	2506.13599	null
2025-06-16	Agent Capability Negotiation and Binding Protocol (ACNBP)	Ken Huang et.al.	2506.13590	link
2025-06-16	Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma	Datong Zhou et.al.	2506.13587	null
2025-06-16	Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes	Bernhard Hilpert et.al.	2506.13583	null
2025-06-17	A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints	Jonathan Hoss et.al.	2506.13566	link
2025-06-16	Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework	Hemjyoti Das et.al.	2506.13478	null
2025-06-16	Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning	David Bani-Harouni et.al.	2506.13474	null
2025-06-16	A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing	Shiqian Guo et.al.	2506.13469	null
2025-06-16	Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics	YR Darr et.al.	2506.13453	null
2025-06-16	Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization	Sebastian Griesbach et.al.	2506.13345	link
2025-06-16	Towards Pervasive Distributed Agentic Generative AI -- A State of The Art	Gianni Molinari et.al.	2506.13324	null
2025-06-16	RL-Guided MPC for Autonomous Greenhouse Control	Salim Msaad et.al.	2506.13278	null
2025-06-16	Screen Reader Users in the Vibe Coding Era: Adaptation, Empowerment, and New Accessibility Landscape	Nan Chen et.al.	2506.13270	null
2025-06-16	Reconstruction-free magnetic control of DIII-D plasma with deep reinforcement learning	G. F. Subbotin et.al.	2506.13267	null
2025-06-16	COME: Adding Scene-Centric Forecasting Control to Occupancy World Model	Yining Shi et.al.	2506.13260	link
2025-06-16	On Immutable Memory Systems for Artificial Agents: A Blockchain-Indexed Automata-Theoretic Framework Using ECDH-Keyed Merkle Chains	Craig Steven Wright et.al.	2506.13246	null
2025-06-16	A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs	Guoxi Zhang et.al.	2506.13245	null
2025-06-16	Mixed-variable policy-based optimization	Jonathan Viquerat et.al.	2506.13240	null
2025-06-16	Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty	Ziyu Li et.al.	2506.13207	null
2025-06-19	Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments	Xuan Wang et.al.	2506.13205	null
2025-06-16	Querying Large Automotive Software Models: Agentic vs. Direct LLM Approaches	Lukasz Mazur et.al.	2506.13171	null
2025-06-16	Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback	Tanmay Goyal et.al.	2506.13163	null
2025-06-16	Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management	DongNyeong Heo et.al.	2506.13153	null
2025-06-16	AlphaEvolve: A coding agent for scientific and algorithmic discovery	Alexander Novikov et.al.	2506.13131	null
2025-06-16	Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning	Stella C. Dong et.al.	2506.13113	null
2025-06-16	Leveraging In-Context Learning for Language Model Agents	Shivanshu Gupta et.al.	2506.13109	null
2025-06-17	Towards the Autonomous Optimization of Urban Logistics: Training Generative AI with Scientific Tools via Agentic Digital Twins and Model Context Protocol	Haowen Xu et.al.	2506.13068	link
2025-06-16	MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?	Xixian Yong et.al.	2506.13065	null
2025-06-16	PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue	George Shaikovski et.al.	2506.13063	null
2025-06-16	MAGIC: Multi-Agent Argumentation and Grammar Integrated Critiquer	Joaquin Jordan et.al.	2506.13037	null
2025-06-15	Discovering Coordinated Processes From Social Online Networks	Anna Kalenkova et.al.	2506.12988	link
2025-06-15	On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness	Arnav Mehra et.al.	2506.12950	null
2025-06-15	Scaling Test-time Compute for LLM Agents	King Zhu et.al.	2506.12928	null
2025-06-15	Sectoral Coupling in Linguistic State Space	Sebastian Dumbrava et.al.	2506.12927	null
2025-06-15	Distributed Composite Optimization with Sub-Weibull Noises	Zhan Yu et.al.	2506.12901	null
2025-06-15	Homeostatic Coupling for Prosocial Behavior	Naoto Yoshida et.al.	2506.12894	null
2025-06-15	Exploring the Potential of Metacognitive Support Agents for Human-AI Co-Creation	Frederic Gmeiner et.al.	2506.12879	null
2025-06-15	WereWolf-Plus: An Update of Werewolf Game setting Based on DSGBench	Xinyuan Xia et.al.	2506.12841	null
2025-06-15	Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models	Tung Minh Luu et.al.	2506.12822	null
2025-06-15	PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates	Yun Liu et.al.	2506.12821	null
2025-06-15	Mastering Da Vinci Code: A Comparative Study of Transformer, LLM, and PPO-based Agents	LeCheng Zhang et.al.	2506.12801	null
2025-06-15	Resilient-native and Intelligent NextG Systems	Mehdi Bennis et.al.	2506.12795	null
2025-06-15	Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling	Zhilin Lin et.al.	2506.12735	null
2025-06-15	Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems	Yuqi Ping et.al.	2506.12710	null
2025-06-15	SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation	Yashothara Shanmugarasa et.al.	2506.12699	null
2025-06-15	SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation	Xiaofeng Shi et.al.	2506.12689	null
2025-06-14	LIFELONG SOTOPIA: Evaluating Social Intelligence of Language Agents Over Lifelong Social Interactions	Hitesh Goel et.al.	2506.12666	null
2025-06-14	Behavioral Generative Agents for Energy Operations	Cong Chen et.al.	2506.12664	null
2025-06-14	Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics	Jiarui Liu et.al.	2506.12657	null
2025-06-14	Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback	Julia Santaniello et.al.	2506.12636	null
2025-06-14	Towards Building General Purpose Embedding Models for Industry 4.0 Agents	Christodoulos Constantinides et.al.	2506.12607	null
2025-06-17	The Rise of AI Companions: How Human-Chatbot Relationships Influence Well-Being	Yutong Zhang et.al.	2506.12605	null
2025-06-14	Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow	Jie Pan et.al.	2506.12600	null
2025-06-14	Moment Restrictions for Nonlinear Panel Data Models with Feedback	Stéphane Bonhomme et.al.	2506.12569	null
2025-06-17	AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving	Wentao Zhang et.al.	2506.12508	link
2025-06-18	Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning	Ali Baheri et.al.	2506.12497	null
2025-06-14	Tiered Agentic Oversight: A Hierarchical Multi-Agent System for AI Safety in Healthcare	Yubin Kim et.al.	2506.12482	null
2025-06-14	Generalizable Trajectory Prediction via Inverse Reinforcement Learning with Mamba-Graph Architecture	Wenyun Li et.al.	2506.12474	null
2025-06-14	Levels of Autonomy for AI Agents	K. J. Kevin Feng et.al.	2506.12469	null
2025-06-14	Adding links wisely: how an influencer seeks for leadership in opinion dynamics?	Lingfei Wang et.al.	2506.12463	null
2025-06-14	Topology-Assisted Spatio-Temporal Pattern Disentangling for Scalable MARL in Large-scale Autonomous Traffic Control	Rongpeng Li et.al.	2506.12453	null
2025-06-14	Plan Your Travel and Travel with Your Plan: Wide-Horizon Planning and Evaluation via LLM	Dongjie Yang et.al.	2506.12421	null
2025-06-14	Ghost Policies: A New Paradigm for Understanding and Learning from Failure in Deep Reinforcement Learning	Xabier Olaz et.al.	2506.12366	null
2025-06-17	Sharp Tools: How Developers Wield Agentic AI in Real Software Engineering Tasks	Aayush Kumar et.al.	2506.12347	null
2025-06-14	SheetMind: An End-to-End LLM-Powered Multi-Agent Framework for Spreadsheet Automation	Ruiyan Zhu et.al.	2506.12339	link
2025-06-14	Artificial Intelligence in Team Dynamics: Who Gets Replaced and Why?	Xienan Cheng et.al.	2506.12337	null
2025-06-14	IndoorWorld: Integrating Physical Task Solving and Social Simulation in A Heterogeneous Multi-Agent Environment	Dekun Wu et.al.	2506.12331	null
2025-06-14	Similar Formation Control of Multi-Agent Systems over Directed Acyclic Graphs via Matrix-Weighted Laplacian	Zhipeng Fan et.al.	2506.12297	null
2025-06-13	Cloud Infrastructure Management in the Age of AI Agents	Zhenning Yang et.al.	2506.12270	null
2025-06-13	The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs	Avinash Baidya et.al.	2506.12266	null
2025-06-13	Reversing the Paradigm: Building AI-First Systems with Human Guidance	Cosimo Spera et.al.	2506.12245	null
2025-06-13	Privacy Reasoning in Ambiguous Contexts	Ren Yi et.al.	2506.12241	null
2025-06-13	A Fast, Reliable, and Secure Programming Language for LLM Agents with Code Actions	Stephen Mell et.al.	2506.12202	null
2025-06-13	PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification	Yujie Zhao et.al.	2506.12200	link
2025-06-13	OSI Stack Redesign for Quantum Networks: Requirements, Technologies, Challenges, and Future Directions	Shakil Ahmed et.al.	2506.12195	null
2025-06-13	Because we have LLMs, we Can and Should Pursue Agentic Interpretability	Been Kim et.al.	2506.12152	null
2025-06-13	Eliciting Reasoning in Language Models with Cognitive Tools	Brown Ebouky et.al.	2506.12115	null
2025-06-13	EconGym: A Scalable AI Testbed with Diverse Economic Tasks	Qirui Mi et.al.	2506.12110	null
2025-06-13	DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents	Hao Li et.al.	2506.12104	link
2025-06-12	"I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence	Naba Rizvi et.al.	2506.12098	null
2025-06-12	DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion	Khang Nguyen et.al.	2506.12095	null
2025-06-12	Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure	Timothy Dubber et.al.	2506.12094	null
2025-06-13	Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale	Junha Lee et.al.	2506.12009	null
2025-06-13	Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents?	Ramesh Raskar et.al.	2506.12003	null
2025-06-13	Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks	Ankit Bhardwaj et.al.	2506.11973	null
2025-06-13	Visual Pre-Training on Unlabeled Images using Reinforcement Learning	Dibya Ghosh et.al.	2506.11967	null
2025-06-13	Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning	Mohammadamin Moradi et.al.	2506.11957	null
2025-06-13	Secure API-Driven Research Automation to Accelerate Scientific Discovery	Tyler J. Skluzacek et.al.	2506.11950	null
2025-06-13	Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations	Miguel Suau et.al.	2506.11912	null
2025-06-13	Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients	Chapa Sirithunge et.al.	2506.11906	null
2025-06-13	An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing	Haochen Sun et.al.	2506.11882	null
2025-06-13	Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems	Zhipeng Bao et.al.	2506.11842	null
2025-06-13	Mean Field Games without Rational Expectations	Benjamin Moll et.al.	2506.11838	null
2025-06-13	The Space Between Us: A Methodological Framework for Researching Bonding and Proxemics in Situated Group-Agent Interactions	Ana Müller et.al.	2506.11829	null
2025-06-13	Revealing Political Bias in LLMs through Structured Multi-Agent Debate	Aishwarya Bandaru et.al.	2506.11825	link
2025-06-13	PE-MA: Parameter-Efficient Co-Evolution of Multi-Agent Systems	Yingfan Deng et.al.	2506.11803	null
2025-06-13	Solving Inverse Problems in Stochastic Self-Organising Systems through Invariant Representations	Elias Najarro et.al.	2506.11796	link
2025-06-13	ALEA IACTA EST: A Declarative Domain-Specific Language for Manually Performable Random Experiments	Baltasar Trancón y Widemann et.al.	2506.11794	null
2025-06-13	SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks	Hwiwon Lee et.al.	2506.11791	link
2025-06-16	AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments	Zikang Leng et.al.	2506.11773	null
2025-06-13	Convergence to equilibrium for a class of exchange economies	R. S. MacKay et.al.	2506.11770	null
2025-06-13	DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents	Mingxuan Du et.al.	2506.11763	null
2025-06-13	Bias and Identifiability in the Bounded Confidence Model	Claudio Borile et.al.	2506.11751	null
2025-06-13	Interaction, Process, Infrastructure: A Unified Architecture for Human-Agent Collaboration	Yun Wang et.al.	2506.11718	null
2025-06-13	Generalised Rate Control Approach For Stream Processing Applications	Ziren Xiao et.al.	2506.11710	null
2025-06-13	Growing with Experience: Growing Neural Networks in Deep Reinforcement Learning	Lukas Fehring et.al.	2506.11706	null
2025-06-17	A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences	Pratibha Zunjare et.al.	2506.11681	null
2025-06-13	Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control	Lambert Lee et.al.	2506.11650	null
2025-06-13	High Probability Convergence of Distributed Clipped Stochastic Gradient Descent with Heavy-tailed Noise	Yuchen Yang et.al.	2506.11647	null
2025-06-13	LoRA-Gen: Specializing Large Language Model via Online LoRA Generation	Yicheng Xiao et.al.	2506.11638	null
2025-06-13	"If we misunderstand the client, we misspend 100 hours": Exploring conversational AI and response types for information elicitation	Daniel Hove Paludan et.al.	2506.11610	null
2025-06-13	Learn to Preserve Personality: Federated Foundation Models in Recommendations	Zhiwei Li et.al.	2506.11563	null
2025-06-13	AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction	Syeda Kisaa Fatima et.al.	2506.11475	null
2025-06-13	Linear-quadratic stochastic nonzero-sum differential games between graphon teams	De-xuan Xu et.al.	2506.11468	null
2025-06-13	Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention	Xuan Duy Ta et.al.	2506.11445	null
2025-06-13	ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification	Yiyang Jin et.al.	2506.11442	null
2025-06-13	Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards	Jeff Da et.al.	2506.11425	null
2025-06-13	FocalAD: Local Motion Planning for End-to-End Autonomous Driving	Bin Sun et.al.	2506.11419	null
2025-06-13	Complexity guarantees for risk-neutral generalized Nash equilibrium problems	Haochen Tao et.al.	2506.11409	null
2025-06-13	Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning	Liying Wang et.al.	2506.11376	null
2025-06-12	From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review	Yaohui Zhang et.al.	2506.11343	null
2025-06-12	A Hybrid Adaptive Nash Equilibrium Solver for Distributed Multi-Agent Systems with Game-Theoretic Jump Triggering	Qiuyu Miao et.al.	2506.11304	null
2025-06-12	TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy	Héctor Carrión et.al.	2506.11302	link
2025-06-12	Shapley Machine: A Game-Theoretic Framework for N-Agent Ad Hoc Teamwork	Jianhong Wang et.al.	2506.11285	link
2025-06-12	Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation	Benjamin Elder et.al.	2506.11266	null
2025-06-12	Sensor Model Identification via Simultaneous Model Selection and State Variable Determination	Christian Brommer et.al.	2506.11263	null
2025-06-12	LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation	Ngoc Phuoc An Vo et.al.	2506.11237	null
2025-06-12	Beyond Formal Semantics for Capabilities and Skills: Model Context Protocol in Manufacturing	Luis Miguel Vieira da Silva et.al.	2506.11180	null
2025-06-12	Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning	Xue Zhou et.al.	2506.11172	null
2025-06-11	ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator	Wenlong Hou et.al.	2506.11150	null
2025-06-11	Autonomous Computer Vision Development with Agentic AI	Jin Kim et.al.	2506.11140	link
2025-06-10	GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions	Wenkang Han et.al.	2506.11127	null
2025-06-12	AutoMind: Adaptive Knowledgeable Agent for Automated Data Science	Yixin Ou et.al.	2506.10974	link
2025-06-12	Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop	Justin Kerr et.al.	2506.10968	null
2025-06-12	SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks	Lianghong Guo et.al.	2506.10954	link
2025-06-12	Build the web for agents, not agents for the web	Xing Han Lù et.al.	2506.10953	null
2025-06-14	Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors	Chen Yueh-Han et.al.	2506.10949	link
2025-06-12	Execution Guided Line-by-Line Code Generation	Boaz Lavon et.al.	2506.10948	link
2025-06-12	Dynamic Epistemic Friction in Dialogue	Timothy Obiso et.al.	2506.10934	null
2025-06-12	Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence	Eduardo Baena et.al.	2506.10925	null
2025-06-12	Prediction and control of geometry-induced nematic order in growing multicellular systems	Lukas Hupe et.al.	2506.10867	null
2025-06-12	CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training	Alireza Salemi et.al.	2506.10844	link
2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825	null
2025-06-15	VideoDeepResearch: Long Video Understanding With Agentic Tool Using	Huaying Yuan et.al.	2506.10821	link
2025-06-13	Joint Beamforming with Extremely Large Scale RIS: A Sequential Multi-Agent A2C Approach	Zhi Chai et.al.	2506.10815	null
2025-06-12	OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems	Xiaozhe Li et.al.	2506.10764	link
2025-06-12	Integrating Large Language Models into Text Animation: An Intelligent Editing System with Inline and Chat Interaction	Bao Zhang et.al.	2506.10762	null
2025-06-12	Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding	Yuhang Zhang et.al.	2506.10756	null
2025-06-12	Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering	Sai Prasanna Teja Reddy Bogireddy et.al.	2506.10751	null
2025-06-12	Cursed Equilibria and Knightian Uncertainty in a Trading Game	Jurek Preker et.al.	2506.10663	null
2025-06-12	SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis	Sergio Burdisso et.al.	2506.10622	link
2025-06-12	AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation	Haoyuan Shi et.al.	2506.10540	null
2025-06-12	Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models	Sangmin Song et.al.	2506.10504	null
2025-06-12	BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis	Surya Jasper et.al.	2506.10501	null
2025-06-16	Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications	Felix Härer et.al.	2506.10467	link
2025-06-12	Are We Generalizing from the Exception? An In-the-Wild Study on Group-Sensitive Conversation Design in Human-Agent Interactions	Ana Müller et.al.	2506.10462	null
2025-06-12	Equitable Mechanism Design for Facility Location	Toby Walsh et.al.	2506.10460	null
2025-06-12	Multi-dimensional Autoscaling of Processing Services: A Comparison of Agent-based Methods	Boris Sedlak et.al.	2506.10420	null
2025-06-12	Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges	Jintao Liang et.al.	2506.10408	null
2025-06-12	EQA-RM: A Generative Embodied Reward Model with Test-time Scaling	Yuhang Chen et.al.	2506.10389	null
2025-06-12	Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills	Yuquan Xie et.al.	2506.10387	null
2025-06-12	NeuroPAL: Punctuated Anytime Learning with Neuroevolution for Macromanagement in Starcraft: Brood War	Jim O'Connor et.al.	2506.10384	null
2025-06-12	Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts	Zaijing Li et.al.	2506.10357	null
2025-06-12	Provably Learning from Language Feedback	Wanqiao Xu et.al.	2506.10341	null
2025-06-12	Seeding an Uncertain Technology	Eric Gao et.al.	2506.10340	null
2025-06-13	A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon	Cameron Angliss et.al.	2506.10326	link
2025-06-12	Minimizing False Positives in Static Bug Detection via LLM-Enhanced Path Feasibility Analysis	Xueying Du et.al.	2506.10322	null
2025-06-12	WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models	Qiyue Yin et.al.	2506.10264	null
2025-06-12	Enhancing Ultrasound Molecular Imaging: Toward Real-Time RPCA-Based Filtering to Differentiate Bound and Free Microbubbles	Hoda S. Hashemi et.al.	2506.10257	null
2025-06-15	Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations	Andrea Gaggioli et.al.	2506.10249	null
2025-06-11	Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems	Filip Cano et.al.	2506.10192	null
2025-06-11	AURA: A Multi-Agent Intelligence Framework for Knowledge-Enhanced Cyber Threat Attribution	Nanda Rani et.al.	2506.10175	null
2025-06-11	A Navigation Framework Utilizing Vision-Language Models	Yicheng Duan et.al.	2506.10172	link
2025-06-14	Disclosure Audits for LLM Agents	Saswat Das et.al.	2506.10171	null
2025-06-11	Exploring EEG Responses during Observation of Actions Performed by Human Actor and Humanoid Robot	Anh T. Nguyen et.al.	2506.10170	null
2025-06-11	Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective	Minye Shao et.al.	2506.10142	link
2025-06-11	Provable Sim-to-Real Transfer via Offline Domain Randomization	Arnaud Fickinger et.al.	2506.10133	null
2025-06-11	Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information	Christodoulos Constantinides et.al.	2506.10086	null
2025-06-11	Cybernetic Marionette: Channeling Collective Agency Through a Wearable Robot in a Live Dancer-Robot Duet	Anup Sathya et.al.	2506.10079	null
2025-06-11	A quantum semantic framework for natural language processing	Christopher J. Agostino et.al.	2506.10077	null
2025-06-11	Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy	Malvern Madondo et.al.	2506.10073	null
2025-06-11	Cooling a Qubit using n Others	Jake Xuereb et.al.	2506.10059	link
2025-06-17	TaskCraft: Automated Generation of Agentic Tasks	Dingfeng Shi et.al.	2506.10055	link
2025-06-11	Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling	Tim Z. Xiao et.al.	2506.09998	null
2025-06-11	SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance	Wentao Ge et.al.	2506.09968	null
2025-06-11	The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability	Jiachen Hu et.al.	2506.09940	null
2025-06-11	On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing	Junlin Chen et.al.	2506.09924	null
2025-06-11	PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants	Zheng Zhao et.al.	2506.09902	link
2025-06-11	"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)	Noel Brindise et.al.	2506.09901	null
2025-06-11	OctoNav: Towards Generalist Embodied Navigation	Chen Gao et.al.	2506.09839	null
2025-06-11	Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy	Tonghe Wang et.al.	2506.09805	null
2025-06-11	Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy	Davide Grossi et.al.	2506.09789	null
2025-06-11	Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era	Shuo Jiang et.al.	2506.09755	null
2025-06-11	Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints	Xiangkai Zhang et.al.	2506.09748	null
2025-06-11	Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring	Gusseppe Bravo-Rocca et.al.	2506.09742	null
2025-06-11	Patterns of Patterns III	Joseph Corneli et.al.	2506.09696	null
2025-06-11	Intent Factored Generation: Unleashing the Diversity in Your Language Model	Eltayeb Ahmed et.al.	2506.09659	null
2025-06-11	Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives	Wei Zeng et.al.	2506.09656	null
2025-06-11	DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy	Kaixuan Xu et.al.	2506.09655	null
2025-06-11	Effective Red-Teaming of Policy-Adherent Agents	Itay Nakash et.al.	2506.09600	null
2025-06-11	VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots	Miguel Á. González-Santamarta et.al.	2506.09583	null
2025-06-11	MOORL: A Framework for Integrating Offline-Online Reinforcement Learning	Gaurav Chaudhary et.al.	2506.09574	null
2025-06-11	ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning	Yu Sun et.al.	2506.09513	link
2025-06-11	Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design	Andreas Schlaginhaufen et.al.	2506.09508	null
2025-06-11	A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes	Thomas J. Ringstrom et.al.	2506.09499	null
2025-06-11	Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation	Yuxin Liu et.al.	2506.09485	null
2025-06-11	Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing	Maria Damanaki et.al.	2506.09469	null
2025-06-11	Generalization Error Analysis for Attack-Free and Byzantine-Resilient Decentralized Learning with Data Heterogeneity	Haoxiang Ye et.al.	2506.09438	null
2025-06-11	When Is Diversity Rewarded in Cooperative Multi-Agent Learning?	Michael Amir et.al.	2506.09434	null
2025-06-11	A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy	Henry Peng Zou et.al.	2506.09420	link
2025-06-11	Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models	Zongjie Li et.al.	2506.09396	null
2025-06-15	LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization	Jiaqi Tang et.al.	2506.09373	null
2025-06-11	ContextBuddy: AI-Enhanced Contextual Insights for Security Alert Investigation (Applied to Intrusion Detection)	Ronal Singh et.al.	2506.09365	null
2025-06-11	Intelligent System of Emergent Knowledge: A Coordination Fabric for Billions of Minds	Moshi Wei et.al.	2506.09335	null
2025-06-11	Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation	Arjun Vaithilingam Sudhakar et.al.	2506.09331	null
2025-06-10	UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench	Boxi Yu et.al.	2506.09289	link
2025-06-10	Improved Approximate EFX Guarantees for Multigraphs	Alireza Kaviani et.al.	2506.09288	null
2025-06-10	Learning The Minimum Action Distance	Lorenzo Steccanella et.al.	2506.09276	null
2025-06-10	Uncertainty Prioritized Experience Replay	Rodrigo Carrasco-Davis et.al.	2506.09270	null
2025-06-10	Agent-based Condition Monitoring Assistance with Multimodal Industrial Database Retrieval Augmented Generation	Karl Löwenmark et.al.	2506.09247	null
2025-06-10	Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs	Greyson Brothers et.al.	2506.09215	null
2025-06-10	Optimal Task Offloading with Firm Deadlines for Mobile Edge Computing Systems	Khai Doan et.al.	2506.09180	null
2025-06-10	Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism	Haoyuan Cai et.al.	2506.09176	link
2025-06-10	MultiNet: An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models	Pranav Guruprasad et.al.	2506.09172	null
2025-06-10	Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search	Samuel Holt et.al.	2506.09171	null
2025-06-10	FAIRTOPIA: Envisioning Multi-Agent Guardianship for Disrupting Unfair AI Pipelines	Athena Vakali et.al.	2506.09107	null
2025-06-10	FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making	Jiaxiang Chen et.al.	2506.09080	null
2025-06-08	BG-HOP: A Bimanual Generative Hand-Object Prior	Sriram Krishna et.al.	2506.09068	link
2025-06-10	ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering	Yuki Imajuku et.al.	2506.09050	link
2025-06-10	VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning	Li Kang et.al.	2506.09049	null
2025-06-10	Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation	Xiaowen Ma et.al.	2506.09046	null
2025-06-10	The Decoupled Risk Landscape in Performative Prediction	Javier Sanguino et.al.	2506.09044	null
2025-06-10	Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System	Yuan Guo et.al.	2506.08972	null
2025-06-10	Towards Robust Deep Reinforcement Learning against Environmental State Perturbation	Chenxu Wang et.al.	2506.08961	null
2025-06-10	What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities	Wendong Bu et.al.	2506.08933	null
2025-06-10	Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)	Maria-Veronica Ciocanel et.al.	2506.08916	link
2025-06-10	Intention-Conditioned Flow Occupancy Models	Chongyi Zheng et.al.	2506.08902	link
2025-06-10	Pairwise similarity method for majority domination problem	N. I. Shushko et.al.	2506.08886	null
2025-06-10	Deploying SICNav in the Field: Safe and Interactive Crowd Navigation using MPC and Bilevel Optimization	Sepehr Samavi et.al.	2506.08851	null
2025-06-10	Agile Reinforcement Learning for Real-Time Task Scheduling in Edge Computing	Amin Avan et.al.	2506.08850	link
2025-06-11	Design Patterns for Securing LLM Agents against Prompt Injections	Luca Beurer-Kellner et.al.	2506.08837	null
2025-06-10	Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents	Irene Testini et.al.	2506.08800	null
2025-06-10	Improved LLM Agents for Financial Document Question Answering	Nelvin Tan et.al.	2506.08726	null
2025-06-10	PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly	Liang Ma et.al.	2506.08708	null
2025-06-10	Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs	Šimon Sedláček et.al.	2506.08633	null
2025-06-10	Modular Recurrence in Contextual MDPs for Universal Morphology Control	Laurens Engwegen et.al.	2506.08630	null
2025-06-10	Geometric Hyperscanning under Active Inference	Nicolas Hinrichs et.al.	2506.08599	null
2025-06-10	HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning	Yang Lv et.al.	2506.08580	null
2025-06-10	Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations	Yibo Cui et.al.	2506.08566	null
2025-06-10	FEDTAIL: Federated Long-Tailed Domain Generalization with Sharpness-Guided Gradient Matching	Sunny Gupta et.al.	2506.08518	null
2025-06-12	MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning	Kuo Yang et.al.	2506.08507	null
2025-06-10	Learning to Lead: Incentivizing Strategic Agents in the Dark	Yuchen Wu et.al.	2506.08438	null
2025-06-10	Attention-based Learning for 3D Informative Path Planning	Rui Zhao et.al.	2506.08434	null
2025-06-12	CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models	Ziqi. Liu et.al.	2506.08430	null
2025-06-10	Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy	Utkarsh Pratiush et.al.	2506.08423	link
2025-06-11	TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration	Weiya Li et.al.	2506.08403	link
2025-06-10	Reinforce LLM Reasoning through Multi-Agent Reflection	Yurun Yuan et.al.	2506.08379	null
2025-06-10	Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models	Wentao Shi et.al.	2506.08352	link
2025-06-11	Your Agent Can Defend Itself against Backdoor Attacks	Li Changjiang et.al.	2506.08336	null
2025-06-10	ORFS-agent: Tool-Using Agents for Chip Design Optimization	Amur Ghose et.al.	2506.08332	null
2025-06-10	Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study	Ira Ceka et.al.	2506.08311	null
2025-06-11	HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation	Hongjun Wu et.al.	2506.08296	null
2025-06-09	From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?	Zhanke Zhou et.al.	2506.08295	link
2025-06-09	From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium	Xie Yi et.al.	2506.08292	link
2025-06-09	Scaling Laws of Motion Forecasting and Planning -- A Technical Report	Mustafa Baniodeh et.al.	2506.08228	null
2025-06-09	Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms	Jared Claypoole et.al.	2506.08192	null
2025-06-09	Anomaly, Class Division, and Decoupling in Wealth Dynamics	Jaeseok Hur et.al.	2506.08175	null
2025-06-09	Ego-centric Learning of Communicative World Models for Autonomous Driving	Hang Wang et.al.	2506.08149	null
2025-06-09	EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments	Zefang Liu et.al.	2506.08136	null
2025-06-09	SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents	Subhrangshu Nandi et.al.	2506.08119	null
2025-06-09	Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph	Akash Vishwakarma et.al.	2506.08098	link
2025-06-09	Towards AI-assisted Neutrino Flavor Theory Design	Jason Benjamin Baretz et.al.	2506.08080	link
2025-06-08	UAVs Meet Agentic AI: A Multidomain Survey of Autonomous Aerial Intelligence and Agentic UAVs	Ranjan Sapkota et.al.	2506.08045	null
2025-06-09	GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior	Penghao Wu et.al.	2506.08012	null
2025-06-09	Dreamland: Controllable World Creation with Simulator and Generative Models	Sicheng Mo et.al.	2506.08006	null
2025-06-09	Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System	Fan Yang et.al.	2506.07997	null
2025-06-09	$τ^2$ -Bench: Evaluating Conversational Agents in a Dual-Control Environment	Victor Barres et.al.	2506.07982	link
2025-06-09	Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator	Alberto Bazán-Guillén et.al.	2506.07980	null
2025-06-10	Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction	Junhong Shen et.al.	2506.07976	link
2025-06-09	HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Hongzheng Chen et.al.	2506.07972	link
2025-06-09	Diffusion of Responsibility in Collective Decision Making	Pavel Naumov et.al.	2506.07935	null
2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915	null
2025-06-09	A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit	Andrea Tiranti et.al.	2506.07877	null
2025-06-09	Simulating nationwide coupled disease and fear spread in an agent-based model	Joy Kitson et.al.	2506.07842	null
2025-06-09	Control strategies and trends to equilibrium for kinetic models of opinion dynamics driven by social activity	Andrea Bondesan et.al.	2506.07840	null
2025-06-09	Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information	Jan Corazza et.al.	2506.07829	null
2025-06-11	A Proposal to Extend the Common Model of Cognition with Metacognition	John Laird et.al.	2506.07807	null
2025-06-13	Agent Semantics, Semantic Spacetime, and Graphical Reasoning	Mark Burgess et.al.	2506.07756	null
2025-06-09	Deep Equivariant Multi-Agent Control Barrier Functions	Nikolaos Bousias et.al.	2506.07755	null
2025-06-09	Delay Optimization in Remote ID-Based UAV Communication via BLE and Wi-Fi Switching	Yian Zhu et.al.	2506.07715	null
2025-06-09	QUITE: A Query Rewrite System Beyond Rules with LLM Agents	Yuyang Song et.al.	2506.07675	null
2025-06-09	MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents	Yunhe Yan et.al.	2506.07672	null
2025-06-09	SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling	Haoran Wang et.al.	2506.07636	null
2025-06-09	Blending Participatory Design and Artificial Awareness for Trustworthy Autonomous Vehicles	Ana Tanevska et.al.	2506.07633	null
2025-06-09	MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity	Bikash Saha et.al.	2506.07586	null
2025-06-09	Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models	Ramakrishna Appicharla et.al.	2506.07583	null
2025-06-11	SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems	Peiran Li et.al.	2506.07564	null
2025-06-12	CheMatAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool Learning	Mengsong Wu et.al.	2506.07551	link
2025-06-09	Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning	Weiqiang Jin et.al.	2506.07548	link
2025-06-09	Fractional Collisions: A Framework for Risk Estimation of Counterfactual Conflicts using Autonomous Driving Behavior Simulations	Sreeja Roy-Singh et.al.	2506.07540	null
2025-06-09	Coordinating Search-Informed Reasoning and Reasoning-Guided Search in Claim Verification	Qisheng Hu et.al.	2506.07528	null
2025-06-09	IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents	Shiwei Feng et.al.	2506.07524	null
2025-06-09	Taking Flight with Dialogue: Enabling Natural Language Control for PX4-based Drone Agent	Shoon Kit Lim et.al.	2506.07509	link
2025-06-09	Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models	Mickel Liu et.al.	2506.07468	link
2025-06-09	Efficient Generation of Diverse Cooperative Agents with World Models	Yi Loo et.al.	2506.07450	null
2025-06-09	Generate Realistic Test Scenes for V2X Communication Systems	An Guo et.al.	2506.07419	null
2025-06-11	MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models	Philip R. Liu et.al.	2506.07400	link
2025-06-09	G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems	Guibin Zhang et.al.	2506.07398	link
2025-06-09	From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks	Yuyang Zhou et.al.	2506.07392	link
2025-06-09	Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents	Yun Hua et.al.	2506.07388	null
2025-06-09	Extended Version of "Distributed Adaptive Resilient Consensus Control for Uncertain Nonlinear Multiagent Systems Against Deception Attacks"	Mengze Yu et.al.	2506.07374	null
2025-06-09	Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking	Jun Chen et.al.	2506.07351	null
2025-06-09	MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation	Yijie Deng et.al.	2506.07350	null
2025-06-09	Distributed Risk-Sensitive Safety Filters for Uncertain Discrete-Time Systems	Armin Lederer et.al.	2506.07347	null
2025-06-09	Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation	Yijie Deng et.al.	2506.07338	null
2025-06-09	Digital Twin-based Smart Manufacturing: Dynamic Line Reconfiguration for Disturbance Handling	Bo Fu et.al.	2506.07332	null
2025-06-08	SCGAgent: Recreating the Benefits of Reasoning Models for Secure Code Generation with Agentic Workflows	Rebecca Saul et.al.	2506.07313	null
2025-06-08	Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI	Aditya Chakravarty et.al.	2506.07286	null
2025-06-08	Secondary Stakeholders in AI: Fighting for, Brokering, and Navigating Agency	Leah Hope Ajmani et.al.	2506.07281	null
2025-06-08	A Cramér-von Mises Approach to Incentivizing Truthful Data Sharing	Alex Clinton et.al.	2506.07272	null
2025-06-08	Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs	Atahan Özer et.al.	2506.07270	null
2025-06-08	Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments	Xinran Li et.al.	2506.07232	null
2025-06-08	LLM-Enhanced Rapid-Reflex Async-Reflect Embodied Agent for Real-Time Decision-Making in Dynamically Changing Environments	Yangqing Zheng et.al.	2506.07223	null
2025-06-08	BIMgent: Towards Autonomous Building Modeling via Computer-use Agents	Zihan Deng et.al.	2506.07217	null
2025-06-08	Adaptive Consensus with Exponential Decay	Woocheol Choi et.al.	2506.07203	null
2025-06-08	Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions	Kanato Nakanishi et.al.	2506.07200	null
2025-06-08	Exploring Effective Strategies for Building a Customised GPT Agent for Coding Classroom Dialogues	Luwei Bai et.al.	2506.07194	null
2025-06-08	Value-Set Iteration: Computing Optimal Correlated Equilibria in Infinite-Horizon Multi-Player Stochastic Games	Jiarui Gan et.al.	2506.07186	null
2025-06-12	Delegation with Costly Inspection	Mohammad T. Hajiaghayi et.al.	2506.07162	null
2025-06-08	Mind the Web: The Security of Web Use Agents	Avishag Shapira et.al.	2506.07153	null
2025-06-08	BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite	Liyang Chen et.al.	2506.07116	null
2025-06-08	Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models	Samir Abdaljalil et.al.	2506.07106	null
2025-06-08	Decentralized Optimization with Amplified Privacy via Efficient Communication	Wei Huo et.al.	2506.07102	null
2025-06-08	On the Generalization of Data-Assisted Control in port-Hamiltonian Systems (DAC-pH)	Mostafa Eslami et.al.	2506.07079	null
2025-06-08	A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge	Tarique Dahri et.al.	2506.07055	null
2025-06-08	QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine	Anushka Jha et.al.	2506.07046	null
2025-06-08	Accelerating Two-Dimensional Materials Research via a Universal Interatomic Potential and Large Language Model Agent	Haidi Wang et.al.	2506.07043	null
2025-06-08	MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks	Sanjoy Chowdhury et.al.	2506.07016	null
2025-06-08	Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments	Riley Simmons-Edler et.al.	2506.06981	null
2025-06-08	Near Optimal Non-asymptotic Sample Complexity of 1-Identification	Zitian Li et.al.	2506.06978	null
2025-06-08	Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning	Subhojyoti Mukherjee et.al.	2506.06964	null
2025-06-08	Deontically Constrained Policy Improvement in Reinforcement Learning Agents	Alena Makarova et.al.	2506.06959	null
2025-06-08	Position: Simulating Society Requires Simulating Thought	Chance Jiajie Li et.al.	2506.06958	null
2025-06-07	An Agentic Framework for Autonomous Metamaterial Modeling and Inverse Design	Darui Lu et.al.	2506.06935	null
2025-06-07	Boosting LLM Reasoning via Spontaneous Self-Correction	Xutong Zhao et.al.	2506.06923	null
2025-06-07	Multimodal Spatial Language Maps for Robot Navigation and Manipulation	Chenguang Huang et.al.	2506.06862	null
2025-06-07	DONUT: A Decoder-Only Model for Trajectory Prediction	Markus Knoche et.al.	2506.06854	null
2025-06-07	United Minds or Isolated Agents? Exploring Coordination of LLMs under Cognitive Load Theory	HaoYang Shang et.al.	2506.06843	null
2025-06-07	AI-Generated Compromises for Coalition Formation	Eyal Briman et.al.	2506.06837	null
2025-06-07	Is Optimal Transport Necessary for Inverse Reinforcement Learning?	Zixuan Dong et.al.	2506.06793	null
2025-06-07	Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain	Dimitris Panagopoulos et.al.	2506.06786	null
2025-06-07	AI PsyRoom: Artificial Intelligence Platform for Segmented Yearning and Reactive Outcome Optimization Method	Yigui Feng et.al.	2506.06740	null
2025-06-07	WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making	Guillaume Levy et.al.	2506.06725	null
2025-06-07	Contextual Experience Replay for Self-Improvement of Language Agents	Yitao Liu et.al.	2506.06698	null
2025-06-07	Self-Adapting Improvement Loops for Robotic Learning	Calvin Luo et.al.	2506.06658	null
2025-06-07	Active Test-time Vision-Language Navigation	Heeju Ko et.al.	2506.06630	null
2025-06-06	AI Simulation by Digital Twins: Systematic Survey, Reference Framework, and Mapping to a Standardized Architecture	Xiaoran Liu et.al.	2506.06580	null
2025-06-11	Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce	Yijia Shao et.al.	2506.06576	null
2025-06-12	The Optimization Paradox in Clinical AI Multi-Agent Systems	Suhana Bedi et.al.	2506.06574	link
2025-06-06	Enhancing Robot Safety via MLLM-Based Semantic Interpretation of Failure Data	Aryaman Gupta et.al.	2506.06570	null
2025-06-06	Adapting Under Fire: Multi-Agent Reinforcement Learning for Adversarial Drift in Network Security	Emilia Rivas et.al.	2506.06565	null
2025-06-06	KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes	Eugenie Lai et.al.	2506.06541	link
2025-06-06	ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search	Sam Earle et.al.	2506.06524	null
2025-06-06	Improving LLM-Powered EDA Assistants with RAFT	Luyao Shi et.al.	2506.06500	null
2025-06-06	Fake Friends and Sponsored Ads: The Risks of Advertising in Conversational Search	Jacob Erickson et.al.	2506.06447	null
2025-06-06	Improving choice model specification using reinforcement learning	Gabriel Nova et.al.	2506.06410	null
2025-06-04	CPS-Guard: Framework for Dependability Assurance of AI- and LLM-Based Cyber-Physical Systems	Trisanth Srinivasan et.al.	2506.06381	null
2025-06-06	PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time	Weizhi Zhang et.al.	2506.06254	null
2025-06-06	Longer Lists Yield Better Matchings	Yuri Faenza et.al.	2506.06217	null
2025-06-06	Can Theoretical Physics Research Benefit from Language Agents?	Sirui Lu et.al.	2506.06214	null
2025-06-06	A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization	Muhammed Ustaomeroglu et.al.	2506.06179	null
2025-06-06	Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach	James Ford et.al.	2506.06175	null
2025-06-06	The Lock-in Hypothesis: Stagnation by Algorithm	Tianyi Alex Qiu et.al.	2506.06166	null
2025-06-06	(AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation	Eunhye Grace Ko et.al.	2506.06165	null
2025-06-06	Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks	Adiba Mahbub Proma et.al.	2506.06153	null
2025-06-06	CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting	Peter Lengyel et.al.	2506.06128	null
2025-06-06	Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library	Weixun Wang et.al.	2506.06122	null
2025-06-06	VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning	Zikang Wang et.al.	2506.06097	null
2025-06-06	On-board Mission Replanning for Adaptive Cooperative Multi-Robot Systems	Elim Kwan et.al.	2506.06094	null
2025-06-06	Self driving algorithm for an active four wheel drive racecar	Gergely Bari et.al.	2506.06077	null
2025-06-06	Conversational Interfaces for Parametric Conceptual Architectural Design: Integrating Mixed Reality with LLM-driven Interaction	Ruochen Ji et.al.	2506.06066	null
2025-06-06	Modeling human reputation-seeking behavior in a spatio-temporally complex public good provision game	Edward Hughes et.al.	2506.06032	null
2025-06-06	When to Trust Context: Self-Reflective Debates for Context Reliability	Zeqi Zhou et.al.	2506.06020	null
2025-06-06	AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search	Yu Li et.al.	2506.06017	null
2025-06-06	Propose or Vote: A simple Democratic Procedure	Hans Gersbach et.al.	2506.05998	null
2025-06-06	Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning	Yuheng Lei et.al.	2506.05985	link
2025-06-06	MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks	Zonglin Wu et.al.	2506.05982	link
2025-06-10	CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents	Qingbin Zeng et.al.	2506.05981	null
2025-06-06	Quantum Checkers: The Development and Analysis of a Quantum Combinatorial Game	Marien Raat et.al.	2506.05962	null
2025-06-06	Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes	Alessandro Montenegro et.al.	2506.05953	null
2025-06-06	Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games	Philipp Plank et.al.	2506.05894	null
2025-06-06	CodeContests+: High-Quality Test Case Generation for Competitive Programming	Zihan Wang et.al.	2506.05817	null
2025-06-06	MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning	Ye Bai et.al.	2506.05813	null
2025-06-06	Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction	Yesheng Zhang et.al.	2506.05810	null
2025-06-06	To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt	Zhilong Wang et.al.	2506.05739	null
2025-06-06	Hybrid Stabilization Protocol for Cross-Chain Digital Assets Using Adaptor Signatures and AI-Driven Arbitrage	Shengwei You et.al.	2506.05708	null
2025-06-06	Multi-Project Contracts	Tal Alon et.al.	2506.05705	null
2025-06-06	Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces	Chaofan Pan et.al.	2506.05702	null
2025-06-06	Ordering-disordering dynamics of the voter model under random external bias	Roni Muslim et.al.	2506.05669	null
2025-06-06	A Modular Haptic Display with Reconfigurable Signals for Personalized Information Transfer	Antonio Alvarez Valdivia et.al.	2506.05648	null
2025-06-06	Diffusive Spreading Across Dynamic Mitochondrial Network Architectures	Keaton B. Holt et.al.	2506.05643	null
2025-06-09	Toward Greater Autonomy in Materials Discovery Agents: Unifying Planning, Physics, and Scientists	Lianhao Zhou et.al.	2506.05616	null
2025-06-05	Beating the Logarithmic Barrier for the Subadditive Maximin Share Problem	Masoud Seddighin et.al.	2506.05613	null
2025-06-05	OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation	Ziyi Wang et.al.	2506.05606	null
2025-06-05	Stochastic maximum principle for optimal control problem of non exchangeable mean field systems	Idris Kharroubi et.al.	2506.05595	null
2025-06-05	Collaborative Learning in Agentic Systems: A Collective AI is Greater Than the Sum of Its Parts	Saptarshi Nath et.al.	2506.05577	link
2025-06-05	Applying Informer for Option Pricing: A Transformer-Based Approach	Feliks Bańka et.al.	2506.05565	null
2025-06-05	Improving LLMs with a knowledge from databases	Petr Máša et.al.	2506.05560	null
2025-06-05	Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data	Vlastimil Martinek et.al.	2506.05542	link
2025-06-05	SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms	Arnesh Batra et.al.	2506.05538	link
2025-06-05	Quantum circuits as a game: A reinforcement learning agent for quantum compilation and its application to reconfigurable neutral atom arrays	Kouhei Nakaji et.al.	2506.05536	null
2025-06-05	Avoiding Death through Fear Intrinsic Conditioning	Rodney Sanchez et.al.	2506.05529	null
2025-06-05	Sequence Modeling for N-Agent Ad Hoc Teamwork	Caroline Wang et.al.	2506.05527	null
2025-06-05	Towards Data Systems That Are Business Semantic-Centric and AI Agents-Assisted	Cecil Pang et.al.	2506.05520	null
2025-06-05	Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap?	Giovanni M. Di Liberto et.al.	2506.05494	null
2025-06-05	A MARL-based Approach for Easing MAS Organization Engineering	Julien Soulé et.al.	2506.05437	null
2025-06-05	Robustness Evaluation for Video Models with Reinforcement Learning	Ashwin Ramesh Babu et.al.	2506.05431	null
2025-06-05	Mixture-of-Experts Meets In-Context Reinforcement Learning	Wenhao Wu et.al.	2506.05426	null
2025-06-05	Constructive Symbolic Reinforcement Learning via Intuitionistic Logic and Goal-Chaining Inference	Andrei T. Patrascu et.al.	2506.05422	null
2025-06-03	Rational Superautotrophic Diplomacy (SupraAD); A Conceptual Framework for Alignment Based on Interdisciplinary Findings on the Fundamentals of Cognition	Andrea Morris et.al.	2506.05389	null
2025-06-05	Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games	Niv Eckhaus et.al.	2506.05309	link
2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305	null
2025-06-05	Control Tax: The Price of Keeping AI in Check	Mikhail Terekhov et.al.	2506.05296	null
2025-06-05	A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$ : Robust Imitation via Learning to Search	Arnav Kumar Jain et.al.	2506.05294	link
2025-06-05	Tight analyses of first-order methods with error feedback	Daniel Berg Thomsen et.al.	2506.05271	link
2025-06-06	Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams	Mohammed Almutairi et.al.	2506.05265	null
2025-06-05	Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning	Dravyansh Sharma et.al.	2506.05252	null
2025-06-05	Towards Language-Augmented Multi-Agent Deep Reinforcement Learning	Maxime Toquebiau et.al.	2506.05236	null
2025-06-05	A Framework for Ethical Judgment of Smart City Applications	Weichen Shi et.al.	2506.05172	null
2025-06-05	An emergence-oriented approach to cyclic pursuit	Zhaozhan Yao et.al.	2506.05157	null
2025-06-05	Truly Self-Improving Agents Require Intrinsic Metacognitive Learning	Tennison Liu et.al.	2506.05109	null
2025-06-05	LLM-Guided Scenario-based GUI Testing	Shengcheng Yu et.al.	2506.05079	null
2025-06-05	Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System	Haokun Liu et.al.	2506.05020	null
2025-06-05	ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development	Zhenran Xu et.al.	2506.05010	link
2025-06-05	QiMeng: Fully Automated Hardware and Software Design for Processor Chip	Rui Zhang et.al.	2506.05007	null
2025-06-05	Agentic AI for Intent-Based Industrial Automation	Marcos Lima Romero et.al.	2506.04980	link
2025-06-05	Optimization for Semantic-Aware Resource Allocation under CPT-based Utilities	Symeon Vaidanis et.al.	2506.04952	null
2025-06-05	Goal-Oriented Semantic Resource Allocation with Cumulative Prospect Theoretic Agents	Symeon Vaidanis et.al.	2506.04947	null
2025-06-05	No Trade Under Verifiable Information	Spyros Galanis et.al.	2506.04944	null
2025-06-05	Energentic Intelligence: From Self-Sustaining Systems to Enduring Artificial Life	Atahan Karagoz et.al.	2506.04916	null
2025-06-05	Efficient Path Planning and Task Allocation Algorithm for Boolean Specifications	Ioana Hustiu et.al.	2506.04881	link
2025-06-05	LLMs for sensory-motor control: Combining in-context and iterative learning	Jônata Tyska Carvalho et.al.	2506.04867	link
2025-06-05	Towards a Multi-Agent Simulation of Cyber-attackers and Cyber-defenders Battles	Julien Soulé et.al.	2506.04849	null
2025-06-05	Oversight Structures for Agentic AI in Public-Sector Organizations	Chris Schmitz et.al.	2506.04836	null
2025-06-05	Safe Planning and Policy Optimization via World Model Learning	Artem Latyshev et.al.	2506.04828	null
2025-06-05	Distributionally Robust Auction Design with Deferred Inspection	Halil I. Bayrak et.al.	2506.04767	null
2025-06-05	SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs	Shuhan Xu et.al.	2506.04743	null
2025-06-05	Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling	Bihan Xu et.al.	2506.04699	null
2025-06-05	Gen-n-Val: Agentic Image Data Generation and Validation	Jing-En Huang et.al.	2506.04676	null
2025-06-05	E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction	Zhichao Yang et.al.	2506.04654	null
2025-06-05	Agents of Change: Self-Evolving LLM Agents for Strategic Planning	Nikolas Belle et.al.	2506.04651	null
2025-06-05	Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents	Juhyun Oh et.al.	2506.04649	link
2025-06-05	CHANCERY: Evaluating corporate governance reasoning capabilities in language models	Lucas Irwin et.al.	2506.04636	null
2025-06-05	Composing Agents to Minimize Worst-case Risk	Guruprerana Shabadi et.al.	2506.04632	null
2025-06-05	Enhancing Efficiency and Propulsion in Bio-mimetic Robotic Fish through End-to-End Deep Reinforcement Learning	Xinyu Cui et.al.	2506.04627	null
2025-06-05	Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning	Haochen Zhang et.al.	2506.04626	null
2025-06-05	Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning	Zhiyuan Ma et.al.	2506.04625	null
2025-06-05	Subjective Perspectives within Learned Representations Predict High-Impact Innovation	Likun Cao et.al.	2506.04616	null
2025-06-05	SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents	Alexander Huang-Menders et.al.	2506.04606	null
2025-06-05	Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning	Ziqi Jia et.al.	2506.04595	null
2025-06-05	Demonstrations of Integrity Attacks in Multi-Agent Systems	Can Zheng et.al.	2506.04572	null
2025-06-05	OpenAg: Democratizing Agricultural Intelligence	Srikanth Thudumu et.al.	2506.04571	null
2025-06-05	From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems	Jiayi Chen et.al.	2506.04565	null
2025-06-04	SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning	Nikita Oskolkov et.al.	2506.04505	null
2025-06-04	CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective	Jiayu Liu et.al.	2506.04481	null
2025-06-04	MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale	Ran Xu et.al.	2506.04405	null
2025-06-04	Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning	Suzan Ece Ada et.al.	2506.04399	null
2025-06-04	Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care	Saurabh Kumar et.al.	2506.04389	null
2025-06-04	Replay Can Provably Increase Forgetting	Yasaman Mahdaviyeh et.al.	2506.04377	null
2025-06-04	WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning	Delong Chen et.al.	2506.04363	null
2025-06-04	The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective	Jiin Kim et.al.	2506.04301	null
2025-06-04	AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents	Fengze Liu et.al.	2506.04293	null
2025-06-04	Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback	Yongjin Yang et.al.	2506.04287	null
2025-06-04	Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response	Lei Han et.al.	2506.04276	null
2025-06-03	CORA: Coalitional Rational Advantage Decomposition for Multi-Agent Policy Gradients	Mengda Ji et.al.	2506.04265	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217	link
2025-06-04	Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs	Alex DeWeese et.al.	2506.04215	null
2025-06-06	TracLLM: A Generic Framework for Attributing Long Context LLMs	Yanting Wang et.al.	2506.04202	link
2025-06-04	MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures	Elena Zamaraeva et.al.	2506.04195	null
2025-06-04	SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Yuhao Wu et.al.	2506.04180	null
2025-06-04	A primal-dual price-optimization method for computing equilibrium prices in mean-field games models	Xu Wang et.al.	2506.04169	link
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-05	macOSWorld: A Multilingual Interactive Benchmark for GUI Agents	Pei Yang et.al.	2506.04135	link
2025-06-04	TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems	Shaina Raza et.al.	2506.04133	null
2025-06-04	CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues	Disha Sheshanarayana et.al.	2506.04131	null
2025-06-04	TextAtari: 100K Frames Game Playing with Language Agents	Wenhao Li et.al.	2506.04098	link
2025-06-04	AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment	Anastasiia Ivanova et.al.	2506.04089	link
2025-06-04	Optimal Transport-based Domain Alignment as a Preprocessing Step for Federated Learning	Luiz Manella Pereira et.al.	2506.04071	null
2025-06-04	AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data	Sina Rashidian et.al.	2506.04032	null
2025-06-04	AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents	Akshat Naik et.al.	2506.04018	null
2025-06-04	Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning	Junqi Gao et.al.	2506.03939	link
2025-06-04	HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Zhaolu Kang et.al.	2506.03922	link
2025-06-04	Causal Explanations Over Time: Articulated Reasoning for Interactive Environments	Sebastian Rödling et.al.	2506.03915	null
2025-06-04	Jet-Feedback on kpc scales: a review	Dipanjan Mukherjee et.al.	2506.03888	null
2025-06-04	PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading	Qiuhan Han et.al.	2506.03861	null
2025-06-04	AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance	Dhaval Patel et.al.	2506.03828	link
2025-06-04	Learning Equilibria in Matching Games with Bandit Feedback	Andreas Athanasopoulos et.al.	2506.03802	null
2025-06-04	From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation	Peter Pfeiffer et.al.	2506.03801	null
2025-06-04	Misalignment or misuse? The AGI alignment tradeoff	Max Hellrigel-Holderbaum et.al.	2506.03755	null
2025-06-04	A Retrieval-Augmented Multi-Agent Framework for Psychiatry Diagnosis	Mengxi Xiao et.al.	2506.03750	link
2025-06-04	AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives	Aniruddh Sikdar et.al.	2506.03709	null
2025-06-04	Stability Notions for Hospital Residents with Sizes	Haricharan Balasundaram et.al.	2506.03638	null
2025-06-04	Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations	Shaoshan Liu et.al.	2506.03613	link
2025-06-04	Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games	Dongmin Park et.al.	2506.03610	null
2025-06-08	Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems	Yu Ma et.al.	2506.03586	null
2025-06-05	Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving	Li Zeqiao et.al.	2506.03568	link
2025-06-04	From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context	Yuanchen Bai et.al.	2506.03546	null
2025-06-04	CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications	Wanghao Ye et.al.	2506.03543	null
2025-06-04	Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement	Xiaofeng Zhou et.al.	2506.03541	null
2025-06-04	Go-Browse: Training Web Agents with Structured Exploration	Apurva Gandhi et.al.	2506.03533	null
2025-06-04	GA-S $^3$ : Comprehensive Social Network Simulation with Group Agents	Yunyao Zhang et.al.	2506.03532	link
2025-06-04	How Far Are We from Predicting Missing Modalities with Foundation Models?	Guanzhou Ke et.al.	2506.03530	link
2025-06-04	Correlated equilibrium implementation: Navigating toward social optima with learning dynamics	Soumen Banerjee et.al.	2506.03528	null
2025-06-04	Path Generation and Evaluation in Video Games: A Nonparametric Statistical Approach	Daniel Campa et.al.	2506.03522	null
2025-06-04	VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety	Han Zhang et.al.	2506.03520	null
2025-06-04	SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models	Arnab Debnath et.al.	2506.03516	null
2025-06-04	Computational Architects of Society: Quantum Machine Learning for Social Rule Genesis	Shan Shan et.al.	2506.03503	null
2025-06-04	CORE: Constraint-Aware One-Step Reinforcement Learning for Simulation-Guided Neural Network Accelerator Design	Yifeng Xiao et.al.	2506.03474	null
2025-06-03	The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks	Walter Mayor et.al.	2506.03404	null
2025-06-03	Impact of Rankings and Personalized Recommendations in Marketplaces	Omar Besbes et.al.	2506.03369	null
2025-06-03	A Differential Perspective on Distributional Reinforcement Learning	Juan Sebastian Rojas et.al.	2506.03333	null
2025-06-03	Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic Workflows	Yifei Ming et.al.	2506.03332	null
2025-06-03	The Future of Continual Learning in the Era of Foundation Models: Three Key Directions	Jack Bell et.al.	2506.03320	null
2025-06-03	FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes	Christodoulos Constantinides et.al.	2506.03278	link
2025-06-03	NetPress: Dynamically Generated LLM Benchmarks for Network Applications	Yajie Zhou et.al.	2506.03231	link
2025-06-03	Multiple-Frequencies Population-Based Training	Waël Doulazmi et.al.	2506.03225	null
2025-06-02	Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments	Umberto Gonçalves de Sousa et.al.	2506.03205	null
2025-06-03	GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents	Qianhui Wu et.al.	2506.03143	null
2025-06-03	Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning	Yinjie Wang et.al.	2506.03136	link
2025-06-03	Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff	Sophie Greenwood et.al.	2506.03102	null
2025-06-03	EgoVLM: Policy Optimization for Egocentric Video Understanding	Ashwin Vinod et.al.	2506.03097	link
2025-06-03	DPO Learning with LLMs-Judge Signal for Computer Use Agents	Man Luo et.al.	2506.03095	null
2025-06-03	Provable Reinforcement Learning from Human Feedback with an Unknown Link Function	Qining Zhang et.al.	2506.03066	null
2025-06-03	MAEBE: Multi-Agent Emergent Behavior Framework	Sinem Erisken et.al.	2506.03053	null
2025-06-03	EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment	Mikolaj Walczak et.al.	2506.03046	null
2025-06-06	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2506.03038	null
2025-06-03	TestAgent: An Adaptive and Intelligent Expert for Human Assessment	Junhao Yu et.al.	2506.03032	null
2025-06-03	Coding Agents with Multimodal Browsing are Generalist Problem Solvers	Aditya Bharat Soni et.al.	2506.03011	null
2025-06-03	DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models	Jiarui Wang et.al.	2506.03007	null
2025-06-03	A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems	Đorđe Klisura et.al.	2506.02998	null
2025-06-03	Mapping Student-AI Interaction Dynamics in Multi-Agent Learning Environments: Supporting Personalised Learning and Reducing Performance Gaps	Zhanxin Hao et.al.	2506.02993	null
2025-06-03	Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation	Li Zhang et.al.	2506.02992	null
2025-06-03	Adaptive Graph Pruning for Multi-Agent Communication	Boyi Li et.al.	2506.02951	null
2025-06-03	Abstract Counterfactuals for Language Model Agents	Edoardo Pona et.al.	2506.02946	null
2025-06-08	Hallucination to Consensus: Multi-Agent LLMs for End-to-End Test Generation with Accurate Oracles	Qinghua Xu et.al.	2506.02943	null
2025-06-03	ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms	Praneet Sai Madhu Surabhi et.al.	2506.02931	link
2025-06-03	Large Processor Chip Model	Kaiyan Chang et.al.	2506.02929	null
2025-06-03	The Limits of Predicting Agents from Behaviour	Alexis Bellot et.al.	2506.02923	null
2025-06-03	Text-guided Generation of Efficient Personalized Inspection Plans	Xingpeng Sun et.al.	2506.02917	null
2025-06-03	A Continual Offline Reinforcement Learning Benchmark for Navigation Tasks	Anthony Kobanda et.al.	2506.02883	null
2025-06-03	It's the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics	Matthew Kowal et.al.	2506.02873	null
2025-06-03	Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights	Mathieu Andreux et.al.	2506.02865	null
2025-06-03	CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech	Helin Wang et.al.	2506.02863	null
2025-06-03	ATAG: AI-Agent Application Threat Assessment with Attack Graphs	Parth Atulbhai Gandhi et.al.	2506.02859	null
2025-06-03	Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods	Tom Danino et.al.	2506.02841	null
2025-06-03	On dual-rate consensus under transmission delays	David Umsonst et.al.	2506.02840	null
2025-06-03	DeepShop: A Benchmark for Deep Research Shopping Agents	Yougang Lyu et.al.	2506.02839	null
2025-06-03	TaxAgent: How Large Language Model Designs Fiscal Policy	Jizhou Wang et.al.	2506.02838	null
2025-06-03	Solving the Pod Repositioning Problem with Deep Reinforced Adaptive Large Neighborhood Search	Lin Xie et.al.	2506.02746	null
2025-06-03	Why do AI agents communicate in human language?	Pengcheng Zhou et.al.	2506.02739	null
2025-06-03	Benchmarking and Advancing Large Language Models for Local Life Services	Xiaochong Lan et.al.	2506.02720	null
2025-06-03	Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems	Guanzhong Chen et.al.	2506.02718	null
2025-06-04	MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching	Liang Yue et.al.	2506.02689	null
2025-06-03	Decompose, Plan in Parallel, and Merge: A Novel Paradigm for Large Language Models based Planning with Multiple Constraints	Zhengdong Lu et.al.	2506.02683	null
2025-06-03	Bounded confidence dynamics generates opinion cascades on a growing scale-free network	David Hernandez et.al.	2506.02669	null
2025-06-03	FAuNO: Semi-Asynchronous Federated Reinforcement Learning Framework for Task Offloading in Edge Systems	Frederico Metelo et.al.	2506.02668	null
2025-06-04	Non-exchangeable evolutionary and mean field games and their applications	H. Yoshioka et.al.	2506.02644	null
2025-06-03	Compositional Learning for Modular Multi-Agent Self-Organizing Networks	Qi Liao et.al.	2506.02616	null
2025-06-04	Multi Layered Autonomy and AI Ecologies in Robotic Art Installations	Baoyang Chen et.al.	2506.02606	null
2025-06-03	Computational adversarial risk analysis for general security games	Jose Manuel Camacho et.al.	2506.02603	null
2025-06-03	A Hybrid Approach to Indoor Social Navigation: Integrating Reactive Local Planning and Proactive Global Planning	Arnab Debnath et.al.	2506.02593	null
2025-06-03	CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale	Zhun Wang et.al.	2506.02548	link
2025-06-03	Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems	Pengfei He et.al.	2506.02546	null
2025-06-03	VerificAgent: Integrating Expert Knowledge and Fact-Checked Memory for Robust Domain-Specific Task Planning	Thong Q. Nguyen et.al.	2506.02539	null
2025-06-03	Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making	Xu Wan et.al.	2506.02522	null
2025-06-03	To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents	Kyra Wang et.al.	2506.02514	link
2025-06-03	AURA: Agentic Upskilling via Reinforced Abstractions	Alvin Zhu et.al.	2506.02507	null
2025-06-03	VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents	Tri Cao et.al.	2506.02456	link
2025-06-03	Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework	Zhaorui Yang et.al.	2506.02454	null
2025-06-03	From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models	Mahammed Kamruzzaman et.al.	2506.02431	null
2025-06-04	Comparative Analysis of AI Agent Architectures for Entity Relationship Classification	Maryam Berijanian et.al.	2506.02426	link
2025-06-03	VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments	Zelai Xu et.al.	2506.02387	null
2025-06-03	Multi-agent Markov Entanglement	Shuze Chen et.al.	2506.02385	null
2025-06-03	Evaluating LLM Agent Adherence to Hierarchical Safety Principles: A Lightweight Benchmark for Probing Foundational Controllability Components	Ram Potham et.al.	2506.02357	null
2025-06-03	DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization	Jeonghun Kang et.al.	2506.02351	null
2025-06-02	LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback	Thai Hoang et.al.	2506.02298	null
2025-06-02	Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction	Samuel Li et.al.	2506.02265	null
2025-06-02	Composable Building Blocks for Controllable and Transparent Interactive AI Systems	Sebe Vanbrabant et.al.	2506.02262	null
2025-06-02	Stochastically Dominant Peer Prediction	Yichi Zhang et.al.	2506.02259	null
2025-06-02	Optimal Coordination of Flexible DERs in Local Energy and Flexibility Markets to Ensure Social Equity	Niloofar Pourghaderi et.al.	2506.02179	null
2025-06-02	Reflection-Based Memory For Web navigation Agents	Ruhana Azam et.al.	2506.02158	null
2025-06-02	Small Language Models are the Future of Agentic AI	Peter Belcak et.al.	2506.02153	null
2025-06-04	The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning	Edward Y. Chang et.al.	2506.02139	null
2025-06-02	Descriptive History Representations: Learning Representations by Answering Questions	Guy Tennenholtz et.al.	2506.02125	null
2025-06-02	Enhancing Interpretability of Quantum-Assisted Blockchain Clustering via AI Agent-Based Qualitative Analysis	Yun-Cheng Tsai et.al.	2506.02068	null
2025-06-01	The Measurement Imbalance in Agentic AI Evaluation Undermines Industry Productivity Claims	Kiana Jafari Meimandi et.al.	2506.02064	null
2025-06-01	Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI	Nikola Balic et.al.	2506.02055	link
2025-06-01	Phenotypic Profile-Informed Generation of Drug-Like Molecules via Dual-Channel Variational Autoencoders	Hui Liu et.al.	2506.02051	null
2025-06-01	Decoupled Hierarchical Reinforcement Learning with State Abstraction for Discrete Grids	Qingyu Xiao et.al.	2506.02050	null
2025-06-01	EvoGit: Decentralized Code Evolution via Git-Based Multi-Agent Collaboration	Beichen Huang et.al.	2506.02049	link
2025-06-01	Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges	Lajos Muzsai et.al.	2506.02048	null
2025-05-31	Beyond the Protocol: Unveiling Attack Vectors in the Model Context Protocol Ecosystem	Hao Song et.al.	2506.02040	link
2025-06-02	WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks	Atsuyuki Miyai et.al.	2506.01952	null
2025-06-02	Should Decision-Makers Reveal Classifiers in Online Strategic Classification?	Han Shao et.al.	2506.01936	null
2025-06-02	Online Competitive Information Gathering for Partially Observable Trajectory Games	Mel Krusniak et.al.	2506.01927	null
2025-06-02	COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents	Manish Bhatt et.al.	2506.01900	null
2025-06-02	WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue	Yaoyao Qian et.al.	2506.01881	link
2025-06-02	Pearl: Automatic Code Optimization Using Deep Reinforcement Learning	Djamel Rassem Lamouri et.al.	2506.01880	null
2025-06-02	CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions	Tamer Alkhouli et.al.	2506.01859	null
2025-06-02	Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research	Jennifer Haase et.al.	2506.01839	null
2025-06-02	The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships?	Djallel Bouneffouf et.al.	2506.01813	null
2025-06-02	A Study on the MCP x A2A Framework for Enhancing Interoperability of LLM-based Autonomous Agents	Cheonsu Jeong et.al.	2506.01804	null
2025-06-02	Enhancing Customer Service Chatbots with Context-Aware NLU through Selective Attention and Multi-task Learning	Subhadip Nandi et.al.	2506.01781	null
2025-06-02	Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning	Yihong Tang et.al.	2506.01748	null
2025-06-02	Self-Challenging Language Model Agents	Yifei Zhou et.al.	2506.01716	null
2025-06-02	A Descriptive and Normative Theory of Human Beliefs in RLHF	Sylee Dandekar et.al.	2506.01692	null
2025-06-02	Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts	Benjamin Schiffer et.al.	2506.01685	null
2025-06-02	A Hierarchical Bin Packing Framework with Dual Manipulators via Heuristic Search and Deep Reinforcement Learning	Beomjoon Lee et.al.	2506.01628	null
2025-06-02	Social Cooperation in Conversational AI Agents	Mustafa Mert Çelikok et.al.	2506.01624	null
2025-06-02	MAGIK: Mapping to Analogous Goals via Imagination-enabled Knowledge Transfer	Ajsal Shereef Palattuparambil et.al.	2506.01623	null
2025-06-02	General agents need world models	Jonathan Richens et.al.	2506.01622	null
2025-06-02	MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments	Xiao Yang et.al.	2506.01616	null
2025-06-02	Trajectory First: A Curriculum for Discovering Diverse Policies	Cornelius V. Braun et.al.	2506.01568	null
2025-06-02	EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation	Bingqian Lin et.al.	2506.01551	null
2025-06-03	LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation	Guobin Zhu et.al.	2506.01538	null
2025-06-03	Quantum Agents	Eldar Sultanow et.al.	2506.01536	null
2025-06-03	STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework	Wenhao Liu et.al.	2506.01531	link
2025-06-02	FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents	Bobo Li et.al.	2506.01520	null
2025-06-02	PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization	Zouying Cao et.al.	2506.01475	null
2025-06-02	Agentic AI and Multiagentic: Are We Reinventing the Wheel?	V. Botti et.al.	2506.01463	null
2025-06-02	Agentic Episodic Control	Xidong Yang et.al.	2506.01442	null
2025-06-02	Distinguishing Autonomous AI Agents from Collaborative Agentic Systems: A Comprehensive Framework for Understanding Modern Intelligent Architectures	Prashik Buddhaghosh Bansod et.al.	2506.01438	null
2025-06-02	FinRobot: Generative Business Process AI Agents for Enterprise Resource Planning in Finance	Hongyang Yang et.al.	2506.01423	null
2025-06-02	SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation	Rafael Flor-Rodríguez et.al.	2506.01418	link
2025-06-02	Sparse Imagination for Efficient Visual World Model Planning	Junha Chun et.al.	2506.01392	null
2025-06-02	AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning	Zhong Zhang et.al.	2506.01391	link
2025-06-02	Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents	Manan Suri et.al.	2506.01344	null
2025-06-02	Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models	Yiwen Jiang et.al.	2506.01334	null
2025-06-02	An Empirical Study of Group Conformity in Multi-Agent Systems	Min Choi et.al.	2506.01332	null
2025-06-02	ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding	Yiyang Zhou et.al.	2506.01300	null
2025-06-02	RAISE: Reasoning Agent for Interactive SQL Exploration	Fernando Granado et.al.	2506.01273	null
2025-06-02	CleanS2S: Single-file Framework for Proactive Speech-to-Speech Interaction	Yudong Lu et.al.	2506.01268	null
2025-06-02	Comprehensive Vulnerability Analysis is Necessary for Trustworthy LLM-MAS	Pengfei He et.al.	2506.01245	null
2025-06-01	A mean field game model with non-local spatial interactions and resources accumulation	Daria Ghilli et.al.	2506.01200	null
2025-06-04	Test Automation for Interactive Scenarios via Promptable Traffic Simulation	Augusto Mondelli et.al.	2506.01199	null
2025-06-01	Near-feasible Fair Allocations in Two-sided Markets	Javier Cembrano et.al.	2506.01178	null
2025-06-01	GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering	Muhammad Qasim Ali et.al.	2506.01174	null
2025-06-01	Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism	Orchid Chetia Phukan et.al.	2506.01148	null
2025-06-01	DeepVerse: 4D Autoregressive Video Generation as a World Model	Junyi Chen et.al.	2506.01103	null
2025-06-01	Modular Speaker Architecture: A Framework for Sustaining Responsibility and Contextual Integrity in Multi-Agent AI Communication	Khe-Han Toh et.al.	2506.01095	null
2025-06-01	The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process	Florian Carichon et.al.	2506.01080	null
2025-06-01	SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models	Thinh Pham et.al.	2506.01062	null
2025-06-04	MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch	Xiang Fei et.al.	2506.01056	null
2025-06-01	Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution	Meysam Alizadeh et.al.	2506.01055	null
2025-06-01	Robust and Safe Multi-Agent Reinforcement Learning Framework with Communication for Autonomous Vehicles	Keshawn Smith et.al.	2506.00982	null
2025-06-01	HMPC-assisted Adversarial Inverse Reinforcement Learning for Smart Home Energy Management	Jiadong He et.al.	2506.00898	null
2025-06-01	Toward a Theory of Agents as Tool-Use Decision-Makers	Hongru Wang et.al.	2506.00886	null
2025-06-01	CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching	Leying Zhang et.al.	2506.00885	null
2025-06-01	Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks	Qiang Chen et.al.	2506.00856	null
2025-06-01	Federated Deep Reinforcement Learning-Driven O-RAN for Automatic Multirobot Reconfiguration	Faisal Ahmed et.al.	2506.00822	null
2025-06-01	Action Dependency Graphs for Globally Optimal Coordinated Reinforcement Learning	Jianglin Ding et.al.	2506.00797	null
2025-06-01	Predicting Empirical AI Research Outcomes with Language Models	Jiaxin Wen et.al.	2506.00794	null
2025-06-01	CO-OPERA: A Human-AI Collaborative Playwriting Tool to Support Creative Storytelling for Interdisciplinary Drama Education	Xuejiao Ma et.al.	2506.00791	link
2025-06-01	CoP: Agentic Red-teaming for Large Language Models using Composition of Principles	Chen Xiong et.al.	2506.00781	null
2025-05-31	Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences?	Zhuojun Gu et.al.	2506.00751	null
2025-05-31	DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments	Chiyu Zhang et.al.	2506.00739	link
2025-05-31	Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning	Javier Bisbal et.al.	2506.00727	null
2025-05-31	Browser Fingerprinting Using WebAssembly	Mordechai Guri et.al.	2506.00719	null
2025-05-31	An LLM Agent for Functional Bug Detection in Network Protocols	Mingwei Zheng et.al.	2506.00714	link
2025-05-31	Adaptive Traffic-Following Scheme for Orderly Distributed Control of Multi-Vehicle Systems	Anahita Jain et.al.	2506.00703	null
2025-06-04	Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning	Junaid Muzaffar et.al.	2506.00691	null
2025-05-31	AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents	Hanjun Luo et.al.	2506.00641	null
2025-05-31	Social Construction of Urban Space: Understanding Neighborhood Boundaries Using Rental Listings	Adam Visokay et.al.	2506.00634	null
2025-05-31	The Disparate Effects of Partial Information in Bayesian Strategic Learning	Srikanth Avasarala et.al.	2506.00627	null
2025-06-04	RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents	Jingyi Yang et.al.	2506.00618	null
2025-05-31	PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements	Petros Raptopoulos et.al.	2506.00608	link
2025-05-31	Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn	Hongyao Tang et.al.	2506.00592	null
2025-05-31	Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs	Yufa Zhou et.al.	2506.00577	link
2025-05-31	ORAN-GUIDE: RAG-Driven Prompt Learning for LLM-Augmented Reinforcement Learning in O-RAN Network Slicing	Fatemeh Lotfi et.al.	2506.00576	null
2025-05-31	Prompt-Tuned LLM-Augmented DRL for Dynamic O-RAN Network Slicing	Fatemeh Lotfi et.al.	2506.00574	null
2025-05-31	MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning	Peng Xia et.al.	2506.00555	null
2025-05-31	Two-Sided Manipulation Games in Stable Matching Markets	Hadi Hosseini et.al.	2506.00554	null
2025-05-31	AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation	Ming Wang et.al.	2506.00551	link
2025-05-31	Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages	Hyangsuk Min et.al.	2506.00549	null
2025-05-31	Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline	Zhaoying Wang et.al.	2506.00546	null
2025-06-04	ARIA: Training Language Agents with Intention-Driven Reward Aggregation	Ruihan Yang et.al.	2506.00539	null
2025-05-31	Temac: Multi-Agent Collaboration for Automated Web GUI Testing	Chenxu Liu et.al.	2506.00520	null
2025-05-31	Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems	Zherui Li et.al.	2506.00509	null
2025-05-31	Reinforcement Learning for Hanabi	Nina Cohen et.al.	2506.00458	null
2025-05-31	RLAE: Reinforcement Learning-Assisted Ensemble for LLMs	Yuqian Fu et.al.	2506.00439	null
2025-05-31	Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions	Jihyoung Jang et.al.	2506.00421	null
2025-05-31	World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks	Changyuan Zhao et.al.	2506.00417	null
2025-05-31	LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks	Yi Yang et.al.	2506.00411	null
2025-05-31	Sensor Fusion Methods for Gaussian Mixture Models	Ishan Paranjape et.al.	2506.00383	null
2025-05-31	Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents	Xiao Yu et.al.	2506.00320	null
2025-05-30	Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model	Oliver Mortensen et.al.	2506.00286	null
2025-05-30	MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility	Yexiao He et.al.	2506.00235	null
2025-05-30	Sorrel: A simple and flexible framework for multi-agent reinforcement learning	Rebekah A. Gelpí et.al.	2506.00228	link
2025-05-30	REIC: RAG-Enhanced Intent Classification at Scale	Ziji Zhang et.al.	2506.00210	null
2025-05-30	When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs	Xinyue Shen et.al.	2506.00197	null
2025-05-30	Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents	Kaivalya Hariharan et.al.	2506.00172	null
2025-06-03	A novel sensitivity analysis method for agent-based models stratifies in-silico tumor spheroid simulations	Edward H. Rohr et.al.	2506.00168	null
2025-05-30	Werewolf: A Straightforward Game Framework with TTS for Improved User Engagement	Qihui Fan et.al.	2506.00160	null
2025-05-30	MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation	Biqi Rebekah Zhao et.al.	2506.00143	null
2025-05-30	Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation	Reece Keller et.al.	2506.00138	null
2025-05-30	A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things	Mohammadhossein Homaei et.al.	2506.00133	null
2025-05-30	Adapting Offline Reinforcement Learning with Online Delays	Simon Sinong Zhan et.al.	2506.00131	null
2025-05-30	Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents	Yaxin Luo et.al.	2505.24878	link
2025-05-30	Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks	Tajamul Ashraf et.al.	2505.24876	link
2025-05-30	VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software	Brandon Man et.al.	2505.24838	link
2025-05-30	Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation	Yucheng Zhou et.al.	2505.24787	link
2025-06-02	EXP-Bench: Can AI Conduct AI Research Experiments?	Patrick Tser Jern Kon et.al.	2505.24785	link
2025-05-30	Emergent Dynamics of Active Systems on Curved Environments	Euan D. Mackay et.al.	2505.24730	null
2025-05-30	CoRet: Improved Retriever for Code Editing	Fabio Fehr et.al.	2505.24715	null
2025-05-30	Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting	Wei Chen et.al.	2505.24710	link
2025-05-30	Towards a unified user modeling language for engineering human centered AI systems	Aaron Conrardy et.al.	2505.24697	null
2025-05-30	Multiple LLM Agents Debate for Equitable Cultural Alignment	Dayeon Ki et.al.	2505.24671	link
2025-05-30	Black-box Adversarial Attacks on CNN-based SLAM Algorithms	Maria Rafaela Gkeka et.al.	2505.24654	null
2025-05-30	Online Budget-Feasible Mechanism Design with Predictions	Georgios Amanatidis et.al.	2505.24624	null
2025-05-30	Distributed Intelligence in the Computing Continuum with Active Inference	Victor Casamayor Pujol et.al.	2505.24618	null
2025-05-30	When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation	Daniela Occhipinti et.al.	2505.24613	null
2025-06-02	AutoChemSchematic AI: A Closed-Loop, Physics-Aware Agentic Framework for Auto-Generating Chemical Process and Instrumentation Diagrams	Sakhinana Sagar Srinivas et.al.	2505.24584	null
2025-05-30	NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization	Hyuntak Kim et.al.	2505.24575	null
2025-05-30	CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	Ye Eun Chun et.al.	2505.24553	null
2025-05-30	Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Compatibility	Leonid Kondrashov et.al.	2505.24551	null
2025-05-30	Online Fair Division with Additional Information	Tzeh Yuan Neoh et.al.	2505.24503	null
2025-05-30	RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation	Zhentao Xie et.al.	2505.24442	link
2025-05-30	P: A Universal Measure of Predictive Intelligence	David Gamez et.al.	2505.24426	null
2025-05-30	Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer	Yilun Kong et.al.	2505.24378	link
2025-05-30	Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research	Qianqian Zhang et.al.	2505.24354	null
2025-05-30	Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents	Fanhang Man et.al.	2505.24331	link
2025-05-30	Online Fair Allocations with Binary Valuations and Beyond	Yuanyuan Wang et.al.	2505.24321	null
2025-05-30	ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving	Yongming Chen et.al.	2505.24317	null
2025-05-30	R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning	Harsh Goel et.al.	2505.24265	link
2025-05-30	Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games	Neemesh Yadav et.al.	2505.24255	link
2025-05-30	Rethinking Continual Learning with Progressive Neural Collapse	Zheng Wang et.al.	2505.24254	null
2025-05-30	Proactive Guidance of Multi-Turn Conversation in Industrial Search	Xiaoyu Li et.al.	2505.24251	null
2025-05-30	An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring	Sana Ebrahimi et.al.	2505.24239	null
2025-05-30	SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems	Xu He et.al.	2505.24201	null
2025-05-30	Learning Gentle Humanoid Locomotion and End-Effector Stabilization Control	Yitang Li et.al.	2505.24198	link
2025-05-30	Learning API Functionality from Demonstrations for Tool-based Agents	Bhrij Patel et.al.	2505.24197	null
2025-05-30	Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control	Zijie Xu et.al.	2505.24161	null
2025-05-30	Don't Just Follow MLLM Plans: Robust and Efficient Planning for Open-world Agents	Seungjoon Lee et.al.	2505.24157	null
2025-05-30	Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction	Chenyou Fan et.al.	2505.24156	null
2025-05-30	Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning	Ehtesamul Azim et.al.	2505.24155	link
2025-05-30	Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning	Pengcheng Dai et.al.	2505.24113	null
2025-05-30	Deception in Oligopoly Games via Adaptive Nash Seeking Systems	Michael Tang et.al.	2505.24112	null
2025-05-29	mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation	Chan-Wei Hu et.al.	2505.24073	null
2025-05-29	Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning	Jiashun Liu et.al.	2505.24061	null
2025-05-29	LLM Agents Should Employ Security Principles	Kaiyuan Zhang et.al.	2505.24019	null
2025-05-29	ConversAR: Exploring Embodied LLM-Powered Group Conversations in Augmented Reality for Second Language Learners	Jad Bendarkawi et.al.	2505.24000	null
2025-05-29	Multi-RAG: A Multimodal Retrieval-Augmented Generation System for Adaptive Video Understanding	Mingyang Mao et.al.	2505.23990	null
2025-05-29	Rules, agents and order	Amalia Puente et.al.	2505.23985	null
2025-05-29	Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation	Henry Conklin et.al.	2505.23960	null
2025-05-29	Estimating Misreporting in the Presence of Genuine Modification: A Causal Perspective	Dylan Zapzalka et.al.	2505.23954	null
2025-05-29	Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach	Melika Sepidband et.al.	2505.23953	null
2025-05-29	InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback	Boyuan Chen et.al.	2505.23950	null
2025-05-29	Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve	Yuanzhe Liu et.al.	2505.23946	null
2025-05-29	ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents	Feiteng Fang et.al.	2505.23923	null
2025-05-29	OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation	Mengkang Hu et.al.	2505.23885	link
2025-05-29	Combining Deep Architectures for Information Gain estimation and Reinforcement Learning for multiagent field exploration	Emanuele Masiero et.al.	2505.23865	null
2025-05-29	DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control	Wuhao Wang et.al.	2505.23857	null
2025-05-29	Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease	Nic Dobbins et.al.	2505.23852	null
2025-05-28	Seven Security Challenges That Must be Solved in Cross-domain Multi-agent LLM Systems	Ronny Ko et.al.	2505.23847	null
2025-05-28	Scalable, Symbiotic, AI and Non-AI Agent Based Parallel Discrete Event Simulations	Atanu Barai et.al.	2505.23846	null
2025-05-28	GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance	Zaixi Zhang et.al.	2505.23839	link
2025-05-28	CoMaPOI: A Collaborative Multi-Agent Framework for Next POI Prediction Bridging the Gap Between Trajectory and Language	Lin Zhong et.al.	2505.23837	null
2025-05-28	Large Language Models Often Know When They Are Being Evaluated	Joe Needham et.al.	2505.23836	null
2025-05-28	Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective	Qingchuan Ma et.al.	2505.23833	link
2025-05-28	Privacy-Preserving Inconsistency Measurement	Carl Corea et.al.	2505.23825	null
2025-05-27	Aligning LLMs by Predicting Preferences from User Writing Samples	Stéphane Aroca-Ouellette et.al.	2505.23815	null
2025-05-29	From Chat Logs to Collective Insights: Aggregative Question Answering	Wentao Zhang et.al.	2505.23765	null
2025-05-29	ZeroGUI: Automating Online GUI Learning at Zero Human Cost	Chenyu Yang et.al.	2505.23762	link
2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link
2025-05-29	ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering	Zexi Liu et.al.	2505.23723	link
2025-05-29	COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents	Arun Verma et.al.	2505.23720	null
2025-05-29	From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems	Zeinab Nezami et.al.	2505.23710	null
2025-05-29	Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics	Ran Zhang et.al.	2505.23695	link
2025-05-29	ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork	Caroline Wang et.al.	2505.23686	link
2025-05-31	GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents	Manish Shetty et.al.	2505.23671	link
2025-05-29	Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning	Michael A. Ramirez-Sierra et.al.	2505.23650	null
2025-05-29	Securing AI Agents with Information-Flow Control	Manuel Costa et.al.	2505.23643	link
2025-05-29	MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment	John Halloran et.al.	2505.23634	null
2025-06-02	MAPLE: A Mobile Agent with Persistent Finite State Machines for Structured Task Reasoning	Linqiang Guo et.al.	2505.23596	null
2025-05-29	SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents	Kunlun Zhu et.al.	2505.23559	link
2025-05-29	Going from a Representative Agent to Counterfactuals in Combinatorial Choice	Yanqiu Ruan et.al.	2505.23546	null
2025-05-29	TRAP: Targeted Redirecting of Agentic Preferences	Hangoo Kang et.al.	2505.23518	null
2025-05-29	PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views	Mohamed Rayan Barhdadi et.al.	2505.23481	link
2025-05-29	Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns	Xiang Li et.al.	2505.23474	null
2025-05-29	On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment	Safwan Labbi et.al.	2505.23459	null
2025-05-29	Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents	Zhejian Yang et.al.	2505.23450	null
2025-06-01	Emergent Risk Awareness in Rational Agents under Resource Constraints	Daniel Jarne Ornia et.al.	2505.23436	null
2025-05-29	From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents	Tobias Lindenbauer et.al.	2505.23422	link
2025-06-01	SWE-bench Goes Live!	Linghao Zhang et.al.	2505.23419	link
2025-05-29	Agent Interpolation for Knowledge	Marta Bílková et.al.	2505.23401	null
2025-05-29	GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning	Jusheng Zhang et.al.	2505.23399	null
2025-05-29	Grower-in-the-Loop Interactive Reinforcement Learning for Greenhouse Climate Control	Maxiu Xiao et.al.	2505.23355	null
2025-05-29	Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems	Xu Shen et.al.	2505.23352	link
2025-06-02	ScEdit: Script-based Assessment of Knowledge Editing	Xinye Li et.al.	2505.23291	link
2025-05-29	Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception	Guangyuan Liu et.al.	2505.23275	null
2025-05-29	Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion	Chunlong Xie et.al.	2505.23266	null
2025-05-29	Achieving Equitability with Subsidy	Yuanyuan Wang et.al.	2505.23251	null
2025-05-29	Context-Aware Semantic Communication for the Wireless Networks	Guangyuan Liu et.al.	2505.23249	null
2025-05-29	OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software	Lingkai Meng et.al.	2505.23239	link
2025-05-29	TrackVLA: Embodied Visual Tracking in the Wild	Shaoan Wang et.al.	2505.23189	null
2025-05-29	Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration	Yilong Li et.al.	2505.23187	null
2025-05-29	Conceptual Framework Toward Embodied Collective Adaptive Intelligence	Fan Wang et.al.	2505.23153	null
2025-05-29	Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners	Michal Nauman et.al.	2505.23150	null
2025-05-29	PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents	Haoyu Chen et.al.	2505.23130	null
2025-05-29	Learning to Incentivize in Repeated Principal-Agent Problems with Adversarial Agent Arrivals	Junyan Liu et.al.	2505.23124	null
2025-05-29	A Constructed Response: Designing and Choreographing Robot Arm Movements in Collaborative Dance Improvisation	Xiaoyu Chang et.al.	2505.23090	null
2025-05-29	Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble	Amit Kumthekar et.al.	2505.23075	null
2025-05-29	CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents	Zhen Xiang et.al.	2505.23055	link
2025-05-29	AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models	Jinchuan Zhang et.al.	2505.23020	link
2025-06-01	Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration	Zeying Gong et.al.	2505.23019	null
2025-05-29	A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs	Chiwan Park et.al.	2505.23006	null
2025-05-29	LLM Agents for Bargaining with Utility-based Feedback	Jihwan Oh et.al.	2505.22998	null
2025-05-29	Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation	Hoang Pham et.al.	2505.22993	null
2025-05-29	MenTeR: A fully-automated Multi-agenT workflow for end-to-end RF/Analog Circuits Netlist Design	Pin-Han Chen et.al.	2505.22990	null
2025-05-29	Free Lunch for User Experience: Crowdsourcing Agents for Scalable User Studies	Siyang Liu et.al.	2505.22981	null
2025-05-29	Learning Recommender Mechanisms for Bayesian Stochastic Games	Bengisu Guresti et.al.	2505.22979	null
2025-05-29	MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming	Chengqi Zheng et.al.	2505.22967	null
2025-05-29	ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind	Peixuan Han et.al.	2505.22961	link
2025-05-29	Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness	Yongjin Yang et.al.	2505.22960	null
2025-05-29	Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents	Jenny Zhang et.al.	2505.22954	link
2025-05-28	WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning	Yuchen Zhuang et.al.	2505.22942	null
2025-05-28	A Smart-Contract to Resolve Multiple Equilibrium in Intermediated Trade	Daniel Aronoff et.al.	2505.22940	null
2025-05-28	On the Resolution of Stochastic MPECs over Networks: Distributed Implicit Zeroth-Order Gradient Tracking Methods	Mohammadjavad Ebrahimi et.al.	2505.22916	null
2025-05-28	Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents	Cristian Chica et.al.	2505.22909	null
2025-05-28	Conversational Alignment with Artificial Intelligence in Context	Rachel Katharine Sterken et.al.	2505.22907	null
2025-05-30	Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel	Carlota Parés-Morlans et.al.	2505.22861	null
2025-05-28	Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment	Krti Tallam et.al.	2505.22852	null
2025-05-28	RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation	Nikita Khramov et.al.	2505.22846	null
2025-05-28	A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems	Jonghan Lim et.al.	2505.22814	null
2025-05-28	First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay	Andrew Zhu et.al.	2505.22809	link
2025-05-28	Dynamic Task Adaptation for Multi-Robot Manufacturing Systems with Large Language Models	Jonghan Lim et.al.	2505.22804	null
2025-05-28	Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games	Antonio Ocello et.al.	2505.22781	null
2025-05-28	MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators	John Mendonça et.al.	2505.22777	link
2025-05-28	Calibrated Value-Aware Model Learning with Stochastic Environment Models	Claas Voelcker et.al.	2505.22772	null
2025-05-28	Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential Fields	Arseniy Pertzovsky et.al.	2505.22753	null
2025-05-28	HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer	Qi Cai et.al.	2505.22705	link
2025-05-28	Design and testing of an agent chatbot supporting decision making with public transport data	Luca Fantin et.al.	2505.22698	null
2025-05-28	When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks?	Eleni Nisioti et.al.	2505.22696	link
2025-05-28	LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning	Tengfei Lyu et.al.	2505.22695	null
2025-05-28	3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model	Wenbo Hu et.al.	2505.22657	null
2025-05-28	Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents	Michael Kirchhof et.al.	2505.22655	null
2025-05-28	WebDancer: Towards Autonomous Information Seeking Agency	Jialong Wu et.al.	2505.22648	link
2025-06-01	FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control	Younggyo Seo et.al.	2505.22642	null
2025-05-28	LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents	Rui Li et.al.	2505.22634	null
2025-05-28	HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym	Ngoc La et.al.	2505.22597	link
2025-05-28	GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git	Tobias Lindenbauer et.al.	2505.22583	link
2025-05-30	Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems	Hoang Pham et.al.	2505.22571	null
2025-05-28	Universal Visuo-Tactile Video Understanding for Embodied Interaction	Yifan Xie et.al.	2505.22566	null
2025-05-28	Training RL Agents for Multi-Objective Network Defense Tasks	Andres Molina-Markham et.al.	2505.22531	null
2025-05-28	AI instructional agent improves student's perceived learner control and learning outcome: empirical evidence from a randomized controlled trial	Fei Qin et.al.	2505.22526	null
2025-05-28	From Strangers to Assistants: Fast Desire Alignment for Embodied Agent-User Adaptation	Yuanfei Wang et.al.	2505.22503	null
2025-05-28	EvolveSearch: An Iterative Self-Evolving Search Agent	Dingchu Zhang et.al.	2505.22501	null
2025-05-28	Human-Centered Human-AI Collaboration (HCHAC)	Qi Gao et.al.	2505.22477	null
2025-05-29	Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems	Jiaxi Yang et.al.	2505.22467	null
2025-05-28	AI Mathematician: Towards Fully Automated Frontier Mathematical Research	Yuanhang Liu et.al.	2505.22451	null
2025-05-28	COSMOS: A Data-Driven Probabilistic Time Series simulator for Chemical Plumes across Spatial Scales	Arunava Nag et.al.	2505.22436	link
2025-05-28	Exact Algorithms and Lower Bounds for Forming Coalitions of Constrained Maximum Size	Foivos Fioravantes et.al.	2505.22384	null
2025-05-28	AgentDNS: A Root Domain Naming System for LLM Agents	Enfang Cui et.al.	2505.22368	null
2025-05-28	From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications	Feibo Jiang et.al.	2505.22311	null
2025-05-28	Voice CMS: updating the knowledge base of a digital assistant through conversation	Grzegorz Wolny et.al.	2505.22303	null
2025-05-29	YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction	Mingzhuang Wang et.al.	2505.22250	null
2025-05-28	Efficient Leave-one-out Approximation in LLM Multi-agent Debate Based on Introspection	Yue Cui et.al.	2505.22192	null
2025-05-28	MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction	Mohammad Samin Nur Chowdhury et.al.	2505.22187	null
2025-05-28	Online Fair Division for Personalized $2$ -Value Instances	Georgios Amanatidis et.al.	2505.22174	null
2025-05-28	Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL	Claude Formanek et.al.	2505.22151	null
2025-05-28	Lifted Forward Planning in Relational Factored Markov Decision Processes with Concurrent Actions	Florian Andreas Marwitz et.al.	2505.22147	null
2025-05-28	Sentiment Simulation using Generative AI Agents	Melrose Tia et.al.	2505.22125	null
2025-05-30	VIRAL: Vision-grounded Integration for Reward design And Learning	Valentin Cuzin-Rambaud et.al.	2505.22092	link
2025-05-28	AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation	Yan Rong et.al.	2505.22053	null
2025-05-28	Reinforced Reasoning for Embodied Planning	Di Wu et.al.	2505.22050	null
2025-05-28	VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries	Nasir Hussain et.al.	2505.22010	null
2025-05-28	Efficiently Enhancing General Agents With Hierarchical-categorical Memory	Changze Qiao et.al.	2505.22006	null
2025-05-28	Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning	Naoto Yoshida et.al.	2505.21985	null
2025-05-28	Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset	Fakhraddin Alwajih et.al.	2505.21979	null
2025-05-29	DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation	Tianjun Gu et.al.	2505.21969	link
2025-05-28	MapStory: LLM-Powered Text-Driven Map Animation Prototyping with Human-in-the-Loop Editing	Aditya Gunturu et.al.	2505.21966	null
2025-05-28	UI-Evol: Automatic Knowledge Evolving for Computer Use Agents	Ziyun Zhang et.al.	2505.21964	null
2025-05-28	LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents	Taro Yano et.al.	2505.21963	null
2025-05-28	Properties of zero-determinant strategies in multichannel games	Masahiko Ueda et.al.	2505.21952	null
2025-06-01	RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments	Zeyi Liao et.al.	2505.21936	link
2025-05-28	Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference	Yue Zhu et.al.	2505.21919	null
2025-05-31	Modeling and Optimizing User Preferences in AI Copilots: A Comprehensive Survey and Taxonomy	Saleh Afzoon et.al.	2505.21907	null
2025-05-28	Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development	Rennai Qiu et.al.	2505.21898	null
2025-05-28	Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation	Yu-Lun Song et.al.	2505.21880	null
2025-06-02	GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning	Shikhhar Siingh et.al.	2505.21863	null
2025-05-27	AI Agent Governance: A Field Guide	Jam Kraprayoon et.al.	2505.21808	null
2025-05-27	Events and their Localisation are Relative to a Lab	V. Vilasini et.al.	2505.21797	null
2025-05-27	Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation	Tharindu Kumarage et.al.	2505.21784	null
2025-05-27	BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum	Yubin Kim et.al.	2505.21757	null
2025-05-27	AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models	Dongjune Chang et.al.	2505.21741	null
2025-05-27	Deep Reinforcement Learning Agents are not even close to Human Intelligence	Quentin Delfosse et.al.	2505.21731	null
2025-05-27	On Reconfigurable Bisimulation, with an Application to the Distributed Synthesis Problem	Yehia Abd Alrahman et.al.	2505.21672	null
2025-05-27	Classifying and Clustering Trading Agents	Mateusz Wilinski et.al.	2505.21662	link
2025-05-27	PreGenie: An Agentic Framework for High-quality Visual Presentation Generation	Xiaojie Xu et.al.	2505.21660	null
2025-05-27	Herd Behavior: Investigating Peer Influence in LLM-based Multi-Agent Systems	Young-Min Cho et.al.	2505.21588	null
2025-05-27	AITEE -- Agentic Tutor for Electrical Engineering	Christopher Knievel et.al.	2505.21582	link
2025-05-27	RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving	Huacan Wang et.al.	2505.21577	link
2025-05-27	ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools	Zhucong Li et.al.	2505.21569	null
2025-05-26	Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework	Julien Soulé et.al.	2505.21559	null
2025-05-26	Fermionic operatorial model of a system with competitive and cooperative interactions	M. Gorgone et.al.	2505.21554	null
2025-05-27	Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making	Yihan Wang et.al.	2505.21503	null
2025-05-27	AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery	Haowei Wang et.al.	2505.21499	link
2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	link
2025-05-27	UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents	Han Xiao et.al.	2505.21496	link
2025-05-27	Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming	Yang Yang et.al.	2505.21486	null
2025-05-27	Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration	Zijun Liu et.al.	2505.21471	link
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457	null
2025-05-27	Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks	Francesco Cozzi et.al.	2505.21426	link
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-27	Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery	Lina Zhao et.al.	2505.21418	null
2025-05-27	A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment	Brett Bissey et.al.	2505.21414	null
2025-05-27	MRSD: Multi-Resolution Skill Discovery for HRL Agents	Shashank Sharma et.al.	2505.21410	null
2025-05-27	Breaking co-existence: zealotry vs. nonlinear social impact	Christopher R. Kitching et.al.	2505.21407	null
2025-05-27	AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs	Xuanwen Ding et.al.	2505.21389	link
2025-05-27	Distributed equilibrium seeking in aggregative games: linear convergence under singular perturbations lens	Guido Carnevale et.al.	2505.21386	null
2025-05-27	Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History	Qishuai Zhong et.al.	2505.21362	link
2025-05-28	PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims	Valentin Knappich et.al.	2505.21342	null
2025-05-27	Large Language Models Miss the Multi-Agent Mark	Emanuele La Malfa et.al.	2505.21298	null
2025-05-27	Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework	Saman Marandi et.al.	2505.21291	null
2025-05-27	PACT: A Contract-Theoretic Framework for Pricing Agentic AI Services Powered by Large Language Models	Ya-Ting Yang et.al.	2505.21286	null
2025-05-27	XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration	Shaoqing Zhang et.al.	2505.21279	null
2025-05-27	Data-Driven Cellular Mobility Management via Bayesian Optimization and Reinforcement Learning	Mohamed Benzaghta et.al.	2505.21249	null
2025-05-27	Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies	Felix Chalumeau et.al.	2505.21236	null
2025-05-27	Quantum AIXI: Universal Intelligence via Quantum Information	Elija Perrier et.al.	2505.21170	null
2025-05-27	GGBond: Growing Graph-Based AI-Agent Society for Socially-Aware Recommender Simulation	Hailin Zhong et.al.	2505.21154	null
2025-05-27	IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model	Yang Zhao et.al.	2505.21146	null
2025-05-27	Creativity in LLM-based Multi-Agent Systems: A Survey	Yi-Cheng Lin et.al.	2505.21116	null
2025-05-27	Simulating Ethics: Using LLM Debate Panels to Model Deliberation on Medical Dilemmas	Hazem Zohny et.al.	2505.21112	null
2025-05-27	CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building	Zhengmin Yu et.al.	2505.21069	null
2025-05-27	Agent-Environment Alignment via Automated Interface Generation	Kaiming Liu et.al.	2505.21055	null
2025-05-27	RefAV: Towards Planning-Centric Scenario Mining	Cainan Davidson et.al.	2505.20981	link
2025-05-27	Identifying Super Spreaders in Multilayer Networks	Michał Czuba et.al.	2505.20980	null
2025-05-28	Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement	Keheliya Gallaba et.al.	2505.20973	null
2025-05-27	Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence	Mehdi Bennis et.al.	2505.20964	null
2025-05-27	Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective	Yang Zhang et.al.	2505.20922	link
2025-05-27	Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation	Pingrui Zhang et.al.	2505.20897	link
2025-05-27	Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment	Leizhen Wang et.al.	2505.20889	null
2025-05-27	MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems	Kai Chen et.al.	2505.20824	link
2025-05-27	MT-Mol:Multi Agent System with Tool-based Reasoning for Molecular Optimization	Hyomin Kim et.al.	2505.20820	null
2025-05-27	Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective	Krishna Singh Rajput et.al.	2505.20816	null
2025-05-27	Can Agents Fix Agent Issues?	Alfin Wijaya Rahardja et.al.	2505.20749	null
2025-05-27	RRO: LLM Agent Optimization Through Rising Reward Trajectories	Zilong Wang et.al.	2505.20737	null
2025-05-27	E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing	Cheonsu Jeong et.al.	2505.20733	null
2025-05-27	SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution	Hanlin Wang et.al.	2505.20732	link
2025-05-27	ManiTaskGen: A Comprehensive Task Generator for Benchmarking and Improving Vision-Language Agents on Embodied Decision-Making	Liu Dai et.al.	2505.20726	null
2025-05-27	A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs	Alberto Pliego Marugán et.al.	2505.20725	null
2025-05-28	VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models	Kui Wu et.al.	2505.20718	null
2025-05-27	Hierarchical Instruction-aware Embodied Visual Tracking	Kui Wu et.al.	2505.20710	null
2025-05-27	Berk-Nash Rationalizability	Ignacio Esponda et.al.	2505.20708	null
2025-05-27	GIFARC: Synthetic Dataset for Leveraging Human-Intuitive Analogies to Elevate AI Reasoning	Woochang Sim et.al.	2505.20672	null
2025-05-27	LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation	Heng Tan et.al.	2505.20671	null
2025-05-27	MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning	Zikang Guo et.al.	2505.20670	null
2025-05-30	AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage	Xuanle Zhao et.al.	2505.20662	link
2025-05-27	BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism	Qinzhuo Wu et.al.	2505.20660	null
2025-05-27	An Optimisation Framework for Unsupervised Environment Design	Nathan Monette et.al.	2505.20659	null
2025-05-27	CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models	Yi Zhan et.al.	2505.20642	null
2025-05-27	IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios	Yifan Li et.al.	2505.20640	null
2025-05-27	Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration	Sibo Xiao et.al.	2505.20625	null
2025-05-29	The challenge of hidden gifts in multi-agent reinforcement learning	Dane Malenfant et.al.	2505.20579	null
2025-05-26	Synergising Hierarchical Data Centers and Power Networks: A Privacy-Preserving Approach	Junhong Liu et.al.	2505.20575	null
2025-05-26	xChemAgents: Agentic AI for Explainable Quantum Chemistry	Can Polat et.al.	2505.20574	link
2025-05-26	Byzantine-Resilient Distributed P2P Energy Trading via Spatial-Temporal Anomaly Detection	Junhong Liu et.al.	2505.20567	null
2025-05-26	Learning a Pessimistic Reward Model in RLHF	Yinglun Xu et.al.	2505.20556	null
2025-05-28	Trade among moral agents with information asymmetries	José Ignacio Rivero-Wildemauwe et.al.	2505.20551	null
2025-05-26	Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting	Ana Rita Ortigoso et.al.	2505.20521	null
2025-05-26	CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic	Yuxuan Sun et.al.	2505.20510	null
2025-05-26	Reconceptualizing Smart Microscopy: From Data Collection to Knowledge Creation by Multi-Agent Integration	P. S. Kesavan et.al.	2505.20466	null
2025-05-26	OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation	Raktim Gautam Goswami et.al.	2505.20425	null
2025-05-26	RetroMotion: Retrocausal Motion Forecasting Models are Instructable	Royden Wagner et.al.	2505.20414	link
2025-05-26	SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents	Ibragim Badertdinov et.al.	2505.20411	link
2025-05-26	Algorithmic Control Improves Residential Building Energy and EV Management when PV Capacity is High but Battery Capacity is Low	Lennart Ullner et.al.	2505.20377	null
2025-05-26	VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection	Zeyi Huang et.al.	2505.20289	null
2025-05-26	Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution	Jiahao Qiu et.al.	2505.20286	link
2025-05-27	MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability	Weiqi Wu et.al.	2505.20285	link
2025-05-26	OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction	Haonan Zhang et.al.	2505.20277	link
2025-05-26	Ten Principles of AI Agent Economics	Ke Yang et.al.	2505.20273	null
2025-05-26	syftr: Pareto-Optimal Generative AI	Alexander Conway et.al.	2505.20266	link
2025-05-26	On Path to Multimodal Historical Reasoning: HistBench and HistAgent	Jiahao Qiu et.al.	2505.20246	link
2025-05-26	Shutdownable Agents through POST-Agency	Elliott Thornley et.al.	2505.20203	null
2025-05-26	THiNK: Can Large Language Models Think-aloud?	Yongan Yu et.al.	2505.20184	link
2025-05-26	The Problem of Algorithmic Collisions: Mitigating Unforeseen Risks in a Connected World	Maurice Chiodo et.al.	2505.20181	null
2025-05-27	MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents	Ziming Wei et.al.	2505.20148	link
2025-05-26	Agentic 3D Scene Generation with Spatially Contextualized VLMs	Xinhang Liu et.al.	2505.20129	null
2025-05-26	Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers	Zhengliang Shi et.al.	2505.20128	link
2025-05-26	Agentic AI Process Observability: Discovering Behavioral Variability	Fabiana Fournier et.al.	2505.20127	null
2025-05-26	Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets	Simpson Zhang et.al.	2505.20120	null
2025-05-27	TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent	Dominik Meier et.al.	2505.20118	link
2025-05-26	MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning	Thang Nguyen et.al.	2505.20096	null
2025-05-26	SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale	Qi Li et.al.	2505.20094	null
2025-05-26	REARANK: Reasoning Re-ranking Agent via Reinforcement Learning	Le Zhang et.al.	2505.20046	link
2025-05-26	Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking	Yihan Chen et.al.	2505.20023	null
2025-05-26	WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback	Minda Hu et.al.	2505.20013	null
2025-05-26	The Many Challenges of Human-Like Agents in Virtual Game Environments	Maciej Świechowski et.al.	2505.20011	null
2025-05-26	Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents	Tao Wu et.al.	2505.19997	null
2025-05-26	The residual maximin share	Uriel Feige et.al.	2505.19961	null
2025-05-26	MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research	Hui Chen et.al.	2505.19955	link
2025-05-26	Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19952	null
2025-05-26	Signed Angle Rigid Graphs for Network Localization and Formation Control	Jinpeng Huang et.al.	2505.19945	null
2025-05-26	Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making	Yejin Son et.al.	2505.19933	null
2025-05-27	Evaluating AI cyber capabilities with crowdsourced elicitation	Artem Petrov et.al.	2505.19915	null
2025-05-26	EMAC+: Embodied Multimodal Agent for Collaborative Planning with VLM+LLM	Shuang Ao et.al.	2505.19905	null
2025-05-26	ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows	Qiushi Sun et.al.	2505.19897	null
2025-05-26	Large Language Models as Autonomous Spacecraft Operators in Kerbal Space Program	Alejandro Carrasco et.al.	2505.19896	link
2025-05-26	Deep Active Inference Agents for Delayed and Long-Horizon Environments	Yavar Taheri Yeganeh et.al.	2505.19867	link
2025-05-26	DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning	Leander Diaz-Bone et.al.	2505.19850	link
2025-05-26	Multi-Agent Reinforcement Learning in Cybersecurity: From Fundamentals to Applications	Christoph R. Landolt et.al.	2505.19837	null
2025-05-26	SecVulEval: Benchmarking LLMs for Real-World C/C++ Vulnerability Detection	Md Basim Uddin Ahmed et.al.	2505.19828	link
2025-05-26	Integrating emotional intelligence, memory architecture, and gestures to achieve empathetic humanoid robot interaction in an educational setting	Fuze Sun et.al.	2505.19803	null
2025-05-26	Opinion dynamics for an increasing population of agents. A symmetric continuous agent model	Ioannis Markou et.al.	2505.19791	null
2025-05-26	TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning	Yuhui Chen et.al.	2505.19769	null
2025-05-26	T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search	Xing Cui et.al.	2505.19768	null
2025-05-26	RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback	Junyang Shu et.al.	2505.19767	null
2025-05-26	Agentic Predictor: Performance Prediction for Agentic Workflows via Multi-View Encoding	Patara Trirat et.al.	2505.19764	link
2025-05-26	Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning	Zican Hu et.al.	2505.19761	link
2025-05-26	NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering	Ruisheng Cao et.al.	2505.19754	null
2025-05-26	ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection	Juxin Niu et.al.	2505.19734	link
2025-05-26	Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning	Quentin Rouxel et.al.	2505.19717	null
2025-05-28	JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning	Jing Yu Lim et.al.	2505.19698	null
2025-05-26	Large Language Models for Planning: A Comprehensive and Systematic Survey	Pengfei Cao et.al.	2505.19683	link
2025-05-26	FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks	Atsunori Moteki et.al.	2505.19662	null
2025-05-26	Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation	Xiaochuan Liu et.al.	2505.19647	link
2025-05-26	Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning	Byunghyun Yoo et.al.	2505.19637	null
2025-05-26	DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue	Yichun Feng et.al.	2505.19630	link
2025-05-28	AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems	Yu Shang et.al.	2505.19623	null
2025-05-26	Multi-Agent Collaboration via Evolving Orchestration	Yufan Dang et.al.	2505.19591	null
2025-05-26	LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer	Rasoul Zahedifar et.al.	2505.19567	null
2025-05-26	AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare	Ying Xiao et.al.	2505.19562	link
2025-05-26	Towards Multi-Granularity Memory Association and Selection for Long-Term Conversational Agents	Derong Xu et.al.	2505.19549	link
2025-05-26	DoctorRAG: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients	Yuxing Lu et.al.	2505.19538	null
2025-05-26	Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning	Shijie Liu et.al.	2505.19532	null
2025-05-26	Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs	Zhenhao Zhou et.al.	2505.19489	link
2025-05-26	VLMLight: Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning	Maonan Wang et.al.	2505.19486	null
2025-05-26	Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs	Hao Kang et.al.	2505.19481	link
2025-05-26	Judging with Many Minds: Do More Perspectives Mean Less Prejudice?	Chiyu Ma et.al.	2505.19477	link
2025-05-26	Improving Recommendation Fairness without Sensitive Attributes Using Multi-Persona LLMs	Haoran Xin et.al.	2505.19473	null
2025-05-26	Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI	Ranjan Sapkota et.al.	2505.19443	null
2025-05-26	Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents	Ye Ye et.al.	2505.19436	link
2025-05-26	Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression	Peijie Dong et.al.	2505.19433	link
2025-05-26	Frictional Agent Alignment Framework: Slow Down and Don't Break Things	Abhijnan Nath et.al.	2505.19428	link
2025-05-26	Fusion Intelligence for Digital Twinning AI Data Centers: A Synergistic GenAI-PhyAI Approach	Ruihang Wang et.al.	2505.19409	null
2025-05-26	CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems	Yan Wen et.al.	2505.19405	null
2025-05-27	DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving	Anqing Jiang et.al.	2505.19381	null
2025-05-26	Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality	Lance Ying et.al.	2505.19376	null
2025-05-27	Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies	Kevin Li et.al.	2505.19337	null
2025-05-25	What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study	Bhanuka Gamage et.al.	2505.19325	null
2025-05-25	Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning	Rex Chen et.al.	2505.19316	null
2025-05-25	Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking	Robin D. Pesl et.al.	2505.19310	null
2025-05-25	A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control	Ken Huang et.al.	2505.19301	null
2025-05-25	A likelihood-based Bayesian inference framework for the calibration of and selection between stochastic velocity-jump models	Arianna Ceccarelli et.al.	2505.19292	null
2025-05-25	A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning	Yuzheng Hu et.al.	2505.19281	link
2025-05-25	A General Theory of Risk Sharing	Vasily Melnikov et.al.	2505.19276	null
2025-05-25	Agentic Information Theory: Ergodicity and Intrinsic Semantics of Information Processes	James P. Crutchfield et.al.	2505.19275	null
2025-05-25	ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast & Slow Reasoning for Robust Agent Defense	Shiyu Xiang et.al.	2505.19260	null
2025-05-25	DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research	João Coelho et.al.	2505.19253	null
2025-05-25	Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees	Sourav Ganguly et.al.	2505.19238	null
2025-05-25	Sensorimotor features of self-awareness in multimodal large language models	Iñaki Dellibarda Varela et.al.	2505.19237	null
2025-05-25	GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling	Jialong Zhou et.al.	2505.19234	null
2025-05-25	Numerical Analysis of Damage Evolution in Open Hole CFRP Laminates Modified with Electrospun Self Healing Diels Alder Interleaves	Marianna Chantzi et.al.	2505.19232	null
2025-05-25	Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding	Shiyue Wang et.al.	2505.19219	null
2025-05-25	Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion in Dynamic Environments	Zifan Wang et.al.	2505.19214	null
2025-05-25	When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas	Steffen Backmann et.al.	2505.19212	link
2025-05-25	SpeakStream: Streaming Text-to-Speech with Interleaved Data	Richard He Bai et.al.	2505.19206	null
2025-05-25	OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization	Meher Bhaskar Madiraju et.al.	2505.19205	link
2025-05-27	Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance	Chanyeol Choi et.al.	2505.19197	null
2025-05-27	When Two LLMs Debate, Both Think They'll Win	Pradyumna Shyama Prasad et.al.	2505.19184	null
2025-05-25	Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style	Debdeep Sanyal et.al.	2505.19173	null
2025-05-25	Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence	Thomas P. Kehler et.al.	2505.19167	null
2025-05-25	The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework	Feiran Liu et.al.	2505.19139	null
2025-05-25	Incentivizing High-Quality Human Annotations with Golden Questions	Shang Liu et.al.	2505.19134	null
2025-05-25	Agentic Visualization: Extracting Agent-based Design Patterns from Visualization Systems	Vaishali Dhanoa et.al.	2505.19101	null
2025-05-25	ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World	Runliang Niu et.al.	2505.19095	link
2025-05-25	A Systematic Classification of Vulnerabilities in MoveEVM Smart Contracts (MWC)	Selçuk Topal et.al.	2505.19047	null
2025-05-25	SANNet: A Semantic-Aware Agentic AI Networking Framework for Multi-Agent Cross-Layer Coordination	Yong Xiao et.al.	2505.18946	null
2025-05-25	MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems	Xuanming Zhang et.al.	2505.18943	link
2025-05-24	Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos	Andrea Ramazzina et.al.	2505.18899	link
2025-05-24	Security Concerns for Large Language Models: A Survey	Miles Q. Li et.al.	2505.18889	null
2025-05-24	Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach	Yuchen Wu et.al.	2505.18882	null
2025-05-24	SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes	Dicong Qiu et.al.	2505.18881	null
2025-05-24	CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions	Kung-Hsiang Huang et.al.	2505.18878	link
2025-05-24	Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning	Maeva Guerrier et.al.	2505.18858	null
2025-05-24	Multi-Party Conversational Agents: A Survey	Sagar Sapkota et.al.	2505.18845	null
2025-05-24	Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning	Jinzheng Li et.al.	2505.18831	null
2025-05-24	LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS	Kai Mei et.al.	2505.18829	link
2025-05-24	Agent-Based Decentralized Energy Management of EV Charging Station with Solar Photovoltaics via Multi-Agent Reinforcement Learning	Jiarong Fan et.al.	2505.18750	null
2025-05-27	$C^3$ -Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking	Peijie Yu et.al.	2505.18746	link
2025-05-24	Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction	Wei Shen et.al.	2505.18731	null
2025-05-24	AI-Researcher: Autonomous Scientific Innovation	Jiabin Tang et.al.	2505.18705	link
2025-05-24	LLM-QFL: Distilling Large Language Model for Quantum Federated Learning	Dev Gurung et.al.	2505.18656	link
2025-05-24	SEW: Self-Evolving Agentic Workflows for Automated Code Generation	Siwei Liu et.al.	2505.18646	link
2025-05-24	DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation	Zhihao Jia et.al.	2505.18630	null
2025-05-24	A representation theorem for events within lattice structures of state-spaces	Alex A. T. Rathke et.al.	2505.18615	null
2025-05-27	Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models	Chen Han et.al.	2505.18596	null
2025-05-24	MisoDICE: Multi-Agent Imitation from Unlabeled Mixed-Quality Demonstrations	The Viet Bui et.al.	2505.18595	null
2025-05-24	Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks	Joery A. de Vries et.al.	2505.18591	link
2025-05-24	Removal of Hallucination on Hallucination: Debate-Augmented RAG	Wentao Hu et.al.	2505.18581	link
2025-05-24	MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework	Yifan Zhu et.al.	2505.18572	null
2025-05-24	Benchmarking Poisoning Attacks against Retrieval-Augmented Generation	Baolei Zhang et.al.	2505.18543	null
2025-05-24	MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs	Pengyu Wang et.al.	2505.18530	null
2025-05-24	Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning	Junlin Wang et.al.	2505.18487	link
2025-05-24	Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services	Guoheng Sun et.al.	2505.18471	null
2025-05-27	A Survey of LLM $\times$ DATA	Xuanhe Zhou et.al.	2505.18458	link
2025-05-24	EdgeAgentX: A Novel Framework for Agentic AI at the Edge in Military Communication Networks	Abir Ray et.al.	2505.18457	null
2025-05-24	A numerical demonstration of dynamic stall control	Sarasija Sudharsan et.al.	2505.18449	null
2025-05-24	Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning	Zhiyao Zhang et.al.	2505.18433	null
2025-05-23	Reinforcement Learning for Ballbot Navigation in Uneven Terrain	Achkan Salehi et.al.	2505.18417	link
2025-05-23	DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding	Yue Jiang et.al.	2505.18411	link
2025-05-23	An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems	Fangqiao Tian et.al.	2505.18397	null
2025-05-23	Dynamic Risk Assessments for Offensive Cybersecurity Agents	Boyi Wei et.al.	2505.18384	link
2025-05-23	Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems	Hansa Meghwani et.al.	2505.18366	null
2025-05-23	Persona Alchemy: Designing, Evaluating, and Implementing Psychologically-Grounded LLM Agents for Diverse Stakeholder Representation	Sola Kim et.al.	2505.18351	null
2025-05-23	The Cell Must Go On: Agar.io for Continual Reinforcement Learning	Mohamed A. Mohamed et.al.	2505.18347	link
2025-05-23	Diffusion Self-Weighted Guidance for Offline Reinforcement Learning	Augusto Tagle et.al.	2505.18345	null
2025-05-23	CrashAgent: Crash Scenario Generation via Multi-modal Reasoning	Miao Li et.al.	2505.18341	null
2025-05-23	Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play	Jiaxun Cui et.al.	2505.18334	null
2025-05-23	Single-agent or Multi-agent Systems? Why Not Both?	Mingyan Gao et.al.	2505.18286	null
2025-05-23	Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control	Alireza Rezazadeh et.al.	2505.18279	null
2025-05-23	BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs	Mingning Guo et.al.	2505.18229	link
2025-05-23	Implementing Agents in JavaScript	Timotheus Kampik et.al.	2505.18228	null
2025-05-23	IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis	Hanyu Li et.al.	2505.18223	link
2025-05-23	CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games	Shuhang Xu et.al.	2505.18218	link
2025-05-23	LA-RCS: LLM-Agent-Based Robot Control System	TaekHyun Park et.al.	2505.18214	null
2025-05-23	Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find	Owen Bianchi et.al.	2505.18148	null
2025-05-23	Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading	Mohamed Swailem et.al.	2505.18145	null
2025-05-23	Gaming Tool Preferences in Agentic LLMs	Kazem Faghih et.al.	2505.18135	link
2025-05-23	ProgRM: Build Better GUI Agents with Progress Rewards	Danyang Zhang et.al.	2505.18121	null
2025-05-23	Facility Location with Public Locations and Private Doubly-Peaked Costs	Richard Cole et.al.	2505.18114	null
2025-05-23	ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework	Lisheng Huang et.al.	2505.18105	link
2025-05-23	Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL	Joey Hong et.al.	2505.18098	null
2025-05-23	Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding	Xiaoyi Zhang et.al.	2505.18079	null
2025-05-23	Linear Mixture Distributionally Robust Markov Decision Processes	Zhishuai Liu et.al.	2505.18044	null
2025-05-27	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2505.17997	null
2025-05-23	Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity	Zhihong Chen et.al.	2505.17937	link
2025-05-23	Formalizing Embeddedness Failures in Universal Artificial Intelligence	Cole Wyeth et.al.	2505.17882	null
2025-05-23	Best Group Identification in Multi-Objective Bandits	Mohammad Shahverdikondori et.al.	2505.17869	null
2025-05-23	DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization	Hongshu Guo et.al.	2505.17866	null
2025-05-23	Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities	Ziwei Zhou et.al.	2505.17862	link
2025-05-23	Superplatforms Have to Attack AI Agents	Jianghao Lin et.al.	2505.17861	null
2025-05-23	Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning	Nicolas Castanet et.al.	2505.17830	null
2025-05-23	Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models	Xuchen Pan et.al.	2505.17826	link
2025-05-23	Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour	Bálint Gyevnár et.al.	2505.17801	null
2025-05-23	DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors	Tazeek Bin Abdur Rakib et.al.	2505.17795	null
2025-05-23	The Real Barrier to LLM Agent Usability is Agentic ROI	Weiwen Liu et.al.	2505.17767	null
2025-05-23	HRSim: An agent-based simulation platform for high-capacity ride-sharing services	Wang Chen et.al.	2505.17758	link
2025-05-23	Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions	Ashwin George et.al.	2505.17739	link
2025-05-23	Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios	Xueyang Zhou et.al.	2505.17735	null
2025-05-23	URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles	Ahmet Onur Akman et.al.	2505.17734	null
2025-05-23	Get Experience from Practice: LLM Agents with Record & Replay	Erhu Feng et.al.	2505.17716	null
2025-05-23	Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek	Xueyang Li et.al.	2505.17702	null
2025-05-23	Star-like thermoresponsive microgels: a new class of soft nanocolloids	Elisa Ballin et.al.	2505.17700	null
2025-05-23	Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution	Jiawei Du et.al.	2505.17673	null
2025-05-23	Simulating Macroeconomic Expectations using LLM Agents	Jianhao Lin et.al.	2505.17648	null
2025-05-23	HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning	Chuhao Zhou et.al.	2505.17645	null
2025-05-27	TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments	Yuheng Lu et.al.	2505.17629	link
2025-05-23	CAS-IQA: Teaching Vision-Language Models for Synthetic Angiography Quality Assessment	Bo Wang et.al.	2505.17619	null
2025-05-23	Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments	Qingyu Lu et.al.	2505.17616	link
2025-05-23	Distilling LLM Agent into Small Models with Retrieval and Code Tools	Minki Kang et.al.	2505.17612	link
2025-05-23	Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning	Till Freihaut et.al.	2505.17610	null
2025-05-23	Controlled Agentic Planning & Reasoning for Mechanism Synthesis	João Pedro Gandarela et.al.	2505.17607	null
2025-05-23	AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model	Tijmen de Haan et.al.	2505.17592	null
2025-05-23	USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents	Siqi Lai et.al.	2505.17572	null
2025-05-26	Novobo: Supporting Teachers' Peer Learning of Instructional Gestures by Teaching a Mentee AI-Agent Together	Jiaqi Jiang et.al.	2505.17557	null
2025-05-23	Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs	Shuhang Xu et.al.	2505.17512	null
2025-05-23	Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification	Aditya Gautam et.al.	2505.17511	null
2025-05-23	The Discovery Engine: A Framework for AI-Driven Synthesis and Navigation of Scientific Knowledge Landscapes	Vladimir Baulin et.al.	2505.17500	null
2025-05-23	PD $^3$ : A Project Duplication Detection Framework via Adapted Multi-Agent Debate	Dezheng Bao et.al.	2505.17492	null
2025-05-23	MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning	Yusheng Zhao et.al.	2505.17481	null
2025-05-23	Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning	Xingyu Tan et.al.	2505.17464	null
2025-05-23	LLM-BSCVM: An LLM-Based Blockchain Smart Contract Vulnerability Management Framework	Yanli Jin et.al.	2505.17416	link
2025-05-23	Emergence of Anti-chemotactic Flocking in Active Biomimetic Colloids	Joseph D. Lopes et.al.	2505.17394	null
2025-05-23	Curriculum Guided Reinforcement Learning for Efficient Multi Hop Retrieval Augmented Generation	Yuelyu Ji et.al.	2505.17391	null
2025-05-23	Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition	Zichen Wang et.al.	2505.17379	null
2025-05-22	A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety	Ankita Kushwaha et.al.	2505.17342	null
2025-05-22	Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)	Ruaridh Mon-Williams et.al.	2505.17323	null
2025-05-22	Control of Renewable Energy Communities using AI and Real-World Data	Tiago Fonseca et.al.	2505.17321	null
2025-05-22	Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty	Peilin Wu et.al.	2505.17281	null
2025-05-22	ConvoyNext: A Scalable Testbed Platform for Cooperative Autonomous Vehicle Systems	Hossein Maghsoumi et.al.	2505.17275	link
2025-05-22	Navigating Polytopes with Safety: A Control Barrier Function Approach	Tamas G. Molnar et.al.	2505.17270	link
2025-05-22	Backdoors in DRL: Four Environments Focusing on In-distribution Triggers	Chace Ashcraft et.al.	2505.17248	null
2025-05-22	Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG)	Clayton Cohn et.al.	2505.17238	null
2025-05-22	ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects	Jipeng Zhang et.al.	2505.17231	null
2025-05-22	RetroChat: Designing for the Preservation of Past Digital Experiences	Suifang Zhou et.al.	2505.17208	null
2025-05-22	LengthLogD: A Length-Stratified Ensemble Framework for Enhanced Peptide Lipophilicity Prediction via Multi-Scale Feature Integration	Shuang Wu et.al.	2505.17198	null
2025-05-22	Can Large Language Models Design Biological Weapons? Evaluating Moremi Bio	Gertrude Hattoh et.al.	2505.17154	null
2025-05-22	LLM-Powered Agents for Navigating Venice's Historical Cadastre	Tristan Karch et.al.	2505.17148	null
2025-05-22	RAP: Runtime-Adaptive Pruning for LLM Inference	Huanrong Liu et.al.	2505.17138	null
2025-05-21	Swarm Intelligence Enhanced Reasoning: A Density-Driven Framework for LLM-Based Multi-Agent Optimization	Ying Zhu et.al.	2505.17115	null
2025-05-21	CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution	Minghao Shao et.al.	2505.17107	link
2025-05-21	P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark	Tao Sun et.al.	2505.17104	link
2025-05-20	Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization	Yihong Wu et.al.	2505.17086	null
2025-05-22	SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding	Haoning Wu et.al.	2505.17012	link
2025-05-22	X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs	Rui Ye et.al.	2505.16997	link
2025-05-22	MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems	Rui Ye et.al.	2505.16988	link
2025-05-22	T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning	Amartya Chakraborty et.al.	2505.16986	null
2025-05-22	Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine	Adib Bazgir et.al.	2505.16982	null
2025-05-22	Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design	Zhenkun Li et.al.	2505.16979	null
2025-05-22	SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	Yaxin Du et.al.	2505.16975	link
2025-05-22	Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions	Mayank Kejriwal et.al.	2505.16966	null
2025-05-22	Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection	Jiaying Fu et.al.	2505.16954	null
2025-05-22	A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization	Shengyu Feng et.al.	2505.16952	null
2025-05-22	AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios	Yunjia Qi et.al.	2505.16944	link
2025-05-25	NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification	NovelSeek Team et.al.	2505.16938	link
2025-05-22	Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning	Bosung Kim et.al.	2505.16928	null
2025-05-22	Risk-Averse Reinforcement Learning with Itakura-Saito Loss	Igor Udovichenko et.al.	2505.16925	null
2025-05-22	RealEngine: Simulating Autonomous Driving in Realistic Context	Junzhe Jiang et.al.	2505.16902	link
2025-05-22	Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks	Hongyuan Tao et.al.	2505.16901	null
2025-05-22	Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships	Kerem Oktar et.al.	2505.16899	null
2025-05-22	Hydrogen peroxide electrogeneration from O2 electroreduction: a review focusing on carbon electrocatalysts and environmental applications	Aline B. Trench et.al.	2505.16887	null
2025-05-22	Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning	Alihan Hüyük et.al.	2505.16833	null
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-22	GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent	Bin Xie et.al.	2505.16827	link
2025-05-22	LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols	Ziming liu et.al.	2505.16821	null
2025-05-22	A modular framework for automated evaluation of procedural content generation in serious games with deep reinforcement learning agents	Eleftherios Kalafatis et.al.	2505.16801	null
2025-05-22	Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making	Qianlei Jia et.al.	2505.16781	null
2025-05-22	Sequential Monte Carlo for Policy Optimization in Continuous POMDPs	Hany Abdulsamad et.al.	2505.16732	null
2025-05-22	MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models	Xuanqi Gao et.al.	2505.16700	null
2025-05-22	CoNav: Collaborative Cross-Modal Reasoning for Embodied Navigation	Haihong Hao et.al.	2505.16663	link
2025-05-22	O $^2$ -Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering	Jianbiao Mei et.al.	2505.16582	link
2025-05-22	How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning	Max Weltevrede et.al.	2505.16581	null
2025-05-22	Large Language Model-Empowered Interactive Load Forecasting	Yu Zuo et.al.	2505.16577	null
2025-05-22	EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions	Spencer Hong et.al.	2505.16576	link
2025-05-22	Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning	Junchi Yao et.al.	2505.16557	null
2025-05-22	Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events	Mengzhu Liu et.al.	2505.16455	null
2025-05-22	Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems	Song Jin et.al.	2505.16429	null
2025-05-22	Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach	Xiaoran Yin et.al.	2505.16422	null
2025-05-22	WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning	Zhepei Wei et.al.	2505.16421	link
2025-05-22	VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving	Yansong Qu et.al.	2505.16377	null
2025-05-22	Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance	Taeyoon Kwon et.al.	2505.16348	null
2025-05-22	Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions	Marc Brooks et.al.	2505.16311	null
2025-05-22	No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery	Xiaoxue Han et.al.	2505.16288	null
2025-05-22	ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay	Fanbin Lu et.al.	2505.16282	link
2025-05-22	HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation	Shijie Zhang et.al.	2505.16281	null
2025-05-22	Spatio-temporal agent-based modelling of malaria	Camelia R. Walker et.al.	2505.16240	link
2025-05-22	CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering	Yuren Mao et.al.	2505.16229	null
2025-05-22	Velocity Completion Task and Method for Event-based Player Positional Data in Soccer	Rikuhei Umemoto et.al.	2505.16199	null
2025-05-22	Fairness and Efficiency in Human-Agent Teams: An Iterative Algorithm Design Approach	Mai Lee Chang et.al.	2505.16171	null
2025-05-22	LLM-Powered AI Agent Systems and Their Applications in Industry	Guannan Liang et.al.	2505.16120	null
2025-05-22	BioDSA-1K: Benchmarking Data Science Agents for Biomedical Research	Zifeng Wang et.al.	2505.16100	null
2025-05-24	Reinforcement Learning for Stock Transactions	Ziyi Zhou et.al.	2505.16099	null
2025-05-22	Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development	Ming Shen et.al.	2505.16086	null
2025-05-21	A Distributed Local Energy Market Clearing Framework Using a Two-Loop ADMM Method	Milad Kabirifar et.al.	2505.16070	null
2025-05-21	How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior	Zidi Xiong et.al.	2505.16067	link
2025-05-21	Bayesian adaptive randomization in the I-SPY2.2 sequential multiple assignment randomized trial	Peter Norwood et.al.	2505.16047	null
2025-05-21	Towards improved pest management of the soybean aphid	Urvashi Verma et.al.	2505.16013	null
2025-05-21	Position: Agentic Systems Constitute a Key Component of Next-Generation Intelligent Image Processing	Jinjin Gu et.al.	2505.16007	null
2025-05-21	MAPS: A Multilingual Benchmark for Global Agent Performance and Security	Omer Hofman et.al.	2505.15935	null
2025-05-21	ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation	Tony Montes et.al.	2505.15928	link
2025-05-21	Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition	Dong Won Lee et.al.	2505.15922	null
2025-05-21	Text-to-Pipeline: Bridging Natural Language and Data Preparation Pipelines	Yuhang Ge et.al.	2505.15874	null
2025-05-23	InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation	Yunjia Xi et.al.	2505.15872	null
2025-05-21	AutoData: A Multi-Agent System for Open Web Data Collection	Tianyi Ma et.al.	2505.15859	link
2025-05-21	Large Language Model-Powered Agent for C to Rust Code Translation	HoHyun Sim et.al.	2505.15858	null
2025-05-21	Simulating Prosocial Behavior and Social Contagion in LLM Agents under Institutional Interventions	Yujia Zhou et.al.	2505.15857	link
2025-05-22	GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents	Yuqi Zhou et.al.	2505.15810	link
2025-05-21	The Agentic Economy	David M. Rothschild et.al.	2505.15799	null
2025-05-22	HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving	Zhiwen Chen et.al.	2505.15793	null
2025-05-21	Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning	Pedro P. Santos et.al.	2505.15782	null
2025-05-21	Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses	Xiaoxue Yang et.al.	2505.15738	link
2025-05-21	DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning	Gaurav Srivastava et.al.	2505.15734	null
2025-05-21	Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications	Pronama Biswas et.al.	2505.15705	null
2025-05-21	HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning	Xiaodong Mei et.al.	2505.15703	null
2025-05-21	Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives	Milad Kazemi et.al.	2505.15693	null
2025-05-21	From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems	Xiuchao Sui et.al.	2505.15685	link
2025-05-21	Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model	Ke Hu et.al.	2505.15670	null
2025-05-21	Improved power methods for computing eigenvalues of dual quaternion Hermitian matrices	Yongjun Chen et.al.	2505.15584	null
2025-05-21	The equilibrium price of bubble assets	Charles Bertucci et.al.	2505.15578	null
2025-05-21	Temporal Spectrum Cartography in Low-Altitude Economy Networks: A Generative AI Framework with Multi-Agent Learning	Changyuan Zhao et.al.	2505.15571	null
2025-05-21	Riemannian EXTRA: Communication-efficient decentralized optimization over compact submanifolds with data heterogeneity	Jiayuan Wu et.al.	2505.15537	null
2025-05-21	Collaborative Problem-Solving in an Optimization Game	Isidora Jeknic et.al.	2505.15490	link
2025-05-21	Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL	Xintong Zhang et.al.	2505.15436	null
2025-05-21	X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System	Peng Wang et.al.	2505.15372	link
2025-05-21	Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One	Yiwen Song et.al.	2505.15306	null
2025-05-22	AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving	Kangan Qian et.al.	2505.15298	null
2025-05-21	Agent-based Liquidity Risk Modelling for Financial Markets	Perukrishnen Vytelingum et.al.	2505.15296	null
2025-05-21	LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models	Qianyue Hao et.al.	2505.15293	null
2025-05-21	Web-Shepherd: Advancing PRMs for Reinforcing Web Agents	Hyungjoo Chae et.al.	2505.15277	link
2025-05-21	AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection	Jiatao Li et.al.	2505.15261	null
2025-05-24	ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search	Hyunseok Lee et.al.	2505.15259	null
2025-05-21	Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets	Idriss Malek et.al.	2505.15251	null
2025-05-21	BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems	Andy K. Zhang et.al.	2505.15216	null
2025-05-21	ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection	Jeonghye Kim et.al.	2505.15182	null
2025-05-21	R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization	Yuante Li et.al.	2505.15155	link
2025-05-21	lmgame-Bench: How Good are LLMs at Playing Games?	Lanxiang Hu et.al.	2505.15146	link
2025-05-21	Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data	Adib Bazgir et.al.	2505.15132	null
2025-05-21	On Discounted Infinite-Time Mean Field Games	Zeyu Yang et.al.	2505.15131	null
2025-05-21	An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents	Bowen Jin et.al.	2505.15117	link
2025-05-21	A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents	Ian Steenstra et.al.	2505.15108	null
2025-05-21	StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization	Ziliang Wang et.al.	2505.15107	null
2025-05-21	Nek Minit: Harnessing Pragmatic Metacognitive Prompting for Explainable Sarcasm Detection of Australian and Indian English	Ishmanbir Singh et.al.	2505.15095	null
2025-05-21	Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories	Nanxu Gong et.al.	2505.15076	null
2025-05-21	ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges	Cheng Qian et.al.	2505.15068	link
2025-05-21	UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking	Sarfraz Ahmad et.al.	2505.15063	link
2025-05-21	AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars	Tianbao Zhang et.al.	2505.15058	null
2025-05-21	PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration	Yingming Pu et.al.	2505.15047	link
2025-05-21	Toward Task Capable Active Matter: Learning to Avoid Clogging in Confined Collectives via Collisions	Kehinde O. Aina et.al.	2505.15033	null
2025-05-21	COSMIC: Enabling Full-Stack Co-Design and Optimization of Distributed Machine Learning Systems	Aditi Raju et.al.	2505.15020	null
2025-05-21	HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning	Kryspin Varys et.al.	2505.15011	link
2025-05-21	Meta-Design Matters: A Self-Design Multi-Agent System	Zixuan Ke et.al.	2505.14996	null
2025-05-20	JARVIS: A Multi-Agent Code Assistant for High-Quality EDA Script Generation	Ghasem Pasandi et.al.	2505.14978	null
2025-05-20	MedBrowseComp: Benchmarking Medical Deep Research and Computer Use	Shan Chen et.al.	2505.14963	null
2025-05-20	Characteristic scales and adaptation in higher-order contagions	Giulio Burgio et.al.	2505.14930	link
2025-05-20	Think, Reflect, Create: Metacognitive Learning for Zero-Shot Robotic Planning with LLMs	Wenjie Lin et.al.	2505.14899	null
2025-05-20	On the Day They Experience: Awakening Self-Sovereign Experiential AI Agents	Botao Amber Hu et.al.	2505.14893	null
2025-05-20	Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters	Danqing Wang et.al.	2505.14886	null
2025-05-20	Unremarkable to Remarkable AI Agent: Exploring Boundaries of Agent Intervention for Adults With and Without Cognitive Impairment	Mai Lee Chang et.al.	2505.14872	null
2025-05-20	MAATS: A Multi-Agent Automated Translation System Based on MQM Evaluation	Xi Wang et.al.	2505.14848	link
2025-05-20	Beyond Symmetry in Repeated Games with Restarts	Henry Fleischmann et.al.	2505.14847	null
2025-05-20	Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles	Kushagra Gupta et.al.	2505.14817	link
2025-05-20	Integrating Field of View in Human-Aware Collaborative Planning	Ya-Chuan Hsu et.al.	2505.14805	null
2025-05-20	$\texttt{LLINBO}$ : Trustworthy LLM-in-the-Loop Bayesian Optimization	Chih-Yu Chang et.al.	2505.14756	link
2025-05-20	R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution	Xu Yang et.al.	2505.14738	link
2025-05-20	The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents	Mohammad Rubyet Islam et.al.	2505.14727	null
2025-05-20	NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search	Sunhao Dai et.al.	2505.14680	null
2025-05-20	ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions	Bufang Yang et.al.	2505.14668	null
2025-05-20	AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis	Microsoft Copilot et.al.	2505.14612	null
2025-05-20	Agent Context Protocols Enhance Collective Inference	Devansh Bhardwaj et.al.	2505.14569	null
2025-05-20	Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study	Saahil Mahato et.al.	2505.14544	link
2025-05-20	A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version)	Gaia Belardinelli et.al.	2505.14539	null
2025-05-20	Energy-Efficient Deep Reinforcement Learning with Spiking Transformers	Mohammad Irfan Uddin et.al.	2505.14533	null
2025-05-22	BACON: A fully explainable AI model with graded logic for decision making problems	Haishi Bai et.al.	2505.14510	null
2025-05-20	Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms	Biman Barua et.al.	2505.14508	null
2025-05-20	Security of Distributed Gradient Descent Against Byzantine Agents	Sribalaji C. Anand et.al.	2505.14473	null
2025-05-20	Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks	Kamal Singh et.al.	2505.14459	null
2025-05-21	Robustness Evaluation of Graph-based News Detection Using Network Structural Information	Xianghua Zeng et.al.	2505.14453	null
2025-05-23	Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents	Pengzhou Cheng et.al.	2505.14418	null
2025-05-20	Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation	Peter Baile Chen et.al.	2505.14398	null
2025-05-20	Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds	Gaël Gendron et.al.	2505.14396	link
2025-05-20	Information-optimal measurement: From fixed sampling protocols to adaptive spectroscopy	J. Schroeder et.al.	2505.14364	null
2025-05-20	DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning	Ziwei Zheng et.al.	2505.14362	link
2025-05-20	PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs	Sho Inoue et.al.	2505.14356	link
2025-05-20	Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy	Zihao Feng et.al.	2505.14299	null
2025-05-20	EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection	Yijie Lu et.al.	2505.14289	null
2025-05-20	Visual Agentic Reinforcement Fine-Tuning	Ziyu Liu et.al.	2505.14246	link
2025-05-20	Safety Devolution in AI Agents	Cheng Yu et.al.	2505.14215	null
2025-05-20	Embedded Mean Field Reinforcement Learning for Perimeter-defense Game	Li Wang et.al.	2505.14209	null
2025-05-20	DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation	He Wang et.al.	2505.14163	null
2025-05-20	MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem	Fan Liu et.al.	2505.14148	link
2025-05-20	s3: You Don't Need That Much Data to Train a Search Agent via RL	Pengcheng Jiang et.al.	2505.14146	link
2025-05-20	Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent	Fanglin Mo et.al.	2505.14141	null
2025-05-20	MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow	Yuan-Hao Jiang et.al.	2505.14126	null
2025-05-20	A novel approach to process TRISO nuclear fuel using plasma-aided chemistry	Tobias Chemnitz et.al.	2505.14108	null
2025-05-20	Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering	Yihua Zhu et.al.	2505.14099	null
2025-05-20	Personalized and Resilient Distributed Learning Through Opinion Dynamics	Luca Ballotta et.al.	2505.14081	null
2025-05-22	BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks	Weihong Du et.al.	2505.14079	link
2025-05-22	Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning	Wenlin Zhang et.al.	2505.14069	link
2025-05-20	Exploring Temporal Graphs with Frequent and Regular Edges	Duncan Adamson et.al.	2505.14046	null
2025-05-20	Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning	Ruiyi Yang et.al.	2505.13994	null
2025-05-20	CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring	Jiamin Su et.al.	2505.13965	null
2025-05-20	MultiDrive: A Co-Simulation Framework Bridging 2D and 3D Driving Simulation for AV Software Validation	Marc Kaufeld et.al.	2505.13959	link
2025-05-20	Memory-Centric Embodied Question Answer	Mingliang Zhai et.al.	2505.13948	null
2025-05-20	MLZero: A Multi-Agent System for End-to-end Machine Learning Automation	Haoyang Fang et.al.	2505.13941	link
2025-05-20	DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery	Kun Li et.al.	2505.13940	link
2025-05-21	CLEVER: A Curated Benchmark for Formally Verified Code Generation	Amitayush Thakur et.al.	2505.13938	link
2025-05-20	Efficient Agent Training for Computer Use	Yanheng He et.al.	2505.13909	link
2025-05-21	Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation	Junyang Wang et.al.	2505.13887	null
2025-05-22	PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks	Guobin Shen et.al.	2505.13862	link
2025-05-20	A Challenge to Build Neuro-Symbolic Video Agents	Sahil Shah et.al.	2505.13851	link
2025-05-20	Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams	Zhi Su et.al.	2505.13834	null
2025-05-20	Online Resource Sharing: Better Robust Guarantees via Randomized Strategies	David X. Lin et.al.	2505.13824	link
2025-05-20	Structured Agent Distillation for Large Language Model	Jun Liu et.al.	2505.13820	null
2025-05-20	RAG/LLM Augmented Switching Driven Polymorphic Metaheuristic Framework	Faramarz Safi Esfahani et.al.	2505.13808	null
2025-05-19	Model Cards for AI Teammates: Comparing Human-AI Team Familiarization Methods for High-Stakes Environments	Ryan Bowers et.al.	2505.13773	link
2025-05-19	Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis	Ruiquan Huang et.al.	2505.13768	null
2025-05-21	Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making	Jacob Kleiman et.al.	2505.13761	null
2025-05-19	Benchmarking MOEAs for solving continuous multi-objective RL problems	Carlos Hernández et.al.	2505.13726	link
2025-05-19	Revenue-Optimal Efficient Mechanism Design with General Type Spaces	Siddharth Prasad et.al.	2505.13687	null
2025-05-19	MAFA: A multi-agent framework for annotation	Mahmood Hegazy et.al.	2505.13668	null
2025-05-19	Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents	Karina Zainullina et.al.	2505.13652	null
2025-05-19	Non-Obvious Manipulability in Additively Separable and Fractional Hedonic Games	Diodato Ferraioli et.al.	2505.13642	null
2025-05-19	Incentivizing Truthful Language Models via Peer Elicitation Games	Baiting Chen et.al.	2505.13636	link
2025-05-19	Q ${}^2$ Forge: Minting Competency Questions and SPARQL Queries for Question-Answering Over Knowledge Graphs	Yousouf Taghzouti et.al.	2505.13572	null
2025-05-19	Learning Dynamics of RNNs in Closed-Loop Environments	Yoav Ger et.al.	2505.13567	link
2025-05-19	Counter-Inferential Behavior in Natural and Artificial Cognitive Systems	Serge Dolgikh et.al.	2505.13551	null
2025-05-19	Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems	Ke Chen et.al.	2505.13546	null
2025-05-19	Origin-Destination Pattern Effects on Large-Scale Mixed Traffic Control via Multi-Agent Reinforcement Learning	Muyang Fan et.al.	2505.13543	link
2025-05-18	LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems	Shengkang Gu et.al.	2505.13528	null
2025-05-18	ACPs: Agent Collaboration Protocols for the Internet of Agents	Jun Liu et.al.	2505.13523	null
2025-05-17	HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems	Zhipeng Hou et.al.	2505.13516	link
2025-05-16	Can AI Freelancers Compete? Benchmarking Earnings, Reliability, and Task Success at Scale	David Noever et.al.	2505.13511	null
2025-05-16	An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents	Ayesha Amjad et.al.	2505.13504	null
2025-05-19	G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning	Liang Chen et.al.	2505.13426	link
2025-05-20	A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut	Gabriel Malikal et.al.	2505.13405	null
2025-05-19	Robin: A multi-agent system for automating scientific discovery	Ali Essam Ghareeb et.al.	2505.13400	null
2025-05-19	Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges	Hongru Wang et.al.	2505.13328	null
2025-05-19	Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions	Saleh Soudijani et.al.	2505.13311	null
2025-05-19	TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents	Yifu Cai et.al.	2505.13291	link
2025-05-19	Hybrid Voting-Based Task Assignment in Modular Construction Scenarios	Daniel Weiner et.al.	2505.13278	null
2025-05-19	From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery	Tianshi Zheng et.al.	2505.13259	link
2025-05-19	Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability	Jingyi Ren et.al.	2505.13258	link
2025-05-19	Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic	Lennart Röstel et.al.	2505.13253	null
2025-05-19	Agentic Publications: An LLM-Driven Framework for Interactive Scientific Publishing, Supplementing Traditional Papers with AI-Powered Knowledge Systems	Roberto Pugliese et.al.	2505.13246	null
2025-05-19	Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis	Tianbao Xie et.al.	2505.13227	null
2025-05-19	Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities	Lili Zhang et.al.	2505.13195	null
2025-05-19	When a Reinforcement Learning Agent Encounters Unknown Unknowns	Juntian Zhu et.al.	2505.13188	null
2025-05-19	Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping	Jianfeng Xu et.al.	2505.13182	null
2025-05-19	Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair	Han Zheng et.al.	2505.13103	null
2025-05-19	The Hidden Dangers of Browsing AI Agents	Mykyta Mudryi et.al.	2505.13076	null
2025-05-19	CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents	Rebecca Westhäußer et.al.	2505.13044	null
2025-05-19	Adversarial Reasoning for Repair Based on Inferred Program Intent	He Ye et.al.	2505.13008	null
2025-05-20	From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents	Liangxuan Wu et.al.	2505.12981	null
2025-05-19	Improved Approximation Ratio for Strategyproof Facility Location on a Cycle	Krzysztof Rogowski et.al.	2505.12943	null
2025-05-20	Leveraging LLM Inconsistency to Boost Pass@k Performance	Uri Dalal et.al.	2505.12938	null
2025-05-19	The Traitors: Deception and Trust in Multi-Agent Language Model Simulations	Pedro M. P. Curvo et.al.	2505.12923	link
2025-05-19	PyFCG: Fluid Construction Grammar in Python	Paul Van Eecke et.al.	2505.12920	null
2025-05-19	Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach	Hao Fang et.al.	2505.12902	null
2025-05-19	From Grunts to Grammar: Emergent Language from Cooperative Foraging	Maytus Piriyajitakonkij et.al.	2505.12872	null
2025-05-19	GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents	Zheng Wu et.al.	2505.12842	link
2025-05-19	Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs	Zhuo Yang et.al.	2505.12833	null
2025-05-19	Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning	Wei-Chen Liao et.al.	2505.12811	null
2025-05-19	Mixture Policy based Multi-Hop Reasoning over N-tuple Temporal Knowledge Graphs	Zhongni Hou et.al.	2505.12788	null
2025-05-19	Forewarned is Forearmed: A Survey on Large Language Model-based Agents in Autonomous Cyberattacks	Minrui Xu et.al.	2505.12786	null
2025-05-19	Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization	Haochen Yuan et.al.	2505.12759	null
2025-05-19	Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses	Yingkai Kang et.al.	2505.12710	null
2025-05-19	PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI	Yingchen He et.al.	2505.12707	null
2025-05-19	AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use	Yaotian Yang et.al.	2505.12650	link
2025-05-19	Two out of Three (ToT): using self-consistency to make robust predictions	Jung Hoon Lee et.al.	2505.12642	null
2025-05-19	Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents	Yunseok Jang et.al.	2505.12632	null
2025-05-19	Dual-Agent Reinforcement Learning for Automated Feature Generation	Wanfu Gao et.al.	2505.12628	link
2025-05-19	Lightweight and Effective Preference Construction in PIBT for Large-Scale Multi-Agent Pathfinding	Keisuke Okumura et.al.	2505.12623	null
2025-05-19	HIL: Hybrid Imitation Learning of Diverse Parkour Skills from Videos	Jiashun Wang et.al.	2505.12619	null
2025-05-19	Action-Dependent Optimality-Preserving Reward Shaping	Grant C. Forbes et.al.	2505.12611	null
2025-05-19	The Hamiltonian of Poly-matrix Zero-sum Games	Toshihiro Ota et.al.	2505.12609	link
2025-05-19	Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis	Yifan Hu et.al.	2505.12597	link
2025-05-19	AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection	Tiankai Yang et.al.	2505.12594	link
2025-05-18	A Survey of Attacks on Large Language Models	Wenrui Xu et.al.	2505.12567	null
2025-05-18	ESC-Judge: A Framework for Comparing Emotional Support Conversational Agents	Navid Madani et.al.	2505.12531	null
2025-05-18	InnateCoder: Learning Programmatic Options with Foundation Models	Rubens O. Moraes et.al.	2505.12508	link
2025-05-18	Optimal Task and Motion Planning for Autonomous Systems Using Petri Nets	Zhou He et.al.	2505.12503	null
2025-05-18	ALAS: A Stateful Multi-LLM Agent Framework for Disruption-Aware Planning	Edward Y. Chang et.al.	2505.12501	null
2025-05-18	UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning	Longxi Gao et.al.	2505.12493	null
2025-05-18	Proposal for Improving Google A2A Protocol: Safeguarding Sensitive Data in Multi-Agent Systems	Yedidel Louck et.al.	2505.12490	null
2025-05-18	Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems	Haochun Wang et.al.	2505.12467	null
2025-05-18	Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning	Junzhe Jiang et.al.	2505.12465	null
2025-05-18	BadNAVer: Exploring Jailbreak Attacks On Vision-and-Language Navigation	Wenqi Lyu et.al.	2505.12443	null
2025-05-20	IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems	Liwen Wang et.al.	2505.12442	null
2025-05-18	Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games	Jinming Zhang et.al.	2505.12439	null
2025-05-20	Steady-State Strategy Synthesis for Swarms of Autonomous Agents	Martin Jonáš et.al.	2505.12406	null
2025-05-18	Automated Profile Inference with Language Model Agents	Yuntao Du et.al.	2505.12402	link
2025-05-18	MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks	Yinghao Zhu et.al.	2505.12371	link
2025-05-18	Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning	Xinbin Yuan et.al.	2505.12370	link
2025-05-18	A universal policy wrapper with guarantees	Anton Bolychev et.al.	2505.12354	null
2025-05-18	Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance	Yufeng Wang et.al.	2505.12334	null
2025-05-18	Robust Planning for Autonomous Driving via Mixed Adversarial Diffusion Predictions	Albert Zhao et.al.	2505.12327	null
2025-05-18	BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind	Rikunari Sagara et.al.	2505.12321	link
2025-05-18	Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization	Hongbiao Zhu et.al.	2505.12311	null
2025-05-18	Enhance Mobile Agents Thinking Process Via Iterative Preference Learning	Kun Huang et.al.	2505.12299	null
2025-05-18	LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage Approach	Yinqiu Liu et.al.	2505.12247	null
2025-05-18	Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents	Shuo Han et.al.	2505.12204	null
2025-05-20	LLM-DSE: Searching Accelerator Parameters with LLM Agents	Hanyu Wang et.al.	2505.12188	link
2025-05-17	LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs	Omar Choukrani et.al.	2505.12135	link
2025-05-17	Towards Sustainability in 6G Network Slicing with Energy-Saving and Optimization Methods	Rodrigo Moreira et.al.	2505.12132	null
2025-05-17	Scalable Time-Tagged Data Acquisition for Entanglement Distribution in Quantum Networks	Abderrahim Amlou et.al.	2505.12102	null
2025-05-17	Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents	Tiannuo Yang et.al.	2505.12065	link
2025-05-17	AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research	Renqi Chen et.al.	2505.12039	null
2025-05-17	Incentivize Contribution and Learn Parameters Too: Federated Learning with Strategic Data Owners	Drashthi Doshi et.al.	2505.12010	null
2025-05-17	SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation	Yuncheng Hua et.al.	2505.12006	null
2025-05-17	Interactional Fairness in LLM Multi-Agent Systems: An Evaluation Framework	Ruta Binkyte et.al.	2505.12001	null
2025-05-17	Task Scheduling in Space-Air-Ground Uniformly Integrated Networks with Ripple Effects	Chuan Huang et.al.	2505.11974	null
2025-05-17	MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models	Luca Collini et.al.	2505.11963	null
2025-05-17	CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World	Zoya Volovikova et.al.	2505.11962	null
2025-05-17	LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners	Junhao Zheng et.al.	2505.11942	link
2025-05-17	Modèles de Substitution pour les Modèles à base d'Agents : Enjeux, Méthodes et Applications	Paul Saves et.al.	2505.11912	link
2025-05-17	Benchmarking LLMs in an Embodied Environment for Blue Team Threat Hunting	Xiaoqun Liu et.al.	2505.11901	null
2025-05-17	Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents	Weikai Xu et.al.	2505.11891	null
2025-05-17	AR Secretary Agent: Real-time Memory Augmentation via LLM-powered Augmented Reality Glasses	Raphaël A. El Haddad et.al.	2505.11888	null
2025-05-20	Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation	Shuo Wang et.al.	2505.11886	null
2025-05-17	Position Paper: Bounded Alignment: What (Not) To Expect From AGI Agents	Ali A. Minai et.al.	2505.11866	null
2025-05-17	Learning Pareto-Optimal Rewards from Noisy Preferences: A Framework for Multi-Objective Inverse Reinforcement Learning	Kalyan Cherukuri et.al.	2505.11864	null
2025-05-17	RVTBench: A Benchmark for Visual Reasoning Tasks	Yiqing Shen et.al.	2505.11838	link
2025-05-17	Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment	Siliang Zeng et.al.	2505.11821	null
2025-05-17	BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering	Taolin Zhang et.al.	2505.11811	null
2025-05-17	Retrospex: Language Agent Meets Offline Reinforcement Learning Critic	Yufei Xiang et.al.	2505.11807	link
2025-05-17	Robustness of Incentive Mechanisms Against System Misspecification in Congestion Games	Chih-Yuan Chiu et.al.	2505.11791	null
2025-05-17	OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration	Shijun Li et.al.	2505.11765	link
2025-05-16	REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning	Pawin Taechoyotin et.al.	2505.11718	null
2025-05-16	EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents	Xilong Wang et.al.	2505.11717	null
2025-05-16	Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents	Diksha Goel et.al.	2505.11708	null
2025-05-16	Forensics of Error Rates of Quantum Hardware	Rupshali Roy et.al.	2505.11706	null
2025-05-16	Ambiguity Resolution in Text-to-Structured Data Mapping	Zhibo Hu et.al.	2505.11679	null
2025-05-16	Terminators: Terms of Service Parsing and Auditing Agents	Maruf Ahmed Mridul et.al.	2505.11672	null
2025-05-16	Learning from Less: Guiding Deep Reinforcement Learning with Differentiable Symbolic Planning	Zihan Ye et.al.	2505.11661	null
2025-05-16	PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning	Falong Fan et.al.	2505.11642	link
2025-05-20	Talk to Your Slides: Language-Driven Agents for Efficient Slide Editing	Kyudan Jung et.al.	2505.11604	link
2025-05-16	Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search	Rui Liu et.al.	2505.11601	null
2025-05-16	LLM Agents Are Hypersensitive to Nudges	Manuel Cherep et.al.	2505.11584	null
2025-05-16	Toward Adaptive Categories: Dimensional Governance for Agentic AI	Zeynep Engin et.al.	2505.11579	null
2025-05-15	Assessing Collective Reasoning in Multi-Agent LLMs via Hidden Profile Tasks	Yuxuan Li et.al.	2505.11556	null
2025-05-14	TARGET: Benchmarking Table Retrieval for Generative Tasks	Xingyu Ji et.al.	2505.11545	null
2025-05-16	Automatic Reward Shaping from Confounded Offline Data	Mingxuan Li et.al.	2505.11478	null
2025-05-16	Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks	Wesley A Suttle et.al.	2505.11461	null
2025-05-16	Robust Equilibria in Shared Resource Allocation via Strengthening Border's Theorem	David X. Lin et.al.	2505.11431	null
2025-05-16	Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis	Jing Liu et.al.	2505.11401	null
2025-05-16	Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation	Zihan Wang et.al.	2505.11383	link
2025-05-16	GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents	Lingxiao Diao et.al.	2505.11368	null
2025-05-16	Long-Term Average Impulse Control with Mean Field Interactions	K. L. Helmes et.al.	2505.11345	null
2025-05-16	Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics	Ardian Selmonaj et.al.	2505.11311	null
2025-05-16	Diffusion Learning with Partial Agent Participation and Local Updates	Elsa Rizk et.al.	2505.11307	null
2025-05-16	Meta-World+: An Improved, Standardized, RL Benchmark	Reginald McLean et.al.	2505.11289	link
2025-05-16	TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes	Chao Zhang et.al.	2505.11270	null
2025-05-19	Massive-STEPS: Massive Semantic Trajectories for Understanding POI Check-ins -- Dataset and Benchmarks	Wilson Wongso et.al.	2505.11239	link
2025-05-16	Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation	Donghoon Lee et.al.	2505.11221	link
2025-05-16	From Intent Discovery to Recognition with Topic Modeling and Synthetic Data	Aaron Rodrigues et.al.	2505.11176	null
2025-05-19	Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition	Bo Yue et.al.	2505.11175	null
2025-05-16	MPMA: Preference Manipulation Attack Against Model Context Protocol	Zihan Wang et.al.	2505.11154	null
2025-05-16	Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes	Ashok Arora et.al.	2505.11153	null
2025-05-16	Reinforcement Learning for AMR Charging Decisions: The Impact of Reward and Action Space Design	Janik Bischoff et.al.	2505.11136	null
2025-05-16	Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets	Patrick Stöckermann et.al.	2505.11135	null
2025-05-16	Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity	Chan-Jan Hsu et.al.	2505.11107	null
2025-05-16	Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors	Lang Feng et.al.	2505.11100	null
2025-05-16	LLM-Enhanced Symbolic Control for Safety-Critical Applications	Amir

Name		Name	Last commit message	Last commit date
Latest commit History 879 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2026.04.13

Agents

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Updated on 2026.04.13

Agents

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages