Skip to content

Lyz103/LLM-Agent-Paper-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

879 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2026.04.13

Usage instructions: here

Table of Contents
  1. Agents

Agents

Publish Date Title Authors PDF Code
2025-07-23 DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models Liwenhan Xie et.al. 2507.17734 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-23 Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks Ilias Chatzistefanidis et.al. 2507.17695 null
2025-07-23 Simulating multiple human perspectives in socio-ecological systems using large language models Yongchao Zeng et.al. 2507.17680 null
2025-07-23 LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning Luca Salvatore Lorello et.al. 2507.17482 null
2025-07-23 ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents Chang Nie et.al. 2507.17462 null
2025-07-23 IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception Haichuan Li et.al. 2507.17445 null
2025-07-23 Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach Hugh Adams et.al. 2507.17433 null
2025-07-23 CAPRI-CT: Causal Analysis and Predictive Reasoning for Image Quality Optimization in Computed Tomography Sneha George Gnanakalavathy et.al. 2507.17420 null
2025-07-23 Residual Prophet Inequalities Jose Correa et.al. 2507.17391 null
2025-07-23 DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning Chuzhan Hao et.al. 2507.17365 null
2025-07-23 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 HuNavSim 2.0 Miguel Escudero-Jiménez et.al. 2507.17317 null
2025-07-23 EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents Zijie Guo et.al. 2507.17311 null
2025-07-23 Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments Shitong Zhu et.al. 2507.17289 null
2025-07-23 Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance Rishi Parekh et.al. 2507.17273 null
2025-07-23 Agent Identity Evals: Measuring Agentic Identity Elija Perrier et.al. 2507.17257 null
2025-07-23 LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks Lijie Zheng et.al. 2507.17188 null
2025-07-23 Optimal Calibrated Signaling in Digital Auctions Zhicheng Du et.al. 2507.17187 null
2025-07-23 FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance Lingfeng Zeng et.al. 2507.17186 null
2025-07-23 Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria Die Hu et.al. 2507.17183 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards Cheng Liu et.al. 2507.17147 null
2025-07-23 Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination Mariam ALMutairi et.al. 2507.17134 null
2025-07-23 Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance Yufei He et.al. 2507.17131 null
2025-07-23 Stochastically Structured Reservoir Computers for Financial and Economic System Identification Lendy Banegas et.al. 2507.17115 null
2025-07-22 Deformable Cluster Manipulation via Whole-Arm Policy Learning Jayadeep Jacob et.al. 2507.17085 null
2025-07-22 VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings Ramin Giahi et.al. 2507.17080 null
2025-07-22 Approximation Techniques for the Reconstruction of the Probability Measure and the Coupling Parameters in a Curie-Weiss Model for Large Populations Miguel Ballesteros et.al. 2507.17073 null
2025-07-22 Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation Jessup Byun et.al. 2507.17066 null
2025-07-22 Parallelism Meets Adaptiveness: Scalable Documents Understanding in Multi-Agent LLM Systems Chengxuan Xia et.al. 2507.17061 null
2025-07-22 Shared Control of Holonomic Wheelchairs through Reinforcement Learning Jannis Bähler et.al. 2507.17055 null
2025-07-22 New Mechanisms in Flex Distribution for Bounded Suboptimal Multi-Agent Path Finding Shao-Hung Chan et.al. 2507.17054 null
2025-07-22 Evaluating Uncertainty and Quality of Visual Language Action-enabled Robots Pablo Valle et.al. 2507.17049 null
2025-07-22 Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry Koushik Mondal et.al. 2507.17023 null
2025-07-22 Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge? Arduin Findeis et.al. 2507.17015 null
2025-07-22 Quantitative convergence for displacement monotone Mean Field Games of control Joe Jackson et.al. 2507.17014 null
2025-07-22 Towards Autonomous Sustainability Assessment via Multimodal AI Agents Zhihan Zhang et.al. 2507.17012 null
2025-07-22 On-chip stencil lithography for superconducting qubits Roudy Hanna et.al. 2507.17005 null
2025-07-22 Hierarchical Reinforcement Learning Framework for Adaptive Walking Control Using General Value Functions of Lower-Limb Sensor Signals Sonny T. Jones et.al. 2507.16983 null
2025-07-22 Text-to-SPARQL Goes Beyond English: Multilingual Question Answering Over Knowledge Graphs through Human-Inspired Reasoning Aleksandr Perevalov et.al. 2507.16971 null
2025-07-22 Fundamental limits of distributed covariance matrix estimation via a conditional strong data processing inequality Mohammad Reza Rahmani et.al. 2507.16953 null
2025-07-22 Multi-agent Reinforcement Learning for Robotized Coral Reef Sample Collection Daniel Correa et.al. 2507.16941 null
2025-07-22 AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation Nima Fathi et.al. 2507.16940 null
2025-07-22 Budget Allocation Policies for Real-Time Multi-Agent Path Finding Raz Beck et.al. 2507.16874 null
2025-07-21 Reinforcement Learning in hyperbolic space for multi-step reasoning Tao Xu et.al. 2507.16864 null
2025-07-21 MobileUse: A GUI Agent with Hierarchical Reflection for Autonomous Mobile Operation Ning Li et.al. 2507.16853 null
2025-07-21 Dynamic Simulation Framework for Disinformation Dissemination and Correction With Social Bots Boyu Qiao et.al. 2507.16848 null
2025-07-22 ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Chi-Pin Huang et.al. 2507.16815 null
2025-07-22 LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs Da-Chen Lian et.al. 2507.16809 null
2025-07-23 Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Yanjun Zheng et.al. 2507.16802 null
2025-07-23 Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent Xiaoyu Zhan et.al. 2507.16799 null
2025-07-22 Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning Mian Ibad Ali Shah et.al. 2507.16796 null
2025-07-22 Generalized non-reciprocal phase transitions in multipopulation systems Cheyne Weis et.al. 2507.16763 null
2025-07-22 AI-enhanced conversational agents for personalized asthma support Factors for engagement, value and efficacy Laura Moradbakhti et.al. 2507.16735 null
2025-07-23 Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints Zhenyun Yin et.al. 2507.16727 null
2025-07-22 RAVine: Reality-Aligned Evaluation for Agentic Search Yilong Xu et.al. 2507.16725 null
2025-07-22 Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation Viktor Muryn et.al. 2507.16704 null
2025-07-22 FOGNITE: Federated Learning-Enhanced Fog-Cloud Architecture Somayeh Sobati-M et.al. 2507.16668 null
2025-07-22 Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis Sara Giordano et.al. 2507.16641 null
2025-07-22 Novel Multi-Agent Action Masked Deep Reinforcement Learning for General Industrial Assembly Lines Balancing Problems Ali Mohamed Ali et.al. 2507.16635 null
2025-07-22 Augmenting Von Neumann's Architecture for an Intelligent Future Rajpreet Singh et.al. 2507.16628 null
2025-07-22 CTSL: Codebook-based Temporal-Spatial Learning for Accurate Non-Contrast Cardiac Risk Prediction Using Cine MRIs Haoyang Su et.al. 2507.16612 null
2025-07-22 Smooth Games of Configuration in the Linear-Quadratic Setting Jesse Milzman et.al. 2507.16611 null
2025-07-22 Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis Xiaojiao Xiao et.al. 2507.16579 null
2025-07-22 Evaluating Social Acceptance of eXtended Reality (XR) Agent Technology: A User Study (Extended Version) Megha Quamara et.al. 2507.16562 null
2025-07-22 A Distributed Actor-Critic Algorithm for Fixed-Time Consensus in Nonlinear Multi-Agent Systems Aria Delshad et.al. 2507.16520 null
2025-07-22 Analogy making as amortised model construction David G. Nagy et.al. 2507.16511 null
2025-07-22 Agentic RAG with Knowledge Graphs for Complex Multi-Hop Reasoning in Real-World Applications Jean Lelong et.al. 2507.16507 null
2025-07-22 Arbitrage Tactics in the Local Markets via Hierarchical Multi-agent Reinforcement Learning Haoyang Zhang et.al. 2507.16479 null
2025-07-22 Adaptive Bayesian Single-Shot Quantum Sensing Ivana Nikoloska et.al. 2507.16477 null
2025-07-22 Towards Enforcing Company Policy Adherence in Agentic Workflows Naama Zwerdling et.al. 2507.16459 null
2025-07-22 Distributed Oscillatory Guidance for Formation Flight of Fixed-Wing Drones Yang Xu et.al. 2507.16458 null
2025-07-23 RIS-aided Latent Space Alignment for Semantic Channel Equalization Tomás Hüttebräucker et.al. 2507.16450 null
2025-07-22 From model-based learning to model-free behaviour with Meta-Interpretive Learning Stassa Patsantzis et.al. 2507.16434 null
2025-07-22 LLM-Driven Collaborative Model for Untangling Commits via Explicit and Implicit Dependency Reasoning Bo Hou et.al. 2507.16395 null
2025-07-22 Application of LLM Guided Reinforcement Learning in Formation Control with Collision Avoidance Chenhao Yao et.al. 2507.16382 null
2025-07-22 COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network Xingjian Zhang et.al. 2507.16306 null
2025-07-22 ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry Tianze Xu et.al. 2507.16280 null
2025-07-22 Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping Srivatsan Krishnan et.al. 2507.16249 null
2025-07-22 FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents Run Sun et.al. 2507.16248 null
2025-07-22 Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery Bo Wen et.al. 2507.16229 null
2025-07-22 Unbeatable imitation of a friend Masahiko Ueda et.al. 2507.16221 null
2025-07-22 Best-of-Both-Worlds Guarantees with Fairer Endings Telikepalli Kavitha et.al. 2507.16209 null
2025-07-22 CHIMERA: Compressed Hybrid Intelligence for Twin-Model Enhanced Multi-Agent Deep Reinforcement Learning for Multi-Functional RIS-Assisted Space-Air-Ground Integrated Networks Li-Hsiang Shen et.al. 2507.16204 null
2025-07-22 SVAgent: AI Agent for Hardware Security Verification Assertion Rui Guo et.al. 2507.16203 null
2025-07-22 RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs Pengwei Jin et.al. 2507.16200 null
2025-07-22 Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task Jared Moore et.al. 2507.16196 null
2025-07-22 Emergent Cognitive Convergence via Implementation: A Structured Loop Reflecting Four Theories of Mind (A Position Paper) Myung Ho Kim et.al. 2507.16184 null
2025-07-22 Benchmarking LLM Privacy Recognition for Social Robot Decision Making Dakota Sullivan et.al. 2507.16124 null
2025-07-21 Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization Shengchao Liu et.al. 2507.16110 null
2025-07-21 Deep Researcher with Test-Time Diffusion Rujun Han et.al. 2507.16075 null
2025-07-21 Asymptotic consensus with transmission and reaction delay: an overview Jan Haskovec et.al. 2507.16072 null
2025-07-21 Is memory all you need? Data-driven Mori-Zwanzig modeling of Lagrangian particle dynamics in turbulent flows Xander de Wit et.al. 2507.16058 null
2025-07-23 Making REST APIs Agent-Ready: From OpenAPI to Model Context Protocol Servers for Tool-Augmented LLMs Meriem Mastouri et.al. 2507.16044 null
2025-07-21 A Pilot Study on LLM-Based Agentic Translation from Android to iOS: Pitfalls and Insights Zhili Zeng et.al. 2507.16037 null
2025-07-21 Minor Embedding for Quantum Annealing with Reinforcement Learning Riccardo Nembrini et.al. 2507.16004 null
2025-07-21 Automated Design of Structured Variational Quantum Circuits with Reinforcement Learning Gloria Turati et.al. 2507.16001 null
2025-07-21 Red Supergiant Mass Loss and Mass-Loss Rates Jacco Th. van Loon et.al. 2507.15971 null
2025-07-23 HyDRA: A Hybrid-Driven Reasoning Architecture for Verifiable Knowledge Graphs Adrian Kaiser et.al. 2507.15917 null
2025-07-21 Towards Mitigation of Hallucination for LLM-empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor Siyuan Liu et.al. 2507.15903 null
2025-07-21 Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation Joydeep Chandra et.al. 2507.15901 null
2025-07-20 Integrating Reason-Based Moral Decision-Making in the Reinforcement Learning Architecture Lisa Dargasz et.al. 2507.15895 null
2025-07-20 StaAgent: An Agentic Framework for Testing Static Analyzers Elijah Nnorom et.al. 2507.15892 null
2025-07-19 AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs? Ori Press et.al. 2507.15887 null
2025-07-18 ADEPTS: A Capability Framework for Human-Centered Agent Design Pierluca D'Oro et.al. 2507.15885 null
2025-07-21 LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra Seth Karten et.al. 2507.15815 null
2025-07-21 Density control of multi-agent swarms via bio-inspired leader-follower plasticity Gian Carlo Maffettone et.al. 2507.15781 null
2025-07-21 A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining Yifan Shen et.al. 2507.15770 null
2025-07-21 GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts Jingyi Zheng et.al. 2507.15761 null
2025-07-21 Towards physician-centered oversight of conversational diagnostic AI Elahe Vedadi et.al. 2507.15743 null
2025-07-21 General Matching Games Felipe Garrido-Lucero et.al. 2507.15737 null
2025-07-21 Competitive Algorithms for Cooperative Multi-Agent Ski-Rental Problems Xuchuang Wang et.al. 2507.15727 null
2025-07-21 Agentic AI for autonomous anomaly management in complex systems Reza Vatankhah Barenji et.al. 2507.15676 null
2025-07-21 BugScope: Learn to Find Bugs Like Human Jinyao Guo et.al. 2507.15671 null
2025-07-21 Asynchronous Collective Tree Exploration: a Distributed Algorithm, and a new Lower Bound Romain Cosson et.al. 2507.15658 null
2025-07-21 Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training Kailai Yang et.al. 2507.15640 null
2025-07-21 TacticCraft: Natural Language-Driven Tactical Adaptation for StarCraft II Weiyu Ma et.al. 2507.15618 null
2025-07-21 Why can't Epidemiology be automated (yet)? David Bann et.al. 2507.15617 null
2025-07-21 DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving Zhihao Zhang et.al. 2507.15615 null
2025-07-21 Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario Yinsong Chen et.al. 2507.15587 null
2025-07-21 FlowForge: Guiding the Creation of Multi-agent Workflows with Design Space Visualization as a Thinking Scaffold Pan Hao et.al. 2507.15559 null
2025-07-21 PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors Yimeng Chen et.al. 2507.15550 null
2025-07-21 HAMLET: Hyperadaptive Agent-based Modeling for Live Embodied Theatrics Sizhou Chen et.al. 2507.15518 null
2025-07-21 The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents Simon Kohaut et.al. 2507.15478 null
2025-07-21 The Emergence of Deep Reinforcement Learning for Path Planning Thanh Thi Nguyen et.al. 2507.15469 null
2025-07-23 Solving nonconvex Hamilton--Jacobi--Isaacs equations with PINN-based policy iteration Hee Jun Yang et.al. 2507.15455 null
2025-07-21 EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent Jiaao Li et.al. 2507.15428 null
2025-07-21 PhishIntentionLLM: Uncovering Phishing Website Intentions through Multi-Agent Retrieval-Augmented Generation Wenhao Li et.al. 2507.15419 null
2025-07-21 RAD: Retrieval High-quality Demonstrations to Enhance Decision-making Lu Guo et.al. 2507.15356 null
2025-07-21 One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms Zijian Zhao et.al. 2507.15351 null
2025-07-21 QSAF: A Novel Mitigation Framework for Cognitive Degradation in Agentic AI Hammad Atta et.al. 2507.15330 null
2025-07-21 Strategically Robust Game Theory via Optimal Transport Nicolas Lanzetti et.al. 2507.15325 null
2025-07-21 Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems Qian Xiong et.al. 2507.15296 null
2025-07-21 Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning Elias Malomgré et.al. 2507.15287 null
2025-07-21 Event-Triggered Resilient Consensus of Networked Euler-Lagrange Systems Under Byzantine Attacks Yuliang Fu et.al. 2507.15283 null
2025-07-21 IM-Chat: A Multi-agent LLM-based Framework for Knowledge Transfer in Injection Molding Industry Junhyeong Lee et.al. 2507.15268 null
2025-07-21 SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search Xiaofeng Shi et.al. 2507.15245 null
2025-07-21 FaultLine: Automated Proof-of-Vulnerability Generation Using LLM Agents Vikram Nitin et.al. 2507.15241 null
2025-07-21 Solving Formal Math Problems by Decomposition and Iterative Reflection Yichi Zhou et.al. 2507.15225 null
2025-07-21 EchoVoices: Preserving Generational Voices and Memories for Seniors and Children Haiying Xu et.al. 2507.15221 null
2025-07-21 PromptArmor: Simple yet Effective Prompt Injection Defenses Tianneng Shi et.al. 2507.15219 null
2025-07-21 1H Polarization above 60% at room temperature by triplet dynamic nuclear polarization Kenichiro Tateishi et.al. 2507.15217 null
2025-07-21 Personalized 3D Myocardial Infarct Geometry Reconstruction from Cine MRI with Explicit Cardiac Motion Modeling Yilin Lyu et.al. 2507.15194 null
2025-07-21 Joint-Local Grounded Action Transformation for Sim-to-Real Transfer in Multi-Agent Traffic Control Justin Turnau et.al. 2507.15174 null
2025-07-20 STL-GO: Spatio-Temporal Logic with Graph Operators for Distributed Systems with Multiple Network Topologies Yiqi Zhao et.al. 2507.15147 null
2025-07-20 Can We Move Freely in NEOM's The Line? An Agent-Based Simulation of Human Mobility in a Futuristic Smart City Abderaouf Bahi et.al. 2507.15143 null
2025-07-20 Statistical state dynamics based study of turbulent Eady fronts. Part 2. Finite amplitude equilibria Eojin Kim et.al. 2507.15134 null
2025-07-20 Initialization-driven neural generation and training for high-dimensional optimal control and first-order mean field games Mouhcine Assouli et.al. 2507.15126 null
2025-07-20 From Kicking to Causality: Simulating Infant Agency Detection with a Robust Intrinsic Reward Xia Xu et.al. 2507.15106 null
2025-07-20 Search-Based Autonomous Vehicle Motion Planning Using Game Theory Pouya Panahandeh et.al. 2507.15088 null
2025-07-20 WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization Zhengwei Tao et.al. 2507.15061 null
2025-07-20 LibLMFuzz: LLM-Augmented Fuzz Target Generation for Black-box Libraries Ian Hardgrove et.al. 2507.15058 null
2025-07-20 EduThink4AI: Translating Educational Critical Thinking into Multi-Agent LLM Systems Xinmeng Hou et.al. 2507.15015 null
2025-07-20 The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering Hao Li et.al. 2507.15003 null
2025-07-20 LLM-Enhanced Multi-Agent Reinforcement Learning with Expert Workflow for Real-Time P2P Energy Trading Chengwei Lou et.al. 2507.14995 null
2025-07-20 Think Like an Engineer: A Neuro-Symbolic Collaboration Agent for Generative Software Requirements Elicitation and Self-Review Sai Zhang et.al. 2507.14969 null
2025-07-20 STEPC: A Pixel-wise Nonuniformity Correction Framework for Photon-Counting CT in Multi-material Imaging Scenarios Enze Zhou et.al. 2507.14963 null
2025-07-20 Probing EFX via PMMS: (Non-)Existence Results in Discrete Fair Division Jarosław Byrka et.al. 2507.14957 null
2025-07-20 Echoes of the Land: An Interactive Installation Based on Physical Model of Earthquake Ivan C. H. Liu et.al. 2507.14947 null
2025-07-20 Byzantine-Robust Decentralized Coordination of LLM Agents Yongrae Jo et.al. 2507.14928 null
2025-07-20 Redefining Elderly Care with Agentic AI: Challenges and Opportunities Ruhul Amin Khalil et.al. 2507.14912 null
2025-07-20 TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP Fan Li et.al. 2507.14904 null
2025-07-20 Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies Armin Kekić et.al. 2507.14901 null
2025-07-20 InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis Jiale Liu et.al. 2507.14899 null
2025-07-20 AgentFly: Extensible and Scalable Reinforcement Learning for LM Agents Renxi Wang et.al. 2507.14897 null
2025-07-20 Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems H. M. Sabbir Ahmad et.al. 2507.14850 null
2025-07-20 Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree Sam Johnson et.al. 2507.14799 null
2025-07-19 Towards AI Urban Planner in the Age of GenAI, LLMs, and Agentic AI Yanjie Fu et.al. 2507.14730 null
2025-07-19 Simulating Chirality: Solving Distance- $k$ -Dispersion on an 1-Interval Connected Ring Brati Mondal et.al. 2507.14723 null
2025-07-19 Configurable multi-agent framework for scalable and realistic testing of llm-based agents Sai Wang et.al. 2507.14705 null
2025-07-19 WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis Xinheng Lyu et.al. 2507.14680 null
2025-07-19 When Autonomy Goes Rogue: Preparing for Risks of Multi-Agent Collusion in Social Systems Qibing Ren et.al. 2507.14660 null
2025-07-19 Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence Faizan Contractor et.al. 2507.14658 null
2025-07-19 Agentic Satellite-Augmented Low-Altitude Economy and Terrestrial Networks: A Survey on Generative Approaches Xiaozheng Gao et.al. 2507.14633 null
2025-07-19 Towards a Proactive Autoscaling Framework for Data Stream Processing at the Edge using GRU and Transfer Learning Eugene Armah et.al. 2507.14597 null
2025-07-19 Amico: An Event-Driven Modular Framework for Persistent and Embedded Autonomy Hongyi Yang et.al. 2507.14513 null
2025-07-19 Federated Reinforcement Learning in Heterogeneous Environments Ukjo Hwang et.al. 2507.14487 null
2025-07-22 Routine: A Structural Planning Framework for LLM Agent System in Enterprise Guancheng Zeng et.al. 2507.14447 null
2025-07-18 NetIntent: Leveraging Large Language Models for End-to-End Intent-Based SDN Automation Md. Kamrul Hossain et.al. 2507.14398 null
2025-07-18 Adaptive Multi-Agent Reasoning via Automated Workflow Generation Humza Sami et.al. 2507.14393 null
2025-07-18 Text-to-SQL for Enterprise Data Analytics Albert Chen et.al. 2507.14372 null
2025-07-18 Stable matchings with switching costs Boris Pittel et.al. 2507.14362 null
2025-07-18 FedStrategist: A Meta-Learning Framework for Adaptive and Robust Aggregation in Federated Learning Md Rafid Haque et.al. 2507.14322 null
2025-07-18 Semantic Segmentation based Scene Understanding in Autonomous Vehicles Ehsan Rassekh et.al. 2507.14303 null
2025-07-18 Distributed consensus-based observer design for target state estimation with bearing measurements Marcelo Jacinto et.al. 2507.14300 null
2025-07-18 Age of Information Minimization in UAV-Enabled Integrated Sensing and Communication Systems Yu Bai et.al. 2507.14299 null
2025-07-18 WebGuard: Building a Generalizable Guardrail for Web Agents Boyuan Zheng et.al. 2507.14293 null
2025-07-18 DREAMS: Density Functional Theory Based Research Engine for Agentic Materials Simulation Ziqi Wang et.al. 2507.14267 null
2025-07-18 Beyond DNS: Unlocking the Internet of AI Agents via the NANDA Index and Verified AgentFacts Ramesh Raskar et.al. 2507.14263 null
2025-07-17 Towards an ABM on Proactive Community Adaptation for Climate Change Önder Gürcan et.al. 2507.14233 null
2025-07-17 Intent-Based Network for RAN Management with Large Language Models Fransiscus Asisi Bimo et.al. 2507.14230 null
2025-07-18 DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration Xiyun Li et.al. 2507.14088 null
2025-07-18 Collaborative Rational Speech Act: Pragmatic Reasoning for Multi-Turn Dialog Lautaro Estienne et.al. 2507.14063 null
2025-07-23 Well-posedness and propagation of chaos for multi-agent models with strategies and diffusive effects Alessandro Baldi et.al. 2507.14058 null
2025-07-18 Online MMS Allocation for Chores Jiaxin Song et.al. 2507.14039 null
2025-07-18 Architecting Human-AI Cocreation for Technical Services -- Interaction Modes and Contingency Factors Jochen Wulf et.al. 2507.14034 null
2025-07-18 Byzantine-resilient federated online learning for Gaussian process regression Xu Zhang et.al. 2507.14021 null
2025-07-18 DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation Haoran Li et.al. 2507.13985 null
2025-07-18 A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints Roberto Morales et.al. 2507.13983 null
2025-07-18 Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need Bhishma Dedhia et.al. 2507.13966 null
2025-07-18 NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning Qingyi Chen et.al. 2507.13940 null
2025-07-18 Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support Jan Trienes et.al. 2507.13937 null
2025-07-18 Reframing attention as a reinforcement learning problem for causal discovery Turan Orujlu et.al. 2507.13920 null
2025-07-18 Advanced X-rays techniques for research-oriented high-resolution imaging of articular cartilage: a scoping review Simone Fantoni et.al. 2507.13854 null
2025-07-18 Impact of homophily in adherence to anti-epidemic measures on the spread of infectious diseases in social networks Piotr Bentkowski et.al. 2507.13848 null
2025-07-18 Causal Knowledge Transfer for Multi-Agent Reinforcement Learning in Dynamic Environments Kathrin Korte et.al. 2507.13846 null
2025-07-18 Principles and Reasons Behind Automated Vehicle Decisions in Ethically Ambiguous Everyday Scenarios Lucas Elbert Suryana et.al. 2507.13837 null
2025-07-18 Conformal Data Contamination Tests for Trading or Sharing of Data Martin V. Vejling et.al. 2507.13835 null
2025-07-18 Scalable Submodular Policy Optimization via Pruned Submodularity Graph Aditi Anand et.al. 2507.13834 null
2025-07-18 CodeEdu: A Multi-Agent Collaborative Platform for Personalized Coding Education Jianing Zhao et.al. 2507.13814 null
2025-07-18 From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning Renato Ghisellini et.al. 2507.13768 null
2025-07-21 Navigating the Lobbying Landscape: Insights from Opinion Dynamics Models Daniele Giachini et.al. 2507.13767 null
2025-07-18 AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework Yu Yao et.al. 2507.13729 null
2025-07-18 CogniQ-H: A Soft Hierarchical Reinforcement Learning Paradigm for Automated Data Preparation Jing Chang et.al. 2507.13710 null
2025-07-18 Minimum Clustering of Matrices Based on Phase Alignment Honghao Wu et.al. 2507.13678 null
2025-07-18 Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones Minze Li et.al. 2507.13647 null
2025-07-18 Differential Privacy in Kernelized Contextual Bandits via Random Projections Nikola Pavlovic et.al. 2507.13639 null
2025-07-17 Evolving Neural Controllers for Xpilot-AI Racing Using Neuroevolution of Augmenting Topologies Jim O'Connor et.al. 2507.13549 null
2025-07-17 Human-Like Trajectories Generation via Receding Horizon Tracking Applied to the TickTacking Interface Daniele Masti et.al. 2507.13528 null
2025-07-17 Humans learn to prefer trustworthy AI over human partners Yaomin Jiang et.al. 2507.13524 null
2025-07-17 GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination Nabil Abdelaziz Ferhat Taleb et.al. 2507.13511 null
2025-07-17 Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents Thomas Banker et.al. 2507.13491 null
2025-07-17 LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data Aleksey Lapin et.al. 2507.13413 null
2025-07-21 A Survey of Context Engineering for Large Language Models Lingrui Mei et.al. 2507.13334 null
2025-07-17 N Bugs on a Circle Josh Briley et.al. 2507.13333 null
2025-07-17 Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis Wang Xi et.al. 2507.13285 null
2025-07-20 Analysis Theory of Data Economy: Dataization, Technological Progress and Dynamic General Equilibrium Yongheng Hu et.al. 2507.13274 null
2025-07-17 RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality Ruohao Li et.al. 2507.13247 null
2025-07-17 GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems Jisoo Lee et.al. 2507.13190 null
2025-07-17 Black Box Deployed -- Functional Criteria for Artificial Moral Agents in the LLM Era Matthew E. Brophy et.al. 2507.13175 null
2025-07-17 Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback Suzie Kim et.al. 2507.13171 null
2025-07-17 Prompt Injection 2.0: Hybrid AI Threats Jeremy McHugh et.al. 2507.13169 null
2025-07-17 SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models Xiangyu Dong et.al. 2507.13152 null
2025-07-17 RIDAS: A Multi-Agent Framework for AI-RAN with Representation- and Intention-Driven Agents Kuiyuan Ding et.al. 2507.13140 null
2025-07-17 Governance, productivity and economic development Cuong Le Van et.al. 2507.13099 null
2025-07-17 iReDev: A Knowledge-Driven Multi-Agent Framework for Intelligent Requirements Development Dongming Jin et.al. 2507.13081 null
2025-07-17 Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication Tianyu Song et.al. 2507.13052 null
2025-07-17 What Can Robots Teach Us About Trust and Reliance? An interdisciplinary dialogue between Social Sciences and Social Robotics Julien Wacquez et.al. 2507.13041 null
2025-07-17 MAD-Spear: A Conformity-Driven Prompt Injection Attack on Multi-Agent Debate Systems Yu Cui et.al. 2507.13038 null
2025-07-17 Lower Bound for Online MMS Assignment of Indivisible Chores Masoud Seddighin et.al. 2507.12984 null
2025-07-17 Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning Giwon Lee et.al. 2507.12977 null
2025-07-21 LaViPlan : Language-Guided Visual Path Planning with RLVR Hayeon Oh et.al. 2507.12911 null
2025-07-17 Autonomous Resource Management in Microservice Systems via Reinforcement Learning Yujun Zou et.al. 2507.12879 null
2025-07-20 Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command Taylan Akay et.al. 2507.12862 null
2025-07-17 Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering Muhammad Fadhil Ginting et.al. 2507.12846 null
2025-07-17 Machine-Readable Ads: Accessibility and Trust Patterns for AI Web Agents interacting with Online Advertisements Joel Nitu et.al. 2507.12844 null
2025-07-22 Assessing Adaptive World Models in Machines with Novel Games Lance Ying et.al. 2507.12821 null
2025-07-17 From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning Gaurav Chaudhary et.al. 2507.12815 null
2025-07-17 MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models Zhiwei Liu et.al. 2507.12806 null
2025-07-17 Imitating Mistakes in a Learning Companion AI Agent for Online Peer Learning Sosui Moribe et.al. 2507.12801 null
2025-07-17 City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning Penglei Sun et.al. 2507.12795 null
2025-07-17 A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models Weijieying Ren et.al. 2507.12774 null
2025-07-17 Autonomy for Older Adult-Agent Interaction Jiaxin An et.al. 2507.12767 null
2025-07-17 Public Evaluation on Potential Social Impacts of Fully Autonomous Cybernetic Avatars for Physical Support in Daily-Life Environments: Large-Scale Demonstration and Survey at Avatar Land Lotfi El Hafi et.al. 2507.12741 null
2025-07-17 Competition Erases Simplicity: Tight Regret Bounds for Uniform Pricing with Multiple Buyers Houshuang Chen et.al. 2507.12733 null
2025-07-17 Strategy Adaptation in Large Language Model Werewolf Agents Fuya Nakamori et.al. 2507.12732 null
2025-07-17 Identification of Authoritative Nodes and Dismantling of Illicit Networks Using a Novel Metric for Measuring Strength of a Graph Kartikeya Kansal et.al. 2507.12711 null
2025-07-16 Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models Alex Zook et.al. 2507.12666 null
2025-07-16 NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. 2507.12621 null
2025-07-16 A Survey of Explainable Reinforcement Learning: Targets, Methods and Needs Léo Saulières et.al. 2507.12599 null
2025-07-16 The Impact of Social Attractiveness on Casual Group Formation: Power-Law Group Sizes and Suppressed Percolation Matheus S. Mariano et.al. 2507.12585 null
2025-07-20 Can Mental Imagery Improve the Thinking Capabilities of AI Systems? Slimane Larabi et.al. 2507.12555 null
2025-07-15 FOUNDER: Grounding Foundation Models in World Models for Open-Ended Embodied Decision Making Yucen Wang et.al. 2507.12496 null
2025-07-15 MR-LDM -- The Merge-Reactive Longitudinal Decision Model: Game Theoretic Human Decision Modeling for Interactive Sim Agents Dustin Holley et.al. 2507.12494 null
2025-07-15 On multiagent online problems with predictions Gabriel Istrate et.al. 2507.12486 null
2025-07-14 AI-Powered Math Tutoring: Platform for Personalized and Adaptive Education Jarosław A. Chudziak et.al. 2507.12484 null
2025-07-16 Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data Chandana Cheerla et.al. 2507.12425 null
2025-07-16 Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment Noble Harasha et.al. 2507.12400 null
2025-07-16 Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate Ana Davila et.al. 2507.12370 null
2025-07-21 GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities Diganta Misra et.al. 2507.12367 null
2025-07-16 Social polarization promoted by sparse higher-order interactions Hugo Pérez-Martínez et.al. 2507.12325 null
2025-07-17 Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot Luca Garello et.al. 2507.12273 null
2025-07-16 Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Johann Frei et.al. 2507.12261 null
2025-07-16 Toward a Behavioural Translation Style Space: Simulating the Temporal Dynamics of Affect, Behaviour, and Cognition in Human Translation Production Michael Carl et.al. 2507.12208 null
2025-07-16 BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search Azhar Ikhtiarudin et.al. 2507.12189 null
2025-07-16 Fast and Scalable Game-Theoretic Trajectory Planning with Intentional Uncertainties Zhenmin Huang et.al. 2507.12174 null
2025-07-16 Convergence Rate of Generalized Nash Equilibrium Learning in Strongly Monotone Games with Linear Constraints Tatiana Tatarenko et.al. 2507.12112 null
2025-07-16 Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVs Ye Han et.al. 2507.12110 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited Anthony G Cohn et.al. 2507.12059 null
2025-07-16 Contracting with a Mechanism Designer Tian Bai et.al. 2507.12054 null
2025-07-16 ARRC: Explainable, Workflow-Integrated Recommender for Sustainable Resource Optimization Across the Edge-Cloud Continuum Brian-Frederik Jahnke et.al. 2507.12032 null
2025-07-16 QAS-QTNs: Curriculum Reinforcement Learning-Driven Quantum Architecture Search for Quantum Tensor Networks Siddhant Dutta et.al. 2507.12013 null
2025-07-16 Understanding visual attention beehind bee-inspired UAV navigation Pranav Rajbhandari et.al. 2507.11992 null
2025-07-17 Aime: Towards Fully-Autonomous Multi-Agent Framework Yexuan Shi et.al. 2507.11988 null
2025-07-16 Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness Yuki Sakamoto et.al. 2507.11979 null
2025-07-16 Online Training and Pruning of Deep Reinforcement Learning Networks Valentin Frank Ingmar Guenter et.al. 2507.11975 null
2025-07-16 Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker Yuhong Zhang et.al. 2507.11972 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 From Generative to Episodic: Sample-Efficient Replicable Reinforcement Learning Max Hopkins et.al. 2507.11926 null
2025-07-16 Hybrid Conformal Prediction-based Risk-Aware Model Predictive Planning in Dense, Uncertain Environments Jeongyong Yang et.al. 2507.11920 null
2025-07-16 CoCre-Sam (Kokkuri-san): Modeling Ouija Board as Collective Langevin Dynamics Sampling from Fused Language Models Tadahiro Taniguchi et.al. 2507.11906 null
2025-07-16 Extremal Testing for Network Software using LLMs Rathin Singha et.al. 2507.11898 null
2025-07-16 Generative Intelligence Systems in the Flow of Group Emotions Fernando Koch et.al. 2507.11831 null
2025-07-16 The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist Haoxuan Zhang et.al. 2507.11810 null
2025-07-16 New allocation rule based on graph structures and their application to economic phenomena Taiki Yamada et.al. 2507.11808 null
2025-07-15 Large-scale distributed synchronization systems, using a cancel-on-completion redundancy mechanism Alexander Stolyar et.al. 2507.11779 null
2025-07-15 A Cellular Automata Approach to Donation Game Marcin Kowalik et.al. 2507.11744 null
2025-07-15 Let's Think in Two Steps: Mitigating Agreement Bias in MLLMs with Self-Grounded Verification Moises Andrade et.al. 2507.11662 null
2025-07-15 STAGED: A Multi-Agent Neural Network for Learning Cellular Interaction Dynamics Joao F. Rocha et.al. 2507.11660 null
2025-07-15 VISTA: Monocular Segmentation-Based Mapping for Appearance and View-Invariant Global Localization Hannah Shafferman et.al. 2507.11653 null
2025-07-15 General Modular Harness for LLM Agents in Multi-Turn Gaming Environments Yuxuan Zhang et.al. 2507.11633 null
2025-07-15 AI, Humans, and Data Science: Optimizing Roles Across Workflows and the Workforce Richard Timpone et.al. 2507.11597 null
2025-07-14 Consumer Law for AI Agents Christoph Busch et.al. 2507.11567 null
2025-07-14 Emergent Heterogeneous Swarm Control Through Hebbian Learning Fuda van Diggelen et.al. 2507.11566 null
2025-07-14 A Model Aware AIGC Task Offloading Algorithm in IIoT Edge Computing Xin Wang et.al. 2507.11560 null
2025-07-15 DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Yinsheng Li et.al. 2507.11527 null
2025-07-15 Opinion dynamics: Statistical physics and beyond Michele Starnini et.al. 2507.11521 null
2025-07-15 AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air Shiyi Yang et.al. 2507.11515 null
2025-07-15 On the Complexity of the Optimal Correlated Equilibria in Extensive-Form Games Vincent Cheval et.al. 2507.11509 null
2025-07-15 LF: Online Multi-Robot Path Planning Meets Optimal Trajectory Control Ajay Shankar et.al. 2507.11464 null
2025-07-15 EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes LG AI Research et.al. 2507.11407 null
2025-07-15 From Production Logistics to Smart Manufacturing: The Vision for a New RoboCup Industrial League Supun Dissanayaka et.al. 2507.11402 null
2025-07-20 Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian Andrei Niculae et.al. 2507.11299 null
2025-07-15 Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems Dany Moshkovich et.al. 2507.11277 null
2025-07-15 An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling Anh Nguyen-Duc et.al. 2507.11272 null
2025-07-15 Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound Tal Fiskus et.al. 2507.11269 null
2025-07-15 An Agentic Flow for Finite State Machine Extraction using Prompt Chaining Fares Wael et.al. 2507.11222 null
2025-07-15 Fair Contracts Matteo Castiglioni et.al. 2507.11214 null
2025-07-15 Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias Rushia Harada et.al. 2507.11210 null
2025-07-15 Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding Conrad Borchers et.al. 2507.11198 null
2025-07-15 Quantized Rank Reduction: A Communications-Efficient Federated Learning Scheme for Network-Critical Applications Dimitrios Kritsiolis et.al. 2507.11183 null
2025-07-15 AI Agent Architecture for Decentralized Trading of Alternative Assets Ailiya Borjigin et.al. 2507.11117 null
2025-07-15 Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander Li Wang et.al. 2507.11079 null
2025-07-17 SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks Pavel Adamenko et.al. 2507.11059 null
2025-07-16 Journalism-Guided Agentic In-Context Learning for News Stance Detection Dahyun Lee et.al. 2507.11049 null
2025-07-15 Value of History in Social Learning: Applications to Markets for History Hiroto Sato et.al. 2507.11029 null
2025-07-15 DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models Anthony Miyaguchi et.al. 2507.10958 null
2025-07-15 A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge Shuangyao Huang et.al. 2507.10913 null
2025-07-15 Lessons Learned from Evaluation of LLM based Multi-agents in Safer Therapy Recommendation Yicong Wu et.al. 2507.10911 null
2025-07-15 NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization Zongtao He et.al. 2507.10894 null
2025-07-15 Start from the End: A Framework for Computational Policy Exploration to Inform Effective and Geospatially Consistent Interventions applied to COVID-19 in St. Louis David O'Gara et.al. 2507.10870 null
2025-07-14 LLM-Guided Agentic Object Detection for Open-World Understanding Furkan Mumcu et.al. 2507.10844 null
2025-07-14 Past, Present and Future: Exploring Adaptive AI in Software Development Bots Omar Elsisi et.al. 2507.10822 null
2025-07-14 Semantic Context for Tool Orchestration Robert Müller et.al. 2507.10820 null
2025-07-14 Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection Huiyi Wang et.al. 2507.10814 null
2025-07-14 React to This (RTT): A Nonverbal Turing Test for Embodied AI Chuxuan Zhang et.al. 2507.10812 null
2025-07-14 Warehouse Spatial Question Answering with LLM Agent Hsiang-Wei Huang et.al. 2507.10778 null
2025-07-14 RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding Benjamin Stoler et.al. 2507.10749 null
2025-07-14 Ground-Compose-Reinforce: Tasking Reinforcement Learning Agents through Formal Language Andrew C. Li et.al. 2507.10741 null
2025-07-14 Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems Sohan Shankar et.al. 2507.10722 null
2025-07-14 Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health Jabari Kwesi et.al. 2507.10695 null
2025-07-14 Vision Language Action Models in Robotic Manipulation: A Systematic Review Muhayy Ud Din et.al. 2507.10672 null
2025-07-16 From Semantic Web and MAS to Agentic AI: A Unified Narrative of the Web of Agents Tatiana Petrova et.al. 2507.10644 null
2025-07-14 Enhancing the Capabilities of Large Language Models for API calls through Knowledge Graphs Ye Yang et.al. 2507.10630 null
2025-07-14 Game Theory Meets LLM and Agentic AI: Reimagining Cybersecurity for the Age of Intelligent Threats Quanyan Zhu et.al. 2507.10621 null
2025-07-13 Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks Oluwaseyi Giwa et.al. 2507.10619 null
2025-07-13 LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents Zihe Yan et.al. 2507.10610 null
2025-07-12 Emergence of Hierarchical Emotion Organization in Large Language Models Bo Zhao et.al. 2507.10599 null
2025-07-11 ARPaCCino: An Agentic-RAG for Policy as Code Compliance Francesco Romeo et.al. 2507.10584 null
2025-07-11 An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation Vimaleswar A et.al. 2507.10580 null
2025-07-16 Truth Sleuth and Trend Bender: AI Agents to fact-check YouTube videos and influence opinions Cécile Logé et.al. 2507.10577 null
2025-07-14 EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Mingxian Lin et.al. 2507.10548 null
2025-07-14 Graph World Model Tao Feng et.al. 2507.10539 null
2025-07-14 DeepResearch $^{\text{Eco}}$ : A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology Jennifer D'Souza et.al. 2507.10522 null
2025-07-14 An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments Mikko Korkiakoski et.al. 2507.10469 null
2025-07-14 Logic layer Prompt Control Injection (LPCI): A Novel Security Vulnerability Class in Agentic Systems Hammad Atta et.al. 2507.10457 null
2025-07-14 Negative entropy and non-equilibrium Euclidean shell Yang An et.al. 2507.10450 null
2025-07-14 Am I on the Right Track? What Can Predicted Query Performance Tell Us about the Search Behaviour of Agentic RAG Fangzheng Tian et.al. 2507.10411 null
2025-07-14 Machine-Learning to Trust Ran Spiegler et.al. 2507.10363 null
2025-07-14 Toolsuite for Implementing Multiagent Systems Based on Communication Protocols Amit K. Chopra et.al. 2507.10324 null
2025-07-14 Prompt Informed Reinforcement Learning for Visual Coverage Path Planning Venkat Margapuri et.al. 2507.10284 null
2025-07-14 Toward Real-World Table Agents: Capabilities, Workflows, and Design Principles for LLM-based Table Intelligence Jiaming Tian et.al. 2507.10281 null
2025-07-14 ToMacVF : Temporal Macro-action Value Factorization for Asynchronous Multi-Agent Reinforcement Learning Wenjing Zhang et.al. 2507.10251 null
2025-07-14 Should We Ever Prefer Decision Transformer for Offline Reinforcement Learning? Yumi Omori et.al. 2507.10174 null
2025-07-14 Play Style Identification Using Low-Level Representations of Play Traces in MicroRTS Ruizhe Yu Xia et.al. 2507.10172 null
2025-07-14 Simulating Biases for Interpretable Fairness in Offline and Online Classifiers Ricardo Inácio et.al. 2507.10154 null
2025-07-14 Adaptability in Multi-Agent Reinforcement Learning: A Framework and Unified Review Siyi Hu et.al. 2507.10142 null
2025-07-16 A PBN-RL-XAI Framework for Discovering a "Hit-and-Run" Therapeutic Strategy in Melanoma Zhonglin Liu et.al. 2507.10136 null
2025-07-14 Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering Tung Sum Thomas Kwok et.al. 2507.10088 null
2025-07-14 Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires Simon Münker et.al. 2507.10073 null
2025-07-14 Finetuning Deep Reinforcement Learning Policies with Evolutionary Strategies for Control of Underactuated Robots Marco Calì et.al. 2507.10030 null
2025-07-14 The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents Lixu Wang et.al. 2507.10016 null
2025-07-14 On The Role of Intentionality in Knowledge Representation: Analyzing Scene Context for Cognitive Agents with a Tiny Language Model Mark Burgess et.al. 2507.10000 null
2025-07-17 Predictive & Trust-based Multi-Agent Coordination Venkatraman Renganathan et.al. 2507.09997 null
2025-07-14 Evolution of Fear and Social Rewards in Prey-Predator Relationship Yuji Kanagawa et.al. 2507.09992 null
2025-07-14 Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient Xiaoyang Yu et.al. 2507.09989 null
2025-07-14 Quantum measurement of work in mesoscopic systems Anant Vijay Varma et.al. 2507.09977 null
2025-07-14 Generalized Quantal Response Equilibrium: Existence and Efficient Learning Apurv Shukla et.al. 2507.09928 null
2025-07-14 Intelligent Task Management via Dynamic Multi-region Division in LEO Satellite Networks Zixuan Song et.al. 2507.09926 null
2025-07-14 Energy-Stable Swarm-Based Inertial Algorithms for Optimization Xuelong Gu et.al. 2507.09909 null
2025-07-14 Large Population Models Ayush Chopra et.al. 2507.09901 null
2025-07-14 Towards Realistic and Interpretable Market Simulations: Factorizing Financial Power Law using Optimal Transport Ryuji Hashimoto et.al. 2507.09863 null
2025-07-14 Multi-residual Mixture of Experts Learning for Cooperative Control in Multi-vehicle Systems Vindula Jayawardana et.al. 2507.09836 null
2025-07-20 Active Probing with Multimodal Predictions for Motion Planning Darshan Gadginmath et.al. 2507.09822 null
2025-07-13 An infinitesimal generator approach on weak convergence of regulated multi-class matching systems Bowen Xie et.al. 2507.09789 null
2025-07-13 TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit Paulo Salem et.al. 2507.09788 null
2025-07-13 Toward accurate RUL and SOH estimation using reinforced graph-based PINNs enhanced with dynamic weights Mohamadreza Akbari Pour et.al. 2507.09766 null
2025-07-13 IteraOptiRacing: A Unified Planning-Control Framework for Real-time Autonomous Racing for Iterative Optimal Performance Yifan Zeng et.al. 2507.09714 null
2025-07-13 Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI Phat Nguyen et.al. 2507.09702 null
2025-07-13 Networked Information Aggregation via Machine Learning Michael Kearns et.al. 2507.09683 null
2025-07-13 Negotiating Comfort: Simulating Personality-Driven LLM Agents in Shared Residential Social Networks Ann Nedime Nese Rende et.al. 2507.09657 null
2025-07-13 humancompatible.interconnect: Testing Properties of Repeated Uses of Interconnections of AI Systems Rodion Nazarov et.al. 2507.09626 null
2025-07-13 On the existence of EFX allocations for goods Ujjwal Kumar et.al. 2507.09600 null
2025-07-17 THOR: Transformer Heuristics for On-Demand Retrieval Isaac Shi et.al. 2507.09592 null
2025-07-13 eSapiens: A Platform for Secure and Auditable Retrieval-Augmented Generation Isaac Shi et.al. 2507.09588 null
2025-07-13 AICrypto: A Comprehensive Benchmark For Evaluating Cryptography Capabilities of Large Language Models Yu Wang et.al. 2507.09580 null
2025-07-13 On Probabilistic Assignment Rules Sreedurga Gogulapati et.al. 2507.09550 null
2025-07-13 Existence of Fair and Efficient Allocation of Indivisible Chores Ryoga Mahara et.al. 2507.09544 null
2025-07-13 Learning to Control Dynamical Agents via Spiking Neural Networks and Metropolis-Hastings Sampling Ali Safa et.al. 2507.09540 null
2025-07-13 Self-supervised Pretraining for Integrated Prediction and Planning of Automated Vehicles Yangang Ren et.al. 2507.09537 null
2025-07-13 TruckV2X: A Truck-Centered Perception Dataset Tenghui Xie et.al. 2507.09505 null
2025-07-13 GoalfyMax: A Protocol-Driven Multi-Agent System for Intelligent Experience Entities Siyi Wu et.al. 2507.09497 null
2025-07-13 GenAI-based Multi-Agent Reinforcement Learning towards Distributed Agent Intelligence: A Generative-RL Agent Perspective Hang Wang et.al. 2507.09495 null
2025-07-13 Evaluating LLMs on Sequential API Call Through Automated Test Generation Yuheng Huang et.al. 2507.09481 null
2025-07-16 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs Yangning Li et.al. 2507.09477 null
2025-07-13 Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints Yan Dai et.al. 2507.09473 null
2025-07-13 MobiWorld: World Models for Mobile Wireless Network Haoye Chai et.al. 2507.09462 null
2025-07-13 Intermediate Interaction Strategies for Collective Behavior Y. Kikuchi et.al. 2507.09457 null
2025-07-13 Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions Yuanhong Zheng et.al. 2507.09446 null
2025-07-12 Contracting a crowd of heterogeneous agents Guillermo Alonso Alvarez et.al. 2507.09415 null
2025-07-12 Adaptive Social Learning using Theory of Mind Lance Ying et.al. 2507.09409 null
2025-07-12 LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing Quanyan Zhu et.al. 2507.09407 null
2025-07-12 Knowledge Conceptualization Impacts RAG Efficacy Chris Davis Jaldi et.al. 2507.09389 null
2025-07-12 Constrained Style Learning from Imperfect Demonstrations under Task Optimality Kehan Wen et.al. 2507.09371 null
2025-07-15 Simulation for All: A Step-by-Step Cookbook for Developing Human-Centered Multi-Agent Transportation Simulators Shiva Azimi et.al. 2507.09367 null
2025-07-12 When Developer Aid Becomes Security Debt: A Systematic Analysis of Insecure Behaviors in LLM Coding Agents Matous Kozak et.al. 2507.09329 null
2025-07-12 StockSim: A Dual-Mode Order-Level Simulator for Evaluating Multi-Agent LLMs in Financial Markets Charidimos Papadakis et.al. 2507.09255 null
2025-07-12 Hide-and-Shill: A Reinforcement Learning Framework for Market Manipulation Detection in Symphony-a Decentralized Multi-Agent System Ronghua Shi et.al. 2507.09179 null
2025-07-12 Continual Reinforcement Learning by Planning with Online World Models Zichen Liu et.al. 2507.09177 null
2025-07-12 RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking Shuo Yang et.al. 2507.09174 null
2025-07-12 Tactile-VLA: Unlocking Vision-Language-Action Model's Physical Knowledge for Tactile Generalization Jialei Huang et.al. 2507.09160 null
2025-07-12 Egalitarian-equivalent and strategy-proof mechanisms in homogeneous multi-object allocation problems Hinata Kurashita et.al. 2507.09152 null
2025-07-12 A Study of Value-Aware Eigenoptions Harshil Kotamreddy et.al. 2507.09127 null
2025-07-12 Proactive AI-and-RAN Workload Orchestration in O-RAN Architectures for 6G Networks Syed Danial Ali Shah et.al. 2507.09124 null
2025-07-12 AInsight: Augmenting Expert Decision-Making with On-the-Fly Insights Grounded in Historical Data Mohammad Abolnejadian et.al. 2507.09100 null
2025-07-12 Transformer based Collaborative Reinforcement Learning for Fluid Antenna System (FAS)-enabled 3D UAV Positioning Xiaoren Xu et.al. 2507.09094 null
2025-07-12 Learning from Synthetic Labs: Language Models as Auction Participants Anand Shah et.al. 2507.09083 null
2025-07-11 Infinite Video Understanding Dell Zhang et.al. 2507.09068 null
2025-07-11 SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments Avi Arora et.al. 2507.09063 null
2025-07-11 Behavioral Exploration: Learning to Explore via In-Context Adaptation Andrew Wagenmaker et.al. 2507.09041 null
2025-07-11 Accelerating Drug Discovery Through Agentic AI: A Multi-Agent Approach to Laboratory Automation in the DMTA Cycle Yao Fehlis et.al. 2507.09023 null
2025-07-11 How to Train a Leader: Hierarchical Reasoning in Multi-Agent LLMs Andrew Estornell et.al. 2507.08960 null
2025-07-15 Bridging Literature and the Universe Via A Multi-Agent Large Language Model System Xiaowen Zhang et.al. 2507.08958 null
2025-07-11 Optimizing Sequential Multi-Step Tasks with Parallel LLM Agents Enhao Zhang et.al. 2507.08944 null
2025-07-10 AirScape: An Aerial Generative World Model with Motion Controllability Baining Zhao et.al. 2507.08885 null
2025-07-10 Agent-based visualization of streaming text Jordan Riley Benson et.al. 2507.08884 null
2025-07-11 NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Luke Rivard et.al. 2507.08800 null
2025-07-11 SPLASH! Sample-efficient Preference-based inverse reinforcement learning for Long-horizon Adversarial tasks from Suboptimal Hierarchical demonstrations Peter Crowley et.al. 2507.08707 null
2025-07-11 elsciRL: Integrating Language Solutions into Reinforcement Learning Problem Settings Philip Osborne et.al. 2507.08705 null
2025-07-11 Introspection of Thought Helps AI Agents Haoran Sun et.al. 2507.08664 null
2025-07-11 Safe Deep Reinforcement Learning for Resource Allocation with Peak Age of Information Violation Guarantees Berire Gunes Reyhan et.al. 2507.08653 null
2025-07-11 DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images Haoran Sun et.al. 2507.08648 null
2025-07-11 OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception Junho Koh et.al. 2507.08644 null
2025-07-11 Agentic Large Language Models for Conceptual Systems Engineering and Design Soheyl Massoudi et.al. 2507.08619 null
2025-07-11 AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs Florian Grötschla et.al. 2507.08616 null
2025-07-11 Emergent Natural Language with Communication Games for Improving Image Captioning Capabilities without Additional Data Parag Dutta et.al. 2507.08610 null
2025-07-11 Unlocking Speech Instruction Data Potential with Query Rewriting Yonghua Hei et.al. 2507.08603 null
2025-07-11 To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions Dimitrios Emmanoulopoulos et.al. 2507.08584 null
2025-07-11 SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2 Alen Adamyan et.al. 2507.08548 null
2025-07-11 Recursive Reward Aggregation Yuting Tang et.al. 2507.08537 null
2025-07-11 Occlusion-Guided Feature Purification Learning via Reinforced Knowledge Distillation for Occluded Person Re-Identification Yufei Zheng et.al. 2507.08520 null
2025-07-11 The stability of bi-polarization on dynamical directed graphs: an emergent game perspective Yakun Wang et.al. 2507.08449 null
2025-07-11 Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences Selina Heller et.al. 2507.08440 null
2025-07-11 Age of Information Optimization in Laser-charged UAV-assisted IoT Networks: A Multi-agent Deep Reinforcement Learning Method Geng Sun et.al. 2507.08429 null
2025-07-11 A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities Lu Xiang et.al. 2507.08425 null
2025-07-11 Temperature Measurement in Agent Systems Christoph J. Börner et.al. 2507.08394 null
2025-07-11 Multi-Agent LLMs as Ethics Advocates in AI-Based Systems Asma Yamani et.al. 2507.08392 null
2025-07-11 Online Pre-Training for Offline-to-Online Reinforcement Learning Yongjae Shin et.al. 2507.08387 null
2025-07-11 Exploring Design of Multi-Agent LLM Dialogues for Research Ideation Keisuke Ueda et.al. 2507.08350 null
2025-07-11 What Factors Affect LLMs and RLLMs in Financial Question Answering? Peng Wang et.al. 2507.08339 null
2025-07-11 MK2 at PBIG Competition: A Prompt Generation Solution Yuzheng Xu et.al. 2507.08335 null
2025-07-11 CRMAgent: A Multi-Agent LLM System for E-Commerce CRM Message Template Generation Yinzhu Quan et.al. 2507.08325 null
2025-07-15 KAT-V1: Kwai-AutoThink Technical Report Zizheng Zhan et.al. 2507.08297 null
2025-07-11 Agent Safety Alignment via Reinforcement Learning Zeyang Sha et.al. 2507.08270 null
2025-07-11 Giving AI Agents Access to Cryptocurrency and Smart Contracts Creates New Vectors of AI Harm Bill Marino et.al. 2507.08249 null
2025-07-11 Advancing AI Capabilities and Evolving Labor Outcomes Jacob Dominski et.al. 2507.08244 null
2025-07-10 Effect of Static vs. Conversational AI-Generated Messages on Colorectal Cancer Screening Intent: a Randomized Controlled Trial Neil K. R. Sehgal et.al. 2507.08211 null
2025-07-10 From Curiosity to Competence: How World Models Interact with the Dynamics of Exploration Fryderyk Mantiuk et.al. 2507.08210 null
2025-07-10 Reasoning and Behavioral Equilibria in LLM-Nash Games: From Mindsets to Actions Quanyan Zhu et.al. 2507.08208 null
2025-07-10 A Dynamic Stackelberg Game Framework for Agentic AI Defense Against LLM Jailbreaking Zhengye Han et.al. 2507.08207 null
2025-07-10 KP-A: A Unified Network Knowledge Plane for Catalyzing Agentic Network Intelligence Yun Tang et.al. 2507.08164 null
2025-07-10 Code with Me or for Me? How Increasing AI Automation Transforms Developer Workflows Valerie Chen et.al. 2507.08149 null
2025-07-10 AI for NONMEM Coding in Pharmacometrics Research and Education: Shortcut or Pitfall? Wenhao Zheng et.al. 2507.08144 null
2025-07-10 Noise-Enabled Goal Attainment in Crowded Collectives Lucy Liu et.al. 2507.08100 null
2025-07-10 Multi-Scale Network Dynamics and Systemic Risk: A Model Context Protocol Approach to Financial Markets Avishek Bhandari et.al. 2507.08065 null
2025-07-10 MCPmed: A Call for MCP-Enabled Bioinformatics Web Services for LLM-Driven Discovery Matthias Flotho et.al. 2507.08055 null
2025-07-09 AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research Talor Abramovich et.al. 2507.08038 null
2025-07-14 PyVision: Agentic Vision with Dynamic Tooling Shitian Zhao et.al. 2507.07998 null
2025-07-10 OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding JingLi Lin et.al. 2507.07984 null
2025-07-15 Reinforcement Learning with Action Chunking Qiyang Li et.al. 2507.07969 null
2025-07-10 MIRIX: Multi-Agent Memory System for LLM-Based Agents Yu Wang et.al. 2507.07957 null
2025-07-10 Agentic Retrieval of Topics and Insights from Earnings Calls Anant Gupta et.al. 2507.07906 null
2025-07-11 The Trust Fabric: Decentralized Interoperability and Economic Coordination for the Agentic Web Sree Bhargavi Balija et.al. 2507.07901 null
2025-07-10 Automating MD simulations for Proteins using Large language Models: NAMD-Agent Achuth Chandrasekhar et.al. 2507.07887 null
2025-07-10 DocCHA: Towards LLM-Augmented Interactive Online diagnosis System Xinyi Liu et.al. 2507.07870 null
2025-07-10 "So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents Giovanni Dispoto et.al. 2507.07848 null
2025-07-10 Perceptual Distortions and Autonomous Representation Learning in a Minimal Robotic System David Warutumo et.al. 2507.07845 null
2025-07-10 BEAVER: Building Environments with Assessable Variation for Evaluating Multi-Objective Reinforcement Learning Ruohong Liu et.al. 2507.07769 null
2025-07-10 Beyond Connectivity: Higher-Order Network Framework for Capturing Memory-Driven Mobility Dynamics Chen Zhang et.al. 2507.07727 null
2025-07-10 Multi-agent Reinforcement Learning-based In-place Scaling Engine for Edge-cloud Systems Jovan Prodanov et.al. 2507.07671 null
2025-07-10 Upper Expected Meeting Times for Interdependent Stochastic Agents Marco Sangalli et.al. 2507.07626 null
2025-07-10 Position: We Need An Algorithmic Understanding of Generative AI Oliver Eberle et.al. 2507.07544 null
2025-07-10 Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System Yuanchen Shi et.al. 2507.07509 null
2025-07-10 The Pandora's Box Problem with Sequential Inspections Ali Aouad et.al. 2507.07508 null
2025-07-15 Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models Varin Sikka et.al. 2507.07505 null
2025-07-11 StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley Weihao Tan et.al. 2507.07445 null
2025-07-10 SAND: Boosting LLM Agents with Self-Taught Action Deliberation Yu Xia et.al. 2507.07441 null
2025-07-12 DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search Zerui Yang et.al. 2507.07426 null
2025-07-10 KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows Zaifeng Pan et.al. 2507.07400 null
2025-07-10 PILOC: A Pheromone Inverse Guidance Mechanism and Local-Communication Framework for Dynamic Target Search of Multi-Agent in Unknown Environments Hengrui Liu et.al. 2507.07376 null
2025-07-11 FLoRA: An Advanced AI-Powered Engine to Facilitate Hybrid Human-AI Regulated Learning Xinyu Li et.al. 2507.07362 null
2025-07-09 Optimizing Model Splitting and Device Task Assignment for Deceptive Signal Assisted Private Multi-hop Split Learning Dongyu Wei et.al. 2507.07323 null
2025-07-09 Optimizing Communication and Device Clustering for Clustered Federated Learning with Differential Privacy Dongyu Wei et.al. 2507.07320 null
2025-07-09 Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation Anirban Saha Anik et.al. 2507.07307 null
2025-07-09 ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning Yichen Lu et.al. 2507.07306 null
2025-07-09 Application of LLMs to Multi-Robot Path Planning and Task Allocation Ashish Kumar et.al. 2507.07302 null
2025-07-09 LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation Sonia Raychaudhuri et.al. 2507.07299 null
2025-07-09 The Impact of Background Speech on Interruption Detection in Collaborative Groups Mariah Bradford et.al. 2507.07280 null
2025-07-09 Convergence and Robustness Bounds for Distributed Asynchronous Shortest-Path Jared Miller et.al. 2507.07263 null
2025-07-11 Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery Licong Xu et.al. 2507.07257 null
2025-07-09 Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning Elia Piccoli et.al. 2507.07197 null
2025-07-09 Evaluating Retrieval-Augmented Generation Agents for Autonomous Scientific Discovery in Astrophysics Xueqing Xu et.al. 2507.07155 null
2025-07-09 4KAgent: Agentic Any Image to 4K Super-Resolution Yushen Zuo et.al. 2507.07105 null
2025-07-09 Graph-Based Complexity Metrics for Multi-Agent Curriculum Learning: A Validated Approach to Task Ordering in Cooperative Coordination Environments Farhaan Ebadulla et.al. 2507.07074 null
2025-07-09 Robust signal decompositions on the circle Aral Kose et.al. 2507.07007 null
2025-07-09 Federated Learning-based MARL for Strengthening Physical-Layer Security in B5G Networks Deemah H. Tashman et.al. 2507.06997 null
2025-07-09 The User-Centric Geo-Experience: An LLM-Powered Framework for Enhanced Planning, Navigation, and Dynamic Adaptation Jieren Deng et.al. 2507.06993 null
2025-07-09 Optimizing Cognitive Networks: Reinforcement Learning Meets Energy Harvesting Over Cascaded Channels Deemah H. Tashman et.al. 2507.06981 null
2025-07-09 Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues Fareya Ikram et.al. 2507.06910 null
2025-07-09 MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection Ziyan Liu et.al. 2507.06908 null
2025-07-09 SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds Matthias Zeller et.al. 2507.06906 null
2025-07-09 Designing Adaptive Algorithms Based on Reinforcement Learning for Dynamic Optimization of Sliding Window Size in Multi-Dimensional Data Streams Abolfazl Zarghani et.al. 2507.06901 null
2025-07-09 VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation Ziang Ye et.al. 2507.06899 null
2025-07-09 Toward Neurodivergent-Aware Productivity: A Systems and AI-Based Human-in-the-Loop Framework for ADHD-Affected Professionals Raghavendra Deshmukh et.al. 2507.06864 null
2025-07-11 The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover Matteo Lupinacci et.al. 2507.06850 null
2025-07-10 Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning Matej Straka et.al. 2507.06825 null
2025-07-09 Comparing Dialectical Systems: Contradiction and Counterexample in Belief Change (Extended Version) Uri Andrews et.al. 2507.06798 null
2025-07-09 Multi-Task Multi-Agent Reinforcement Learning via Skill Graphs Guobin Zhu et.al. 2507.06690 null
2025-07-09 Peer influence breaks ergodicity in an opinion dynamics model with external information Federica De Domenico et.al. 2507.06661 null
2025-07-09 Growing Trees with an Agent: Accelerating RRTs with Learned, Multi-Step Episodic Exploration Xinyu Wu et.al. 2507.06605 null
2025-07-09 Generalization in Reinforcement Learning for Radio Access Networks Burak Demirel et.al. 2507.06602 null
2025-07-15 A Mathematical Theory of Discursive Networks Juan B. Gutiérrez et.al. 2507.06565 null
2025-07-09 SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments Tianshun Li et.al. 2507.06564 null
2025-07-09 On the Hardness of Unsupervised Domain Adaptation: Optimal Learners and Information-Theoretic Perspective Zhiyi Dong et.al. 2507.06552 null
2025-07-09 ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture Mingjin Zeng et.al. 2507.06531 null
2025-07-09 InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior Huisheng Wang et.al. 2507.06528 null
2025-07-09 Gradientsys: A Multi-Agent LLM Scheduler with ReAct Orchestration Xinyuan Song et.al. 2507.06520 null
2025-07-13 Prediction-Augmented Mechanism Design for Weighted Facility Location Yangguang Shi et.al. 2507.06509 null
2025-07-09 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings Russell Taylor et.al. 2507.06506 null
2025-07-09 Learning To Communicate Over An Unknown Shared Network Shivangi Agarwal et.al. 2507.06499 null
2025-07-09 Learning Japanese with Jouzu: Interaction Outcomes with Stylized Dialogue Fictional Agents Zackary Rackauckas et.al. 2507.06483 null
2025-07-09 Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models Aaron Dharna et.al. 2507.06466 null
2025-07-08 Eyes on the Road, Mind Beyond Vision: Context-Aware Multi-modal Enhanced Risk Anticipation Jiaxun Zhang et.al. 2507.06444 null
2025-07-08 Distributed Optimization of Finite Condition Number for Laplacian Matrix in Multi-Agent Systems Yicheng Xu et.al. 2507.06440 null
2025-07-08 Experience-Centric Resource Management in ISAC Networks: A Digital Agent-Assisted Approach Xinyu Huang et.al. 2507.06436 null
2025-07-08 Representing Prompting Patterns with PDL: Compliance Agent Case Study Mandana Vaziri et.al. 2507.06396 null
2025-07-08 VoI-aware Scheduling Schemes for Multi-Agent Formation Control Federico Chiariotti et.al. 2507.06392 null
2025-07-08 Solving the Constrained Random Disambiguation Path Problem via Lagrangian Relaxation and Graph Reduction Li Zhou et.al. 2507.06346 null
2025-07-08 Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms Tarek Gasmi et.al. 2507.06323 null
2025-07-08 Too Human to Model:The Uncanny Valley of LLMs in Social Simulation -- When Generative Language Agents Misalign with Modelling Principles Yongchao Zeng et.al. 2507.06310 null
2025-07-08 A Survey of Multi Agent Reinforcement Learning: Federated Learning and Cooperative and Noncooperative Decentralized Regimes Kemboi Cheruiyot et.al. 2507.06278 null
2025-07-11 Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Gheorghe Comanici et.al. 2507.06261 null
2025-07-10 Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Xiangru Tang et.al. 2507.06229 null
2025-07-08 Aligned Textual Scoring Rules Yuxuan Lu et.al. 2507.06221 null
2025-07-08 Evaluation of Habitat Robotics using Large Language Models William Li et.al. 2507.06157 null
2025-07-08 OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety Sanidhya Vijayvargiya et.al. 2507.06134 null
2025-07-08 A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem Souvik Roy et.al. 2507.06126 null
2025-07-08 On Lockean beliefs that are deductively closed and minimal change Tommaso Flaminio et.al. 2507.06042 null
2025-07-08 Conditional Multi-Stage Failure Recovery for Embodied Agents Youmna Farag et.al. 2507.06016 null
2025-07-08 From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination Chang Yao et.al. 2507.06004 null
2025-07-08 Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models Marc Oriol et.al. 2507.05981 null
2025-07-08 CogniPlay: a work-in-progress Human-like model for General Game Playing Aloïs Rautureau et.al. 2507.05868 null
2025-07-08 Constella: Supporting Storywriters' Interconnected Character Creation through LLM-based Multi-Agents Syemin Park et.al. 2507.05820 null
2025-07-08 Just Say Better or Worse: A Human-AI Collaborative Framework for Medical Image Segmentation Without Manual Annotations Yizhe Zhang et.al. 2507.05815 null
2025-07-10 GTA1: GUI Test-time Scaling Agent Yan Yang et.al. 2507.05791 null
2025-07-08 On the detection of medium inhomogeneity by contrast agent: wave scattering models and numerical implementations Zhe Wang et.al. 2507.05773 null
2025-07-08 An autonomous agent for auditing and improving the reliability of clinical AI models Lukas Kuhn et.al. 2507.05755 null
2025-07-08 An efficiency ordering of k-price auctions under complete information Sumit Goel et.al. 2507.05738 null
2025-07-08 Large Language Models for Agent-Based Modelling: Current and possible uses across the modelling cycle Loïs Vanhée et.al. 2507.05723 null
2025-07-08 MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment Yucheng Shi et.al. 2507.05720 null
2025-07-08 Agentic-R1: Distilled Dual-Strategy Reasoning Weihua Du et.al. 2507.05707 null
2025-07-08 R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding Joonhyung Park et.al. 2507.05673 null
2025-07-08 ECom-Bench: Can LLM Agent Resolve Real-World E-commerce Customer Support Issues? Haoxin Wang et.al. 2507.05639 null
2025-07-08 LLMs are Introvert Litian Zhang et.al. 2507.05638 null
2025-07-08 How Not to Detect Prompt Injections with an LLM Sarthak Choudhary et.al. 2507.05630 null
2025-07-08 Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study Ibne Farabi Shihab et.al. 2507.05619 null
2025-07-08 Density Discontinuity Regression Surya T Tokdar et.al. 2507.05581 null
2025-07-08 Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines Wilka Carvalho et.al. 2507.05561 null
2025-07-09 AI Agent Smart Contract Exploit Generation Arthur Gervais et.al. 2507.05558 null
2025-07-07 Evolutionary and Coevolutionary Multi-Agent Design Choices and Dynamics Erik Hemberg et.al. 2507.05534 null
2025-07-07 Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment Jiahuan Pei et.al. 2507.05528 null
2025-07-07 Cultivating Multimodal Intelligence: Interpretive Reasoning and Agentic RAG Approaches to Dermatological Diagnosis Karishma Thakrar et.al. 2507.05520 null
2025-07-09 Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications Jean-Philippe Corbeil et.al. 2507.05517 null
2025-07-07 Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents Prahaladh Chandrahasan et.al. 2507.05495 null
2025-07-07 Constraint Hypergraphs as a Unifying Framework for Digital Twins John Morris et.al. 2507.05494 null
2025-07-07 Inaugural MOASEI Competition at AAMAS'2025: A Technical Report Ceferino Patino et.al. 2507.05469 null
2025-07-07 2048: Reinforcement Learning in a Delayed Reward Environment Prady Saligram et.al. 2507.05465 null
2025-07-07 A Systematization of Security Vulnerabilities in Computer Use Agents Daniel Jones et.al. 2507.05445 null
2025-07-07 Motion Generation: A Survey of Generative Approaches and Benchmarks Aliasghar Khani et.al. 2507.05419 null
2025-07-07 MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents Ming Gong et.al. 2507.05330 null
2025-07-07 AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts Kwangsuk Park et.al. 2507.05321 null
2025-07-07 OASBuilder: Generating OpenAPI Specifications from Online API Documentation with Large Language Models Koren Lazar et.al. 2507.05316 null
2025-07-10 Fuzzy Classification Aggregation for a Continuum of Agents Zijun Meng et.al. 2507.05297 null
2025-07-05 A LLM-Driven Multi-Agent Systems for Professional Development of Mathematics Teachers Kaiqi Yang et.al. 2507.05292 null
2025-07-03 A Fuzzy Supervisor Agent Design for Clinical Reasoning Assistance in a Multi-Agent Educational Clinical Scenario Simulation Weibing Zheng et.al. 2507.05275 null
2025-07-07 Spatio-Temporal LLM: Reasoning about Environments and Actions Haozhen Zheng et.al. 2507.05258 null
2025-07-07 Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions Yuanzhe Hu et.al. 2507.05257 null
2025-07-07 From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving Fabian Konstantinidis et.al. 2507.05254 null
2025-07-07 Action Space Reduction Strategies for Reinforcement Learning in Autonomous Driving Elahe Delavari et.al. 2507.05251 null
2025-07-07 Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration Benjamin Li et.al. 2507.05244 null
2025-07-08 SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam? Jingyi Chai et.al. 2507.05241 null
2025-07-07 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Meng Wei et.al. 2507.05240 null
2025-07-12 MedGemma Technical Report Andrew Sellergren et.al. 2507.05201 null
2025-07-07 CREW-WILDFIRE: Benchmarking Agentic Multi-Agent Collaborations at Scale Jonathan Hyun et.al. 2507.05178 null
2025-07-07 Vector Cost Bimatrix Games with Applications to Autonomous Racing Benjamin R. Toaz et.al. 2507.05171 null
2025-07-07 Critiques of World Models Eric Xing et.al. 2507.05169 null
2025-07-07 Macroscopic Structural Light Absorbers Jan M. Kaster et.al. 2507.05152 null
2025-07-07 Effects of Unplanned Incoming Flights on Airport Relief Processes after a Major Natural Disaster Luka Van de Sype et.al. 2507.05150 null
2025-07-07 LERa: Replanning with Visual Feedback in Instruction Following Svyatoslav Pchelintsev et.al. 2507.05135 null
2025-07-07 Optimal Consumption-Investment for General Utility with a Drawdown Constraint over a Finite-Time Horizon Chonghu Guan et.al. 2507.05115 null
2025-07-07 Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance Tobias Demmler et.al. 2507.05098 null
2025-07-07 Perspectives on How Sociology Can Advance Theorizing about Human-Chatbot Interaction and Developing Chatbots for Social Good Celeste Campos-Castillo et.al. 2507.05030 null
2025-07-07 Linking Homeostasis to Reinforcement Learning: Internal State Control of Motivated Behavior Naoto Yoshida et.al. 2507.04998 null
2025-07-07 From Autonomy to Agency: Agentic Vehicles for Human-Centered Mobility Systems Jiangbo Yu et.al. 2507.04996 null
2025-07-07 Leadership Detection via Time-Lagged Correlation-Based Network Inference Thayanne França da Silva et.al. 2507.04917 null
2025-07-07 MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction Kaleem Ullah Qasim et.al. 2507.04893 null
2025-07-07 Fine-tuning on simulated data outperforms prompting for agent tone of voice Ingo Marquardt et.al. 2507.04889 null
2025-07-07 Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning Giwon Lee et.al. 2507.04790 null
2025-07-07 Training-free Generation of Temporally Consistent Rewards from VLMs Yinuo Zhao et.al. 2507.04789 null
2025-07-07 FurniMAS: Language-Guided Furniture Decoration using Multi-Agent System Toan Nguyen et.al. 2507.04770 null
2025-07-07 Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking Maria Damanaki et.al. 2507.04762 null
2025-07-07 LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction Sungmin Lee et.al. 2507.04748 null
2025-07-07 Who's the Mole? Modeling and Detecting Intention-Hiding Malicious Agents in LLM-Based Multi-Agent Systems Yizhe Xie et.al. 2507.04724 null
2025-07-07 UrbanMind: Towards Urban General Intelligence via Tool-Enhanced Retrieval-Augmented Generation and Multilevel Optimization Kai Yang et.al. 2507.04706 null
2025-07-07 Interpretable Reward Modeling with Active Concept Bottlenecks Sonia Laguna et.al. 2507.04695 null
2025-07-07 Quantitative Single-particle Profiling of Extracellular Vesicles via Fluorescent Nanoparticle Tracking Analysis Yiting Liu et.al. 2507.04655 null
2025-07-07 LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction Yixin Yan et.al. 2507.04634 null
2025-07-07 Equilibrium Strategies for the N-agent Mean-Variance Investment Problem over a Random Horizon Xiaoqing Liang et.al. 2507.04611 null
2025-07-07 VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents Rui Meng et.al. 2507.04590 null
2025-07-08 Greedy Dynamic Matching Nick Arnosti et.al. 2507.04551 null
2025-07-06 Grounded Gesture Generation: Language, Motion, and Space Anna Deichler et.al. 2507.04522 null
2025-07-09 Constant-Approximate and Constant-Strategyproof Two-Facility Location Elijah Journey Fullerton et.al. 2507.04485 null
2025-07-06 Agentic Distributed Computing Ajay D. Kshemkalyani et.al. 2507.04459 null
2025-07-06 "Hi AirStar, Guide Me to the Badminton Court." Ziqin Wang et.al. 2507.04430 null
2025-07-06 MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind Emilio Villa-Cueva et.al. 2507.04415 null
2025-07-06 Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models Huy Hoan Le et.al. 2507.04410 null
2025-07-06 Inverse Reinforcement Learning using Revealed Preferences and Passive Stochastic Optimization Vikram Krishnamurthy et.al. 2507.04396 null
2025-07-08 MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Intelligence Agents Georgios Ioannides et.al. 2507.04376 null
2025-07-06 Adaptive Malware Detection using Sequential Feature Selection: A Dueling Double Deep Q-Network (D3QN) Framework for Intelligent Classification Naseem Khan et.al. 2507.04372 null
2025-07-06 WebSynthesis: World-Model-Guided MCTS for Efficient WebUI-Trajectory Synthesis Yifei Gao et.al. 2507.04370 null
2025-07-06 Mission-Aligned Learning-Informed Control of Autonomous Systems: Formulation and Foundations Vyacheslav Kungurtsev et.al. 2507.04356 null
2025-07-06 Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks Hao Huang et.al. 2507.04331 null
2025-07-06 Covalently Integrated CNT@rGO for Superior Conductivity and Cycling Stability in Lithium-Ion Batterie Junwen Tang et.al. 2507.04296 null
2025-07-06 SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement Liwen Xiao et.al. 2507.04263 null
2025-07-06 Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties Guohong Liu et.al. 2507.04227 null
2025-07-05 Gathering Teams of Bounded Memory Agents on a Line Younan Gao et.al. 2507.04172 null
2025-07-05 Comparative Evaluation of VR-Enabled Robots and Human Operators for Targeted Disease Management in Vineyards Hasan Seyyedhasani et.al. 2507.04167 null
2025-07-08 Adaptive Two-sided Assortment Optimization: Revenue Maximization Mohammadreza Ahmadnejadsaein et.al. 2507.04156 null
2025-07-05 Learning Humanoid Arm Motion via Centroidal Momentum Regularized Multi-Agent Reinforcement Learning Ho Jae Lee et.al. 2507.04140 null
2025-07-05 BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering Costas Mavromatis et.al. 2507.04127 null
2025-07-05 Enhancing Robustness of LLM-Driven Multi-Agent Systems through Randomized Smoothing Jinwei Hu et.al. 2507.04105 null
2025-07-05 How to Train Your LLM Web Agent: A Statistical Diagnosis Dheeraj Vattikonda et.al. 2507.04103 null
2025-07-05 Dynamic Asset Pricing with α-MEU Model Jiacheng Fan et.al. 2507.04093 null
2025-07-05 Accurate and Efficient World Modeling with Masked Latent Transformers Maxime Burchi et.al. 2507.04075 null
2025-07-05 Efficiency through Evolution, A Darwinian Approach to Agent-Based Economic Forecast Modeling Martin Jaraiz et.al. 2507.04074 null
2025-07-05 HAWK: A Hierarchical Workflow Framework for Multi-Agent Collaboration Yuyang Cheng et.al. 2507.04067 null
2025-07-05 TopoMAS: Large Language Model Driven Topological Materials Multiagent System Baohua Zhang et.al. 2507.04053 null
2025-07-05 Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation Ziying Song et.al. 2507.04049 null
2025-07-05 Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation Ziyu Zhu et.al. 2507.04047 null
2025-07-05 Ready Jurist One: Benchmarking Language Agents for Legal Intelligence in Dynamic Environments Zheng Jia et.al. 2507.04037 null
2025-07-05 PresentAgent: Multimodal Agent for Presentation Video Generation Jingwei Shi et.al. 2507.04036 null
2025-07-05 Exploring a Gamified Personality Assessment Method through Interaction with Multi-Personality LLM Agents Baiqiao Zhang et.al. 2507.04005 null
2025-07-05 MalVol-25: A Diverse, Labelled and Detailed Volatile Memory Dataset for Malware Detection and Response Testing and Validation Dipo Dunsin et.al. 2507.03993 null
2025-07-05 Fair and Efficient Allocation of Indivisible Mixed Manna Siddharth Barman et.al. 2507.03946 null
2025-07-05 CortexDebate: Debating Sparsely and Equally for Multi-Agent Debate Yiliu Sun et.al. 2507.03928 null
2025-07-05 Agent Exchange: Shaping the Future of AI Agent Economics Yingxuan Yang et.al. 2507.03904 null
2025-07-05 Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing Rahil P Mehta et.al. 2507.03870 null
2025-07-04 Participatory Evolution of Artificial Life Systems via Semantic Feedback Shuowen Li et.al. 2507.03839 null
2025-07-04 Leveraging Large Language Models for Tacit Knowledge Discovery in Organizational Contexts Gianlucca Zuin et.al. 2507.03811 null
2025-07-04 Generating Novelty in Open-World Multi-Agent Strategic Board Games Mayank Kejriwal et.al. 2507.03802 null
2025-07-04 Learning Dark Souls Combat Through Pixel Input With Neuroevolution Jim O'Connor et.al. 2507.03793 null
2025-07-04 Less is More: Empowering GUI Agent with Context-Aware Simplification Gongwei Chen et.al. 2507.03730 null
2025-07-04 Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models Riya Naik et.al. 2507.03726 null
2025-07-09 Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making Sang Quang Nguyen et.al. 2507.03711 null
2025-07-04 Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning Rebekah A. Gelpí et.al. 2507.03682 null
2025-07-04 STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking Tek Raj Chhetri et.al. 2507.03674 null
2025-07-04 Recon, Answer, Verify: Agents in Search of Truth Satyam Shukla et.al. 2507.03671 null
2025-07-04 When Does Diversity Matter? A Unified Framework for Binary-Choice Dynamics Arkadiusz Jędrzejewski et.al. 2507.03665 null
2025-07-04 Is It Time To Treat Prompts As Code? A Multi-Use Case Study For Prompt Optimization Using DSPy Francisca Lemos et.al. 2507.03620 null
2025-07-04 EvoAgentX: An Automated Framework for Evolving Agentic Workflows Yingxu Wang et.al. 2507.03616 null
2025-07-04 On characterization and existence of a constrained correlated equilibria in Markov games Tingting Ni et.al. 2507.03502 null
2025-07-09 Reinforcement Learning-based Feature Generation Algorithm for Scientific Data Meng Xiao et.al. 2507.03498 null
2025-07-04 AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions Abdellah Zeggai et.al. 2507.03493 null
2025-07-04 Explainable Information Retrieval in the Audit Domain Alexander Frummet et.al. 2507.03479 null
2025-07-04 REAL: Benchmarking Abilities of Large Language Models for Housing Transactions and Services Kexin Zhu et.al. 2507.03477 null
2025-07-04 Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis Weitong Zhang et.al. 2507.03460 null
2025-07-04 ElliottAgents: A Natural Language-Driven Multi-Agent System for Stock Market Analysis and Prediction Jarosław A. Chudziak et.al. 2507.03435 null
2025-07-04 Lessons from a Chimp: AI "Scheming" and the Quest for Ape Language Christopher Summerfield et.al. 2507.03409 null
2025-07-04 Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky Ashutosh Hathidara et.al. 2507.03336 null
2025-07-04 Mirror in the Model: Ad Banner Image Generation via Reflective Multi-LLM and Multi-modal Agents Zhao Wang et.al. 2507.03326 null
2025-07-04 GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation Himanshu Dutta et.al. 2507.03311 null
2025-07-04 Dyn-O: Building Structured World Models with Object-Centric Representations Zizhao Wang et.al. 2507.03298 null
2025-07-04 LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents Anand Gokhale et.al. 2507.03293 null
2025-07-04 Conformal Information Pursuit for Interactively Guiding Large Language Models Kwan Ho Ryan Chan et.al. 2507.03279 null
2025-07-04 GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning Jie Peng et.al. 2507.03267 null
2025-07-04 Coalitional stability under myopic expectations and externalities Agustin G. Bonifacio et.al. 2507.03259 null
2025-07-04 CodeAgents: A Token-Efficient Framework for Codified Multi-Agent Reasoning in LLMs Bruce Yang et.al. 2507.03254 null
2025-07-03 SI-Agent: An Agentic Framework for Feedback-Driven Generation and Tuning of Human-Readable System Instructions for Large Language Models Jeshwanth Challagundla et.al. 2507.03223 null
2025-07-03 In vivo imaging of central nervous system fluid spaces using synchrotron radiation-based micro computed tomography Marta Girona Alarcón et.al. 2507.03186 null
2025-07-03 Last-Iterate Convergence of No-Regret Learning for Equilibria in Bargaining Games Serafina Kamp et.al. 2507.03150 null
2025-07-03 RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents Peisong Wang et.al. 2507.03112 null
2025-07-03 From Turing to Tomorrow: The UK's Approach to AI Regulation Oliver Ritchie et.al. 2507.03050 null
2025-07-02 Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains Abhishek Verma et.al. 2507.03026 null
2025-07-02 OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering Zipeng Qiu et.al. 2507.03018 null
2025-07-10 Establishing Best Practices for Building Rigorous Agentic Benchmarks Yuxuan Zhu et.al. 2507.02825 null
2025-07-03 Moral Responsibility or Obedience: What Do We Want from AI? Joseph Boland et.al. 2507.02788 null
2025-07-06 KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs Yuzhang Xie et.al. 2507.02773 null
2025-07-03 Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work Guangwei Zhang et.al. 2507.02760 null
2025-07-03 Defining and classifying models of groups: The social ontology of higher-order networks Jonathan St-Onge et.al. 2507.02758 null
2025-07-03 Multi-agent Auditory Scene Analysis Caleb Rascon et.al. 2507.02755 null
2025-07-03 Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks Sizhe Chen et.al. 2507.02735 null
2025-07-03 Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving Matthieu Zimmer et.al. 2507.02726 null
2025-07-03 A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control Zilin Kang et.al. 2507.02712 null
2025-07-03 Fluid Democracy in Federated Data Aggregation Aditya Vema Reddy Kesari et.al. 2507.02710 null
2025-07-03 Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents Jiangrong Wu et.al. 2507.02699 null
2025-07-03 Multi-Agent Reinforcement Learning for Dynamic Pricing in Supply Chains: Benchmarking Strategic Agent Behaviours under Realistically Simulated Market Conditions Thomas Hazenberg et.al. 2507.02698 null
2025-07-03 On the Convergence of Large Language Model Optimizer for Black-Box Network Management Hoon Lee et.al. 2507.02689 null
2025-07-03 TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for Spatial Public Goods Games Zhaoqilin Yang et.al. 2507.02675 null
2025-07-03 Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification Deepak Narayan Gadde et.al. 2507.02660 null
2025-07-03 Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search Jiajie Jin et.al. 2507.02652 null
2025-07-03 On Efficient Bayesian Exploration in Model-Based Reinforcement Learning Alberto Caron et.al. 2507.02639 null
2025-07-03 VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning Siran Chen et.al. 2507.02626 null
2025-07-03 Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory Kenneth Payne et.al. 2507.02618 null
2025-07-03 DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making Tianqi Shang et.al. 2507.02616 null
2025-07-03 WebSailor: Navigating Super-human Reasoning for Web Agent Kuan Li et.al. 2507.02592 null
2025-07-03 AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench Edan Toledo et.al. 2507.02554 null
2025-07-03 Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue Paulo Ricardo Knob et.al. 2507.02537 null
2025-07-03 A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion Maryem Fadili et.al. 2507.02430 null
2025-07-03 CyberRAG: An agentic RAG cyber attack classification and reporting tool Francesco Blefari et.al. 2507.02424 null
2025-07-03 Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization Caio Azevedo et.al. 2507.02406 null
2025-07-03 Deep Reinforcement Learning-Based DRAM Equalizer Parameter Optimization Using Latent Representations Muhammad Usama et.al. 2507.02365 null
2025-07-03 OMS: On-the-fly, Multi-Objective, Self-Reflective Ad Keyword Generation via LLM Agent Bowen Chen et.al. 2507.02353 null
2025-07-03 CineMyoPS: Segmenting Myocardial Pathologies from Cine Cardiac MR Wangbin Ding et.al. 2507.02289 null
2025-07-03 MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent Hongli Yu et.al. 2507.02259 null
2025-07-03 SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement Zeyu Lei et.al. 2507.02252 null
2025-07-04 CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset in Adverse Weather Minghao Ning et.al. 2507.02245 null
2025-07-04 Dilution, Diffusion and Symbiosis in Spatial Prisoner's Dilemma with Reinforcement Learning Gustavo C. Mangold et.al. 2507.02211 null
2025-07-08 Average Action Efficiency Rises Monotonically in Self-Organizing Systems via Stochastic Least-Action Dynamics Georgi Yordanov Georgiev et.al. 2507.02209 null
2025-07-02 Operator-Theoretic Methods for Differential Games Craig Bakker et.al. 2507.02203 null
2025-07-02 Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations of Human Trust Amogh Mannekote et.al. 2507.02197 null
2025-07-02 Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models Fangjian Lei et.al. 2507.02182 null
2025-07-02 Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNN Miroslav Cibula et.al. 2507.02171 null
2025-07-02 Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System Adam Kostka et.al. 2507.02170 null
2025-07-02 The optimal degree for maximizing rumor spreading on a ring lattice Ana C. Díaz Bacca et.al. 2507.02141 null
2025-07-02 PAL: Designing Conversational Agents as Scalable, Cooperative Patient Simulators for Palliative-Care Training Neil K. R. Sehgal et.al. 2507.02122 null
2025-07-02 What Neuroscience Can Teach AI About Learning in Continuously Changing Environments Daniel Durstewitz et.al. 2507.02103 null
2025-07-02 The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems Reza Yousefi Maragheh et.al. 2507.02097 null
2025-07-02 Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab Haonan Duan et.al. 2507.02083 null
2025-07-02 Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs Mohammad Ali Alomrani et.al. 2507.02076 null
2025-07-05 RoboBrain 2.0 Technical Report BAAI RoboBrain Team et.al. 2507.02029 null
2025-07-01 STELLA: Self-Evolving LLM Agent for Biomedical Research Ruofan Jin et.al. 2507.02004 null
2025-07-01 Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models Shaurya Mallampati et.al. 2507.02002 null
2025-07-04 Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting Zhihao Wang et.al. 2507.01997 null
2025-06-29 Integrating Large Language Models in Financial Investments and Market Analysis: A Survey Sedigheh Mahdavi et.al. 2507.01990 null
2025-07-02 The Thin Line Between Comprehension and Persuasion in LLMs Adrian de Wynter et.al. 2507.01936 null
2025-07-03 Decision-Oriented Text Evaluation Yu-Shiang Huang et.al. 2507.01923 null
2025-07-02 An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram Sunder Neelakantan et.al. 2507.01867 null
2025-07-02 Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents Sanjay Krishna Anbalagan et.al. 2507.01862 null
2025-07-02 TD-MPC-Opt: Distilling Model-Based Multi-Task Reinforcement Learning Agents Dmytro Kuzmenko et.al. 2507.01823 null
2025-07-06 AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction Bin Rao et.al. 2507.01801 null
2025-07-02 ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving Kai Chen et.al. 2507.01735 null
2025-07-02 Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI Gopichand Kanumolu et.al. 2507.01717 null
2025-07-02 Using Machine Learning to Compute Constrained Optimal Carbon Tax Rules Felix Kübler et.al. 2507.01704 null
2025-07-02 AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness Zixin Chen et.al. 2507.01702 null
2025-07-02 Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture Bochen Han et.al. 2507.01701 null
2025-07-02 Quantum reinforcement learning in dynamic environments Oliver Sefrin et.al. 2507.01691 null
2025-07-02 What does really matter in image goal navigation? Gianluca Monaci et.al. 2507.01667 null
2025-07-02 Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems Zhaoyan Sun et.al. 2507.01599 null
2025-07-02 Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation Shutong Feng et.al. 2507.01594 null
2025-07-02 Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors Yulan Gao et.al. 2507.01574 null
2025-07-02 Time-Varying Coverage Control: A Distributed Tracker-Planner MPC Framework Patrick Benito Eberhard et.al. 2507.01567 null
2025-07-02 Chargax: A JAX Accelerated EV Charging Simulator Koen Ponse et.al. 2507.01522 null
2025-07-02 Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning Yanfei Zhang et.al. 2507.01489 null
2025-07-02 BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments Yibo Qiu et.al. 2507.01485 null
2025-07-02 Using multi-agent architecture to mitigate the risk of LLM hallucinations Abd Elrahman Amer et.al. 2507.01446 null
2025-07-02 Reinforcement Learning for Discrete-time LQG Mean Field Social Control Problems with Unknown Dynamics Hanfang Zhang et.al. 2507.01420 null
2025-07-02 Evaluating LLM Agent Collusion in Double Auctions Kushal Agrawal et.al. 2507.01413 null
2025-07-02 RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms Ziyao Wang et.al. 2507.01378 null
2025-07-02 AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing Yinwang Ren et.al. 2507.01376 null
2025-07-02 Context-Aware Code Wiring Recommendation with LLM-based Agent Taiming Wang et.al. 2507.01315 null
2025-07-02 LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction Muhammad Atta ur Rahman et.al. 2507.01308 null
2025-07-02 Optimal Dispersion Under Asynchrony Debasish Pattanayak et.al. 2507.01298 null
2025-07-05 Frustratingly Simple Retrieval Improves Challenging, Reasoning-Intensive Benchmarks Xinxi Lyu et.al. 2507.01297 null
2025-07-02 GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant Michał Matak et.al. 2507.01259 null
2025-07-02 AIGVE-MACS: Unified Multi-Aspect Commenting and Scoring Model for AI-Generated Video Evaluation Xiao Liu et.al. 2507.01255 null
2025-07-01 Rethinking the Illusion of Thinking Iñaki Dellibarda Varela et.al. 2507.01231 null
2025-07-01 SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound Yunke Ao et.al. 2507.01152 null
2025-07-01 Agentic AI in Product Management: A Co-Evolutionary Model Nishant A. Parikh et.al. 2507.01069 null
2025-06-30 Epitome: Pioneering an Experimental Platform for AI-Social Science Integration Jingjing Qu et.al. 2507.01061 null
2025-06-30 Optimizing Conversational Product Recommendation via Reinforcement Learning Kang Liu et.al. 2507.01060 null
2025-06-29 Automated Vehicles Should be Connected with Natural Language Xiangbo Gao et.al. 2507.01059 null
2025-07-01 Running Quantum Computers in Discovery Mode Benedikt Placke et.al. 2507.01013 null
2025-07-02 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning GLM-V Team et.al. 2507.01006 null
2025-07-01 RTMap: Real-Time Recursive Mapping with Change Detection and Localization Yuheng Du et.al. 2507.00980 null
2025-07-01 Enhancing LLM Agent Safety via Causal Influence Prompting Dongyoon Hahm et.al. 2507.00979 null
2025-07-01 Decentralised Multi-Manager Fund Framework Arman Abgaryan et.al. 2507.00978 null
2025-07-01 Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact Rizwan Qureshi et.al. 2507.00951 null
2025-07-01 WebArXiv: Evaluating Multimodal Agents on Time-Invariant arXiv Tasks Zihao Sun et.al. 2507.00938 null
2025-07-01 A Survey: Learning Embodied Intelligence from Physical Simulators and World Models Xiaoxiao Long et.al. 2507.00917 null
2025-07-01 Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications Jindong Han et.al. 2507.00914 null
2025-07-01 MemeCMD: An Automatically Generated Chinese Multi-turn Dialogue Dataset with Contextually Retrieved Memes Yuheng Wang et.al. 2507.00891 null
2025-07-01 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation Xi Xuan et.al. 2507.00875 null
2025-07-01 The Evolution of Altruistic Rationality Provides a Solution to Social Dilemmas via Rational Reciprocity Mohammad Salahshour et.al. 2507.00858 null
2025-07-01 Enhancing Vehicular Platooning with Wireless Federated Learning: A Resource-Aware Control Framework Beining Wu et.al. 2507.00856 null
2025-07-01 Ranking Quantilized Mean-Field Games with an Application to Early-Stage Venture Investments Rinel Foguen Tchuendom et.al. 2507.00853 null
2025-07-01 SafeMobile: Chain-level Jailbreak Detection and Automated Evaluation for Multimodal Mobile Agents Siyuan Liang et.al. 2507.00841 null
2025-07-01 Many LLMs Are More Utilitarian Than One Anita Keshmirian et.al. 2507.00814 null
2025-07-02 Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments Tom Maus et.al. 2507.00762 null
2025-07-01 Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity Jacopo Nudo et.al. 2507.00657 null
2025-07-01 ChatHLS: Towards Systematic Design Automation and Optimization for High-Level Synthesis Runkai Li et.al. 2507.00642 null
2025-07-04 Horus: A Protocol for Trustless Delegation Under Uncertainty David Shi et.al. 2507.00631 null
2025-07-01 Quantum Circuit Structure Optimization for Quantum Reinforcement Learning Seok Bin Son et.al. 2507.00589 null
2025-07-01 Collaborative Multi-Agent Reinforcement Learning Approach for Elastic Cloud Resource Scaling Bruce Fang et.al. 2507.00550 null
2025-07-01 Rethinking Group Recommender Systems in the Era of Generative AI: From One-Shot Recommendations to Agentic Group Decision Support Dietmar Jannach et.al. 2507.00535 null
2025-07-01 PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning Weiran Guo et.al. 2507.00485 null
2025-07-01 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Ying Guo et.al. 2507.00472 null
2025-07-01 Best Agent Identification for General Game Playing Matthew Stephenson et.al. 2507.00451 null
2025-07-01 Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems Reza Ahmadvand et.al. 2507.00443 null
2025-07-01 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Maggie Huan et.al. 2507.00432 null
2025-07-01 Multi-Agent Coordination under Poisson Observations: A Global Game Approach Marcos M. Vasconcelos et.al. 2507.00424 null
2025-07-01 Evolutionary Dynamics with Self-Interaction Learning in Networked Systems Ziyan Zeng et.al. 2507.00422 null
2025-07-01 Minimal Construction of Graphs with Maximum Robustness Haejoon Lee et.al. 2507.00415 null
2025-07-01 iPanda: An Intelligent Protocol Testing and Debugging Agent for Conformance Testing Xikai Sun et.al. 2507.00378 null
2025-07-01 VTS-Guided AI Interaction Workflow for Business Insights Sun Ding et.al. 2507.00347 null
2025-06-30 Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems Oren Fivel et.al. 2507.00268 null
2025-06-30 Examining Reject Relations in Stimulus Equivalence Simulations Alexis Carrillo et.al. 2507.00265 null
2025-06-30 Endogenous Network Structures with Precision and Dimension Choices Nikhil Kumar et.al. 2507.00249 null
2025-06-30 LineRetriever: Planning-Aware Observation Reduction for Web Agents Imene Kerboua et.al. 2507.00210 null
2025-06-30 BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis Vidhi Rathore et.al. 2507.00180 null
2025-06-30 AI-Governed Agent Architecture for Web-Trustworthy Tokenization of Alternative Assets Ailiya Borjigin et.al. 2507.00096 null
2025-06-30 State and Memory is All You Need for Robust and Reliable AI Agents Matthew Muhoberac et.al. 2507.00081 null
2025-06-29 VoyagerVision: Investigating the Role of Multi-modal Information for Open-ended Learning Systems Ethan Smyth et.al. 2507.00079 null
2025-07-01 SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Bo Liu et.al. 2506.24119 null
2025-06-30 Protocol insecurity with finitely many sessions and XOR R Ramanujam et.al. 2506.24072 null
2025-06-30 Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC Xinming Wei et.al. 2506.24045 null
2025-06-30 Ella: Embodied Social Agents with Lifelong Memory Hongxin Zhang et.al. 2506.24019 null
2025-06-30 Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning Seungjun Yi et.al. 2506.23998 null
2025-06-30 Harnessing AI Agents to Advance Research on Refugee Child Mental Health Aditya Shrivastava et.al. 2506.23992 null
2025-06-30 LLM Agents Are the Antidote to Walled Gardens Samuele Marro et.al. 2506.23978 null
2025-06-30 Flexible Moral Hazard Problems with Adverse Selection Siwen Liu et.al. 2506.23954 null
2025-06-30 Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice Akshit Kumar et.al. 2506.23924 null
2025-06-30 A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents Hang Su et.al. 2506.23844 null
2025-06-30 Sociophysics models inspired by the Ising model Pratik Mullick et.al. 2506.23837 null
2025-06-30 Towards the "Digital Me": A vision of authentic Conversational Agents powered by personal Human Digital Twins Lluís C. Coll et.al. 2506.23826 null
2025-06-30 Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning Anton Andreychuk et.al. 2506.23793 null
2025-07-01 Synthetically Expressive: Evaluating gesture and voice for emotion and empathy in VR and 2D scenarios Haoyang Du et.al. 2506.23777 null
2025-06-30 Leveraging a Multi-Agent LLM-Based System to Educate Teachers in Hate Incidents Management Ewelina Gajewska et.al. 2506.23774 null
2025-06-30 A Survey of LLM-based Automated Program Repair: Taxonomies, Design Paradigms, and Applications Boyang Yang et.al. 2506.23749 null
2025-06-30 DABstep: Data Agent Benchmark for Multi-step Reasoning Alex Egg et.al. 2506.23719 null
2025-06-30 Agent4S: The Transformation of Research Paradigms from the Perspective of Large Language Models Boyuan Zheng et.al. 2506.23692 null
2025-06-30 PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red Zihao Liu et.al. 2506.23689 null
2025-06-30 Efficient Interleaved Speech Modeling through Knowledge Distillation Mohammadmahdi Nouriborji et.al. 2506.23670 null
2025-06-30 L0: Reinforcement Learning to Become General Agents Junjie Zhang et.al. 2506.23667 null
2025-06-30 Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games António Afonso et.al. 2506.23626 null
2025-06-30 Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs Manuel Pratelli et.al. 2506.23610 null
2025-06-30 Evaluating Multi-Agent Defences Against Jailbreaking Attacks on Large Language Models Maria Carolina Cornelia Wit et.al. 2506.23576 null
2025-06-30 CooT: Learning to Coordinate In-Context with Coordination Transformers Huai-Chih Wang et.al. 2506.23549 null
2025-06-30 Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent Haocheng Yu et.al. 2506.23485 null
2025-06-30 NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments Xuan Yao et.al. 2506.23468 null
2025-06-30 Accessible Data Access and Analysis by People who are Blind or Have Low Vision Samuel Reinders et.al. 2506.23443 null
2025-06-29 Do LLMs Dream of Discrete Algorithms? Claudionor Coelho Jr et.al. 2506.23408 null
2025-06-29 ATGen: A Framework for Active Text Generation Akim Tsvigun et.al. 2506.23342 null
2025-06-29 IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering Parker Liu et.al. 2506.23329 null
2025-06-29 InfGen: Scenario Generation as Next Token Group Prediction Zhenghao Peng et.al. 2506.23316 null
2025-06-29 GATSim: Urban Mobility Simulation with Generative Agents Qi Liu et.al. 2506.23306 null
2025-06-29 Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games David Guzman Piedrahita et.al. 2506.23276 null
2025-06-29 FinStat2SQL: A Text2SQL Pipeline for Financial Statement Analysis Quang Hung Nguyen et.al. 2506.23273 null
2025-06-29 From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows Mohamed Amine Ferrag et.al. 2506.23260 null
2025-06-29 Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models Maarten Hugenholtz et.al. 2506.23164 null
2025-06-29 Benchmarking Deep Search over Heterogeneous Enterprise Data Prafulla Kumar Choubey et.al. 2506.23139 null
2025-06-29 Learning Motion Skills with Adaptive Assistive Curriculum Force in Humanoid Robots Zhanxiang Cao et.al. 2506.23125 null
2025-06-29 Curious Causality-Seeking Agents Learn Meta Causal World Zhiyu Zhao et.al. 2506.23068 null
2025-06-29 AURA: Agent for Understanding, Reasoning, and Automated Tool Use in Voice-Driven Tasks Leander Melroy Maben et.al. 2506.23049 null
2025-06-29 SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions Xianzhe Fan et.al. 2506.23046 null
2025-06-28 Fragile, Robust, and Antifragile: A Perspective from Parameter Responses in Reinforcement Learning Under Stress Zain ul Abdeen et.al. 2506.23036 null
2025-06-28 A "Good" Regulator May Provide a World Model for Intelligent Systems Bradly Alicea et.al. 2506.23032 null
2025-06-28 Scenario-Based Hierarchical Reinforcement Learning for Automated Driving Decision Making M. Youssef Abdelhamid et.al. 2506.23023 null
2025-06-28 A Reinforcement Learning Approach for Optimal Control in Microgrids Davide Salaorni et.al. 2506.22995 null
2025-06-28 Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications Mehdi Bennis et.al. 2506.22991 null
2025-06-28 Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models Younwoo Choi et.al. 2506.22957 null
2025-06-28 GamerAstra: Enhancing Video Game Accessibility for Blind and Low-Vision Players through a Multi-Agent AI Framework Tianrun Qiu et.al. 2506.22937 null
2025-06-28 Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example Bei Zhou et.al. 2506.22894 null
2025-06-28 Agentic Enterprise: AI-Centric User to User-Centric AI Arpit Narechania et.al. 2506.22893 null
2025-06-28 CP-Guard: A Unified, Probability-Agnostic, and Adaptive Framework for Malicious Agent Detection and Defense in Multi-Agent Embodied Perception Systems Senkang Hu et.al. 2506.22890 null
2025-06-28 Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS Shayak Nandi et.al. 2506.22876 null
2025-06-28 Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity Mohammadreza Doostmohammadian et.al. 2506.22855 null
2025-07-02 DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues Kyochul Jang et.al. 2506.22853 null
2025-06-28 Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems Yucheng Cai et.al. 2506.22852 null
2025-06-28 Actively induced supercoiling can slow down plasmid solutions by trapping the threading entanglements Roman Staňo et.al. 2506.22842 null
2025-06-28 Memory as a Service (MaaS): Rethinking Contextual Memory as Service-Oriented Modules for Collaborative Agents Haichang Li et.al. 2506.22815 null
2025-06-28 BayesLoRA: Task-Specific Uncertainty in Low-Rank Adapters Cooper Doyle et.al. 2506.22809 null
2025-06-28 Trusted Routing for Blockchain-Enabled Low-Altitude Intelligent Networks Sijie He et.al. 2506.22745 null
2025-06-28 Questions as cognitive filters Willem Conradie et.al. 2506.22735 null
2025-06-28 FairMarket-RL: LLM-Guided Fairness Shaping for Multi-Agent Reinforcement Learning in Peer-to-Peer Markets Shrenik Jadhav et.al. 2506.22708 null
2025-06-28 General Autonomous Cybersecurity Defense: Learning Robust Policies for Dynamic Topologies and Diverse Attackers Arun Ramamurthy et.al. 2506.22706 null
2025-06-27 Knowledge-Guided Multi-Agent Framework for Automated Requirements Development: A Vision Jiangping Huang et.al. 2506.22656 null
2025-06-27 URSA: The Universal Research and Scientific Agent Michael Grosskopf et.al. 2506.22653 null
2025-06-27 QoS-aware State-Augmented Learnable Algorithm for Wireless Coexistence Parameter Management Mohammad Reza Fasihi et.al. 2506.22652 null
2025-06-27 Entropy Regularized Belief Reporting Elchin Suleymanov et.al. 2506.22649 null
2025-06-27 Ludax: A GPU-Accelerated Domain Specific Language for Board Games Graham Todd et.al. 2506.22609 null
2025-06-27 RExBench: Can coding agents autonomously implement AI research extensions? Nicholas Edwards et.al. 2506.22598 null
2025-06-27 Capacity Planning in Stable Matching with Truthful or Strategic Preference Uncertainty Maria Bazotte et.al. 2506.22560 null
2025-07-01 Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset Vasu Agrawal et.al. 2506.22554 null
2025-06-26 Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures Yubo Peng et.al. 2506.22507 null
2025-06-30 The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Bingchen Zhao et.al. 2506.22419 null
2025-06-27 Why Are Parsing Actions for Understanding Message Hierarchies Not Random? Daichi Kato et.al. 2506.22366 null
2025-06-27 Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation Tao Li et.al. 2506.22365 null
2025-07-03 Embodied AI Agents: Modeling the World Pascale Fung et.al. 2506.22355 null
2025-06-27 Agent-based modeling and the sociology of money: some suggestions for refining monetary theory using social simulation Eduardo Coltre Ferraciolli et.al. 2506.22318 null
2025-06-27 Artificial Intelligent Disobedience: Rethinking the Agency of Our Artificial Teammates Reuth Mirsky et.al. 2506.22276 null
2025-06-27 Exploring Modularity of Agentic Systems for Drug Discovery Laura van Weesep et.al. 2506.22189 null
2025-06-27 Autonomic Microservice Management via Agentic AI and MAPE-K Integration Matteo Esposito et.al. 2506.22185 null
2025-06-27 A Different Approach to AI Safety: Proceedings from the Columbia Convening on Openness in Artificial Intelligence and AI Safety Camille François et.al. 2506.22183 null
2025-06-27 ASVSim (AirSim for Surface Vehicles): A High-Fidelity Simulation Framework for Autonomous Surface Vehicle Research Bavo Lesy et.al. 2506.22174 null
2025-06-27 Learning Distributed Safe Multi-Agent Navigation via Infinite-Horizon Optimal Graph Control Fenglan Wang et.al. 2506.22117 null
2025-06-27 Flocking with random non-reciprocal interactions Jiwon Choi et.al. 2506.22060 null
2025-06-27 Universal Retrieval for Multimodal Trajectory Modeling Xuan Zhang et.al. 2506.22056 null
2025-06-27 TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning Alessandro Sestini et.al. 2506.22008 null
2025-06-27 A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments Akshay Jaitly et.al. 2506.21982 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-06-27 Don't Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism Simon Münker et.al. 2506.21974 null
2025-06-27 More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents Weimin Xiong et.al. 2506.21967 null
2025-06-27 CAL-RAG: Retrieval-Augmented Multi-Agent Generation for Content-Aware Layout Design Najmeh Forouzandehmehr et.al. 2506.21934 null
2025-06-27 ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation Reza Yousefi Maragheh et.al. 2506.21931 null
2025-06-27 SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding Zhao Jin et.al. 2506.21924 null
2025-06-27 Advancements and Challenges in Continual Reinforcement Learning: A Comprehensive Review Amara Zuffer et.al. 2506.21899 null
2025-06-27 Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation Qiyue Gao et.al. 2506.21876 null
2025-06-27 A Survey of Continual Reinforcement Learning Chaofan Pan et.al. 2506.21872 null
2025-06-27 GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles Mengyi Shan et.al. 2506.21839 null
2025-06-26 When Networks Mislead: How Partisan Communication Undermines Democratic Decision-Making Hsuan-Wei Lee et.al. 2506.21820 null
2025-06-26 CitySim: Modeling Urban Behaviors and City Dynamics with Large-Scale LLM-Driven Agent Simulation Nicolas Bougie et.al. 2506.21805 null
2025-06-26 Adaptive Multipath-Based SLAM for Distributed MIMO Systems Xuhong Li et.al. 2506.21798 null
2025-06-26 MobiVerse: Scaling Urban Mobility Simulation with Hybrid Lightweight Domain-Specific Generator and Large Language Models Yifan Liu et.al. 2506.21784 null
2025-06-26 Simultaneously Fair Allocation of Indivisible Items Across Multiple Dimensions Yasushi Kawase et.al. 2506.21727 null
2025-06-26 SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents Wanxin Tian et.al. 2506.21669 null
2025-06-26 Monetary Macro Accounting Theory Renéee Menéndez et.al. 2506.21651 null
2025-06-23 TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge Zhiyuan Zhang et.al. 2506.21618 null
2025-06-26 Whole-Body Conditioned Egocentric Video Prediction Yutong Bai et.al. 2506.21552 null
2025-06-26 PsyLite Technical Report Fangjun Ding et.al. 2506.21536 null
2025-07-03 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Boyu Gou et.al. 2506.21506 null
2025-06-26 From multi-allocations to allocations, with subadditive valuations Uriel Feige et.al. 2506.21493 null
2025-06-29 Ad-Hoc Human-AI Coordination Challenge Tin Dizdarević et.al. 2506.21490 null
2025-06-26 Reinforcement Learning for Optimal Control of Spin Magnetometers Logan W. Cooke et.al. 2506.21475 null
2025-06-26 Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents Tianyi Men et.al. 2506.21252 null
2025-06-26 Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations Elia Trevisan et.al. 2506.21205 null
2025-06-26 Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout Apurva Shah et.al. 2506.21186 null
2025-06-26 Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 Jongyeon Park et.al. 2506.21174 null
2025-06-26 Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks Deepak Kumar Panda et.al. 2506.21129 null
2025-06-26 GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction Muleilan Pei et.al. 2506.21121 null
2025-06-26 Homogenization of Multi-agent Learning Dynamics in Finite-state Markov Games Yann Kerzreho et.al. 2506.21079 null
2025-06-26 RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment Suorong Yang et.al. 2506.21037 null
2025-06-26 Evidence-based diagnostic reasoning with multi-agent copilot for human pathology Chengkuan Chen et.al. 2506.20964 null
2025-06-26 Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation Chenkai Sun et.al. 2506.20949 null
2025-06-26 ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks Joshua H. Davis et.al. 2506.20938 null
2025-06-26 Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market Chi-Sheng Chen et.al. 2506.20930 null
2025-06-26 LLM-guided Chemical Process Optimization with a Multi-Agent Approach Tong Zeng et.al. 2506.20921 null
2025-06-26 FaSTA $^*$ : Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing Advait Gupta et.al. 2506.20911 null
2025-06-26 Smoothness Meets Autobidding: Tight Price of Anarchy Bounds for Simultaneous First-Price Auctions Riccardo Colini-Baldeschi et.al. 2506.20908 null
2025-06-25 Complex Model Transformations by Reinforcement Learning with Uncertain Human Guidance Kyanna Dagenais et.al. 2506.20883 null
2025-06-28 Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine Sebastian Joseph et.al. 2506.20876 null
2025-06-25 GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization Martin Andrews et.al. 2506.20807 null
2025-06-25 Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis Zhonghao Zhan et.al. 2506.20806 null
2025-06-25 A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools Minh-Hao Van et.al. 2506.20743 null
2025-06-25 MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation Gurusha Juneja et.al. 2506.20737 null
2025-06-25 MMSearch-R1: Incentivizing LMMs to Search Jinming Wu et.al. 2506.20670 null
2025-06-25 The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind Andrei Lupu et.al. 2506.20664 null
2025-06-25 Memento: Note-Taking for Your Future Self Chao Wan et.al. 2506.20642 null
2025-06-25 Towards Community-Driven Agents for Machine Learning Engineering Sijie Li et.al. 2506.20640 null
2025-06-25 Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm Baixiang Huang et.al. 2506.20606 null
2025-06-25 Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges Alexander D. Kalian et.al. 2506.20598 null
2025-06-25 An Explicit Solution for the Problem of Optimal Investment with Random Endowment Michael Donisch et.al. 2506.20506 null
2025-06-25 Engineering Sentience Konstantin Demin et.al. 2506.20504 null
2025-06-25 Opinion Dynamics with Highly Oscillating Opinions Víctor A. Vargas-Pérez et.al. 2506.20472 null
2025-06-25 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning Weike Zhao et.al. 2506.20430 null
2025-06-25 SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models Dipayan Saha et.al. 2506.20415 null
2025-06-26 TAPS: Tool-Augmented Personalisation via Structured Tagging Ekaterina Taktasheva et.al. 2506.20409 null
2025-06-25 A Visualization Framework for Exploring Multi-Agent-Based Simulations Case Study of an Electric Vehicle Home Charging Ecosystem Kristoffer Christensen et.al. 2506.20400 null
2025-06-27 Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards Jihao Gu et.al. 2506.20332 null
2025-06-26 Finding the Easy Way Through -- the Probabilistic Gap Planner for Social Robot Navigation Malte Probst et.al. 2506.20320 null
2025-06-25 Exact and approximate maximin share allocations in multi-graphs George Christodoulou et.al. 2506.20317 null
2025-06-25 Language Modeling by Language Models Junyan Cheng et.al. 2506.20249 null
2025-06-25 Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox Malikussaid et.al. 2506.20102 null
2025-06-25 PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models Wang Bill Zhu et.al. 2506.20097 null
2025-06-25 From Conversation to Orchestration: HCI Challenges and Opportunities in Interactive Multi-Agentic Systems Sarah Schömbs et.al. 2506.20091 null
2025-06-24 Beyond Autocomplete: Designing CopilotLens Towards Transparent and Explainable AI Coding Agents Runlong Ye et.al. 2506.20062 null
2025-06-24 Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models Zhicheng Zhang et.al. 2506.20061 null
2025-06-26 Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception Eric C. Joyce et.al. 2506.20045 null
2025-06-24 Learning Bilateral Team Formation in Cooperative Multi-Agent Reinforcement Learning Koorosh Moslemi et.al. 2506.20039 null
2025-06-24 Automated Generation of Diverse Courses of Actions for Multi-Agent Operations using Binary Optimization and Graph Learning Prithvi Poddar et.al. 2506.20031 null
2025-06-24 Polynomial-Time Approximation Schemes via Utility Alignment: Unit-Demand Pricing and More Robin Bowers et.al. 2506.20030 null
2025-06-24 QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges Abdul Basit et.al. 2506.20008 null
2025-06-24 Can One Safety Loop Guard Them All? Agentic Guard Rails for Federated Computing Narasimha Raghavan Veeraragavan et.al. 2506.20000 null
2025-06-24 Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation Xinyi Ni et.al. 2506.19998 null
2025-07-02 TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design Geonwoo Cho et.al. 2506.19997 null
2025-06-24 Prover Agent: An Agent-based Framework for Formal Mathematical Proofs Kaito Baba et.al. 2506.19923 null
2025-06-24 JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning Ai Han et.al. 2506.19846 null
2025-06-24 MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration Yucheng Zhou et.al. 2506.19835 null
2025-06-24 Curating art exhibitions using machine learning Eurico Covas et.al. 2506.19813 null
2025-06-24 LLM-Based Social Simulations Require a Boundary Zengqing Wu et.al. 2506.19806 null
2025-06-24 Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning Menglong Zhang et.al. 2506.19785 null
2025-06-24 SAGE: Strategy-Adaptive Generation Engine for Query Rewriting Teng Wang et.al. 2506.19783 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-24 From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking Gyeongwon James Kim et.al. 2506.19724 null
2025-07-02 A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures Dezhang Kong et.al. 2506.19676 null
2025-06-24 How trust networks shape students' opinions about the proficiency of artificially intelligent assistants Yutong Bu et.al. 2506.19655 null
2025-06-24 HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions Mrunmai Vivek Phatak et.al. 2506.19639 null
2025-06-24 Mobile oscillators in a mobile multi-cluster network Venceslas Nguefoue Meli et.al. 2506.19617 null
2025-06-24 Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI Sha Zhang et.al. 2506.19613 null
2025-06-24 Robotics Under Construction: Challenges on Job Sites Haruki Uchiito et.al. 2506.19597 null
2025-06-30 Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning Harisankar Babu et.al. 2506.19592 null
2025-06-24 Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects Federico Tavella et.al. 2506.19579 null
2025-06-24 KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs Kelin Fu et.al. 2506.19527 null
2025-06-24 MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications Aleksandr Algazinov et.al. 2506.19502 null
2025-06-24 NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling Yan Jiang et.al. 2506.19500 null
2025-06-24 SceneCrafter: Controllable Multi-View Driving Scene Editing Zehao Zhu et.al. 2506.19488 null
2025-06-24 Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning Russell Beale et.al. 2506.19484 null
2025-06-24 LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code Shahbaz Siddeeq et.al. 2506.19481 null
2025-06-24 Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System Lixuan He et.al. 2506.19433 null
2025-06-24 Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection Yazhou Zhang et.al. 2506.19420 null
2025-06-24 Center of Gravity-Guided Focusing Influence Mechanism for Multi-Agent Reinforcement Learning Yisak Park et.al. 2506.19417 null
2025-06-24 Is an object-centric representation beneficial for robotic manipulation ? Alexandre Chapin et.al. 2506.19408 null
2025-06-24 Do cell culturing influence the radiosensitizing effect of gold nanoparticles part 1: scrutinizing recent evidence for data consistency Hans Rabus et.al. 2506.19372 null
2025-06-24 Computing Tree Structures in Anonymous Graphs via Mobile Agents Prabhat Kumar Chand et.al. 2506.19365 null
2025-06-24 Distributed Interview Selection for Stable Matching in Large Random Markets Richard Cole et.al. 2506.19345 null
2025-06-26 The Autonomy of the Lightning Network: A Mathematical and Economic Proof of Structural Decoupling from BTC Craig Steven Wright et.al. 2506.19333 null
2025-06-24 Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs Liang Zeng et.al. 2506.19290 null
2025-06-24 Robust Behavior Cloning Via Global Lipschitz Regularization Shili Wu et.al. 2506.19250 null
2025-06-24 Augmenting Multi-Agent Communication with State Delta Trajectory Yichen Tang et.al. 2506.19209 null
2025-06-25 Vertex addition to a ball graph with application to reliability and area coverage in autonomous swarms Calum Buchanan et.al. 2506.19197 null
2025-06-23 Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition Craig Steven Wright et.al. 2506.19191 null
2025-06-23 Distilling Tool Knowledge into Language Models via Back-Translated Traces Xingyue Huang et.al. 2506.19171 null
2025-06-23 AgenticControl: An Automated Control Design Framework Using Large Language Models Mohammad Narimani et.al. 2506.19160 null
2025-06-23 Model Reference Adaptive Control of Networked Systems with State and Input Delays Moh Kamalul Wafi et.al. 2506.19138 null
2025-06-23 Emergent collective dynamics from motile photokinetic organisms J. Morales et.al. 2506.19081 null
2025-06-23 How brains build higher order representations of uncertainty Megan A. K. Peters et.al. 2506.19057 null
2025-06-26 From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents Weizhi Zhang et.al. 2506.18959 null
2025-06-23 A Comment On "The Illusion of Thinking": Reframing the Reasoning Cliff as an Agentic Gap Sheraz Khan et.al. 2506.18957 null
2025-06-23 SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications Jinyang Li et.al. 2506.18951 null
2025-06-22 Advanced Applications of Generative AI in Actuarial Science: Case Studies Beyond ChatGPT Simon Hatzesberger et.al. 2506.18942 null
2025-06-23 Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models Kiymet Akdemir et.al. 2506.18900 null
2025-06-23 Steering Conceptual Bias via Transformer Latent-Subspace Activation Vansh Sharma et.al. 2506.18887 null
2025-06-23 GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM Annika Thomas et.al. 2506.18885 null
2025-06-23 Broad Validity of the First-Order Approach in Moral Hazard Eduardo Azevedo et.al. 2506.18873 null
2025-06-25 Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning Anthony Kobanda et.al. 2506.18847 null
2025-06-23 Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories Islem Bouzenia et.al. 2506.18824 null
2025-06-23 Multi-Agent Online Control with Adversarial Disturbances Anas Barakat et.al. 2506.18814 null
2025-06-23 Fair Allocation with Money: What is Your Objective? Noga Klein Elmalem et.al. 2506.18794 null
2025-06-23 TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation Kamil Szczepanik et.al. 2506.18783 null
2025-06-23 Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI Daniel M. Lang et.al. 2506.18720 null
2025-06-23 Safety-Aware Optimal Scheduling for Autonomous Masonry Construction using Collaborative Heterogeneous Aerial Robots Marios-Nektarios Stamatopoulos et.al. 2506.18697 null
2025-06-23 MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation Ruicheng Zhang et.al. 2506.18679 null
2025-06-23 MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation Tianchen Deng et.al. 2506.18678 null
2025-06-23 Dual-level Behavioral Consistency for Inter-group and Intra-group Coordination in Multi-Agent Systems Shuocun Yang et.al. 2506.18651 null
2025-06-23 Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits Yannik Mahlau et.al. 2506.18627 null
2025-06-23 Reply to "Emergent LLM behaviors are observationally equivalent to data leakage" Ariel Flint Ashery et.al. 2506.18600 null
2025-06-23 Agentic Markets: Game Dynamics and Equilibrium in Markets with Learning Agents Martin Bichler et.al. 2506.18571 null
2025-06-23 Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning Jiexin Zhang et.al. 2506.18560 null
2025-06-23 T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent Hong Qing Yu et.al. 2506.18559 null
2025-06-23 Unilateral determination of causal order in a cyclic process Ilyass Mejdoub et.al. 2506.18540 null
2025-06-23 Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning Azad Deihim et.al. 2506.18537 null
2025-06-23 Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance Yu Han et.al. 2506.18511 null
2025-06-23 Reliability-Adjusted Prioritized Experience Replay Leonard S. Pleiss et.al. 2506.18482 null
2025-06-23 AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction Gengyuan Zhang et.al. 2506.18472 null
2025-06-23 Networked pointing system: Bearing-only target localization and pointing control Shiyao Li et.al. 2506.18460 null
2025-06-23 A Motivational Architecture for Open-Ended Learning Challenges in Robots Alejandro Romero et.al. 2506.18454 null
2025-06-23 GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System Quang Nguyen et.al. 2506.18448 null
2025-06-23 A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction Chengjie Liu et.al. 2506.18424 null
2025-06-23 Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots Imene Tarakli et.al. 2506.18365 null
2025-06-27 Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team Weilun Yu et.al. 2506.18348 null
2025-06-23 Use Property-Based Testing to Bridge LLM Code Generation and Validation Lehan He et.al. 2506.18315 null
2025-06-23 A stochastic model for the diffusion of competing opinions with trend-following, opposition, and indifference Manuel González-Navarrete et.al. 2506.18313 null
2025-06-23 Advanced For-Loop for QML algorithm search FuTe Wong et.al. 2506.18260 null
2025-06-22 Wisdom of Crowds Through Myopic Self-Confidence Adaptation Giacomo Como et.al. 2506.18195 null
2025-06-22 Mapping The Invisible Internet: Framework and Dataset Siddique Abubakr Muntaka et.al. 2506.18159 null
2025-06-22 Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation Xinzge Gao et.al. 2506.18158 null
2025-06-22 CoachGPT: A Scaffolding-based Academic Writing Assistant Fumian Chen et.al. 2506.18149 null
2025-06-22 Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game Xiang Yuming et.al. 2506.18126 null
2025-06-22 Deep Research Agents: A Systematic Examination And Roadmap Yuxuan Huang et.al. 2506.18096 null
2025-06-27 MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering Jisheng Dang et.al. 2506.18071 null
2025-06-26 Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities Yuanchen Bei et.al. 2506.18019 null
2025-06-22 Ultra-Efficient Contracts: Breaking the Substitutes Barrier in Combinatorial Contracts Michal Feldman et.al. 2506.18008 null
2025-06-22 An Axiomatization of the Random Priority Rule Christian Basteck et.al. 2506.17997 null
2025-06-22 Non-Euclidean Enriched Contraction Theory for Monotone Operators and Monotone Dynamical Systems Diego Deplano et.al. 2506.17990 null
2025-06-22 GeNIE: A Generalizable Navigation System for In-the-Wild Environments Jiaming Wang et.al. 2506.17960 null
2025-06-22 ASTER: Adaptive Spatio-Temporal Early Decision Model for Dynamic Resource Allocation Shulun Chen et.al. 2506.17929 null
2025-06-22 Learning, Reasoning, Refinement: A Framework for Kahneman's Dual-System Intelligence in GUI Agents Jinjie Wei et.al. 2506.17913 null
2025-06-22 Towards Robust Fact-Checking: A Multi-Agent System with Advanced Evidence Retrieval Tam Trinh et.al. 2506.17878 null
2025-06-21 Out of Control -- Why Alignment Needs Formal Control Theory (and an Alignment Control Stack) Elija Perrier et.al. 2506.17846 null
2025-06-21 Reflective Verbal Reward Design for Pluralistic Alignment Carter Blair et.al. 2506.17834 null
2025-06-21 Is Your Automated Software Engineer Trustworthy? Noble Saji Mathews et.al. 2506.17812 null
2025-06-21 Bayesian Social Deduction with Graph-Informed Language Models Shahab Rahimirad et.al. 2506.17788 null
2025-06-21 AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction Song Wang et.al. 2506.17784 null
2025-06-21 Toward Autonomous UI Exploration: The UIExplorer Benchmark Andrei Cristian Nica et.al. 2506.17779 null
2025-06-21 Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems Sebastian Sansoni et.al. 2506.17775 null
2025-06-21 PAGENT: Learning to Patch Software Engineering Agents Haoran Xue et.al. 2506.17772 null
2025-06-21 CARTS: Collaborative Agents for Recommendation Textual Summarization Jiao Chen et.al. 2506.17765 null
2025-06-21 Experimental Evidence for the Propagation and Preservation of Machine Discoveries in Human Populations Levin Brinkmann et.al. 2506.17741 null
2025-06-21 Distributed Butterfly Analysis using Mobile Agents Prabhat Kumar Chand et.al. 2506.17721 null
2025-06-21 Wealth Thermalization Hypothesis Klaus M. Frahm et.al. 2506.17720 null
2025-06-21 Beyond Syntax: Action Semantics Learning for App Agents Bohan Tang et.al. 2506.17697 null
2025-06-21 Network Heterogeneity and Value of Information Kota Murayama et.al. 2506.17660 null
2025-06-21 Diffusion of Tracer Particles in Early Growing Biofilms. A Computer Simulation Study Fabian A. Garcia Daza et.al. 2506.17653 null
2025-06-21 May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs Shaoyu Yang et.al. 2506.17642 null
2025-06-21 JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent Yunlong Lin et.al. 2506.17612 null
2025-06-26 Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown Bowen Wang et.al. 2506.17589 null
2025-06-21 Towards Zero-Shot Coordination between Teams of Agents: The N-XPlay Framework Ava Abderezaei et.al. 2506.17560 null
2025-06-24 Breaking Single-Tester Limits: Multi-Agent LLMs for Multi-User Feature Testing Sidong Feng et.al. 2506.17539 null
2025-06-20 Kaleidoscopic Teaming in Multi Agent Simulations Ninareh Mehrabi et.al. 2506.17514 null
2025-06-20 A Grassroots Network and Community Roadmap for Interconnected Autonomous Science Laboratories for Accelerated Discovery Rafael Ferreira da Silva et.al. 2506.17510 null
2025-06-20 From Unstructured Communication to Intelligent RAG: Multi-Agent Automation for Supply Chain Knowledge Bases Yao Zhang et.al. 2506.17484 null
2025-06-20 General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting Bernard Lange et.al. 2506.17462 null
2025-06-20 OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections Manasa Bharadwaj et.al. 2506.17449 null
2025-06-20 Resource Rational Contractualism Should Guide AI Alignment Sydney Levine et.al. 2506.17434 null
2025-06-20 UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making Jinhao Duan et.al. 2506.17419 null
2025-06-20 Challenges in Grounding Language in the Real World Peter Lindes et.al. 2506.17375 null
2025-06-20 Cash or Comfort? How LLMs Value Your Inconvenience Mateusz Cedro et.al. 2506.17367 null
2025-06-19 Advanced Game-Theoretic Frameworks for Multi-Agent AI Challenges: A 2025 Outlook Pavel Malinovskiy et.al. 2506.17348 null
2025-06-19 Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning Zijian Long et.al. 2506.17342 null
2025-06-19 AI is the Strategy: From Agentic AI to Autonomous Business Models onto Strategy in the Age of AI René Bohnsack et.al. 2506.17339 null
2025-06-24 PBFT-Backed Semantic Voting for Multi-Agent Memory Pruning Duong Bach et.al. 2506.17338 null
2025-06-19 Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases Yubeen Bae et.al. 2506.17336 link
2025-06-19 LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research Shuo Yan et.al. 2506.17335 null
2025-06-19 Beyond Prediction -- Structuring Epistemic Integrity in Artificial Reasoning Systems Craig Steven Wright et.al. 2506.17331 null
2025-06-18 MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant Akash Awasthi et.al. 2506.17320 null
2025-06-18 Context manipulation attacks : Web agents are susceptible to corrupted memory Atharv Singh Patlan et.al. 2506.17318 null
2025-06-18 Can Large Language Models Be Trusted Paper Reviewers? A Feasibility Study Chuanlei Li et.al. 2506.17311 null
2025-06-17 SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library Satyam Mishra et.al. 2506.17297 null
2025-06-25 VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning Zhangyang Qi et.al. 2506.17221 null
2025-06-20 Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Xiuyu Yang et.al. 2506.17213 link
2025-06-20 Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems Matias Martinez et.al. 2506.17208 null
2025-06-20 Towards AI Search Paradigm Yuchen Li et.al. 2506.17188 null
2025-06-20 Capturing Misalignment Pierfrancesco Guarino et.al. 2506.17176 null
2025-06-20 A Note on Proper Relational Structures Adam Bjorndahl et.al. 2506.17142 null
2025-06-20 When Can Model-Free Reinforcement Learning be Enough for Thinking? Josiah P. Hanna et.al. 2506.17124 null
2025-06-20 A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study Elia Onofri et.al. 2506.17078 null
2025-06-20 Behavior Driven Development for 3D Games Fernando Pastor Ricós et.al. 2506.17057 null
2025-06-20 Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment Leizhen Wang et.al. 2506.17029 null
2025-06-20 A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving Hanlin Wu et.al. 2506.17004 null
2025-06-20 Elevating Styled Mahjong Agents with Learning from Demonstration Lingfeng Li et.al. 2506.16995 null
2025-06-20 RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering Ines Besrour et.al. 2506.16988 link
2025-06-20 Formal Control for Uncertain Systems via Contract-Based Probabilistic Surrogates (Extended Version) Oliver Schön et.al. 2506.16971 null
2025-06-20 LunarLoc: Segment-Based Global Localization on the Moon Annika Thomas et.al. 2506.16940 link
2025-06-20 Do You Know What I Mean? A Syntactic Representation for Differential Bounded Awareness Ani Guerdjikova et.al. 2506.16901 null
2025-06-20 Engineering Resilience: An Energy-Based Approach to Sustainable Behavioural Interventions Arpitha Srivathsa Malavalli et.al. 2506.16836 null
2025-06-20 Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting Michał Wawer et.al. 2506.16813 null
2025-06-20 Distributed Affine Formation Control of Linear Multi-agent Systems with Adaptive Event-triggering Chenjun Liu et.al. 2506.16797 null
2025-06-20 Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly Lance Ying et.al. 2506.16755 null
2025-06-20 Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation Kosuke Nakanishi et.al. 2506.16753 link
2025-06-20 A Scalable Post-Processing Pipeline for Large-Scale Free-Space Multi-Agent Path Planning with PiBT Arjo Chakravarty et.al. 2506.16748 link
2025-06-20 Incentivizing High-quality Participation From Federated Learning Agents Jinlong Pang et.al. 2506.16731 null
2025-06-20 DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy Weitao Zhou et.al. 2506.16720 null
2025-06-20 Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation Chenxu Wang et.al. 2506.16718 link
2025-06-20 Mean-field and Monte Carlo Analysis of Multi-Species Dynamics of agents Eduardo Velasco Stock et.al. 2506.16717 null
2025-06-20 Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models Lu Gao et.al. 2506.16699 null
2025-06-20 Interpretable Low-Dimensional Modeling of Spatiotemporal Agent States for Decision Making in Football Tactics Kenjiro Ide et.al. 2506.16696 null
2025-06-20 Closed curve covering and multiagent TSP ratios Travis Dillon et.al. 2506.16675 null
2025-06-19 SemAgent: A Semantics Aware Program Repair Agent Anvith Pabba et.al. 2506.16650 null
2025-06-19 Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces Jiamin He et.al. 2506.16608 null
2025-06-19 AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions Ihor Pysmennyi et.al. 2506.16586 link
2025-06-19 ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning Zexi Liu et.al. 2506.16499 null
2025-06-19 Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support Sophie Chiang et.al. 2506.16473 null
2025-06-19 StoryWriter: A Multi-Agent Framework for Long Story Generation Haotian Xia et.al. 2506.16445 null
2025-06-19 Agentic Personalisation of Cross-Channel Marketing Experiences Sami Abboud et.al. 2506.16429 null
2025-06-19 When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework Zhen Xu et.al. 2506.16411 null
2025-06-19 IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks Xiaoya Lu et.al. 2506.16402 null
2025-06-19 GoalLadder: Incremental Goal Discovery with Vision-Language Models Alexey Zakharov et.al. 2506.16396 null
2025-06-19 AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios Yunhao Hou et.al. 2506.16371 link
2025-06-19 Data-Driven Policy Mapping for Safe RL-based Energy Management Systems Theo Zangato et.al. 2506.16352 null
2025-06-19 Improved Exploration in GFlownets via Enhanced Epistemic Neural Networks Sajan Muhammad et.al. 2506.16313 null
2025-06-19 M-Predictive Spliner: Enabling Spatiotemporal Multi-Opponent Overtaking for Autonomous Racing Nadine Imholz et.al. 2506.16301 null
2025-06-19 Coordination of Electrical and Heating Resources by Self-Interested Agents Rico Schrage et.al. 2506.16277 null
2025-06-19 VideoGAN-based Trajectory Proposal for Automated Vehicles Annajoyce Mariani et.al. 2506.16209 link
2025-06-19 Solving Zero-Sum Convex Markov Games Fivos Kalogiannis et.al. 2506.16120 null
2025-06-19 Towards AI-Driven RANs for 6G and Beyond: Architectural Advancements and Future Horizons Mathushaharan Rathakrishnan et.al. 2506.16070 null
2025-06-19 Human-Centered Shared Autonomy for Motor Planning, Learning, and Control Applications MH Farhadi et.al. 2506.16044 null
2025-06-19 OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents Reyna Abhyankar et.al. 2506.16042 null
2025-06-19 DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning Boyu Li et.al. 2506.16012 link
2025-06-19 SimuPanel: A Novel Immersive Multi-Agent System to Simulate Interactive Expert Panel Discussion Xiangyang He et.al. 2506.16010 null
2025-06-19 HybridRAG-based LLM Agents for Low-Carbon Optimization in Low-Altitude Economy Networks Jinbo Wen et.al. 2506.15947 null
2025-06-19 On the optimal regret of collaborative personalized linear bandits Bruce Huang et.al. 2506.15943 null
2025-06-19 Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues Myke C. Cohen et.al. 2506.15928 null
2025-06-23 From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents Mohammad Amaan Sayeed et.al. 2506.15911 null
2025-06-18 Fair Contracts in Principal-Agent Games with Heterogeneous Types Jakub Tłuczek et.al. 2506.15887 null
2025-06-18 Modeling society with a responsible elite Yana Tsodikova et.al. 2506.15877 null
2025-06-18 CooperRisk: A Driving Risk Quantification Pipeline with Multi-Agent Cooperative Perception and Prediction Mingyue Lei et.al. 2506.15868 null
2025-06-18 Understanding Online Polarization Through Human-Agent Interaction in a Synthetic LLM-Based Social Network Tim Donkers et.al. 2506.15866 null
2025-06-18 Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples Viral Rasik Galaiya et.al. 2506.15865 null
2025-06-18 Learning to Coordinate Under Threshold Rewards: A Cooperative Multi-Agent Bandit Framework Michael Ledford et.al. 2506.15856 null
2025-06-18 MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents Zijian Zhou et.al. 2506.15841 null
2025-06-18 Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning Emanuele Musumeci et.al. 2506.15828 null
2025-06-18 Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters Luiz Pereira et.al. 2506.15825 null
2025-06-18 Veracity: An Open-Source AI Fact-Checking System Taylor Lynn Curtis et.al. 2506.15794 null
2025-06-18 Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation Ruoyu Wang et.al. 2506.15757 null
2025-06-18 RecBayes: Recurrent Bayesian Ad Hoc Teamwork in Large Partially Observable Domains João G. Ribeiro et.al. 2506.15756 null
2025-06-23 OAgents: An Empirical Study of Building Effective Agents He Zhu et.al. 2506.15741 null
2025-06-17 SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents Jonathan Kutasov et.al. 2506.15740 null
2025-06-20 Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence Yining Hong et.al. 2506.15677 null
2025-06-18 Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers Tommaso Green et.al. 2506.15674 link
2025-06-18 SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence Yao Zhang et.al. 2506.15672 null
2025-06-18 PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection Wenhao Li et.al. 2506.15656 null
2025-06-18 FindingDory: A Benchmark to Evaluate Memory in Embodied Agents Karmesh Yadav et.al. 2506.15635 null
2025-06-18 The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games Lyle Goodyear et.al. 2506.15624 null
2025-06-18 Multi-Agent, Multi-Scale Systems with the Koopman Operator Craig Bakker et.al. 2506.15589 null
2025-06-18 Learning to flock in open space by avoiding collisions and staying together Martino Brambati et.al. 2506.15587 null
2025-06-18 Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents Aline Dobrovsky et.al. 2506.15567 null
2025-06-18 Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning Roger Creus Castanyer et.al. 2506.15544 link
2025-06-18 Co-Creative Learning via Metropolis-Hastings Interaction between Humans and AI Ryota Okumura et.al. 2506.15468 null
2025-06-18 AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need Zhouhong Gu et.al. 2506.15451 link
2025-06-18 Understanding GUI Agent Localization Biases through Logit Sharpness Xingjian Tao et.al. 2506.15425 null
2025-06-18 Reward Models in Deep Reinforcement Learning: A Survey Rui Yu et.al. 2506.15421 null
2025-06-18 Multi-Timescale Gradient Sliding for Distributed Optimization Junhui Zhang et.al. 2506.15387 null
2025-06-18 Tractable Graph Structures in EFX Orientation Václav Blažej et.al. 2506.15379 null
2025-06-18 Efficient and Generalizable Environmental Understanding for Visual Navigation Ruoyu Wang et.al. 2506.15377 null
2025-06-18 Learning to Maximize Quantum Neural Network Expressivity via Effective Rank Juan Yao et.al. 2506.15375 null
2025-06-18 Designing Intent: A Multimodal Framework for Human-Robot Cooperation in Industrial Workspaces Francesco Chiossi et.al. 2506.15293 null
2025-06-18 RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments Yuchuan Fu et.al. 2506.15253 link
2025-06-18 Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels Jiahao You et.al. 2506.15225 null
2025-06-18 Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study Mohamad A. Hady et.al. 2506.15207 null
2025-06-18 ImprovDML: Improved Trade-off in Private Byzantine-Resilient Distributed Machine Learning Bing Liu et.al. 2506.15181 null
2025-06-18 From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem Yanxu Mao et.al. 2506.15170 null
2025-06-18 Efficient reallocation of indivisible resources: Pair-efficiency versus Pareto-efficiency Pinaki Mandal et.al. 2506.15169 null
2025-06-18 LLM Agent for Hyper-Parameter Optimization Wanzhe Wang et.al. 2506.15167 null
2025-06-18 Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs Jing Yang Lee et.al. 2506.15131 null
2025-06-19 Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality Ziqin Chen et.al. 2506.15106 null
2025-06-18 DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory Zihe Ji et.al. 2506.15096 null
2025-06-18 EmojiVoice: Towards long-term controllable expressivity in robot speech Paige Tuttösí et.al. 2506.15085 null
2025-06-18 HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models Trishna Chakraborty et.al. 2506.15065 null
2025-06-18 2BSDE with uncertain horizon and application to stochastic control in erratic environments Alberto Gennaro et.al. 2506.15037 null
2025-06-19 Context Matters: Learning Generalizable Rewards via Calibrated Features Alexandra Forsey-Smerek et.al. 2506.15012 null
2025-06-17 MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning Tristan Tomilin et.al. 2506.14990 link
2025-06-17 Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits Tianyi Xu et.al. 2506.14988 null
2025-06-17 OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents Thomas Kuntz et.al. 2506.14866 link
2025-06-17 Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching Qizheng Zhang et.al. 2506.14852 null
2025-06-13 Recent Advances in Multi-Agent Human Trajectory Prediction: A Comprehensive Review Céline Finet et.al. 2506.14831 null
2025-06-17 RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills Chunru Lin et.al. 2506.14763 null
2025-06-17 Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems Shiyu Cheng et.al. 2506.14749 null
2025-06-17 AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes Jiahao Qiu et.al. 2506.14728 null
2025-06-17 Linear Planar 3-SAT and Its Applications in Planning Victorien Desbois et.al. 2506.14713 null
2025-06-17 AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions Aishan Liu et.al. 2506.14697 null
2025-06-17 Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference Kalliyan Velasco et.al. 2506.14690 null
2025-06-17 Unified Software Engineering agent as AI Software Engineer Leonhard Applis et.al. 2506.14683 null
2025-06-17 StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery Jina Kim et.al. 2506.14670 null
2025-06-17 SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning Hexian Ni et.al. 2506.14648 null
2025-06-17 GenerationPrograms: Fine-grained Attribution with Executable Programs David Wan et.al. 2506.14580 link
2025-06-17 Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack Daewon Kang et.al. 2506.14539 null
2025-06-17 Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution Xiaohan Zheng et.al. 2506.14529 null
2025-06-17 SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks Zijian Song et.al. 2506.14512 null
2025-06-17 Toward Safety-First Human-Like Decision Making for Autonomous Vehicles in Time-Varying Traffic Flow Xiao Wang et.al. 2506.14502 null
2025-06-17 LLM-Powered Swarms: A New Frontier or a Conceptual Stretch? Muhammad Atta Ur Rahman et.al. 2506.14496 null
2025-06-17 GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies Jingqi Yang et.al. 2506.14477 link
2025-06-17 SimSpark: Interactive Simulation of Social Media Behaviors Ziyue Lin et.al. 2506.14476 null
2025-06-17 Hamiltonian Formalism for Comparing Quantum and Classical Intelligence Elija Perrier et.al. 2506.14456 null
2025-06-17 Active Digital Twins via Active Inference Matteo Torzoni et.al. 2506.14453 null
2025-06-17 Adaptive Reinforcement Learning for Unobservable Random Delays John Wikman et.al. 2506.14411 null
2025-06-17 System 0: Transforming Artificial Intelligence into a Cognitive Extension Massimo Chiriatti et.al. 2506.14376 null
2025-06-18 ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies Jinyan Yuan et.al. 2506.14315 null
2025-06-17 Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent Xueyang Feng et.al. 2506.14302 null
2025-06-17 ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems Fanzhi Zeng et.al. 2506.14299 null
2025-06-17 From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents Seongbo Jang et.al. 2506.14285 link
2025-06-17 Mxplainer: Explain and Learn Insights by Imitating Mahjong Agents Lingfeng Li et.al. 2506.14246 link
2025-06-17 A Novel Indicator for Quantifying and Minimizing Information Utility Loss of Robot Teams Xiyu Zhao et.al. 2506.14237 null
2025-06-17 Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Md Tanzib Hosain et.al. 2506.14234 null
2025-06-17 AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents Jingxu Xie et.al. 2506.14205 link
2025-06-17 MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment Junghwan Kim et.al. 2506.14199 null
2025-06-17 Hierarchical Multi-Agent Reinforcement Learning-based Coordinated Spatial Reuse for Next Generation WLANs Jiaming Yu et.al. 2506.14187 null
2025-06-17 Affective-CARA: A Knowledge Graph Driven Framework for Culturally Adaptive Emotional Intelligence in HCI Nirodya Pussadeniya et.al. 2506.14166 null
2025-06-17 Light Aircraft Game : Basic Implementation and training results analysis Hanzhong Cao et.al. 2506.14164 link
2025-06-17 Common Benchmarks Undervalue the Generalization Power of Programmatic Policies Amirhossein Rajabpour et.al. 2506.14162 link
2025-06-17 StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework Shayan Talaei et.al. 2506.14159 null
2025-06-17 Dividing Conflicting Items Fairly Ayumi Igarashi et.al. 2506.14149 null
2025-06-17 RadFabric: Agentic AI System with Reasoning Capability for Radiology Wenting Chen et.al. 2506.14142 null
2025-06-17 FormGym: Doing Paperwork with Agents Matthew Toles et.al. 2506.14079 null
2025-06-17 Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification Nathaniel Pinckney et.al. 2506.14074 link
2025-06-16 Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning Martin Klissarov et.al. 2506.14045 null
2025-06-16 SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement Chelsi Jain et.al. 2506.14035 link
2025-06-16 A Cooperative Contactless Object Transport with Acoustic Robots Narsimlu Kemsaram et.al. 2506.13957 link
2025-06-16 ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture Vishesh Kumar Tanwar et.al. 2506.13935 null
2025-06-16 How Does LLM Reasoning Work for Code? A Survey and a Call to Action Ira Ceka et.al. 2506.13932 null
2025-06-16 Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems Zhongzhi Yu et.al. 2506.13905 null
2025-06-16 LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning Miho Koda et.al. 2506.13841 link
2025-06-16 Recent trends in socio-epidemic modelling: behaviours and their determinants Daniele Proverbio et.al. 2506.13837 null
2025-06-15 The Reflexive Integrated Information Unit: A Differentiable Primitive for Artificial Consciousness Gnankan Landry Regis N'guessan et.al. 2506.13825 link
2025-06-15 The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI Marcelle Momha et.al. 2506.13818 null
2025-06-14 DeepSeq: High-Throughput Single-Cell RNA Sequencing Data Labeling via Web Search-Augmented Agentic Generative AI Foundation Models Saleem A. Al Dajani et.al. 2506.13817 null
2025-06-13 Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study Sompote Youwai et.al. 2506.13811 null
2025-06-13 Causality in the human niche: lessons for machine learning Richard D. Lange et.al. 2506.13803 null
2025-06-13 Enhancing Clinical Decision Support and EHR Insights through LLMs and the Model Context Protocol: An Open-Source MCP-FHIR Framework Abul Ehtesham et.al. 2506.13800 null
2025-06-16 MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering Arya Fayyazi et.al. 2506.13755 null
2025-06-16 PB $^2$ : Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning Brahim Driss et.al. 2506.13741 null
2025-06-16 The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning Jiashun Liu et.al. 2506.13672 null
2025-06-16 We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems Junfeng Fang et.al. 2506.13666 link
2025-06-16 Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Shulin Tian et.al. 2506.13654 null
2025-06-16 xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations Kaiyuan Chen et.al. 2506.13651 null
2025-06-16 Deceptive Path Planning: A Bayesian Game Approach Violetta Rostobaya et.al. 2506.13650 null
2025-06-16 CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation Yuwei Du et.al. 2506.13599 null
2025-06-16 Agent Capability Negotiation and Binding Protocol (ACNBP) Ken Huang et.al. 2506.13590 link
2025-06-16 Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma Datong Zhou et.al. 2506.13587 null
2025-06-16 Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes Bernhard Hilpert et.al. 2506.13583 null
2025-06-17 A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints Jonathan Hoss et.al. 2506.13566 link
2025-06-16 Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework Hemjyoti Das et.al. 2506.13478 null
2025-06-16 Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning David Bani-Harouni et.al. 2506.13474 null
2025-06-16 A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing Shiqian Guo et.al. 2506.13469 null
2025-06-16 Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics YR Darr et.al. 2506.13453 null
2025-06-16 Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization Sebastian Griesbach et.al. 2506.13345 link
2025-06-16 Towards Pervasive Distributed Agentic Generative AI -- A State of The Art Gianni Molinari et.al. 2506.13324 null
2025-06-16 RL-Guided MPC for Autonomous Greenhouse Control Salim Msaad et.al. 2506.13278 null
2025-06-16 Screen Reader Users in the Vibe Coding Era: Adaptation, Empowerment, and New Accessibility Landscape Nan Chen et.al. 2506.13270 null
2025-06-16 Reconstruction-free magnetic control of DIII-D plasma with deep reinforcement learning G. F. Subbotin et.al. 2506.13267 null
2025-06-16 COME: Adding Scene-Centric Forecasting Control to Occupancy World Model Yining Shi et.al. 2506.13260 link
2025-06-16 On Immutable Memory Systems for Artificial Agents: A Blockchain-Indexed Automata-Theoretic Framework Using ECDH-Keyed Merkle Chains Craig Steven Wright et.al. 2506.13246 null
2025-06-16 A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs Guoxi Zhang et.al. 2506.13245 null
2025-06-16 Mixed-variable policy-based optimization Jonathan Viquerat et.al. 2506.13240 null
2025-06-16 Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty Ziyu Li et.al. 2506.13207 null
2025-06-19 Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments Xuan Wang et.al. 2506.13205 null
2025-06-16 Querying Large Automotive Software Models: Agentic vs. Direct LLM Approaches Lukasz Mazur et.al. 2506.13171 null
2025-06-16 Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback Tanmay Goyal et.al. 2506.13163 null
2025-06-16 Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management DongNyeong Heo et.al. 2506.13153 null
2025-06-16 AlphaEvolve: A coding agent for scientific and algorithmic discovery Alexander Novikov et.al. 2506.13131 null
2025-06-16 Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning Stella C. Dong et.al. 2506.13113 null
2025-06-16 Leveraging In-Context Learning for Language Model Agents Shivanshu Gupta et.al. 2506.13109 null
2025-06-17 Towards the Autonomous Optimization of Urban Logistics: Training Generative AI with Scientific Tools via Agentic Digital Twins and Model Context Protocol Haowen Xu et.al. 2506.13068 link
2025-06-16 MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models? Xixian Yong et.al. 2506.13065 null
2025-06-16 PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue George Shaikovski et.al. 2506.13063 null
2025-06-16 MAGIC: Multi-Agent Argumentation and Grammar Integrated Critiquer Joaquin Jordan et.al. 2506.13037 null
2025-06-15 Discovering Coordinated Processes From Social Online Networks Anna Kalenkova et.al. 2506.12988 link
2025-06-15 On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness Arnav Mehra et.al. 2506.12950 null
2025-06-15 Scaling Test-time Compute for LLM Agents King Zhu et.al. 2506.12928 null
2025-06-15 Sectoral Coupling in Linguistic State Space Sebastian Dumbrava et.al. 2506.12927 null
2025-06-15 Distributed Composite Optimization with Sub-Weibull Noises Zhan Yu et.al. 2506.12901 null
2025-06-15 Homeostatic Coupling for Prosocial Behavior Naoto Yoshida et.al. 2506.12894 null
2025-06-15 Exploring the Potential of Metacognitive Support Agents for Human-AI Co-Creation Frederic Gmeiner et.al. 2506.12879 null
2025-06-15 WereWolf-Plus: An Update of Werewolf Game setting Based on DSGBench Xinyuan Xia et.al. 2506.12841 null
2025-06-15 Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models Tung Minh Luu et.al. 2506.12822 null
2025-06-15 PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates Yun Liu et.al. 2506.12821 null
2025-06-15 Mastering Da Vinci Code: A Comparative Study of Transformer, LLM, and PPO-based Agents LeCheng Zhang et.al. 2506.12801 null
2025-06-15 Resilient-native and Intelligent NextG Systems Mehdi Bennis et.al. 2506.12795 null
2025-06-15 Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling Zhilin Lin et.al. 2506.12735 null
2025-06-15 Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems Yuqi Ping et.al. 2506.12710 null
2025-06-15 SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation Yashothara Shanmugarasa et.al. 2506.12699 null
2025-06-15 SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation Xiaofeng Shi et.al. 2506.12689 null
2025-06-14 LIFELONG SOTOPIA: Evaluating Social Intelligence of Language Agents Over Lifelong Social Interactions Hitesh Goel et.al. 2506.12666 null
2025-06-14 Behavioral Generative Agents for Energy Operations Cong Chen et.al. 2506.12664 null
2025-06-14 Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics Jiarui Liu et.al. 2506.12657 null
2025-06-14 Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback Julia Santaniello et.al. 2506.12636 null
2025-06-14 Towards Building General Purpose Embedding Models for Industry 4.0 Agents Christodoulos Constantinides et.al. 2506.12607 null
2025-06-17 The Rise of AI Companions: How Human-Chatbot Relationships Influence Well-Being Yutong Zhang et.al. 2506.12605 null
2025-06-14 Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow Jie Pan et.al. 2506.12600 null
2025-06-14 Moment Restrictions for Nonlinear Panel Data Models with Feedback Stéphane Bonhomme et.al. 2506.12569 null
2025-06-17 AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving Wentao Zhang et.al. 2506.12508 link
2025-06-18 Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning Ali Baheri et.al. 2506.12497 null
2025-06-14 Tiered Agentic Oversight: A Hierarchical Multi-Agent System for AI Safety in Healthcare Yubin Kim et.al. 2506.12482 null
2025-06-14 Generalizable Trajectory Prediction via Inverse Reinforcement Learning with Mamba-Graph Architecture Wenyun Li et.al. 2506.12474 null
2025-06-14 Levels of Autonomy for AI Agents K. J. Kevin Feng et.al. 2506.12469 null
2025-06-14 Adding links wisely: how an influencer seeks for leadership in opinion dynamics? Lingfei Wang et.al. 2506.12463 null
2025-06-14 Topology-Assisted Spatio-Temporal Pattern Disentangling for Scalable MARL in Large-scale Autonomous Traffic Control Rongpeng Li et.al. 2506.12453 null
2025-06-14 Plan Your Travel and Travel with Your Plan: Wide-Horizon Planning and Evaluation via LLM Dongjie Yang et.al. 2506.12421 null
2025-06-14 Ghost Policies: A New Paradigm for Understanding and Learning from Failure in Deep Reinforcement Learning Xabier Olaz et.al. 2506.12366 null
2025-06-17 Sharp Tools: How Developers Wield Agentic AI in Real Software Engineering Tasks Aayush Kumar et.al. 2506.12347 null
2025-06-14 SheetMind: An End-to-End LLM-Powered Multi-Agent Framework for Spreadsheet Automation Ruiyan Zhu et.al. 2506.12339 link
2025-06-14 Artificial Intelligence in Team Dynamics: Who Gets Replaced and Why? Xienan Cheng et.al. 2506.12337 null
2025-06-14 IndoorWorld: Integrating Physical Task Solving and Social Simulation in A Heterogeneous Multi-Agent Environment Dekun Wu et.al. 2506.12331 null
2025-06-14 Similar Formation Control of Multi-Agent Systems over Directed Acyclic Graphs via Matrix-Weighted Laplacian Zhipeng Fan et.al. 2506.12297 null
2025-06-13 Cloud Infrastructure Management in the Age of AI Agents Zhenning Yang et.al. 2506.12270 null
2025-06-13 The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs Avinash Baidya et.al. 2506.12266 null
2025-06-13 Reversing the Paradigm: Building AI-First Systems with Human Guidance Cosimo Spera et.al. 2506.12245 null
2025-06-13 Privacy Reasoning in Ambiguous Contexts Ren Yi et.al. 2506.12241 null
2025-06-13 A Fast, Reliable, and Secure Programming Language for LLM Agents with Code Actions Stephen Mell et.al. 2506.12202 null
2025-06-13 PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification Yujie Zhao et.al. 2506.12200 link
2025-06-13 OSI Stack Redesign for Quantum Networks: Requirements, Technologies, Challenges, and Future Directions Shakil Ahmed et.al. 2506.12195 null
2025-06-13 Because we have LLMs, we Can and Should Pursue Agentic Interpretability Been Kim et.al. 2506.12152 null
2025-06-13 Eliciting Reasoning in Language Models with Cognitive Tools Brown Ebouky et.al. 2506.12115 null
2025-06-13 EconGym: A Scalable AI Testbed with Diverse Economic Tasks Qirui Mi et.al. 2506.12110 null
2025-06-13 DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents Hao Li et.al. 2506.12104 link
2025-06-12 "I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence Naba Rizvi et.al. 2506.12098 null
2025-06-12 DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion Khang Nguyen et.al. 2506.12095 null
2025-06-12 Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure Timothy Dubber et.al. 2506.12094 null
2025-06-13 Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale Junha Lee et.al. 2506.12009 null
2025-06-13 Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents? Ramesh Raskar et.al. 2506.12003 null
2025-06-13 Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks Ankit Bhardwaj et.al. 2506.11973 null
2025-06-13 Visual Pre-Training on Unlabeled Images using Reinforcement Learning Dibya Ghosh et.al. 2506.11967 null
2025-06-13 Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning Mohammadamin Moradi et.al. 2506.11957 null
2025-06-13 Secure API-Driven Research Automation to Accelerate Scientific Discovery Tyler J. Skluzacek et.al. 2506.11950 null
2025-06-13 Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations Miguel Suau et.al. 2506.11912 null
2025-06-13 Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients Chapa Sirithunge et.al. 2506.11906 null
2025-06-13 An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing Haochen Sun et.al. 2506.11882 null
2025-06-13 Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems Zhipeng Bao et.al. 2506.11842 null
2025-06-13 Mean Field Games without Rational Expectations Benjamin Moll et.al. 2506.11838 null
2025-06-13 The Space Between Us: A Methodological Framework for Researching Bonding and Proxemics in Situated Group-Agent Interactions Ana Müller et.al. 2506.11829 null
2025-06-13 Revealing Political Bias in LLMs through Structured Multi-Agent Debate Aishwarya Bandaru et.al. 2506.11825 link
2025-06-13 PE-MA: Parameter-Efficient Co-Evolution of Multi-Agent Systems Yingfan Deng et.al. 2506.11803 null
2025-06-13 Solving Inverse Problems in Stochastic Self-Organising Systems through Invariant Representations Elias Najarro et.al. 2506.11796 link
2025-06-13 ALEA IACTA EST: A Declarative Domain-Specific Language for Manually Performable Random Experiments Baltasar Trancón y Widemann et.al. 2506.11794 null
2025-06-13 SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks Hwiwon Lee et.al. 2506.11791 link
2025-06-16 AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments Zikang Leng et.al. 2506.11773 null
2025-06-13 Convergence to equilibrium for a class of exchange economies R. S. MacKay et.al. 2506.11770 null
2025-06-13 DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Mingxuan Du et.al. 2506.11763 null
2025-06-13 Bias and Identifiability in the Bounded Confidence Model Claudio Borile et.al. 2506.11751 null
2025-06-13 Interaction, Process, Infrastructure: A Unified Architecture for Human-Agent Collaboration Yun Wang et.al. 2506.11718 null
2025-06-13 Generalised Rate Control Approach For Stream Processing Applications Ziren Xiao et.al. 2506.11710 null
2025-06-13 Growing with Experience: Growing Neural Networks in Deep Reinforcement Learning Lukas Fehring et.al. 2506.11706 null
2025-06-17 A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences Pratibha Zunjare et.al. 2506.11681 null
2025-06-13 Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control Lambert Lee et.al. 2506.11650 null
2025-06-13 High Probability Convergence of Distributed Clipped Stochastic Gradient Descent with Heavy-tailed Noise Yuchen Yang et.al. 2506.11647 null
2025-06-13 LoRA-Gen: Specializing Large Language Model via Online LoRA Generation Yicheng Xiao et.al. 2506.11638 null
2025-06-13 "If we misunderstand the client, we misspend 100 hours": Exploring conversational AI and response types for information elicitation Daniel Hove Paludan et.al. 2506.11610 null
2025-06-13 Learn to Preserve Personality: Federated Foundation Models in Recommendations Zhiwei Li et.al. 2506.11563 null
2025-06-13 AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction Syeda Kisaa Fatima et.al. 2506.11475 null
2025-06-13 Linear-quadratic stochastic nonzero-sum differential games between graphon teams De-xuan Xu et.al. 2506.11468 null
2025-06-13 Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention Xuan Duy Ta et.al. 2506.11445 null
2025-06-13 ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification Yiyang Jin et.al. 2506.11442 null
2025-06-13 Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards Jeff Da et.al. 2506.11425 null
2025-06-13 FocalAD: Local Motion Planning for End-to-End Autonomous Driving Bin Sun et.al. 2506.11419 null
2025-06-13 Complexity guarantees for risk-neutral generalized Nash equilibrium problems Haochen Tao et.al. 2506.11409 null
2025-06-13 Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning Liying Wang et.al. 2506.11376 null
2025-06-12 From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review Yaohui Zhang et.al. 2506.11343 null
2025-06-12 A Hybrid Adaptive Nash Equilibrium Solver for Distributed Multi-Agent Systems with Game-Theoretic Jump Triggering Qiuyu Miao et.al. 2506.11304 null
2025-06-12 TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy Héctor Carrión et.al. 2506.11302 link
2025-06-12 Shapley Machine: A Game-Theoretic Framework for N-Agent Ad Hoc Teamwork Jianhong Wang et.al. 2506.11285 link
2025-06-12 Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation Benjamin Elder et.al. 2506.11266 null
2025-06-12 Sensor Model Identification via Simultaneous Model Selection and State Variable Determination Christian Brommer et.al. 2506.11263 null
2025-06-12 LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation Ngoc Phuoc An Vo et.al. 2506.11237 null
2025-06-12 Beyond Formal Semantics for Capabilities and Skills: Model Context Protocol in Manufacturing Luis Miguel Vieira da Silva et.al. 2506.11180 null
2025-06-12 Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning Xue Zhou et.al. 2506.11172 null
2025-06-11 ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator Wenlong Hou et.al. 2506.11150 null
2025-06-11 Autonomous Computer Vision Development with Agentic AI Jin Kim et.al. 2506.11140 link
2025-06-10 GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions Wenkang Han et.al. 2506.11127 null
2025-06-12 AutoMind: Adaptive Knowledgeable Agent for Automated Data Science Yixin Ou et.al. 2506.10974 link
2025-06-12 Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop Justin Kerr et.al. 2506.10968 null
2025-06-12 SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks Lianghong Guo et.al. 2506.10954 link
2025-06-12 Build the web for agents, not agents for the web Xing Han Lù et.al. 2506.10953 null
2025-06-14 Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors Chen Yueh-Han et.al. 2506.10949 link
2025-06-12 Execution Guided Line-by-Line Code Generation Boaz Lavon et.al. 2506.10948 link
2025-06-12 Dynamic Epistemic Friction in Dialogue Timothy Obiso et.al. 2506.10934 null
2025-06-12 Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence Eduardo Baena et.al. 2506.10925 null
2025-06-12 Prediction and control of geometry-induced nematic order in growing multicellular systems Lukas Hupe et.al. 2506.10867 null
2025-06-12 CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training Alireza Salemi et.al. 2506.10844 link
2025-06-12 Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches Andrea Moglia et.al. 2506.10825 null
2025-06-15 VideoDeepResearch: Long Video Understanding With Agentic Tool Using Huaying Yuan et.al. 2506.10821 link
2025-06-13 Joint Beamforming with Extremely Large Scale RIS: A Sequential Multi-Agent A2C Approach Zhi Chai et.al. 2506.10815 null
2025-06-12 OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems Xiaozhe Li et.al. 2506.10764 link
2025-06-12 Integrating Large Language Models into Text Animation: An Intelligent Editing System with Inline and Chat Interaction Bao Zhang et.al. 2506.10762 null
2025-06-12 Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding Yuhang Zhang et.al. 2506.10756 null
2025-06-12 Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering Sai Prasanna Teja Reddy Bogireddy et.al. 2506.10751 null
2025-06-12 Cursed Equilibria and Knightian Uncertainty in a Trading Game Jurek Preker et.al. 2506.10663 null
2025-06-12 SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis Sergio Burdisso et.al. 2506.10622 link
2025-06-12 AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation Haoyuan Shi et.al. 2506.10540 null
2025-06-12 Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models Sangmin Song et.al. 2506.10504 null
2025-06-12 BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis Surya Jasper et.al. 2506.10501 null
2025-06-16 Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications Felix Härer et.al. 2506.10467 link
2025-06-12 Are We Generalizing from the Exception? An In-the-Wild Study on Group-Sensitive Conversation Design in Human-Agent Interactions Ana Müller et.al. 2506.10462 null
2025-06-12 Equitable Mechanism Design for Facility Location Toby Walsh et.al. 2506.10460 null
2025-06-12 Multi-dimensional Autoscaling of Processing Services: A Comparison of Agent-based Methods Boris Sedlak et.al. 2506.10420 null
2025-06-12 Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges Jintao Liang et.al. 2506.10408 null
2025-06-12 EQA-RM: A Generative Embodied Reward Model with Test-time Scaling Yuhang Chen et.al. 2506.10389 null
2025-06-12 Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills Yuquan Xie et.al. 2506.10387 null
2025-06-12 NeuroPAL: Punctuated Anytime Learning with Neuroevolution for Macromanagement in Starcraft: Brood War Jim O'Connor et.al. 2506.10384 null
2025-06-12 Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts Zaijing Li et.al. 2506.10357 null
2025-06-12 Provably Learning from Language Feedback Wanqiao Xu et.al. 2506.10341 null
2025-06-12 Seeding an Uncertain Technology Eric Gao et.al. 2506.10340 null
2025-06-13 A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon Cameron Angliss et.al. 2506.10326 link
2025-06-12 Minimizing False Positives in Static Bug Detection via LLM-Enhanced Path Feasibility Analysis Xueying Du et.al. 2506.10322 null
2025-06-12 WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models Qiyue Yin et.al. 2506.10264 null
2025-06-12 Enhancing Ultrasound Molecular Imaging: Toward Real-Time RPCA-Based Filtering to Differentiate Bound and Free Microbubbles Hoda S. Hashemi et.al. 2506.10257 null
2025-06-15 Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations Andrea Gaggioli et.al. 2506.10249 null
2025-06-11 Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems Filip Cano et.al. 2506.10192 null
2025-06-11 AURA: A Multi-Agent Intelligence Framework for Knowledge-Enhanced Cyber Threat Attribution Nanda Rani et.al. 2506.10175 null
2025-06-11 A Navigation Framework Utilizing Vision-Language Models Yicheng Duan et.al. 2506.10172 link
2025-06-14 Disclosure Audits for LLM Agents Saswat Das et.al. 2506.10171 null
2025-06-11 Exploring EEG Responses during Observation of Actions Performed by Human Actor and Humanoid Robot Anh T. Nguyen et.al. 2506.10170 null
2025-06-11 Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective Minye Shao et.al. 2506.10142 link
2025-06-11 Provable Sim-to-Real Transfer via Offline Domain Randomization Arnaud Fickinger et.al. 2506.10133 null
2025-06-11 Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information Christodoulos Constantinides et.al. 2506.10086 null
2025-06-11 Cybernetic Marionette: Channeling Collective Agency Through a Wearable Robot in a Live Dancer-Robot Duet Anup Sathya et.al. 2506.10079 null
2025-06-11 A quantum semantic framework for natural language processing Christopher J. Agostino et.al. 2506.10077 null
2025-06-11 Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy Malvern Madondo et.al. 2506.10073 null
2025-06-11 Cooling a Qubit using n Others Jake Xuereb et.al. 2506.10059 link
2025-06-17 TaskCraft: Automated Generation of Agentic Tasks Dingfeng Shi et.al. 2506.10055 link
2025-06-11 Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling Tim Z. Xiao et.al. 2506.09998 null
2025-06-11 SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance Wentao Ge et.al. 2506.09968 null
2025-06-11 The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability Jiachen Hu et.al. 2506.09940 null
2025-06-11 On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing Junlin Chen et.al. 2506.09924 null
2025-06-11 PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants Zheng Zhao et.al. 2506.09902 link
2025-06-11 "What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended) Noel Brindise et.al. 2506.09901 null
2025-06-11 OctoNav: Towards Generalist Embodied Navigation Chen Gao et.al. 2506.09839 null
2025-06-11 Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy Tonghe Wang et.al. 2506.09805 null
2025-06-11 Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy Davide Grossi et.al. 2506.09789 null
2025-06-11 Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era Shuo Jiang et.al. 2506.09755 null
2025-06-11 Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints Xiangkai Zhang et.al. 2506.09748 null
2025-06-11 Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring Gusseppe Bravo-Rocca et.al. 2506.09742 null
2025-06-11 Patterns of Patterns III Joseph Corneli et.al. 2506.09696 null
2025-06-11 Intent Factored Generation: Unleashing the Diversity in Your Language Model Eltayeb Ahmed et.al. 2506.09659 null
2025-06-11 Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives Wei Zeng et.al. 2506.09656 null
2025-06-11 DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy Kaixuan Xu et.al. 2506.09655 null
2025-06-11 Effective Red-Teaming of Policy-Adherent Agents Itay Nakash et.al. 2506.09600 null
2025-06-11 VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots Miguel Á. González-Santamarta et.al. 2506.09583 null
2025-06-11 MOORL: A Framework for Integrating Offline-Online Reinforcement Learning Gaurav Chaudhary et.al. 2506.09574 null
2025-06-11 ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning Yu Sun et.al. 2506.09513 link
2025-06-11 Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design Andreas Schlaginhaufen et.al. 2506.09508 null
2025-06-11 A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes Thomas J. Ringstrom et.al. 2506.09499 null
2025-06-11 Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation Yuxin Liu et.al. 2506.09485 null
2025-06-11 Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing Maria Damanaki et.al. 2506.09469 null
2025-06-11 Generalization Error Analysis for Attack-Free and Byzantine-Resilient Decentralized Learning with Data Heterogeneity Haoxiang Ye et.al. 2506.09438 null
2025-06-11 When Is Diversity Rewarded in Cooperative Multi-Agent Learning? Michael Amir et.al. 2506.09434 null
2025-06-11 A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy Henry Peng Zou et.al. 2506.09420 link
2025-06-11 Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models Zongjie Li et.al. 2506.09396 null
2025-06-15 LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization Jiaqi Tang et.al. 2506.09373 null
2025-06-11 ContextBuddy: AI-Enhanced Contextual Insights for Security Alert Investigation (Applied to Intrusion Detection) Ronal Singh et.al. 2506.09365 null
2025-06-11 Intelligent System of Emergent Knowledge: A Coordination Fabric for Billions of Minds Moshi Wei et.al. 2506.09335 null
2025-06-11 Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation Arjun Vaithilingam Sudhakar et.al. 2506.09331 null
2025-06-10 UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench Boxi Yu et.al. 2506.09289 link
2025-06-10 Improved Approximate EFX Guarantees for Multigraphs Alireza Kaviani et.al. 2506.09288 null
2025-06-10 Learning The Minimum Action Distance Lorenzo Steccanella et.al. 2506.09276 null
2025-06-10 Uncertainty Prioritized Experience Replay Rodrigo Carrasco-Davis et.al. 2506.09270 null
2025-06-10 Agent-based Condition Monitoring Assistance with Multimodal Industrial Database Retrieval Augmented Generation Karl Löwenmark et.al. 2506.09247 null
2025-06-10 Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs Greyson Brothers et.al. 2506.09215 null
2025-06-10 Optimal Task Offloading with Firm Deadlines for Mobile Edge Computing Systems Khai Doan et.al. 2506.09180 null
2025-06-10 Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism Haoyuan Cai et.al. 2506.09176 link
2025-06-10 MultiNet: An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models Pranav Guruprasad et.al. 2506.09172 null
2025-06-10 Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search Samuel Holt et.al. 2506.09171 null
2025-06-10 FAIRTOPIA: Envisioning Multi-Agent Guardianship for Disrupting Unfair AI Pipelines Athena Vakali et.al. 2506.09107 null
2025-06-10 FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making Jiaxiang Chen et.al. 2506.09080 null
2025-06-08 BG-HOP: A Bimanual Generative Hand-Object Prior Sriram Krishna et.al. 2506.09068 link
2025-06-10 ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering Yuki Imajuku et.al. 2506.09050 link
2025-06-10 VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Li Kang et.al. 2506.09049 null
2025-06-10 Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation Xiaowen Ma et.al. 2506.09046 null
2025-06-10 The Decoupled Risk Landscape in Performative Prediction Javier Sanguino et.al. 2506.09044 null
2025-06-10 Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System Yuan Guo et.al. 2506.08972 null
2025-06-10 Towards Robust Deep Reinforcement Learning against Environmental State Perturbation Chenxu Wang et.al. 2506.08961 null
2025-06-10 What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities Wendong Bu et.al. 2506.08933 null
2025-06-10 Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL) Maria-Veronica Ciocanel et.al. 2506.08916 link
2025-06-10 Intention-Conditioned Flow Occupancy Models Chongyi Zheng et.al. 2506.08902 link
2025-06-10 Pairwise similarity method for majority domination problem N. I. Shushko et.al. 2506.08886 null
2025-06-10 Deploying SICNav in the Field: Safe and Interactive Crowd Navigation using MPC and Bilevel Optimization Sepehr Samavi et.al. 2506.08851 null
2025-06-10 Agile Reinforcement Learning for Real-Time Task Scheduling in Edge Computing Amin Avan et.al. 2506.08850 link
2025-06-11 Design Patterns for Securing LLM Agents against Prompt Injections Luca Beurer-Kellner et.al. 2506.08837 null
2025-06-10 Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents Irene Testini et.al. 2506.08800 null
2025-06-10 Improved LLM Agents for Financial Document Question Answering Nelvin Tan et.al. 2506.08726 null
2025-06-10 PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly Liang Ma et.al. 2506.08708 null
2025-06-10 Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs Šimon Sedláček et.al. 2506.08633 null
2025-06-10 Modular Recurrence in Contextual MDPs for Universal Morphology Control Laurens Engwegen et.al. 2506.08630 null
2025-06-10 Geometric Hyperscanning under Active Inference Nicolas Hinrichs et.al. 2506.08599 null
2025-06-10 HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning Yang Lv et.al. 2506.08580 null
2025-06-10 Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations Yibo Cui et.al. 2506.08566 null
2025-06-10 FEDTAIL: Federated Long-Tailed Domain Generalization with Sharpness-Guided Gradient Matching Sunny Gupta et.al. 2506.08518 null
2025-06-12 MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning Kuo Yang et.al. 2506.08507 null
2025-06-10 Learning to Lead: Incentivizing Strategic Agents in the Dark Yuchen Wu et.al. 2506.08438 null
2025-06-10 Attention-based Learning for 3D Informative Path Planning Rui Zhao et.al. 2506.08434 null
2025-06-12 CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models Ziqi. Liu et.al. 2506.08430 null
2025-06-10 Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy Utkarsh Pratiush et.al. 2506.08423 link
2025-06-11 TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration Weiya Li et.al. 2506.08403 link
2025-06-10 Reinforce LLM Reasoning through Multi-Agent Reflection Yurun Yuan et.al. 2506.08379 null
2025-06-10 Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models Wentao Shi et.al. 2506.08352 link
2025-06-11 Your Agent Can Defend Itself against Backdoor Attacks Li Changjiang et.al. 2506.08336 null
2025-06-10 ORFS-agent: Tool-Using Agents for Chip Design Optimization Amur Ghose et.al. 2506.08332 null
2025-06-10 Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study Ira Ceka et.al. 2506.08311 null
2025-06-11 HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation Hongjun Wu et.al. 2506.08296 null
2025-06-09 From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information? Zhanke Zhou et.al. 2506.08295 link
2025-06-09 From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium Xie Yi et.al. 2506.08292 link
2025-06-09 Scaling Laws of Motion Forecasting and Planning -- A Technical Report Mustafa Baniodeh et.al. 2506.08228 null
2025-06-09 Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms Jared Claypoole et.al. 2506.08192 null
2025-06-09 Anomaly, Class Division, and Decoupling in Wealth Dynamics Jaeseok Hur et.al. 2506.08175 null
2025-06-09 Ego-centric Learning of Communicative World Models for Autonomous Driving Hang Wang et.al. 2506.08149 null
2025-06-09 EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments Zefang Liu et.al. 2506.08136 null
2025-06-09 SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents Subhrangshu Nandi et.al. 2506.08119 null
2025-06-09 Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph Akash Vishwakarma et.al. 2506.08098 link
2025-06-09 Towards AI-assisted Neutrino Flavor Theory Design Jason Benjamin Baretz et.al. 2506.08080 link
2025-06-08 UAVs Meet Agentic AI: A Multidomain Survey of Autonomous Aerial Intelligence and Agentic UAVs Ranjan Sapkota et.al. 2506.08045 null
2025-06-09 GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Penghao Wu et.al. 2506.08012 null
2025-06-09 Dreamland: Controllable World Creation with Simulator and Generative Models Sicheng Mo et.al. 2506.08006 null
2025-06-09 Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System Fan Yang et.al. 2506.07997 null
2025-06-09 $τ^2$ -Bench: Evaluating Conversational Agents in a Dual-Control Environment Victor Barres et.al. 2506.07982 link
2025-06-09 Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator Alberto Bazán-Guillén et.al. 2506.07980 null
2025-06-10 Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction Junhong Shen et.al. 2506.07976 link
2025-06-09 HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization Hongzheng Chen et.al. 2506.07972 link
2025-06-09 Diffusion of Responsibility in Collective Decision Making Pavel Naumov et.al. 2506.07935 null
2025-06-09 LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement Dimitris Panagopoulos et.al. 2506.07915 null
2025-06-09 A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit Andrea Tiranti et.al. 2506.07877 null
2025-06-09 Simulating nationwide coupled disease and fear spread in an agent-based model Joy Kitson et.al. 2506.07842 null
2025-06-09 Control strategies and trends to equilibrium for kinetic models of opinion dynamics driven by social activity Andrea Bondesan et.al. 2506.07840 null
2025-06-09 Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information Jan Corazza et.al. 2506.07829 null
2025-06-11 A Proposal to Extend the Common Model of Cognition with Metacognition John Laird et.al. 2506.07807 null
2025-06-13 Agent Semantics, Semantic Spacetime, and Graphical Reasoning Mark Burgess et.al. 2506.07756 null
2025-06-09 Deep Equivariant Multi-Agent Control Barrier Functions Nikolaos Bousias et.al. 2506.07755 null
2025-06-09 Delay Optimization in Remote ID-Based UAV Communication via BLE and Wi-Fi Switching Yian Zhu et.al. 2506.07715 null
2025-06-09 QUITE: A Query Rewrite System Beyond Rules with LLM Agents Yuyang Song et.al. 2506.07675 null
2025-06-09 MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents Yunhe Yan et.al. 2506.07672 null
2025-06-09 SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling Haoran Wang et.al. 2506.07636 null
2025-06-09 Blending Participatory Design and Artificial Awareness for Trustworthy Autonomous Vehicles Ana Tanevska et.al. 2506.07633 null
2025-06-09 MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity Bikash Saha et.al. 2506.07586 null
2025-06-09 Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models Ramakrishna Appicharla et.al. 2506.07583 null
2025-06-11 SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems Peiran Li et.al. 2506.07564 null
2025-06-12 CheMatAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool Learning Mengsong Wu et.al. 2506.07551 link
2025-06-09 Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning Weiqiang Jin et.al. 2506.07548 link
2025-06-09 Fractional Collisions: A Framework for Risk Estimation of Counterfactual Conflicts using Autonomous Driving Behavior Simulations Sreeja Roy-Singh et.al. 2506.07540 null
2025-06-09 Coordinating Search-Informed Reasoning and Reasoning-Guided Search in Claim Verification Qisheng Hu et.al. 2506.07528 null
2025-06-09 IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents Shiwei Feng et.al. 2506.07524 null
2025-06-09 Taking Flight with Dialogue: Enabling Natural Language Control for PX4-based Drone Agent Shoon Kit Lim et.al. 2506.07509 link
2025-06-09 Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models Mickel Liu et.al. 2506.07468 link
2025-06-09 Efficient Generation of Diverse Cooperative Agents with World Models Yi Loo et.al. 2506.07450 null
2025-06-09 Generate Realistic Test Scenes for V2X Communication Systems An Guo et.al. 2506.07419 null
2025-06-11 MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models Philip R. Liu et.al. 2506.07400 link
2025-06-09 G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems Guibin Zhang et.al. 2506.07398 link
2025-06-09 From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks Yuyang Zhou et.al. 2506.07392 link
2025-06-09 Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents Yun Hua et.al. 2506.07388 null
2025-06-09 Extended Version of "Distributed Adaptive Resilient Consensus Control for Uncertain Nonlinear Multiagent Systems Against Deception Attacks" Mengze Yu et.al. 2506.07374 null
2025-06-09 Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking Jun Chen et.al. 2506.07351 null
2025-06-09 MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation Yijie Deng et.al. 2506.07350 null
2025-06-09 Distributed Risk-Sensitive Safety Filters for Uncertain Discrete-Time Systems Armin Lederer et.al. 2506.07347 null
2025-06-09 Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation Yijie Deng et.al. 2506.07338 null
2025-06-09 Digital Twin-based Smart Manufacturing: Dynamic Line Reconfiguration for Disturbance Handling Bo Fu et.al. 2506.07332 null
2025-06-08 SCGAgent: Recreating the Benefits of Reasoning Models for Secure Code Generation with Agentic Workflows Rebecca Saul et.al. 2506.07313 null
2025-06-08 Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI Aditya Chakravarty et.al. 2506.07286 null
2025-06-08 Secondary Stakeholders in AI: Fighting for, Brokering, and Navigating Agency Leah Hope Ajmani et.al. 2506.07281 null
2025-06-08 A Cramér-von Mises Approach to Incentivizing Truthful Data Sharing Alex Clinton et.al. 2506.07272 null
2025-06-08 Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs Atahan Özer et.al. 2506.07270 null
2025-06-08 Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments Xinran Li et.al. 2506.07232 null
2025-06-08 LLM-Enhanced Rapid-Reflex Async-Reflect Embodied Agent for Real-Time Decision-Making in Dynamically Changing Environments Yangqing Zheng et.al. 2506.07223 null
2025-06-08 BIMgent: Towards Autonomous Building Modeling via Computer-use Agents Zihan Deng et.al. 2506.07217 null
2025-06-08 Adaptive Consensus with Exponential Decay Woocheol Choi et.al. 2506.07203 null
2025-06-08 Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions Kanato Nakanishi et.al. 2506.07200 null
2025-06-08 Exploring Effective Strategies for Building a Customised GPT Agent for Coding Classroom Dialogues Luwei Bai et.al. 2506.07194 null
2025-06-08 Value-Set Iteration: Computing Optimal Correlated Equilibria in Infinite-Horizon Multi-Player Stochastic Games Jiarui Gan et.al. 2506.07186 null
2025-06-12 Delegation with Costly Inspection Mohammad T. Hajiaghayi et.al. 2506.07162 null
2025-06-08 Mind the Web: The Security of Web Use Agents Avishag Shapira et.al. 2506.07153 null
2025-06-08 BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite Liyang Chen et.al. 2506.07116 null
2025-06-08 Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models Samir Abdaljalil et.al. 2506.07106 null
2025-06-08 Decentralized Optimization with Amplified Privacy via Efficient Communication Wei Huo et.al. 2506.07102 null
2025-06-08 On the Generalization of Data-Assisted Control in port-Hamiltonian Systems (DAC-pH) Mostafa Eslami et.al. 2506.07079 null
2025-06-08 A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge Tarique Dahri et.al. 2506.07055 null
2025-06-08 QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine Anushka Jha et.al. 2506.07046 null
2025-06-08 Accelerating Two-Dimensional Materials Research via a Universal Interatomic Potential and Large Language Model Agent Haidi Wang et.al. 2506.07043 null
2025-06-08 MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks Sanjoy Chowdhury et.al. 2506.07016 null
2025-06-08 Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments Riley Simmons-Edler et.al. 2506.06981 null
2025-06-08 Near Optimal Non-asymptotic Sample Complexity of 1-Identification Zitian Li et.al. 2506.06978 null
2025-06-08 Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning Subhojyoti Mukherjee et.al. 2506.06964 null
2025-06-08 Deontically Constrained Policy Improvement in Reinforcement Learning Agents Alena Makarova et.al. 2506.06959 null
2025-06-08 Position: Simulating Society Requires Simulating Thought Chance Jiajie Li et.al. 2506.06958 null
2025-06-07 An Agentic Framework for Autonomous Metamaterial Modeling and Inverse Design Darui Lu et.al. 2506.06935 null
2025-06-07 Boosting LLM Reasoning via Spontaneous Self-Correction Xutong Zhao et.al. 2506.06923 null
2025-06-07 Multimodal Spatial Language Maps for Robot Navigation and Manipulation Chenguang Huang et.al. 2506.06862 null
2025-06-07 DONUT: A Decoder-Only Model for Trajectory Prediction Markus Knoche et.al. 2506.06854 null
2025-06-07 United Minds or Isolated Agents? Exploring Coordination of LLMs under Cognitive Load Theory HaoYang Shang et.al. 2506.06843 null
2025-06-07 AI-Generated Compromises for Coalition Formation Eyal Briman et.al. 2506.06837 null
2025-06-07 Is Optimal Transport Necessary for Inverse Reinforcement Learning? Zixuan Dong et.al. 2506.06793 null
2025-06-07 Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain Dimitris Panagopoulos et.al. 2506.06786 null
2025-06-07 AI PsyRoom: Artificial Intelligence Platform for Segmented Yearning and Reactive Outcome Optimization Method Yigui Feng et.al. 2506.06740 null
2025-06-07 WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making Guillaume Levy et.al. 2506.06725 null
2025-06-07 Contextual Experience Replay for Self-Improvement of Language Agents Yitao Liu et.al. 2506.06698 null
2025-06-07 Self-Adapting Improvement Loops for Robotic Learning Calvin Luo et.al. 2506.06658 null
2025-06-07 Active Test-time Vision-Language Navigation Heeju Ko et.al. 2506.06630 null
2025-06-06 AI Simulation by Digital Twins: Systematic Survey, Reference Framework, and Mapping to a Standardized Architecture Xiaoran Liu et.al. 2506.06580 null
2025-06-11 Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce Yijia Shao et.al. 2506.06576 null
2025-06-12 The Optimization Paradox in Clinical AI Multi-Agent Systems Suhana Bedi et.al. 2506.06574 link
2025-06-06 Enhancing Robot Safety via MLLM-Based Semantic Interpretation of Failure Data Aryaman Gupta et.al. 2506.06570 null
2025-06-06 Adapting Under Fire: Multi-Agent Reinforcement Learning for Adversarial Drift in Network Security Emilia Rivas et.al. 2506.06565 null
2025-06-06 KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes Eugenie Lai et.al. 2506.06541 link
2025-06-06 ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search Sam Earle et.al. 2506.06524 null
2025-06-06 Improving LLM-Powered EDA Assistants with RAFT Luyao Shi et.al. 2506.06500 null
2025-06-06 Fake Friends and Sponsored Ads: The Risks of Advertising in Conversational Search Jacob Erickson et.al. 2506.06447 null
2025-06-06 Improving choice model specification using reinforcement learning Gabriel Nova et.al. 2506.06410 null
2025-06-04 CPS-Guard: Framework for Dependability Assurance of AI- and LLM-Based Cyber-Physical Systems Trisanth Srinivasan et.al. 2506.06381 null
2025-06-06 PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time Weizhi Zhang et.al. 2506.06254 null
2025-06-06 Longer Lists Yield Better Matchings Yuri Faenza et.al. 2506.06217 null
2025-06-06 Can Theoretical Physics Research Benefit from Language Agents? Sirui Lu et.al. 2506.06214 null
2025-06-06 A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization Muhammed Ustaomeroglu et.al. 2506.06179 null
2025-06-06 Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach James Ford et.al. 2506.06175 null
2025-06-06 The Lock-in Hypothesis: Stagnation by Algorithm Tianyi Alex Qiu et.al. 2506.06166 null
2025-06-06 (AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation Eunhye Grace Ko et.al. 2506.06165 null
2025-06-06 Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks Adiba Mahbub Proma et.al. 2506.06153 null
2025-06-06 CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting Peter Lengyel et.al. 2506.06128 null
2025-06-06 Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library Weixun Wang et.al. 2506.06122 null
2025-06-06 VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning Zikang Wang et.al. 2506.06097 null
2025-06-06 On-board Mission Replanning for Adaptive Cooperative Multi-Robot Systems Elim Kwan et.al. 2506.06094 null
2025-06-06 Self driving algorithm for an active four wheel drive racecar Gergely Bari et.al. 2506.06077 null
2025-06-06 Conversational Interfaces for Parametric Conceptual Architectural Design: Integrating Mixed Reality with LLM-driven Interaction Ruochen Ji et.al. 2506.06066 null
2025-06-06 Modeling human reputation-seeking behavior in a spatio-temporally complex public good provision game Edward Hughes et.al. 2506.06032 null
2025-06-06 When to Trust Context: Self-Reflective Debates for Context Reliability Zeqi Zhou et.al. 2506.06020 null
2025-06-06 AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search Yu Li et.al. 2506.06017 null
2025-06-06 Propose or Vote: A simple Democratic Procedure Hans Gersbach et.al. 2506.05998 null
2025-06-06 Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning Yuheng Lei et.al. 2506.05985 link
2025-06-06 MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks Zonglin Wu et.al. 2506.05982 link
2025-06-10 CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents Qingbin Zeng et.al. 2506.05981 null
2025-06-06 Quantum Checkers: The Development and Analysis of a Quantum Combinatorial Game Marien Raat et.al. 2506.05962 null
2025-06-06 Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes Alessandro Montenegro et.al. 2506.05953 null
2025-06-06 Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games Philipp Plank et.al. 2506.05894 null
2025-06-06 CodeContests+: High-Quality Test Case Generation for Competitive Programming Zihan Wang et.al. 2506.05817 null
2025-06-06 MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning Ye Bai et.al. 2506.05813 null
2025-06-06 Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction Yesheng Zhang et.al. 2506.05810 null
2025-06-06 To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt Zhilong Wang et.al. 2506.05739 null
2025-06-06 Hybrid Stabilization Protocol for Cross-Chain Digital Assets Using Adaptor Signatures and AI-Driven Arbitrage Shengwei You et.al. 2506.05708 null
2025-06-06 Multi-Project Contracts Tal Alon et.al. 2506.05705 null
2025-06-06 Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces Chaofan Pan et.al. 2506.05702 null
2025-06-06 Ordering-disordering dynamics of the voter model under random external bias Roni Muslim et.al. 2506.05669 null
2025-06-06 A Modular Haptic Display with Reconfigurable Signals for Personalized Information Transfer Antonio Alvarez Valdivia et.al. 2506.05648 null
2025-06-06 Diffusive Spreading Across Dynamic Mitochondrial Network Architectures Keaton B. Holt et.al. 2506.05643 null
2025-06-09 Toward Greater Autonomy in Materials Discovery Agents: Unifying Planning, Physics, and Scientists Lianhao Zhou et.al. 2506.05616 null
2025-06-05 Beating the Logarithmic Barrier for the Subadditive Maximin Share Problem Masoud Seddighin et.al. 2506.05613 null
2025-06-05 OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation Ziyi Wang et.al. 2506.05606 null
2025-06-05 Stochastic maximum principle for optimal control problem of non exchangeable mean field systems Idris Kharroubi et.al. 2506.05595 null
2025-06-05 Collaborative Learning in Agentic Systems: A Collective AI is Greater Than the Sum of Its Parts Saptarshi Nath et.al. 2506.05577 link
2025-06-05 Applying Informer for Option Pricing: A Transformer-Based Approach Feliks Bańka et.al. 2506.05565 null
2025-06-05 Improving LLMs with a knowledge from databases Petr Máša et.al. 2506.05560 null
2025-06-05 Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data Vlastimil Martinek et.al. 2506.05542 link
2025-06-05 SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms Arnesh Batra et.al. 2506.05538 link
2025-06-05 Quantum circuits as a game: A reinforcement learning agent for quantum compilation and its application to reconfigurable neutral atom arrays Kouhei Nakaji et.al. 2506.05536 null
2025-06-05 Avoiding Death through Fear Intrinsic Conditioning Rodney Sanchez et.al. 2506.05529 null
2025-06-05 Sequence Modeling for N-Agent Ad Hoc Teamwork Caroline Wang et.al. 2506.05527 null
2025-06-05 Towards Data Systems That Are Business Semantic-Centric and AI Agents-Assisted Cecil Pang et.al. 2506.05520 null
2025-06-05 Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap? Giovanni M. Di Liberto et.al. 2506.05494 null
2025-06-05 A MARL-based Approach for Easing MAS Organization Engineering Julien Soulé et.al. 2506.05437 null
2025-06-05 Robustness Evaluation for Video Models with Reinforcement Learning Ashwin Ramesh Babu et.al. 2506.05431 null
2025-06-05 Mixture-of-Experts Meets In-Context Reinforcement Learning Wenhao Wu et.al. 2506.05426 null
2025-06-05 Constructive Symbolic Reinforcement Learning via Intuitionistic Logic and Goal-Chaining Inference Andrei T. Patrascu et.al. 2506.05422 null
2025-06-03 Rational Superautotrophic Diplomacy (SupraAD); A Conceptual Framework for Alignment Based on Interdisciplinary Findings on the Fundamentals of Cognition Andrea Morris et.al. 2506.05389 null
2025-06-05 Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Niv Eckhaus et.al. 2506.05309 link
2025-06-05 ProRefine: Inference-time Prompt Refinement with Textual Feedback Deepak Pandita et.al. 2506.05305 null
2025-06-05 Control Tax: The Price of Keeping AI in Check Mikhail Terekhov et.al. 2506.05296 null
2025-06-05 A Smooth Sea Never Made a Skilled $\texttt{SAILOR}$ : Robust Imitation via Learning to Search Arnav Kumar Jain et.al. 2506.05294 link
2025-06-05 Tight analyses of first-order methods with error feedback Daniel Berg Thomsen et.al. 2506.05271 link
2025-06-06 Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams Mohammed Almutairi et.al. 2506.05265 null
2025-06-05 Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning Dravyansh Sharma et.al. 2506.05252 null
2025-06-05 Towards Language-Augmented Multi-Agent Deep Reinforcement Learning Maxime Toquebiau et.al. 2506.05236 null
2025-06-05 A Framework for Ethical Judgment of Smart City Applications Weichen Shi et.al. 2506.05172 null
2025-06-05 An emergence-oriented approach to cyclic pursuit Zhaozhan Yao et.al. 2506.05157 null
2025-06-05 Truly Self-Improving Agents Require Intrinsic Metacognitive Learning Tennison Liu et.al. 2506.05109 null
2025-06-05 LLM-Guided Scenario-based GUI Testing Shengcheng Yu et.al. 2506.05079 null
2025-06-05 Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System Haokun Liu et.al. 2506.05020 null
2025-06-05 ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Zhenran Xu et.al. 2506.05010 link
2025-06-05 QiMeng: Fully Automated Hardware and Software Design for Processor Chip Rui Zhang et.al. 2506.05007 null
2025-06-05 Agentic AI for Intent-Based Industrial Automation Marcos Lima Romero et.al. 2506.04980 link
2025-06-05 Optimization for Semantic-Aware Resource Allocation under CPT-based Utilities Symeon Vaidanis et.al. 2506.04952 null
2025-06-05 Goal-Oriented Semantic Resource Allocation with Cumulative Prospect Theoretic Agents Symeon Vaidanis et.al. 2506.04947 null
2025-06-05 No Trade Under Verifiable Information Spyros Galanis et.al. 2506.04944 null
2025-06-05 Energentic Intelligence: From Self-Sustaining Systems to Enduring Artificial Life Atahan Karagoz et.al. 2506.04916 null
2025-06-05 Efficient Path Planning and Task Allocation Algorithm for Boolean Specifications Ioana Hustiu et.al. 2506.04881 link
2025-06-05 LLMs for sensory-motor control: Combining in-context and iterative learning Jônata Tyska Carvalho et.al. 2506.04867 link
2025-06-05 Towards a Multi-Agent Simulation of Cyber-attackers and Cyber-defenders Battles Julien Soulé et.al. 2506.04849 null
2025-06-05 Oversight Structures for Agentic AI in Public-Sector Organizations Chris Schmitz et.al. 2506.04836 null
2025-06-05 Safe Planning and Policy Optimization via World Model Learning Artem Latyshev et.al. 2506.04828 null
2025-06-05 Distributionally Robust Auction Design with Deferred Inspection Halil I. Bayrak et.al. 2506.04767 null
2025-06-05 SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs Shuhan Xu et.al. 2506.04743 null
2025-06-05 Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling Bihan Xu et.al. 2506.04699 null
2025-06-05 Gen-n-Val: Agentic Image Data Generation and Validation Jing-En Huang et.al. 2506.04676 null
2025-06-05 E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction Zhichao Yang et.al. 2506.04654 null
2025-06-05 Agents of Change: Self-Evolving LLM Agents for Strategic Planning Nikolas Belle et.al. 2506.04651 null
2025-06-05 Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents Juhyun Oh et.al. 2506.04649 link
2025-06-05 CHANCERY: Evaluating corporate governance reasoning capabilities in language models Lucas Irwin et.al. 2506.04636 null
2025-06-05 Composing Agents to Minimize Worst-case Risk Guruprerana Shabadi et.al. 2506.04632 null
2025-06-05 Enhancing Efficiency and Propulsion in Bio-mimetic Robotic Fish through End-to-End Deep Reinforcement Learning Xinyu Cui et.al. 2506.04627 null
2025-06-05 Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning Haochen Zhang et.al. 2506.04626 null
2025-06-05 Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning Zhiyuan Ma et.al. 2506.04625 null
2025-06-05 Subjective Perspectives within Learned Representations Predict High-Impact Innovation Likun Cao et.al. 2506.04616 null
2025-06-05 SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents Alexander Huang-Menders et.al. 2506.04606 null
2025-06-05 Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning Ziqi Jia et.al. 2506.04595 null
2025-06-05 Demonstrations of Integrity Attacks in Multi-Agent Systems Can Zheng et.al. 2506.04572 null
2025-06-05 OpenAg: Democratizing Agricultural Intelligence Srikanth Thudumu et.al. 2506.04571 null
2025-06-05 From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems Jiayi Chen et.al. 2506.04565 null
2025-06-04 SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning Nikita Oskolkov et.al. 2506.04505 null
2025-06-04 CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective Jiayu Liu et.al. 2506.04481 null
2025-06-04 MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Ran Xu et.al. 2506.04405 null
2025-06-04 Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning Suzan Ece Ada et.al. 2506.04399 null
2025-06-04 Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care Saurabh Kumar et.al. 2506.04389 null
2025-06-04 Replay Can Provably Increase Forgetting Yasaman Mahdaviyeh et.al. 2506.04377 null
2025-06-04 WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning Delong Chen et.al. 2506.04363 null
2025-06-04 The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective Jiin Kim et.al. 2506.04301 null
2025-06-04 AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents Fengze Liu et.al. 2506.04293 null
2025-06-04 Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback Yongjin Yang et.al. 2506.04287 null
2025-06-04 Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response Lei Han et.al. 2506.04276 null
2025-06-03 CORA: Coalitional Rational Advantage Decomposition for Multi-Agent Policy Gradients Mengda Ji et.al. 2506.04265 null
2025-06-04 OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis Junting Chen et.al. 2506.04217 link
2025-06-04 Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs Alex DeWeese et.al. 2506.04215 null
2025-06-06 TracLLM: A Generic Framework for Attributing Long Context LLMs Yanting Wang et.al. 2506.04202 link
2025-06-04 MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures Elena Zamaraeva et.al. 2506.04195 null
2025-06-04 SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Yuhao Wu et.al. 2506.04180 null
2025-06-04 A primal-dual price-optimization method for computing equilibrium prices in mean-field games models Xu Wang et.al. 2506.04169 link
2025-06-04 Image Editing As Programs with Diffusion Models Yujia Hu et.al. 2506.04158 null
2025-06-05 macOSWorld: A Multilingual Interactive Benchmark for GUI Agents Pei Yang et.al. 2506.04135 link
2025-06-04 TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems Shaina Raza et.al. 2506.04133 null
2025-06-04 CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues Disha Sheshanarayana et.al. 2506.04131 null
2025-06-04 TextAtari: 100K Frames Game Playing with Language Agents Wenhao Li et.al. 2506.04098 link
2025-06-04 AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment Anastasiia Ivanova et.al. 2506.04089 link
2025-06-04 Optimal Transport-based Domain Alignment as a Preprocessing Step for Federated Learning Luiz Manella Pereira et.al. 2506.04071 null
2025-06-04 AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data Sina Rashidian et.al. 2506.04032 null
2025-06-04 AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents Akshat Naik et.al. 2506.04018 null
2025-06-04 Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning Junqi Gao et.al. 2506.03939 link
2025-06-04 HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models Zhaolu Kang et.al. 2506.03922 link
2025-06-04 Causal Explanations Over Time: Articulated Reasoning for Interactive Environments Sebastian Rödling et.al. 2506.03915 null
2025-06-04 Jet-Feedback on kpc scales: a review Dipanjan Mukherjee et.al. 2506.03888 null
2025-06-04 PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading Qiuhan Han et.al. 2506.03861 null
2025-06-04 AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance Dhaval Patel et.al. 2506.03828 link
2025-06-04 Learning Equilibria in Matching Games with Bandit Feedback Andreas Athanasopoulos et.al. 2506.03802 null
2025-06-04 From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation Peter Pfeiffer et.al. 2506.03801 null
2025-06-04 Misalignment or misuse? The AGI alignment tradeoff Max Hellrigel-Holderbaum et.al. 2506.03755 null
2025-06-04 A Retrieval-Augmented Multi-Agent Framework for Psychiatry Diagnosis Mengxi Xiao et.al. 2506.03750 link
2025-06-04 AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives Aniruddh Sikdar et.al. 2506.03709 null
2025-06-04 Stability Notions for Hospital Residents with Sizes Haricharan Balasundaram et.al. 2506.03638 null
2025-06-04 Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations Shaoshan Liu et.al. 2506.03613 link
2025-06-04 Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games Dongmin Park et.al. 2506.03610 null
2025-06-08 Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems Yu Ma et.al. 2506.03586 null
2025-06-05 Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving Li Zeqiao et.al. 2506.03568 link
2025-06-04 From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context Yuanchen Bai et.al. 2506.03546 null
2025-06-04 CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications Wanghao Ye et.al. 2506.03543 null
2025-06-04 Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement Xiaofeng Zhou et.al. 2506.03541 null
2025-06-04 Go-Browse: Training Web Agents with Structured Exploration Apurva Gandhi et.al. 2506.03533 null
2025-06-04 GA-S $^3$ : Comprehensive Social Network Simulation with Group Agents Yunyao Zhang et.al. 2506.03532 link
2025-06-04 How Far Are We from Predicting Missing Modalities with Foundation Models? Guanzhou Ke et.al. 2506.03530 link
2025-06-04 Correlated equilibrium implementation: Navigating toward social optima with learning dynamics Soumen Banerjee et.al. 2506.03528 null
2025-06-04 Path Generation and Evaluation in Video Games: A Nonparametric Statistical Approach Daniel Campa et.al. 2506.03522 null
2025-06-04 VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety Han Zhang et.al. 2506.03520 null
2025-06-04 SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models Arnab Debnath et.al. 2506.03516 null
2025-06-04 Computational Architects of Society: Quantum Machine Learning for Social Rule Genesis Shan Shan et.al. 2506.03503 null
2025-06-04 CORE: Constraint-Aware One-Step Reinforcement Learning for Simulation-Guided Neural Network Accelerator Design Yifeng Xiao et.al. 2506.03474 null
2025-06-03 The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks Walter Mayor et.al. 2506.03404 null
2025-06-03 Impact of Rankings and Personalized Recommendations in Marketplaces Omar Besbes et.al. 2506.03369 null
2025-06-03 A Differential Perspective on Distributional Reinforcement Learning Juan Sebastian Rojas et.al. 2506.03333 null
2025-06-03 Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic Workflows Yifei Ming et.al. 2506.03332 null
2025-06-03 The Future of Continual Learning in the Era of Foundation Models: Three Key Directions Jack Bell et.al. 2506.03320 null
2025-06-03 FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes Christodoulos Constantinides et.al. 2506.03278 link
2025-06-03 NetPress: Dynamically Generated LLM Benchmarks for Network Applications Yajie Zhou et.al. 2506.03231 link
2025-06-03 Multiple-Frequencies Population-Based Training Waël Doulazmi et.al. 2506.03225 null
2025-06-02 Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments Umberto Gonçalves de Sousa et.al. 2506.03205 null
2025-06-03 GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Qianhui Wu et.al. 2506.03143 null
2025-06-03 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Yinjie Wang et.al. 2506.03136 link
2025-06-03 Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff Sophie Greenwood et.al. 2506.03102 null
2025-06-03 EgoVLM: Policy Optimization for Egocentric Video Understanding Ashwin Vinod et.al. 2506.03097 link
2025-06-03 DPO Learning with LLMs-Judge Signal for Computer Use Agents Man Luo et.al. 2506.03095 null
2025-06-03 Provable Reinforcement Learning from Human Feedback with an Unknown Link Function Qining Zhang et.al. 2506.03066 null
2025-06-03 MAEBE: Multi-Agent Emergent Behavior Framework Sinem Erisken et.al. 2506.03053 null
2025-06-03 EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment Mikolaj Walczak et.al. 2506.03046 null
2025-06-06 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2506.03038 null
2025-06-03 TestAgent: An Adaptive and Intelligent Expert for Human Assessment Junhao Yu et.al. 2506.03032 null
2025-06-03 Coding Agents with Multimodal Browsing are Generalist Problem Solvers Aditya Bharat Soni et.al. 2506.03011 null
2025-06-03 DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models Jiarui Wang et.al. 2506.03007 null
2025-06-03 A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems Đorđe Klisura et.al. 2506.02998 null
2025-06-03 Mapping Student-AI Interaction Dynamics in Multi-Agent Learning Environments: Supporting Personalised Learning and Reducing Performance Gaps Zhanxin Hao et.al. 2506.02993 null
2025-06-03 Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation Li Zhang et.al. 2506.02992 null
2025-06-03 Adaptive Graph Pruning for Multi-Agent Communication Boyi Li et.al. 2506.02951 null
2025-06-03 Abstract Counterfactuals for Language Model Agents Edoardo Pona et.al. 2506.02946 null
2025-06-08 Hallucination to Consensus: Multi-Agent LLMs for End-to-End Test Generation with Accurate Oracles Qinghua Xu et.al. 2506.02943 null
2025-06-03 ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms Praneet Sai Madhu Surabhi et.al. 2506.02931 link
2025-06-03 Large Processor Chip Model Kaiyan Chang et.al. 2506.02929 null
2025-06-03 The Limits of Predicting Agents from Behaviour Alexis Bellot et.al. 2506.02923 null
2025-06-03 Text-guided Generation of Efficient Personalized Inspection Plans Xingpeng Sun et.al. 2506.02917 null
2025-06-03 A Continual Offline Reinforcement Learning Benchmark for Navigation Tasks Anthony Kobanda et.al. 2506.02883 null
2025-06-03 It's the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics Matthew Kowal et.al. 2506.02873 null
2025-06-03 Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights Mathieu Andreux et.al. 2506.02865 null
2025-06-03 CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech Helin Wang et.al. 2506.02863 null
2025-06-03 ATAG: AI-Agent Application Threat Assessment with Attack Graphs Parth Atulbhai Gandhi et.al. 2506.02859 null
2025-06-03 Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods Tom Danino et.al. 2506.02841 null
2025-06-03 On dual-rate consensus under transmission delays David Umsonst et.al. 2506.02840 null
2025-06-03 DeepShop: A Benchmark for Deep Research Shopping Agents Yougang Lyu et.al. 2506.02839 null
2025-06-03 TaxAgent: How Large Language Model Designs Fiscal Policy Jizhou Wang et.al. 2506.02838 null
2025-06-03 Solving the Pod Repositioning Problem with Deep Reinforced Adaptive Large Neighborhood Search Lin Xie et.al. 2506.02746 null
2025-06-03 Why do AI agents communicate in human language? Pengcheng Zhou et.al. 2506.02739 null
2025-06-03 Benchmarking and Advancing Large Language Models for Local Life Services Xiaochong Lan et.al. 2506.02720 null
2025-06-03 Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems Guanzhong Chen et.al. 2506.02718 null
2025-06-04 MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching Liang Yue et.al. 2506.02689 null
2025-06-03 Decompose, Plan in Parallel, and Merge: A Novel Paradigm for Large Language Models based Planning with Multiple Constraints Zhengdong Lu et.al. 2506.02683 null
2025-06-03 Bounded confidence dynamics generates opinion cascades on a growing scale-free network David Hernandez et.al. 2506.02669 null
2025-06-03 FAuNO: Semi-Asynchronous Federated Reinforcement Learning Framework for Task Offloading in Edge Systems Frederico Metelo et.al. 2506.02668 null
2025-06-04 Non-exchangeable evolutionary and mean field games and their applications H. Yoshioka et.al. 2506.02644 null
2025-06-03 Compositional Learning for Modular Multi-Agent Self-Organizing Networks Qi Liao et.al. 2506.02616 null
2025-06-04 Multi Layered Autonomy and AI Ecologies in Robotic Art Installations Baoyang Chen et.al. 2506.02606 null
2025-06-03 Computational adversarial risk analysis for general security games Jose Manuel Camacho et.al. 2506.02603 null
2025-06-03 A Hybrid Approach to Indoor Social Navigation: Integrating Reactive Local Planning and Proactive Global Planning Arnab Debnath et.al. 2506.02593 null
2025-06-03 CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale Zhun Wang et.al. 2506.02548 link
2025-06-03 Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems Pengfei He et.al. 2506.02546 null
2025-06-03 VerificAgent: Integrating Expert Knowledge and Fact-Checked Memory for Robust Domain-Specific Task Planning Thong Q. Nguyen et.al. 2506.02539 null
2025-06-03 Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making Xu Wan et.al. 2506.02522 null
2025-06-03 To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents Kyra Wang et.al. 2506.02514 link
2025-06-03 AURA: Agentic Upskilling via Reinforced Abstractions Alvin Zhu et.al. 2506.02507 null
2025-06-03 VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents Tri Cao et.al. 2506.02456 link
2025-06-03 Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework Zhaorui Yang et.al. 2506.02454 null
2025-06-03 From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models Mahammed Kamruzzaman et.al. 2506.02431 null
2025-06-04 Comparative Analysis of AI Agent Architectures for Entity Relationship Classification Maryam Berijanian et.al. 2506.02426 link
2025-06-03 VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments Zelai Xu et.al. 2506.02387 null
2025-06-03 Multi-agent Markov Entanglement Shuze Chen et.al. 2506.02385 null
2025-06-03 Evaluating LLM Agent Adherence to Hierarchical Safety Principles: A Lightweight Benchmark for Probing Foundational Controllability Components Ram Potham et.al. 2506.02357 null
2025-06-03 DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization Jeonghun Kang et.al. 2506.02351 null
2025-06-02 LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback Thai Hoang et.al. 2506.02298 null
2025-06-02 Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction Samuel Li et.al. 2506.02265 null
2025-06-02 Composable Building Blocks for Controllable and Transparent Interactive AI Systems Sebe Vanbrabant et.al. 2506.02262 null
2025-06-02 Stochastically Dominant Peer Prediction Yichi Zhang et.al. 2506.02259 null
2025-06-02 Optimal Coordination of Flexible DERs in Local Energy and Flexibility Markets to Ensure Social Equity Niloofar Pourghaderi et.al. 2506.02179 null
2025-06-02 Reflection-Based Memory For Web navigation Agents Ruhana Azam et.al. 2506.02158 null
2025-06-02 Small Language Models are the Future of Agentic AI Peter Belcak et.al. 2506.02153 null
2025-06-04 The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning Edward Y. Chang et.al. 2506.02139 null
2025-06-02 Descriptive History Representations: Learning Representations by Answering Questions Guy Tennenholtz et.al. 2506.02125 null
2025-06-02 Enhancing Interpretability of Quantum-Assisted Blockchain Clustering via AI Agent-Based Qualitative Analysis Yun-Cheng Tsai et.al. 2506.02068 null
2025-06-01 The Measurement Imbalance in Agentic AI Evaluation Undermines Industry Productivity Claims Kiana Jafari Meimandi et.al. 2506.02064 null
2025-06-01 Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI Nikola Balic et.al. 2506.02055 link
2025-06-01 Phenotypic Profile-Informed Generation of Drug-Like Molecules via Dual-Channel Variational Autoencoders Hui Liu et.al. 2506.02051 null
2025-06-01 Decoupled Hierarchical Reinforcement Learning with State Abstraction for Discrete Grids Qingyu Xiao et.al. 2506.02050 null
2025-06-01 EvoGit: Decentralized Code Evolution via Git-Based Multi-Agent Collaboration Beichen Huang et.al. 2506.02049 link
2025-06-01 Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges Lajos Muzsai et.al. 2506.02048 null
2025-05-31 Beyond the Protocol: Unveiling Attack Vectors in the Model Context Protocol Ecosystem Hao Song et.al. 2506.02040 link
2025-06-02 WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks Atsuyuki Miyai et.al. 2506.01952 null
2025-06-02 Should Decision-Makers Reveal Classifiers in Online Strategic Classification? Han Shao et.al. 2506.01936 null
2025-06-02 Online Competitive Information Gathering for Partially Observable Trajectory Games Mel Krusniak et.al. 2506.01927 null
2025-06-02 COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents Manish Bhatt et.al. 2506.01900 null
2025-06-02 WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue Yaoyao Qian et.al. 2506.01881 link
2025-06-02 Pearl: Automatic Code Optimization Using Deep Reinforcement Learning Djamel Rassem Lamouri et.al. 2506.01880 null
2025-06-02 CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions Tamer Alkhouli et.al. 2506.01859 null
2025-06-02 Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research Jennifer Haase et.al. 2506.01839 null
2025-06-02 The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? Djallel Bouneffouf et.al. 2506.01813 null
2025-06-02 A Study on the MCP x A2A Framework for Enhancing Interoperability of LLM-based Autonomous Agents Cheonsu Jeong et.al. 2506.01804 null
2025-06-02 Enhancing Customer Service Chatbots with Context-Aware NLU through Selective Attention and Multi-task Learning Subhadip Nandi et.al. 2506.01781 null
2025-06-02 Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning Yihong Tang et.al. 2506.01748 null
2025-06-02 Self-Challenging Language Model Agents Yifei Zhou et.al. 2506.01716 null
2025-06-02 A Descriptive and Normative Theory of Human Beliefs in RLHF Sylee Dandekar et.al. 2506.01692 null
2025-06-02 Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts Benjamin Schiffer et.al. 2506.01685 null
2025-06-02 A Hierarchical Bin Packing Framework with Dual Manipulators via Heuristic Search and Deep Reinforcement Learning Beomjoon Lee et.al. 2506.01628 null
2025-06-02 Social Cooperation in Conversational AI Agents Mustafa Mert Çelikok et.al. 2506.01624 null
2025-06-02 MAGIK: Mapping to Analogous Goals via Imagination-enabled Knowledge Transfer Ajsal Shereef Palattuparambil et.al. 2506.01623 null
2025-06-02 General agents need world models Jonathan Richens et.al. 2506.01622 null
2025-06-02 MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments Xiao Yang et.al. 2506.01616 null
2025-06-02 Trajectory First: A Curriculum for Discovering Diverse Policies Cornelius V. Braun et.al. 2506.01568 null
2025-06-02 EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation Bingqian Lin et.al. 2506.01551 null
2025-06-03 LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation Guobin Zhu et.al. 2506.01538 null
2025-06-03 Quantum Agents Eldar Sultanow et.al. 2506.01536 null
2025-06-03 STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework Wenhao Liu et.al. 2506.01531 link
2025-06-02 FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents Bobo Li et.al. 2506.01520 null
2025-06-02 PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization Zouying Cao et.al. 2506.01475 null
2025-06-02 Agentic AI and Multiagentic: Are We Reinventing the Wheel? V. Botti et.al. 2506.01463 null
2025-06-02 Agentic Episodic Control Xidong Yang et.al. 2506.01442 null
2025-06-02 Distinguishing Autonomous AI Agents from Collaborative Agentic Systems: A Comprehensive Framework for Understanding Modern Intelligent Architectures Prashik Buddhaghosh Bansod et.al. 2506.01438 null
2025-06-02 FinRobot: Generative Business Process AI Agents for Enterprise Resource Planning in Finance Hongyang Yang et.al. 2506.01423 null
2025-06-02 SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation Rafael Flor-Rodríguez et.al. 2506.01418 link
2025-06-02 Sparse Imagination for Efficient Visual World Model Planning Junha Chun et.al. 2506.01392 null
2025-06-02 AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning Zhong Zhang et.al. 2506.01391 link
2025-06-02 Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents Manan Suri et.al. 2506.01344 null
2025-06-02 Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models Yiwen Jiang et.al. 2506.01334 null
2025-06-02 An Empirical Study of Group Conformity in Multi-Agent Systems Min Choi et.al. 2506.01332 null
2025-06-02 ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Yiyang Zhou et.al. 2506.01300 null
2025-06-02 RAISE: Reasoning Agent for Interactive SQL Exploration Fernando Granado et.al. 2506.01273 null
2025-06-02 CleanS2S: Single-file Framework for Proactive Speech-to-Speech Interaction Yudong Lu et.al. 2506.01268 null
2025-06-02 Comprehensive Vulnerability Analysis is Necessary for Trustworthy LLM-MAS Pengfei He et.al. 2506.01245 null
2025-06-01 A mean field game model with non-local spatial interactions and resources accumulation Daria Ghilli et.al. 2506.01200 null
2025-06-04 Test Automation for Interactive Scenarios via Promptable Traffic Simulation Augusto Mondelli et.al. 2506.01199 null
2025-06-01 Near-feasible Fair Allocations in Two-sided Markets Javier Cembrano et.al. 2506.01178 null
2025-06-01 GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering Muhammad Qasim Ali et.al. 2506.01174 null
2025-06-01 Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism Orchid Chetia Phukan et.al. 2506.01148 null
2025-06-01 DeepVerse: 4D Autoregressive Video Generation as a World Model Junyi Chen et.al. 2506.01103 null
2025-06-01 Modular Speaker Architecture: A Framework for Sustaining Responsibility and Contextual Integrity in Multi-Agent AI Communication Khe-Han Toh et.al. 2506.01095 null
2025-06-01 The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process Florian Carichon et.al. 2506.01080 null
2025-06-01 SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models Thinh Pham et.al. 2506.01062 null
2025-06-04 MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch Xiang Fei et.al. 2506.01056 null
2025-06-01 Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution Meysam Alizadeh et.al. 2506.01055 null
2025-06-01 Robust and Safe Multi-Agent Reinforcement Learning Framework with Communication for Autonomous Vehicles Keshawn Smith et.al. 2506.00982 null
2025-06-01 HMPC-assisted Adversarial Inverse Reinforcement Learning for Smart Home Energy Management Jiadong He et.al. 2506.00898 null
2025-06-01 Toward a Theory of Agents as Tool-Use Decision-Makers Hongru Wang et.al. 2506.00886 null
2025-06-01 CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching Leying Zhang et.al. 2506.00885 null
2025-06-01 Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks Qiang Chen et.al. 2506.00856 null
2025-06-01 Federated Deep Reinforcement Learning-Driven O-RAN for Automatic Multirobot Reconfiguration Faisal Ahmed et.al. 2506.00822 null
2025-06-01 Action Dependency Graphs for Globally Optimal Coordinated Reinforcement Learning Jianglin Ding et.al. 2506.00797 null
2025-06-01 Predicting Empirical AI Research Outcomes with Language Models Jiaxin Wen et.al. 2506.00794 null
2025-06-01 CO-OPERA: A Human-AI Collaborative Playwriting Tool to Support Creative Storytelling for Interdisciplinary Drama Education Xuejiao Ma et.al. 2506.00791 link
2025-06-01 CoP: Agentic Red-teaming for Large Language Models using Composition of Principles Chen Xiong et.al. 2506.00781 null
2025-05-31 Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences? Zhuojun Gu et.al. 2506.00751 null
2025-05-31 DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments Chiyu Zhang et.al. 2506.00739 link
2025-05-31 Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning Javier Bisbal et.al. 2506.00727 null
2025-05-31 Browser Fingerprinting Using WebAssembly Mordechai Guri et.al. 2506.00719 null
2025-05-31 An LLM Agent for Functional Bug Detection in Network Protocols Mingwei Zheng et.al. 2506.00714 link
2025-05-31 Adaptive Traffic-Following Scheme for Orderly Distributed Control of Multi-Vehicle Systems Anahita Jain et.al. 2506.00703 null
2025-06-04 Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning Junaid Muzaffar et.al. 2506.00691 null
2025-05-31 AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents Hanjun Luo et.al. 2506.00641 null
2025-05-31 Social Construction of Urban Space: Understanding Neighborhood Boundaries Using Rental Listings Adam Visokay et.al. 2506.00634 null
2025-05-31 The Disparate Effects of Partial Information in Bayesian Strategic Learning Srikanth Avasarala et.al. 2506.00627 null
2025-06-04 RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents Jingyi Yang et.al. 2506.00618 null
2025-05-31 PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements Petros Raptopoulos et.al. 2506.00608 link
2025-05-31 Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn Hongyao Tang et.al. 2506.00592 null
2025-05-31 Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs Yufa Zhou et.al. 2506.00577 link
2025-05-31 ORAN-GUIDE: RAG-Driven Prompt Learning for LLM-Augmented Reinforcement Learning in O-RAN Network Slicing Fatemeh Lotfi et.al. 2506.00576 null
2025-05-31 Prompt-Tuned LLM-Augmented DRL for Dynamic O-RAN Network Slicing Fatemeh Lotfi et.al. 2506.00574 null
2025-05-31 MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning Peng Xia et.al. 2506.00555 null
2025-05-31 Two-Sided Manipulation Games in Stable Matching Markets Hadi Hosseini et.al. 2506.00554 null
2025-05-31 AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation Ming Wang et.al. 2506.00551 link
2025-05-31 Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages Hyangsuk Min et.al. 2506.00549 null
2025-05-31 Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline Zhaoying Wang et.al. 2506.00546 null
2025-06-04 ARIA: Training Language Agents with Intention-Driven Reward Aggregation Ruihan Yang et.al. 2506.00539 null
2025-05-31 Temac: Multi-Agent Collaboration for Automated Web GUI Testing Chenxu Liu et.al. 2506.00520 null
2025-05-31 Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems Zherui Li et.al. 2506.00509 null
2025-05-31 Reinforcement Learning for Hanabi Nina Cohen et.al. 2506.00458 null
2025-05-31 RLAE: Reinforcement Learning-Assisted Ensemble for LLMs Yuqian Fu et.al. 2506.00439 null
2025-05-31 Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions Jihyoung Jang et.al. 2506.00421 null
2025-05-31 World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks Changyuan Zhao et.al. 2506.00417 null
2025-05-31 LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks Yi Yang et.al. 2506.00411 null
2025-05-31 Sensor Fusion Methods for Gaussian Mixture Models Ishan Paranjape et.al. 2506.00383 null
2025-05-31 Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents Xiao Yu et.al. 2506.00320 null
2025-05-30 Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model Oliver Mortensen et.al. 2506.00286 null
2025-05-30 MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility Yexiao He et.al. 2506.00235 null
2025-05-30 Sorrel: A simple and flexible framework for multi-agent reinforcement learning Rebekah A. Gelpí et.al. 2506.00228 link
2025-05-30 REIC: RAG-Enhanced Intent Classification at Scale Ziji Zhang et.al. 2506.00210 null
2025-05-30 When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs Xinyue Shen et.al. 2506.00197 null
2025-05-30 Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents Kaivalya Hariharan et.al. 2506.00172 null
2025-06-03 A novel sensitivity analysis method for agent-based models stratifies in-silico tumor spheroid simulations Edward H. Rohr et.al. 2506.00168 null
2025-05-30 Werewolf: A Straightforward Game Framework with TTS for Improved User Engagement Qihui Fan et.al. 2506.00160 null
2025-05-30 MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation Biqi Rebekah Zhao et.al. 2506.00143 null
2025-05-30 Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation Reece Keller et.al. 2506.00138 null
2025-05-30 A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things Mohammadhossein Homaei et.al. 2506.00133 null
2025-05-30 Adapting Offline Reinforcement Learning with Online Delays Simon Sinong Zhan et.al. 2506.00131 null
2025-05-30 Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents Yaxin Luo et.al. 2505.24878 link
2025-05-30 Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks Tajamul Ashraf et.al. 2505.24876 link
2025-05-30 VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software Brandon Man et.al. 2505.24838 link
2025-05-30 Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation Yucheng Zhou et.al. 2505.24787 link
2025-06-02 EXP-Bench: Can AI Conduct AI Research Experiments? Patrick Tser Jern Kon et.al. 2505.24785 link
2025-05-30 Emergent Dynamics of Active Systems on Curved Environments Euan D. Mackay et.al. 2505.24730 null
2025-05-30 CoRet: Improved Retriever for Code Editing Fabio Fehr et.al. 2505.24715 null
2025-05-30 Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting Wei Chen et.al. 2505.24710 link
2025-05-30 Towards a unified user modeling language for engineering human centered AI systems Aaron Conrardy et.al. 2505.24697 null
2025-05-30 Multiple LLM Agents Debate for Equitable Cultural Alignment Dayeon Ki et.al. 2505.24671 link
2025-05-30 Black-box Adversarial Attacks on CNN-based SLAM Algorithms Maria Rafaela Gkeka et.al. 2505.24654 null
2025-05-30 Online Budget-Feasible Mechanism Design with Predictions Georgios Amanatidis et.al. 2505.24624 null
2025-05-30 Distributed Intelligence in the Computing Continuum with Active Inference Victor Casamayor Pujol et.al. 2505.24618 null
2025-05-30 When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation Daniela Occhipinti et.al. 2505.24613 null
2025-06-02 AutoChemSchematic AI: A Closed-Loop, Physics-Aware Agentic Framework for Auto-Generating Chemical Process and Instrumentation Diagrams Sakhinana Sagar Srinivas et.al. 2505.24584 null
2025-05-30 NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization Hyuntak Kim et.al. 2505.24575 null
2025-05-30 CREFT: Sequential Multi-Agent LLM for Character Relation Extraction Ye Eun Chun et.al. 2505.24553 null
2025-05-30 Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Compatibility Leonid Kondrashov et.al. 2505.24551 null
2025-05-30 Online Fair Division with Additional Information Tzeh Yuan Neoh et.al. 2505.24503 null
2025-05-30 RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation Zhentao Xie et.al. 2505.24442 link
2025-05-30 P: A Universal Measure of Predictive Intelligence David Gamez et.al. 2505.24426 null
2025-05-30 Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer Yilun Kong et.al. 2505.24378 link
2025-05-30 Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research Qianqian Zhang et.al. 2505.24354 null
2025-05-30 Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents Fanhang Man et.al. 2505.24331 link
2025-05-30 Online Fair Allocations with Binary Valuations and Beyond Yuanyuan Wang et.al. 2505.24321 null
2025-05-30 ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving Yongming Chen et.al. 2505.24317 null
2025-05-30 R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning Harsh Goel et.al. 2505.24265 link
2025-05-30 Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games Neemesh Yadav et.al. 2505.24255 link
2025-05-30 Rethinking Continual Learning with Progressive Neural Collapse Zheng Wang et.al. 2505.24254 null
2025-05-30 Proactive Guidance of Multi-Turn Conversation in Industrial Search Xiaoyu Li et.al. 2505.24251 null
2025-05-30 An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring Sana Ebrahimi et.al. 2505.24239 null
2025-05-30 SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems Xu He et.al. 2505.24201 null
2025-05-30 Learning Gentle Humanoid Locomotion and End-Effector Stabilization Control Yitang Li et.al. 2505.24198 link
2025-05-30 Learning API Functionality from Demonstrations for Tool-based Agents Bhrij Patel et.al. 2505.24197 null
2025-05-30 Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control Zijie Xu et.al. 2505.24161 null
2025-05-30 Don't Just Follow MLLM Plans: Robust and Efficient Planning for Open-world Agents Seungjoon Lee et.al. 2505.24157 null
2025-05-30 Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction Chenyou Fan et.al. 2505.24156 null
2025-05-30 Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning Ehtesamul Azim et.al. 2505.24155 link
2025-05-30 Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning Pengcheng Dai et.al. 2505.24113 null
2025-05-30 Deception in Oligopoly Games via Adaptive Nash Seeking Systems Michael Tang et.al. 2505.24112 null
2025-05-29 mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation Chan-Wei Hu et.al. 2505.24073 null
2025-05-29 Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning Jiashun Liu et.al. 2505.24061 null
2025-05-29 LLM Agents Should Employ Security Principles Kaiyuan Zhang et.al. 2505.24019 null
2025-05-29 ConversAR: Exploring Embodied LLM-Powered Group Conversations in Augmented Reality for Second Language Learners Jad Bendarkawi et.al. 2505.24000 null
2025-05-29 Multi-RAG: A Multimodal Retrieval-Augmented Generation System for Adaptive Video Understanding Mingyang Mao et.al. 2505.23990 null
2025-05-29 Rules, agents and order Amalia Puente et.al. 2505.23985 null
2025-05-29 Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation Henry Conklin et.al. 2505.23960 null
2025-05-29 Estimating Misreporting in the Presence of Genuine Modification: A Causal Perspective Dylan Zapzalka et.al. 2505.23954 null
2025-05-29 Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach Melika Sepidband et.al. 2505.23953 null
2025-05-29 InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback Boyuan Chen et.al. 2505.23950 null
2025-05-29 Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve Yuanzhe Liu et.al. 2505.23946 null
2025-05-29 ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents Feiteng Fang et.al. 2505.23923 null
2025-05-29 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation Mengkang Hu et.al. 2505.23885 link
2025-05-29 Combining Deep Architectures for Information Gain estimation and Reinforcement Learning for multiagent field exploration Emanuele Masiero et.al. 2505.23865 null
2025-05-29 DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control Wuhao Wang et.al. 2505.23857 null
2025-05-29 Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease Nic Dobbins et.al. 2505.23852 null
2025-05-28 Seven Security Challenges That Must be Solved in Cross-domain Multi-agent LLM Systems Ronny Ko et.al. 2505.23847 null
2025-05-28 Scalable, Symbiotic, AI and Non-AI Agent Based Parallel Discrete Event Simulations Atanu Barai et.al. 2505.23846 null
2025-05-28 GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance Zaixi Zhang et.al. 2505.23839 link
2025-05-28 CoMaPOI: A Collaborative Multi-Agent Framework for Next POI Prediction Bridging the Gap Between Trajectory and Language Lin Zhong et.al. 2505.23837 null
2025-05-28 Large Language Models Often Know When They Are Being Evaluated Joe Needham et.al. 2505.23836 null
2025-05-28 Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective Qingchuan Ma et.al. 2505.23833 link
2025-05-28 Privacy-Preserving Inconsistency Measurement Carl Corea et.al. 2505.23825 null
2025-05-27 Aligning LLMs by Predicting Preferences from User Writing Samples Stéphane Aroca-Ouellette et.al. 2505.23815 null
2025-05-29 From Chat Logs to Collective Insights: Aggregative Question Answering Wentao Zhang et.al. 2505.23765 null
2025-05-29 ZeroGUI: Automating Online GUI Learning at Zero Human Cost Chenyu Yang et.al. 2505.23762 link
2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 link
2025-05-29 ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering Zexi Liu et.al. 2505.23723 link
2025-05-29 COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents Arun Verma et.al. 2505.23720 null
2025-05-29 From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems Zeinab Nezami et.al. 2505.23710 null
2025-05-29 Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics Ran Zhang et.al. 2505.23695 link
2025-05-29 ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork Caroline Wang et.al. 2505.23686 link
2025-05-31 GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents Manish Shetty et.al. 2505.23671 link
2025-05-29 Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning Michael A. Ramirez-Sierra et.al. 2505.23650 null
2025-05-29 Securing AI Agents with Information-Flow Control Manuel Costa et.al. 2505.23643 link
2025-05-29 MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment John Halloran et.al. 2505.23634 null
2025-06-02 MAPLE: A Mobile Agent with Persistent Finite State Machines for Structured Task Reasoning Linqiang Guo et.al. 2505.23596 null
2025-05-29 SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Kunlun Zhu et.al. 2505.23559 link
2025-05-29 Going from a Representative Agent to Counterfactuals in Combinatorial Choice Yanqiu Ruan et.al. 2505.23546 null
2025-05-29 TRAP: Targeted Redirecting of Agentic Preferences Hangoo Kang et.al. 2505.23518 null
2025-05-29 PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views Mohamed Rayan Barhdadi et.al. 2505.23481 link
2025-05-29 Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns Xiang Li et.al. 2505.23474 null
2025-05-29 On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment Safwan Labbi et.al. 2505.23459 null
2025-05-29 Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents Zhejian Yang et.al. 2505.23450 null
2025-06-01 Emergent Risk Awareness in Rational Agents under Resource Constraints Daniel Jarne Ornia et.al. 2505.23436 null
2025-05-29 From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents Tobias Lindenbauer et.al. 2505.23422 link
2025-06-01 SWE-bench Goes Live! Linghao Zhang et.al. 2505.23419 link
2025-05-29 Agent Interpolation for Knowledge Marta Bílková et.al. 2505.23401 null
2025-05-29 GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning Jusheng Zhang et.al. 2505.23399 null
2025-05-29 Grower-in-the-Loop Interactive Reinforcement Learning for Greenhouse Climate Control Maxiu Xiao et.al. 2505.23355 null
2025-05-29 Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems Xu Shen et.al. 2505.23352 link
2025-06-02 ScEdit: Script-based Assessment of Knowledge Editing Xinye Li et.al. 2505.23291 link
2025-05-29 Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception Guangyuan Liu et.al. 2505.23275 null
2025-05-29 Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion Chunlong Xie et.al. 2505.23266 null
2025-05-29 Achieving Equitability with Subsidy Yuanyuan Wang et.al. 2505.23251 null
2025-05-29 Context-Aware Semantic Communication for the Wireless Networks Guangyuan Liu et.al. 2505.23249 null
2025-05-29 OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software Lingkai Meng et.al. 2505.23239 link
2025-05-29 TrackVLA: Embodied Visual Tracking in the Wild Shaoan Wang et.al. 2505.23189 null
2025-05-29 Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration Yilong Li et.al. 2505.23187 null
2025-05-29 Conceptual Framework Toward Embodied Collective Adaptive Intelligence Fan Wang et.al. 2505.23153 null
2025-05-29 Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners Michal Nauman et.al. 2505.23150 null
2025-05-29 PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents Haoyu Chen et.al. 2505.23130 null
2025-05-29 Learning to Incentivize in Repeated Principal-Agent Problems with Adversarial Agent Arrivals Junyan Liu et.al. 2505.23124 null
2025-05-29 A Constructed Response: Designing and Choreographing Robot Arm Movements in Collaborative Dance Improvisation Xiaoyu Chang et.al. 2505.23090 null
2025-05-29 Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble Amit Kumthekar et.al. 2505.23075 null
2025-05-29 CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents Zhen Xiang et.al. 2505.23055 link
2025-05-29 AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models Jinchuan Zhang et.al. 2505.23020 link
2025-06-01 Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration Zeying Gong et.al. 2505.23019 null
2025-05-29 A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs Chiwan Park et.al. 2505.23006 null
2025-05-29 LLM Agents for Bargaining with Utility-based Feedback Jihwan Oh et.al. 2505.22998 null
2025-05-29 Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation Hoang Pham et.al. 2505.22993 null
2025-05-29 MenTeR: A fully-automated Multi-agenT workflow for end-to-end RF/Analog Circuits Netlist Design Pin-Han Chen et.al. 2505.22990 null
2025-05-29 Free Lunch for User Experience: Crowdsourcing Agents for Scalable User Studies Siyang Liu et.al. 2505.22981 null
2025-05-29 Learning Recommender Mechanisms for Bayesian Stochastic Games Bengisu Guresti et.al. 2505.22979 null
2025-05-29 MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming Chengqi Zheng et.al. 2505.22967 null
2025-05-29 ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind Peixuan Han et.al. 2505.22961 link
2025-05-29 Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness Yongjin Yang et.al. 2505.22960 null
2025-05-29 Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Jenny Zhang et.al. 2505.22954 link
2025-05-28 WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning Yuchen Zhuang et.al. 2505.22942 null
2025-05-28 A Smart-Contract to Resolve Multiple Equilibrium in Intermediated Trade Daniel Aronoff et.al. 2505.22940 null
2025-05-28 On the Resolution of Stochastic MPECs over Networks: Distributed Implicit Zeroth-Order Gradient Tracking Methods Mohammadjavad Ebrahimi et.al. 2505.22916 null
2025-05-28 Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents Cristian Chica et.al. 2505.22909 null
2025-05-28 Conversational Alignment with Artificial Intelligence in Context Rachel Katharine Sterken et.al. 2505.22907 null
2025-05-30 Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel Carlota Parés-Morlans et.al. 2505.22861 null
2025-05-28 Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment Krti Tallam et.al. 2505.22852 null
2025-05-28 RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation Nikita Khramov et.al. 2505.22846 null
2025-05-28 A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems Jonghan Lim et.al. 2505.22814 null
2025-05-28 First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay Andrew Zhu et.al. 2505.22809 link
2025-05-28 Dynamic Task Adaptation for Multi-Robot Manufacturing Systems with Large Language Models Jonghan Lim et.al. 2505.22804 null
2025-05-28 Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games Antonio Ocello et.al. 2505.22781 null
2025-05-28 MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators John Mendonça et.al. 2505.22777 link
2025-05-28 Calibrated Value-Aware Model Learning with Stochastic Environment Models Claas Voelcker et.al. 2505.22772 null
2025-05-28 Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential Fields Arseniy Pertzovsky et.al. 2505.22753 null
2025-05-28 HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer Qi Cai et.al. 2505.22705 link
2025-05-28 Design and testing of an agent chatbot supporting decision making with public transport data Luca Fantin et.al. 2505.22698 null
2025-05-28 When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks? Eleni Nisioti et.al. 2505.22696 link
2025-05-28 LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning Tengfei Lyu et.al. 2505.22695 null
2025-05-28 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model Wenbo Hu et.al. 2505.22657 null
2025-05-28 Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents Michael Kirchhof et.al. 2505.22655 null
2025-05-28 WebDancer: Towards Autonomous Information Seeking Agency Jialong Wu et.al. 2505.22648 link
2025-06-01 FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control Younggyo Seo et.al. 2505.22642 null
2025-05-28 LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents Rui Li et.al. 2505.22634 null
2025-05-28 HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym Ngoc La et.al. 2505.22597 link
2025-05-28 GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git Tobias Lindenbauer et.al. 2505.22583 link
2025-05-30 Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems Hoang Pham et.al. 2505.22571 null
2025-05-28 Universal Visuo-Tactile Video Understanding for Embodied Interaction Yifan Xie et.al. 2505.22566 null
2025-05-28 Training RL Agents for Multi-Objective Network Defense Tasks Andres Molina-Markham et.al. 2505.22531 null
2025-05-28 AI instructional agent improves student's perceived learner control and learning outcome: empirical evidence from a randomized controlled trial Fei Qin et.al. 2505.22526 null
2025-05-28 From Strangers to Assistants: Fast Desire Alignment for Embodied Agent-User Adaptation Yuanfei Wang et.al. 2505.22503 null
2025-05-28 EvolveSearch: An Iterative Self-Evolving Search Agent Dingchu Zhang et.al. 2505.22501 null
2025-05-28 Human-Centered Human-AI Collaboration (HCHAC) Qi Gao et.al. 2505.22477 null
2025-05-29 Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems Jiaxi Yang et.al. 2505.22467 null
2025-05-28 AI Mathematician: Towards Fully Automated Frontier Mathematical Research Yuanhang Liu et.al. 2505.22451 null
2025-05-28 COSMOS: A Data-Driven Probabilistic Time Series simulator for Chemical Plumes across Spatial Scales Arunava Nag et.al. 2505.22436 link
2025-05-28 Exact Algorithms and Lower Bounds for Forming Coalitions of Constrained Maximum Size Foivos Fioravantes et.al. 2505.22384 null
2025-05-28 AgentDNS: A Root Domain Naming System for LLM Agents Enfang Cui et.al. 2505.22368 null
2025-05-28 From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications Feibo Jiang et.al. 2505.22311 null
2025-05-28 Voice CMS: updating the knowledge base of a digital assistant through conversation Grzegorz Wolny et.al. 2505.22303 null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 null
2025-05-28 Efficient Leave-one-out Approximation in LLM Multi-agent Debate Based on Introspection Yue Cui et.al. 2505.22192 null
2025-05-28 MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction Mohammad Samin Nur Chowdhury et.al. 2505.22187 null
2025-05-28 Online Fair Division for Personalized $2$ -Value Instances Georgios Amanatidis et.al. 2505.22174 null
2025-05-28 Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL Claude Formanek et.al. 2505.22151 null
2025-05-28 Lifted Forward Planning in Relational Factored Markov Decision Processes with Concurrent Actions Florian Andreas Marwitz et.al. 2505.22147 null
2025-05-28 Sentiment Simulation using Generative AI Agents Melrose Tia et.al. 2505.22125 null
2025-05-30 VIRAL: Vision-grounded Integration for Reward design And Learning Valentin Cuzin-Rambaud et.al. 2505.22092 link
2025-05-28 AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation Yan Rong et.al. 2505.22053 null
2025-05-28 Reinforced Reasoning for Embodied Planning Di Wu et.al. 2505.22050 null
2025-05-28 VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries Nasir Hussain et.al. 2505.22010 null
2025-05-28 Efficiently Enhancing General Agents With Hierarchical-categorical Memory Changze Qiao et.al. 2505.22006 null
2025-05-28 Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning Naoto Yoshida et.al. 2505.21985 null
2025-05-28 Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset Fakhraddin Alwajih et.al. 2505.21979 null
2025-05-29 DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation Tianjun Gu et.al. 2505.21969 link
2025-05-28 MapStory: LLM-Powered Text-Driven Map Animation Prototyping with Human-in-the-Loop Editing Aditya Gunturu et.al. 2505.21966 null
2025-05-28 UI-Evol: Automatic Knowledge Evolving for Computer Use Agents Ziyun Zhang et.al. 2505.21964 null
2025-05-28 LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents Taro Yano et.al. 2505.21963 null
2025-05-28 Properties of zero-determinant strategies in multichannel games Masahiko Ueda et.al. 2505.21952 null
2025-06-01 RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments Zeyi Liao et.al. 2505.21936 link
2025-05-28 Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference Yue Zhu et.al. 2505.21919 null
2025-05-31 Modeling and Optimizing User Preferences in AI Copilots: A Comprehensive Survey and Taxonomy Saleh Afzoon et.al. 2505.21907 null
2025-05-28 Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development Rennai Qiu et.al. 2505.21898 null
2025-05-28 Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation Yu-Lun Song et.al. 2505.21880 null
2025-06-02 GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning Shikhhar Siingh et.al. 2505.21863 null
2025-05-27 AI Agent Governance: A Field Guide Jam Kraprayoon et.al. 2505.21808 null
2025-05-27 Events and their Localisation are Relative to a Lab V. Vilasini et.al. 2505.21797 null
2025-05-27 Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation Tharindu Kumarage et.al. 2505.21784 null
2025-05-27 BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum Yubin Kim et.al. 2505.21757 null
2025-05-27 AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models Dongjune Chang et.al. 2505.21741 null
2025-05-27 Deep Reinforcement Learning Agents are not even close to Human Intelligence Quentin Delfosse et.al. 2505.21731 null
2025-05-27 On Reconfigurable Bisimulation, with an Application to the Distributed Synthesis Problem Yehia Abd Alrahman et.al. 2505.21672 null
2025-05-27 Classifying and Clustering Trading Agents Mateusz Wilinski et.al. 2505.21662 link
2025-05-27 PreGenie: An Agentic Framework for High-quality Visual Presentation Generation Xiaojie Xu et.al. 2505.21660 null
2025-05-27 Herd Behavior: Investigating Peer Influence in LLM-based Multi-Agent Systems Young-Min Cho et.al. 2505.21588 null
2025-05-27 AITEE -- Agentic Tutor for Electrical Engineering Christopher Knievel et.al. 2505.21582 link
2025-05-27 RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving Huacan Wang et.al. 2505.21577 link
2025-05-27 ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools Zhucong Li et.al. 2505.21569 null
2025-05-26 Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework Julien Soulé et.al. 2505.21559 null
2025-05-26 Fermionic operatorial model of a system with competitive and cooperative interactions M. Gorgone et.al. 2505.21554 null
2025-05-27 Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making Yihan Wang et.al. 2505.21503 null
2025-05-27 AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery Haowei Wang et.al. 2505.21499 link
2025-05-27 Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Wei Pang et.al. 2505.21497 link
2025-05-27 UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents Han Xiao et.al. 2505.21496 link
2025-05-27 Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming Yang Yang et.al. 2505.21486 null
2025-05-27 Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Zijun Liu et.al. 2505.21471 link
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks Francesco Cozzi et.al. 2505.21426 link
2025-05-27 GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation Naizhu Jin et.al. 2505.21425 null
2025-05-27 Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery Lina Zhao et.al. 2505.21418 null
2025-05-27 A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment Brett Bissey et.al. 2505.21414 null
2025-05-27 MRSD: Multi-Resolution Skill Discovery for HRL Agents Shashank Sharma et.al. 2505.21410 null
2025-05-27 Breaking co-existence: zealotry vs. nonlinear social impact Christopher R. Kitching et.al. 2505.21407 null
2025-05-27 AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs Xuanwen Ding et.al. 2505.21389 link
2025-05-27 Distributed equilibrium seeking in aggregative games: linear convergence under singular perturbations lens Guido Carnevale et.al. 2505.21386 null
2025-05-27 Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History Qishuai Zhong et.al. 2505.21362 link
2025-05-28 PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims Valentin Knappich et.al. 2505.21342 null
2025-05-27 Large Language Models Miss the Multi-Agent Mark Emanuele La Malfa et.al. 2505.21298 null
2025-05-27 Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework Saman Marandi et.al. 2505.21291 null
2025-05-27 PACT: A Contract-Theoretic Framework for Pricing Agentic AI Services Powered by Large Language Models Ya-Ting Yang et.al. 2505.21286 null
2025-05-27 XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration Shaoqing Zhang et.al. 2505.21279 null
2025-05-27 Data-Driven Cellular Mobility Management via Bayesian Optimization and Reinforcement Learning Mohamed Benzaghta et.al. 2505.21249 null
2025-05-27 Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies Felix Chalumeau et.al. 2505.21236 null
2025-05-27 Quantum AIXI: Universal Intelligence via Quantum Information Elija Perrier et.al. 2505.21170 null
2025-05-27 GGBond: Growing Graph-Based AI-Agent Society for Socially-Aware Recommender Simulation Hailin Zhong et.al. 2505.21154 null
2025-05-27 IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model Yang Zhao et.al. 2505.21146 null
2025-05-27 Creativity in LLM-based Multi-Agent Systems: A Survey Yi-Cheng Lin et.al. 2505.21116 null
2025-05-27 Simulating Ethics: Using LLM Debate Panels to Model Deliberation on Medical Dilemmas Hazem Zohny et.al. 2505.21112 null
2025-05-27 CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building Zhengmin Yu et.al. 2505.21069 null
2025-05-27 Agent-Environment Alignment via Automated Interface Generation Kaiming Liu et.al. 2505.21055 null
2025-05-27 RefAV: Towards Planning-Centric Scenario Mining Cainan Davidson et.al. 2505.20981 link
2025-05-27 Identifying Super Spreaders in Multilayer Networks Michał Czuba et.al. 2505.20980 null
2025-05-28 Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement Keheliya Gallaba et.al. 2505.20973 null
2025-05-27 Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence Mehdi Bennis et.al. 2505.20964 null
2025-05-27 Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective Yang Zhang et.al. 2505.20922 link
2025-05-27 Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation Pingrui Zhang et.al. 2505.20897 link
2025-05-27 Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment Leizhen Wang et.al. 2505.20889 null
2025-05-27 MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems Kai Chen et.al. 2505.20824 link
2025-05-27 MT-Mol:Multi Agent System with Tool-based Reasoning for Molecular Optimization Hyomin Kim et.al. 2505.20820 null
2025-05-27 Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective Krishna Singh Rajput et.al. 2505.20816 null
2025-05-27 Can Agents Fix Agent Issues? Alfin Wijaya Rahardja et.al. 2505.20749 null
2025-05-27 RRO: LLM Agent Optimization Through Rising Reward Trajectories Zilong Wang et.al. 2505.20737 null
2025-05-27 E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing Cheonsu Jeong et.al. 2505.20733 null
2025-05-27 SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution Hanlin Wang et.al. 2505.20732 link
2025-05-27 ManiTaskGen: A Comprehensive Task Generator for Benchmarking and Improving Vision-Language Agents on Embodied Decision-Making Liu Dai et.al. 2505.20726 null
2025-05-27 A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs Alberto Pliego Marugán et.al. 2505.20725 null
2025-05-28 VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models Kui Wu et.al. 2505.20718 null
2025-05-27 Hierarchical Instruction-aware Embodied Visual Tracking Kui Wu et.al. 2505.20710 null
2025-05-27 Berk-Nash Rationalizability Ignacio Esponda et.al. 2505.20708 null
2025-05-27 GIFARC: Synthetic Dataset for Leveraging Human-Intuitive Analogies to Elevate AI Reasoning Woochang Sim et.al. 2505.20672 null
2025-05-27 LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation Heng Tan et.al. 2505.20671 null
2025-05-27 MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning Zikang Guo et.al. 2505.20670 null
2025-05-30 AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage Xuanle Zhao et.al. 2505.20662 link
2025-05-27 BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism Qinzhuo Wu et.al. 2505.20660 null
2025-05-27 An Optimisation Framework for Unsupervised Environment Design Nathan Monette et.al. 2505.20659 null
2025-05-27 CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models Yi Zhan et.al. 2505.20642 null
2025-05-27 IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios Yifan Li et.al. 2505.20640 null
2025-05-27 Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration Sibo Xiao et.al. 2505.20625 null
2025-05-29 The challenge of hidden gifts in multi-agent reinforcement learning Dane Malenfant et.al. 2505.20579 null
2025-05-26 Synergising Hierarchical Data Centers and Power Networks: A Privacy-Preserving Approach Junhong Liu et.al. 2505.20575 null
2025-05-26 xChemAgents: Agentic AI for Explainable Quantum Chemistry Can Polat et.al. 2505.20574 link
2025-05-26 Byzantine-Resilient Distributed P2P Energy Trading via Spatial-Temporal Anomaly Detection Junhong Liu et.al. 2505.20567 null
2025-05-26 Learning a Pessimistic Reward Model in RLHF Yinglun Xu et.al. 2505.20556 null
2025-05-28 Trade among moral agents with information asymmetries José Ignacio Rivero-Wildemauwe et.al. 2505.20551 null
2025-05-26 Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting Ana Rita Ortigoso et.al. 2505.20521 null
2025-05-26 CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic Yuxuan Sun et.al. 2505.20510 null
2025-05-26 Reconceptualizing Smart Microscopy: From Data Collection to Knowledge Creation by Multi-Agent Integration P. S. Kesavan et.al. 2505.20466 null
2025-05-26 OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation Raktim Gautam Goswami et.al. 2505.20425 null
2025-05-26 RetroMotion: Retrocausal Motion Forecasting Models are Instructable Royden Wagner et.al. 2505.20414 link
2025-05-26 SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Ibragim Badertdinov et.al. 2505.20411 link
2025-05-26 Algorithmic Control Improves Residential Building Energy and EV Management when PV Capacity is High but Battery Capacity is Low Lennart Ullner et.al. 2505.20377 null
2025-05-26 VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection Zeyi Huang et.al. 2505.20289 null
2025-05-26 Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Jiahao Qiu et.al. 2505.20286 link
2025-05-27 MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability Weiqi Wu et.al. 2505.20285 link
2025-05-26 OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction Haonan Zhang et.al. 2505.20277 link
2025-05-26 Ten Principles of AI Agent Economics Ke Yang et.al. 2505.20273 null
2025-05-26 syftr: Pareto-Optimal Generative AI Alexander Conway et.al. 2505.20266 link
2025-05-26 On Path to Multimodal Historical Reasoning: HistBench and HistAgent Jiahao Qiu et.al. 2505.20246 link
2025-05-26 Shutdownable Agents through POST-Agency Elliott Thornley et.al. 2505.20203 null
2025-05-26 THiNK: Can Large Language Models Think-aloud? Yongan Yu et.al. 2505.20184 link
2025-05-26 The Problem of Algorithmic Collisions: Mitigating Unforeseen Risks in a Connected World Maurice Chiodo et.al. 2505.20181 null
2025-05-27 MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents Ziming Wei et.al. 2505.20148 link
2025-05-26 Agentic 3D Scene Generation with Spatially Contextualized VLMs Xinhang Liu et.al. 2505.20129 null
2025-05-26 Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers Zhengliang Shi et.al. 2505.20128 link
2025-05-26 Agentic AI Process Observability: Discovering Behavioral Variability Fabiana Fournier et.al. 2505.20127 null
2025-05-26 Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets Simpson Zhang et.al. 2505.20120 null
2025-05-27 TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent Dominik Meier et.al. 2505.20118 link
2025-05-26 MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning Thang Nguyen et.al. 2505.20096 null
2025-05-26 SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale Qi Li et.al. 2505.20094 null
2025-05-26 REARANK: Reasoning Re-ranking Agent via Reinforcement Learning Le Zhang et.al. 2505.20046 link
2025-05-26 Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking Yihan Chen et.al. 2505.20023 null
2025-05-26 WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback Minda Hu et.al. 2505.20013 null
2025-05-26 The Many Challenges of Human-Like Agents in Virtual Game Environments Maciej Świechowski et.al. 2505.20011 null
2025-05-26 Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents Tao Wu et.al. 2505.19997 null
2025-05-26 The residual maximin share Uriel Feige et.al. 2505.19961 null
2025-05-26 MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research Hui Chen et.al. 2505.19955 link
2025-05-26 Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval Rong-Cheng Tu et.al. 2505.19952 null
2025-05-26 Signed Angle Rigid Graphs for Network Localization and Formation Control Jinpeng Huang et.al. 2505.19945 null
2025-05-26 Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making Yejin Son et.al. 2505.19933 null
2025-05-27 Evaluating AI cyber capabilities with crowdsourced elicitation Artem Petrov et.al. 2505.19915 null
2025-05-26 EMAC+: Embodied Multimodal Agent for Collaborative Planning with VLM+LLM Shuang Ao et.al. 2505.19905 null
2025-05-26 ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Qiushi Sun et.al. 2505.19897 null
2025-05-26 Large Language Models as Autonomous Spacecraft Operators in Kerbal Space Program Alejandro Carrasco et.al. 2505.19896 link
2025-05-26 Deep Active Inference Agents for Delayed and Long-Horizon Environments Yavar Taheri Yeganeh et.al. 2505.19867 link
2025-05-26 DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning Leander Diaz-Bone et.al. 2505.19850 link
2025-05-26 Multi-Agent Reinforcement Learning in Cybersecurity: From Fundamentals to Applications Christoph R. Landolt et.al. 2505.19837 null
2025-05-26 SecVulEval: Benchmarking LLMs for Real-World C/C++ Vulnerability Detection Md Basim Uddin Ahmed et.al. 2505.19828 link
2025-05-26 Integrating emotional intelligence, memory architecture, and gestures to achieve empathetic humanoid robot interaction in an educational setting Fuze Sun et.al. 2505.19803 null
2025-05-26 Opinion dynamics for an increasing population of agents. A symmetric continuous agent model Ioannis Markou et.al. 2505.19791 null
2025-05-26 TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning Yuhui Chen et.al. 2505.19769 null
2025-05-26 T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search Xing Cui et.al. 2505.19768 null
2025-05-26 RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback Junyang Shu et.al. 2505.19767 null
2025-05-26 Agentic Predictor: Performance Prediction for Agentic Workflows via Multi-View Encoding Patara Trirat et.al. 2505.19764 link
2025-05-26 Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning Zican Hu et.al. 2505.19761 link
2025-05-26 NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering Ruisheng Cao et.al. 2505.19754 null
2025-05-26 ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection Juxin Niu et.al. 2505.19734 link
2025-05-26 Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning Quentin Rouxel et.al. 2505.19717 null
2025-05-28 JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning Jing Yu Lim et.al. 2505.19698 null
2025-05-26 Large Language Models for Planning: A Comprehensive and Systematic Survey Pengfei Cao et.al. 2505.19683 link
2025-05-26 FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks Atsunori Moteki et.al. 2505.19662 null
2025-05-26 Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation Xiaochuan Liu et.al. 2505.19647 link
2025-05-26 Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning Byunghyun Yoo et.al. 2505.19637 null
2025-05-26 DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue Yichun Feng et.al. 2505.19630 link
2025-05-28 AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems Yu Shang et.al. 2505.19623 null
2025-05-26 Multi-Agent Collaboration via Evolving Orchestration Yufan Dang et.al. 2505.19591 null
2025-05-26 LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer Rasoul Zahedifar et.al. 2505.19567 null
2025-05-26 AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare Ying Xiao et.al. 2505.19562 link
2025-05-26 Towards Multi-Granularity Memory Association and Selection for Long-Term Conversational Agents Derong Xu et.al. 2505.19549 link
2025-05-26 DoctorRAG: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients Yuxing Lu et.al. 2505.19538 null
2025-05-26 Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning Shijie Liu et.al. 2505.19532 null
2025-05-26 Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs Zhenhao Zhou et.al. 2505.19489 link
2025-05-26 VLMLight: Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Maonan Wang et.al. 2505.19486 null
2025-05-26 Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs Hao Kang et.al. 2505.19481 link
2025-05-26 Judging with Many Minds: Do More Perspectives Mean Less Prejudice? Chiyu Ma et.al. 2505.19477 link
2025-05-26 Improving Recommendation Fairness without Sensitive Attributes Using Multi-Persona LLMs Haoran Xin et.al. 2505.19473 null
2025-05-26 Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI Ranjan Sapkota et.al. 2505.19443 null
2025-05-26 Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents Ye Ye et.al. 2505.19436 link
2025-05-26 Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression Peijie Dong et.al. 2505.19433 link
2025-05-26 Frictional Agent Alignment Framework: Slow Down and Don't Break Things Abhijnan Nath et.al. 2505.19428 link
2025-05-26 Fusion Intelligence for Digital Twinning AI Data Centers: A Synergistic GenAI-PhyAI Approach Ruihang Wang et.al. 2505.19409 null
2025-05-26 CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems Yan Wen et.al. 2505.19405 null
2025-05-27 DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving Anqing Jiang et.al. 2505.19381 null
2025-05-26 Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality Lance Ying et.al. 2505.19376 null
2025-05-27 Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies Kevin Li et.al. 2505.19337 null
2025-05-25 What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study Bhanuka Gamage et.al. 2505.19325 null
2025-05-25 Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning Rex Chen et.al. 2505.19316 null
2025-05-25 Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking Robin D. Pesl et.al. 2505.19310 null
2025-05-25 A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control Ken Huang et.al. 2505.19301 null
2025-05-25 A likelihood-based Bayesian inference framework for the calibration of and selection between stochastic velocity-jump models Arianna Ceccarelli et.al. 2505.19292 null
2025-05-25 A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning Yuzheng Hu et.al. 2505.19281 link
2025-05-25 A General Theory of Risk Sharing Vasily Melnikov et.al. 2505.19276 null
2025-05-25 Agentic Information Theory: Ergodicity and Intrinsic Semantics of Information Processes James P. Crutchfield et.al. 2505.19275 null
2025-05-25 ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast & Slow Reasoning for Robust Agent Defense Shiyu Xiang et.al. 2505.19260 null
2025-05-25 DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research João Coelho et.al. 2505.19253 null
2025-05-25 Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees Sourav Ganguly et.al. 2505.19238 null
2025-05-25 Sensorimotor features of self-awareness in multimodal large language models Iñaki Dellibarda Varela et.al. 2505.19237 null
2025-05-25 GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling Jialong Zhou et.al. 2505.19234 null
2025-05-25 Numerical Analysis of Damage Evolution in Open Hole CFRP Laminates Modified with Electrospun Self Healing Diels Alder Interleaves Marianna Chantzi et.al. 2505.19232 null
2025-05-25 Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding Shiyue Wang et.al. 2505.19219 null
2025-05-25 Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion in Dynamic Environments Zifan Wang et.al. 2505.19214 null
2025-05-25 When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas Steffen Backmann et.al. 2505.19212 link
2025-05-25 SpeakStream: Streaming Text-to-Speech with Interleaved Data Richard He Bai et.al. 2505.19206 null
2025-05-25 OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization Meher Bhaskar Madiraju et.al. 2505.19205 link
2025-05-27 Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance Chanyeol Choi et.al. 2505.19197 null
2025-05-27 When Two LLMs Debate, Both Think They'll Win Pradyumna Shyama Prasad et.al. 2505.19184 null
2025-05-25 Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style Debdeep Sanyal et.al. 2505.19173 null
2025-05-25 Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence Thomas P. Kehler et.al. 2505.19167 null
2025-05-25 The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework Feiran Liu et.al. 2505.19139 null
2025-05-25 Incentivizing High-Quality Human Annotations with Golden Questions Shang Liu et.al. 2505.19134 null
2025-05-25 Agentic Visualization: Extracting Agent-based Design Patterns from Visualization Systems Vaishali Dhanoa et.al. 2505.19101 null
2025-05-25 ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World Runliang Niu et.al. 2505.19095 link
2025-05-25 A Systematic Classification of Vulnerabilities in MoveEVM Smart Contracts (MWC) Selçuk Topal et.al. 2505.19047 null
2025-05-25 SANNet: A Semantic-Aware Agentic AI Networking Framework for Multi-Agent Cross-Layer Coordination Yong Xiao et.al. 2505.18946 null
2025-05-25 MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Xuanming Zhang et.al. 2505.18943 link
2025-05-24 Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos Andrea Ramazzina et.al. 2505.18899 link
2025-05-24 Security Concerns for Large Language Models: A Survey Miles Q. Li et.al. 2505.18889 null
2025-05-24 Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach Yuchen Wu et.al. 2505.18882 null
2025-05-24 SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes Dicong Qiu et.al. 2505.18881 null
2025-05-24 CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions Kung-Hsiang Huang et.al. 2505.18878 link
2025-05-24 Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning Maeva Guerrier et.al. 2505.18858 null
2025-05-24 Multi-Party Conversational Agents: A Survey Sagar Sapkota et.al. 2505.18845 null
2025-05-24 Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning Jinzheng Li et.al. 2505.18831 null
2025-05-24 LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS Kai Mei et.al. 2505.18829 link
2025-05-24 Agent-Based Decentralized Energy Management of EV Charging Station with Solar Photovoltaics via Multi-Agent Reinforcement Learning Jiarong Fan et.al. 2505.18750 null
2025-05-27 $C^3$ -Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking Peijie Yu et.al. 2505.18746 link
2025-05-24 Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction Wei Shen et.al. 2505.18731 null
2025-05-24 AI-Researcher: Autonomous Scientific Innovation Jiabin Tang et.al. 2505.18705 link
2025-05-24 LLM-QFL: Distilling Large Language Model for Quantum Federated Learning Dev Gurung et.al. 2505.18656 link
2025-05-24 SEW: Self-Evolving Agentic Workflows for Automated Code Generation Siwei Liu et.al. 2505.18646 link
2025-05-24 DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation Zhihao Jia et.al. 2505.18630 null
2025-05-24 A representation theorem for events within lattice structures of state-spaces Alex A. T. Rathke et.al. 2505.18615 null
2025-05-27 Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models Chen Han et.al. 2505.18596 null
2025-05-24 MisoDICE: Multi-Agent Imitation from Unlabeled Mixed-Quality Demonstrations The Viet Bui et.al. 2505.18595 null
2025-05-24 Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks Joery A. de Vries et.al. 2505.18591 link
2025-05-24 Removal of Hallucination on Hallucination: Debate-Augmented RAG Wentao Hu et.al. 2505.18581 link
2025-05-24 MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework Yifan Zhu et.al. 2505.18572 null
2025-05-24 Benchmarking Poisoning Attacks against Retrieval-Augmented Generation Baolei Zhang et.al. 2505.18543 null
2025-05-24 MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs Pengyu Wang et.al. 2505.18530 null
2025-05-24 Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning Junlin Wang et.al. 2505.18487 link
2025-05-24 Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services Guoheng Sun et.al. 2505.18471 null
2025-05-27 A Survey of LLM $\times$ DATA Xuanhe Zhou et.al. 2505.18458 link
2025-05-24 EdgeAgentX: A Novel Framework for Agentic AI at the Edge in Military Communication Networks Abir Ray et.al. 2505.18457 null
2025-05-24 A numerical demonstration of dynamic stall control Sarasija Sudharsan et.al. 2505.18449 null
2025-05-24 Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning Zhiyao Zhang et.al. 2505.18433 null
2025-05-23 Reinforcement Learning for Ballbot Navigation in Uneven Terrain Achkan Salehi et.al. 2505.18417 link
2025-05-23 DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding Yue Jiang et.al. 2505.18411 link
2025-05-23 An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems Fangqiao Tian et.al. 2505.18397 null
2025-05-23 Dynamic Risk Assessments for Offensive Cybersecurity Agents Boyi Wei et.al. 2505.18384 link
2025-05-23 Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems Hansa Meghwani et.al. 2505.18366 null
2025-05-23 Persona Alchemy: Designing, Evaluating, and Implementing Psychologically-Grounded LLM Agents for Diverse Stakeholder Representation Sola Kim et.al. 2505.18351 null
2025-05-23 The Cell Must Go On: Agar.io for Continual Reinforcement Learning Mohamed A. Mohamed et.al. 2505.18347 link
2025-05-23 Diffusion Self-Weighted Guidance for Offline Reinforcement Learning Augusto Tagle et.al. 2505.18345 null
2025-05-23 CrashAgent: Crash Scenario Generation via Multi-modal Reasoning Miao Li et.al. 2505.18341 null
2025-05-23 Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play Jiaxun Cui et.al. 2505.18334 null
2025-05-23 Single-agent or Multi-agent Systems? Why Not Both? Mingyan Gao et.al. 2505.18286 null
2025-05-23 Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control Alireza Rezazadeh et.al. 2505.18279 null
2025-05-23 BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs Mingning Guo et.al. 2505.18229 link
2025-05-23 Implementing Agents in JavaScript Timotheus Kampik et.al. 2505.18228 null
2025-05-23 IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis Hanyu Li et.al. 2505.18223 link
2025-05-23 CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games Shuhang Xu et.al. 2505.18218 link
2025-05-23 LA-RCS: LLM-Agent-Based Robot Control System TaekHyun Park et.al. 2505.18214 null
2025-05-23 Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find Owen Bianchi et.al. 2505.18148 null
2025-05-23 Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading Mohamed Swailem et.al. 2505.18145 null
2025-05-23 Gaming Tool Preferences in Agentic LLMs Kazem Faghih et.al. 2505.18135 link
2025-05-23 ProgRM: Build Better GUI Agents with Progress Rewards Danyang Zhang et.al. 2505.18121 null
2025-05-23 Facility Location with Public Locations and Private Doubly-Peaked Costs Richard Cole et.al. 2505.18114 null
2025-05-23 ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework Lisheng Huang et.al. 2505.18105 link
2025-05-23 Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL Joey Hong et.al. 2505.18098 null
2025-05-23 Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding Xiaoyi Zhang et.al. 2505.18079 null
2025-05-23 Linear Mixture Distributionally Robust Markov Decision Processes Zhishuai Liu et.al. 2505.18044 null
2025-05-27 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2505.17997 null
2025-05-23 Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity Zhihong Chen et.al. 2505.17937 link
2025-05-23 Formalizing Embeddedness Failures in Universal Artificial Intelligence Cole Wyeth et.al. 2505.17882 null
2025-05-23 Best Group Identification in Multi-Objective Bandits Mohammad Shahverdikondori et.al. 2505.17869 null
2025-05-23 DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization Hongshu Guo et.al. 2505.17866 null
2025-05-23 Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities Ziwei Zhou et.al. 2505.17862 link
2025-05-23 Superplatforms Have to Attack AI Agents Jianghao Lin et.al. 2505.17861 null
2025-05-23 Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning Nicolas Castanet et.al. 2505.17830 null
2025-05-23 Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models Xuchen Pan et.al. 2505.17826 link
2025-05-23 Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour Bálint Gyevnár et.al. 2505.17801 null
2025-05-23 DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors Tazeek Bin Abdur Rakib et.al. 2505.17795 null
2025-05-23 The Real Barrier to LLM Agent Usability is Agentic ROI Weiwen Liu et.al. 2505.17767 null
2025-05-23 HRSim: An agent-based simulation platform for high-capacity ride-sharing services Wang Chen et.al. 2505.17758 link
2025-05-23 Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions Ashwin George et.al. 2505.17739 link
2025-05-23 Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios Xueyang Zhou et.al. 2505.17735 null
2025-05-23 URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles Ahmet Onur Akman et.al. 2505.17734 null
2025-05-23 Get Experience from Practice: LLM Agents with Record & Replay Erhu Feng et.al. 2505.17716 null
2025-05-23 Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek Xueyang Li et.al. 2505.17702 null
2025-05-23 Star-like thermoresponsive microgels: a new class of soft nanocolloids Elisa Ballin et.al. 2505.17700 null
2025-05-23 Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution Jiawei Du et.al. 2505.17673 null
2025-05-23 Simulating Macroeconomic Expectations using LLM Agents Jianhao Lin et.al. 2505.17648 null
2025-05-23 HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning Chuhao Zhou et.al. 2505.17645 null
2025-05-27 TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments Yuheng Lu et.al. 2505.17629 link
2025-05-23 CAS-IQA: Teaching Vision-Language Models for Synthetic Angiography Quality Assessment Bo Wang et.al. 2505.17619 null
2025-05-23 Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments Qingyu Lu et.al. 2505.17616 link
2025-05-23 Distilling LLM Agent into Small Models with Retrieval and Code Tools Minki Kang et.al. 2505.17612 link
2025-05-23 Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning Till Freihaut et.al. 2505.17610 null
2025-05-23 Controlled Agentic Planning & Reasoning for Mechanism Synthesis João Pedro Gandarela et.al. 2505.17607 null
2025-05-23 AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model Tijmen de Haan et.al. 2505.17592 null
2025-05-23 USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents Siqi Lai et.al. 2505.17572 null
2025-05-26 Novobo: Supporting Teachers' Peer Learning of Instructional Gestures by Teaching a Mentee AI-Agent Together Jiaqi Jiang et.al. 2505.17557 null
2025-05-23 Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs Shuhang Xu et.al. 2505.17512 null
2025-05-23 Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification Aditya Gautam et.al. 2505.17511 null
2025-05-23 The Discovery Engine: A Framework for AI-Driven Synthesis and Navigation of Scientific Knowledge Landscapes Vladimir Baulin et.al. 2505.17500 null
2025-05-23 PD $^3$ : A Project Duplication Detection Framework via Adapted Multi-Agent Debate Dezheng Bao et.al. 2505.17492 null
2025-05-23 MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning Yusheng Zhao et.al. 2505.17481 null
2025-05-23 Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning Xingyu Tan et.al. 2505.17464 null
2025-05-23 LLM-BSCVM: An LLM-Based Blockchain Smart Contract Vulnerability Management Framework Yanli Jin et.al. 2505.17416 link
2025-05-23 Emergence of Anti-chemotactic Flocking in Active Biomimetic Colloids Joseph D. Lopes et.al. 2505.17394 null
2025-05-23 Curriculum Guided Reinforcement Learning for Efficient Multi Hop Retrieval Augmented Generation Yuelyu Ji et.al. 2505.17391 null
2025-05-23 Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition Zichen Wang et.al. 2505.17379 null
2025-05-22 A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety Ankita Kushwaha et.al. 2505.17342 null
2025-05-22 Partner Modelling Emerges in Recurrent Agents (But Only When It Matters) Ruaridh Mon-Williams et.al. 2505.17323 null
2025-05-22 Control of Renewable Energy Communities using AI and Real-World Data Tiago Fonseca et.al. 2505.17321 null
2025-05-22 Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty Peilin Wu et.al. 2505.17281 null
2025-05-22 ConvoyNext: A Scalable Testbed Platform for Cooperative Autonomous Vehicle Systems Hossein Maghsoumi et.al. 2505.17275 link
2025-05-22 Navigating Polytopes with Safety: A Control Barrier Function Approach Tamas G. Molnar et.al. 2505.17270 link
2025-05-22 Backdoors in DRL: Four Environments Focusing on In-distribution Triggers Chace Ashcraft et.al. 2505.17248 null
2025-05-22 Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG) Clayton Cohn et.al. 2505.17238 null
2025-05-22 ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects Jipeng Zhang et.al. 2505.17231 null
2025-05-22 RetroChat: Designing for the Preservation of Past Digital Experiences Suifang Zhou et.al. 2505.17208 null
2025-05-22 LengthLogD: A Length-Stratified Ensemble Framework for Enhanced Peptide Lipophilicity Prediction via Multi-Scale Feature Integration Shuang Wu et.al. 2505.17198 null
2025-05-22 Can Large Language Models Design Biological Weapons? Evaluating Moremi Bio Gertrude Hattoh et.al. 2505.17154 null
2025-05-22 LLM-Powered Agents for Navigating Venice's Historical Cadastre Tristan Karch et.al. 2505.17148 null
2025-05-22 RAP: Runtime-Adaptive Pruning for LLM Inference Huanrong Liu et.al. 2505.17138 null
2025-05-21 Swarm Intelligence Enhanced Reasoning: A Density-Driven Framework for LLM-Based Multi-Agent Optimization Ying Zhu et.al. 2505.17115 null
2025-05-21 CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution Minghao Shao et.al. 2505.17107 link
2025-05-21 P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark Tao Sun et.al. 2505.17104 link
2025-05-20 Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization Yihong Wu et.al. 2505.17086 null
2025-05-22 SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Haoning Wu et.al. 2505.17012 link
2025-05-22 X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs Rui Ye et.al. 2505.16997 link
2025-05-22 MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems Rui Ye et.al. 2505.16988 link
2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null
2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null
2025-05-22 Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design Zhenkun Li et.al. 2505.16979 null
2025-05-22 SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development Yaxin Du et.al. 2505.16975 link
2025-05-22 Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions Mayank Kejriwal et.al. 2505.16966 null
2025-05-22 Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection Jiaying Fu et.al. 2505.16954 null
2025-05-22 A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization Shengyu Feng et.al. 2505.16952 null
2025-05-22 AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios Yunjia Qi et.al. 2505.16944 link
2025-05-25 NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 link
2025-05-22 Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning Bosung Kim et.al. 2505.16928 null
2025-05-22 Risk-Averse Reinforcement Learning with Itakura-Saito Loss Igor Udovichenko et.al. 2505.16925 null
2025-05-22 RealEngine: Simulating Autonomous Driving in Realistic Context Junzhe Jiang et.al. 2505.16902 link
2025-05-22 Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Hongyuan Tao et.al. 2505.16901 null
2025-05-22 Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships Kerem Oktar et.al. 2505.16899 null
2025-05-22 Hydrogen peroxide electrogeneration from O2 electroreduction: a review focusing on carbon electrocatalysts and environmental applications Aline B. Trench et.al. 2505.16887 null
2025-05-22 Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning Alihan Hüyük et.al. 2505.16833 null
2025-05-22 From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization Haonian Ji et.al. 2505.16832 link
2025-05-22 GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent Bin Xie et.al. 2505.16827 link
2025-05-22 LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols Ziming liu et.al. 2505.16821 null
2025-05-22 A modular framework for automated evaluation of procedural content generation in serious games with deep reinforcement learning agents Eleftherios Kalafatis et.al. 2505.16801 null
2025-05-22 Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making Qianlei Jia et.al. 2505.16781 null
2025-05-22 Sequential Monte Carlo for Policy Optimization in Continuous POMDPs Hany Abdulsamad et.al. 2505.16732 null
2025-05-22 MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models Xuanqi Gao et.al. 2505.16700 null
2025-05-22 CoNav: Collaborative Cross-Modal Reasoning for Embodied Navigation Haihong Hao et.al. 2505.16663 link
2025-05-22 O $^2$ -Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering Jianbiao Mei et.al. 2505.16582 link
2025-05-22 How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning Max Weltevrede et.al. 2505.16581 null
2025-05-22 Large Language Model-Empowered Interactive Load Forecasting Yu Zuo et.al. 2505.16577 null
2025-05-22 EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions Spencer Hong et.al. 2505.16576 link
2025-05-22 Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning Junchi Yao et.al. 2505.16557 null
2025-05-22 Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events Mengzhu Liu et.al. 2505.16455 null
2025-05-22 Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems Song Jin et.al. 2505.16429 null
2025-05-22 Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach Xiaoran Yin et.al. 2505.16422 null
2025-05-22 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Zhepei Wei et.al. 2505.16421 link
2025-05-22 VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving Yansong Qu et.al. 2505.16377 null
2025-05-22 Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Taeyoon Kwon et.al. 2505.16348 null
2025-05-22 Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions Marc Brooks et.al. 2505.16311 null
2025-05-22 No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery Xiaoxue Han et.al. 2505.16288 null
2025-05-22 ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay Fanbin Lu et.al. 2505.16282 link
2025-05-22 HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation Shijie Zhang et.al. 2505.16281 null
2025-05-22 Spatio-temporal agent-based modelling of malaria Camelia R. Walker et.al. 2505.16240 link
2025-05-22 CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering Yuren Mao et.al. 2505.16229 null
2025-05-22 Velocity Completion Task and Method for Event-based Player Positional Data in Soccer Rikuhei Umemoto et.al. 2505.16199 null
2025-05-22 Fairness and Efficiency in Human-Agent Teams: An Iterative Algorithm Design Approach Mai Lee Chang et.al. 2505.16171 null
2025-05-22 LLM-Powered AI Agent Systems and Their Applications in Industry Guannan Liang et.al. 2505.16120 null
2025-05-22 BioDSA-1K: Benchmarking Data Science Agents for Biomedical Research Zifeng Wang et.al. 2505.16100 null
2025-05-24 Reinforcement Learning for Stock Transactions Ziyi Zhou et.al. 2505.16099 null
2025-05-22 Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development Ming Shen et.al. 2505.16086 null
2025-05-21 A Distributed Local Energy Market Clearing Framework Using a Two-Loop ADMM Method Milad Kabirifar et.al. 2505.16070 null
2025-05-21 How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior Zidi Xiong et.al. 2505.16067 link
2025-05-21 Bayesian adaptive randomization in the I-SPY2.2 sequential multiple assignment randomized trial Peter Norwood et.al. 2505.16047 null
2025-05-21 Towards improved pest management of the soybean aphid Urvashi Verma et.al. 2505.16013 null
2025-05-21 Position: Agentic Systems Constitute a Key Component of Next-Generation Intelligent Image Processing Jinjin Gu et.al. 2505.16007 null
2025-05-21 MAPS: A Multilingual Benchmark for Global Agent Performance and Security Omer Hofman et.al. 2505.15935 null
2025-05-21 ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation Tony Montes et.al. 2505.15928 link
2025-05-21 Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition Dong Won Lee et.al. 2505.15922 null
2025-05-21 Text-to-Pipeline: Bridging Natural Language and Data Preparation Pipelines Yuhang Ge et.al. 2505.15874 null
2025-05-23 InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation Yunjia Xi et.al. 2505.15872 null
2025-05-21 AutoData: A Multi-Agent System for Open Web Data Collection Tianyi Ma et.al. 2505.15859 link
2025-05-21 Large Language Model-Powered Agent for C to Rust Code Translation HoHyun Sim et.al. 2505.15858 null
2025-05-21 Simulating Prosocial Behavior and Social Contagion in LLM Agents under Institutional Interventions Yujia Zhou et.al. 2505.15857 link
2025-05-22 GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents Yuqi Zhou et.al. 2505.15810 link
2025-05-21 The Agentic Economy David M. Rothschild et.al. 2505.15799 null
2025-05-22 HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving Zhiwen Chen et.al. 2505.15793 null
2025-05-21 Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning Pedro P. Santos et.al. 2505.15782 null
2025-05-21 Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses Xiaoxue Yang et.al. 2505.15738 link
2025-05-21 DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning Gaurav Srivastava et.al. 2505.15734 null
2025-05-21 Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications Pronama Biswas et.al. 2505.15705 null
2025-05-21 HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning Xiaodong Mei et.al. 2505.15703 null
2025-05-21 Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives Milad Kazemi et.al. 2505.15693 null
2025-05-21 From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems Xiuchao Sui et.al. 2505.15685 link
2025-05-21 Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model Ke Hu et.al. 2505.15670 null
2025-05-21 Improved power methods for computing eigenvalues of dual quaternion Hermitian matrices Yongjun Chen et.al. 2505.15584 null
2025-05-21 The equilibrium price of bubble assets Charles Bertucci et.al. 2505.15578 null
2025-05-21 Temporal Spectrum Cartography in Low-Altitude Economy Networks: A Generative AI Framework with Multi-Agent Learning Changyuan Zhao et.al. 2505.15571 null
2025-05-21 Riemannian EXTRA: Communication-efficient decentralized optimization over compact submanifolds with data heterogeneity Jiayuan Wu et.al. 2505.15537 null
2025-05-21 Collaborative Problem-Solving in an Optimization Game Isidora Jeknic et.al. 2505.15490 link
2025-05-21 Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL Xintong Zhang et.al. 2505.15436 null
2025-05-21 X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System Peng Wang et.al. 2505.15372 link
2025-05-21 Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One Yiwen Song et.al. 2505.15306 null
2025-05-22 AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving Kangan Qian et.al. 2505.15298 null
2025-05-21 Agent-based Liquidity Risk Modelling for Financial Markets Perukrishnen Vytelingum et.al. 2505.15296 null
2025-05-21 LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models Qianyue Hao et.al. 2505.15293 null
2025-05-21 Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Hyungjoo Chae et.al. 2505.15277 link
2025-05-21 AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection Jiatao Li et.al. 2505.15261 null
2025-05-24 ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search Hyunseok Lee et.al. 2505.15259 null
2025-05-21 Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets Idriss Malek et.al. 2505.15251 null
2025-05-21 BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems Andy K. Zhang et.al. 2505.15216 null
2025-05-21 ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection Jeonghye Kim et.al. 2505.15182 null
2025-05-21 R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization Yuante Li et.al. 2505.15155 link
2025-05-21 lmgame-Bench: How Good are LLMs at Playing Games? Lanxiang Hu et.al. 2505.15146 link
2025-05-21 Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data Adib Bazgir et.al. 2505.15132 null
2025-05-21 On Discounted Infinite-Time Mean Field Games Zeyu Yang et.al. 2505.15131 null
2025-05-21 An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents Bowen Jin et.al. 2505.15117 link
2025-05-21 A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents Ian Steenstra et.al. 2505.15108 null
2025-05-21 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization Ziliang Wang et.al. 2505.15107 null
2025-05-21 Nek Minit: Harnessing Pragmatic Metacognitive Prompting for Explainable Sarcasm Detection of Australian and Indian English Ishmanbir Singh et.al. 2505.15095 null
2025-05-21 Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories Nanxu Gong et.al. 2505.15076 null
2025-05-21 ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges Cheng Qian et.al. 2505.15068 link
2025-05-21 UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking Sarfraz Ahmad et.al. 2505.15063 link
2025-05-21 AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars Tianbao Zhang et.al. 2505.15058 null
2025-05-21 PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration Yingming Pu et.al. 2505.15047 link
2025-05-21 Toward Task Capable Active Matter: Learning to Avoid Clogging in Confined Collectives via Collisions Kehinde O. Aina et.al. 2505.15033 null
2025-05-21 COSMIC: Enabling Full-Stack Co-Design and Optimization of Distributed Machine Learning Systems Aditi Raju et.al. 2505.15020 null
2025-05-21 HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning Kryspin Varys et.al. 2505.15011 link
2025-05-21 Meta-Design Matters: A Self-Design Multi-Agent System Zixuan Ke et.al. 2505.14996 null
2025-05-20 JARVIS: A Multi-Agent Code Assistant for High-Quality EDA Script Generation Ghasem Pasandi et.al. 2505.14978 null
2025-05-20 MedBrowseComp: Benchmarking Medical Deep Research and Computer Use Shan Chen et.al. 2505.14963 null
2025-05-20 Characteristic scales and adaptation in higher-order contagions Giulio Burgio et.al. 2505.14930 link
2025-05-20 Think, Reflect, Create: Metacognitive Learning for Zero-Shot Robotic Planning with LLMs Wenjie Lin et.al. 2505.14899 null
2025-05-20 On the Day They Experience: Awakening Self-Sovereign Experiential AI Agents Botao Amber Hu et.al. 2505.14893 null
2025-05-20 Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters Danqing Wang et.al. 2505.14886 null
2025-05-20 Unremarkable to Remarkable AI Agent: Exploring Boundaries of Agent Intervention for Adults With and Without Cognitive Impairment Mai Lee Chang et.al. 2505.14872 null
2025-05-20 MAATS: A Multi-Agent Automated Translation System Based on MQM Evaluation Xi Wang et.al. 2505.14848 link
2025-05-20 Beyond Symmetry in Repeated Games with Restarts Henry Fleischmann et.al. 2505.14847 null
2025-05-20 Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles Kushagra Gupta et.al. 2505.14817 link
2025-05-20 Integrating Field of View in Human-Aware Collaborative Planning Ya-Chuan Hsu et.al. 2505.14805 null
2025-05-20 $\texttt{LLINBO}$ : Trustworthy LLM-in-the-Loop Bayesian Optimization Chih-Yu Chang et.al. 2505.14756 link
2025-05-20 R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution Xu Yang et.al. 2505.14738 link
2025-05-20 The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents Mohammad Rubyet Islam et.al. 2505.14727 null
2025-05-20 NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Sunhao Dai et.al. 2505.14680 null
2025-05-20 ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions Bufang Yang et.al. 2505.14668 null
2025-05-20 AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis Microsoft Copilot et.al. 2505.14612 null
2025-05-20 Agent Context Protocols Enhance Collective Inference Devansh Bhardwaj et.al. 2505.14569 null
2025-05-20 Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study Saahil Mahato et.al. 2505.14544 link
2025-05-20 A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version) Gaia Belardinelli et.al. 2505.14539 null
2025-05-20 Energy-Efficient Deep Reinforcement Learning with Spiking Transformers Mohammad Irfan Uddin et.al. 2505.14533 null
2025-05-22 BACON: A fully explainable AI model with graded logic for decision making problems Haishi Bai et.al. 2505.14510 null
2025-05-20 Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms Biman Barua et.al. 2505.14508 null
2025-05-20 Security of Distributed Gradient Descent Against Byzantine Agents Sribalaji C. Anand et.al. 2505.14473 null
2025-05-20 Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks Kamal Singh et.al. 2505.14459 null
2025-05-21 Robustness Evaluation of Graph-based News Detection Using Network Structural Information Xianghua Zeng et.al. 2505.14453 null
2025-05-23 Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents Pengzhou Cheng et.al. 2505.14418 null
2025-05-20 Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation Peter Baile Chen et.al. 2505.14398 null
2025-05-20 Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds Gaël Gendron et.al. 2505.14396 link
2025-05-20 Information-optimal measurement: From fixed sampling protocols to adaptive spectroscopy J. Schroeder et.al. 2505.14364 null
2025-05-20 DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning Ziwei Zheng et.al. 2505.14362 link
2025-05-20 PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs Sho Inoue et.al. 2505.14356 link
2025-05-20 Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy Zihao Feng et.al. 2505.14299 null
2025-05-20 EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection Yijie Lu et.al. 2505.14289 null
2025-05-20 Visual Agentic Reinforcement Fine-Tuning Ziyu Liu et.al. 2505.14246 link
2025-05-20 Safety Devolution in AI Agents Cheng Yu et.al. 2505.14215 null
2025-05-20 Embedded Mean Field Reinforcement Learning for Perimeter-defense Game Li Wang et.al. 2505.14209 null
2025-05-20 DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation He Wang et.al. 2505.14163 null
2025-05-20 MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem Fan Liu et.al. 2505.14148 link
2025-05-20 s3: You Don't Need That Much Data to Train a Search Agent via RL Pengcheng Jiang et.al. 2505.14146 link
2025-05-20 Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent Fanglin Mo et.al. 2505.14141 null
2025-05-20 MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow Yuan-Hao Jiang et.al. 2505.14126 null
2025-05-20 A novel approach to process TRISO nuclear fuel using plasma-aided chemistry Tobias Chemnitz et.al. 2505.14108 null
2025-05-20 Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering Yihua Zhu et.al. 2505.14099 null
2025-05-20 Personalized and Resilient Distributed Learning Through Opinion Dynamics Luca Ballotta et.al. 2505.14081 null
2025-05-22 BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks Weihong Du et.al. 2505.14079 link
2025-05-22 Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning Wenlin Zhang et.al. 2505.14069 link
2025-05-20 Exploring Temporal Graphs with Frequent and Regular Edges Duncan Adamson et.al. 2505.14046 null
2025-05-20 Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning Ruiyi Yang et.al. 2505.13994 null
2025-05-20 CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring Jiamin Su et.al. 2505.13965 null
2025-05-20 MultiDrive: A Co-Simulation Framework Bridging 2D and 3D Driving Simulation for AV Software Validation Marc Kaufeld et.al. 2505.13959 link
2025-05-20 Memory-Centric Embodied Question Answer Mingliang Zhai et.al. 2505.13948 null
2025-05-20 MLZero: A Multi-Agent System for End-to-end Machine Learning Automation Haoyang Fang et.al. 2505.13941 link
2025-05-20 DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery Kun Li et.al. 2505.13940 link
2025-05-21 CLEVER: A Curated Benchmark for Formally Verified Code Generation Amitayush Thakur et.al. 2505.13938 link
2025-05-20 Efficient Agent Training for Computer Use Yanheng He et.al. 2505.13909 link
2025-05-21 Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation Junyang Wang et.al. 2505.13887 null
2025-05-22 PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks Guobin Shen et.al. 2505.13862 link
2025-05-20 A Challenge to Build Neuro-Symbolic Video Agents Sahil Shah et.al. 2505.13851 link
2025-05-20 Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams Zhi Su et.al. 2505.13834 null
2025-05-20 Online Resource Sharing: Better Robust Guarantees via Randomized Strategies David X. Lin et.al. 2505.13824 link
2025-05-20 Structured Agent Distillation for Large Language Model Jun Liu et.al. 2505.13820 null
2025-05-20 RAG/LLM Augmented Switching Driven Polymorphic Metaheuristic Framework Faramarz Safi Esfahani et.al. 2505.13808 null
2025-05-19 Model Cards for AI Teammates: Comparing Human-AI Team Familiarization Methods for High-Stakes Environments Ryan Bowers et.al. 2505.13773 link
2025-05-19 Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis Ruiquan Huang et.al. 2505.13768 null
2025-05-21 Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making Jacob Kleiman et.al. 2505.13761 null
2025-05-19 Benchmarking MOEAs for solving continuous multi-objective RL problems Carlos Hernández et.al. 2505.13726 link
2025-05-19 Revenue-Optimal Efficient Mechanism Design with General Type Spaces Siddharth Prasad et.al. 2505.13687 null
2025-05-19 MAFA: A multi-agent framework for annotation Mahmood Hegazy et.al. 2505.13668 null
2025-05-19 Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents Karina Zainullina et.al. 2505.13652 null
2025-05-19 Non-Obvious Manipulability in Additively Separable and Fractional Hedonic Games Diodato Ferraioli et.al. 2505.13642 null
2025-05-19 Incentivizing Truthful Language Models via Peer Elicitation Games Baiting Chen et.al. 2505.13636 link
2025-05-19 Q ${}^2$ Forge: Minting Competency Questions and SPARQL Queries for Question-Answering Over Knowledge Graphs Yousouf Taghzouti et.al. 2505.13572 null
2025-05-19 Learning Dynamics of RNNs in Closed-Loop Environments Yoav Ger et.al. 2505.13567 link
2025-05-19 Counter-Inferential Behavior in Natural and Artificial Cognitive Systems Serge Dolgikh et.al. 2505.13551 null
2025-05-19 Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems Ke Chen et.al. 2505.13546 null
2025-05-19 Origin-Destination Pattern Effects on Large-Scale Mixed Traffic Control via Multi-Agent Reinforcement Learning Muyang Fan et.al. 2505.13543 link
2025-05-18 LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems Shengkang Gu et.al. 2505.13528 null
2025-05-18 ACPs: Agent Collaboration Protocols for the Internet of Agents Jun Liu et.al. 2505.13523 null
2025-05-17 HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems Zhipeng Hou et.al. 2505.13516 link
2025-05-16 Can AI Freelancers Compete? Benchmarking Earnings, Reliability, and Task Success at Scale David Noever et.al. 2505.13511 null
2025-05-16 An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents Ayesha Amjad et.al. 2505.13504 null
2025-05-19 G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning Liang Chen et.al. 2505.13426 link
2025-05-20 A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut Gabriel Malikal et.al. 2505.13405 null
2025-05-19 Robin: A multi-agent system for automating scientific discovery Ali Essam Ghareeb et.al. 2505.13400 null
2025-05-19 Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges Hongru Wang et.al. 2505.13328 null
2025-05-19 Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions Saleh Soudijani et.al. 2505.13311 null
2025-05-19 TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents Yifu Cai et.al. 2505.13291 link
2025-05-19 Hybrid Voting-Based Task Assignment in Modular Construction Scenarios Daniel Weiner et.al. 2505.13278 null
2025-05-19 From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery Tianshi Zheng et.al. 2505.13259 link
2025-05-19 Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability Jingyi Ren et.al. 2505.13258 link
2025-05-19 Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic Lennart Röstel et.al. 2505.13253 null
2025-05-19 Agentic Publications: An LLM-Driven Framework for Interactive Scientific Publishing, Supplementing Traditional Papers with AI-Powered Knowledge Systems Roberto Pugliese et.al. 2505.13246 null
2025-05-19 Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Tianbao Xie et.al. 2505.13227 null
2025-05-19 Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities Lili Zhang et.al. 2505.13195 null
2025-05-19 When a Reinforcement Learning Agent Encounters Unknown Unknowns Juntian Zhu et.al. 2505.13188 null
2025-05-19 Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping Jianfeng Xu et.al. 2505.13182 null
2025-05-19 Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair Han Zheng et.al. 2505.13103 null
2025-05-19 The Hidden Dangers of Browsing AI Agents Mykyta Mudryi et.al. 2505.13076 null
2025-05-19 CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents Rebecca Westhäußer et.al. 2505.13044 null
2025-05-19 Adversarial Reasoning for Repair Based on Inferred Program Intent He Ye et.al. 2505.13008 null
2025-05-20 From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents Liangxuan Wu et.al. 2505.12981 null
2025-05-19 Improved Approximation Ratio for Strategyproof Facility Location on a Cycle Krzysztof Rogowski et.al. 2505.12943 null
2025-05-20 Leveraging LLM Inconsistency to Boost Pass@k Performance Uri Dalal et.al. 2505.12938 null
2025-05-19 The Traitors: Deception and Trust in Multi-Agent Language Model Simulations Pedro M. P. Curvo et.al. 2505.12923 link
2025-05-19 PyFCG: Fluid Construction Grammar in Python Paul Van Eecke et.al. 2505.12920 null
2025-05-19 Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach Hao Fang et.al. 2505.12902 null
2025-05-19 From Grunts to Grammar: Emergent Language from Cooperative Foraging Maytus Piriyajitakonkij et.al. 2505.12872 null
2025-05-19 GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents Zheng Wu et.al. 2505.12842 link
2025-05-19 Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs Zhuo Yang et.al. 2505.12833 null
2025-05-19 Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning Wei-Chen Liao et.al. 2505.12811 null
2025-05-19 Mixture Policy based Multi-Hop Reasoning over N-tuple Temporal Knowledge Graphs Zhongni Hou et.al. 2505.12788 null
2025-05-19 Forewarned is Forearmed: A Survey on Large Language Model-based Agents in Autonomous Cyberattacks Minrui Xu et.al. 2505.12786 null
2025-05-19 Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization Haochen Yuan et.al. 2505.12759 null
2025-05-19 Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses Yingkai Kang et.al. 2505.12710 null
2025-05-19 PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI Yingchen He et.al. 2505.12707 null
2025-05-19 AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use Yaotian Yang et.al. 2505.12650 link
2025-05-19 Two out of Three (ToT): using self-consistency to make robust predictions Jung Hoon Lee et.al. 2505.12642 null
2025-05-19 Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents Yunseok Jang et.al. 2505.12632 null
2025-05-19 Dual-Agent Reinforcement Learning for Automated Feature Generation Wanfu Gao et.al. 2505.12628 link
2025-05-19 Lightweight and Effective Preference Construction in PIBT for Large-Scale Multi-Agent Pathfinding Keisuke Okumura et.al. 2505.12623 null
2025-05-19 HIL: Hybrid Imitation Learning of Diverse Parkour Skills from Videos Jiashun Wang et.al. 2505.12619 null
2025-05-19 Action-Dependent Optimality-Preserving Reward Shaping Grant C. Forbes et.al. 2505.12611 null
2025-05-19 The Hamiltonian of Poly-matrix Zero-sum Games Toshihiro Ota et.al. 2505.12609 link
2025-05-19 Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis Yifan Hu et.al. 2505.12597 link
2025-05-19 AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection Tiankai Yang et.al. 2505.12594 link
2025-05-18 A Survey of Attacks on Large Language Models Wenrui Xu et.al. 2505.12567 null
2025-05-18 ESC-Judge: A Framework for Comparing Emotional Support Conversational Agents Navid Madani et.al. 2505.12531 null
2025-05-18 InnateCoder: Learning Programmatic Options with Foundation Models Rubens O. Moraes et.al. 2505.12508 link
2025-05-18 Optimal Task and Motion Planning for Autonomous Systems Using Petri Nets Zhou He et.al. 2505.12503 null
2025-05-18 ALAS: A Stateful Multi-LLM Agent Framework for Disruption-Aware Planning Edward Y. Chang et.al. 2505.12501 null
2025-05-18 UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning Longxi Gao et.al. 2505.12493 null
2025-05-18 Proposal for Improving Google A2A Protocol: Safeguarding Sensitive Data in Multi-Agent Systems Yedidel Louck et.al. 2505.12490 null
2025-05-18 Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems Haochun Wang et.al. 2505.12467 null
2025-05-18 Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning Junzhe Jiang et.al. 2505.12465 null
2025-05-18 BadNAVer: Exploring Jailbreak Attacks On Vision-and-Language Navigation Wenqi Lyu et.al. 2505.12443 null
2025-05-20 IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems Liwen Wang et.al. 2505.12442 null
2025-05-18 Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games Jinming Zhang et.al. 2505.12439 null
2025-05-20 Steady-State Strategy Synthesis for Swarms of Autonomous Agents Martin Jonáš et.al. 2505.12406 null
2025-05-18 Automated Profile Inference with Language Model Agents Yuntao Du et.al. 2505.12402 link
2025-05-18 MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks Yinghao Zhu et.al. 2505.12371 link
2025-05-18 Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning Xinbin Yuan et.al. 2505.12370 link
2025-05-18 A universal policy wrapper with guarantees Anton Bolychev et.al. 2505.12354 null
2025-05-18 Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance Yufeng Wang et.al. 2505.12334 null
2025-05-18 Robust Planning for Autonomous Driving via Mixed Adversarial Diffusion Predictions Albert Zhao et.al. 2505.12327 null
2025-05-18 BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind Rikunari Sagara et.al. 2505.12321 link
2025-05-18 Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization Hongbiao Zhu et.al. 2505.12311 null
2025-05-18 Enhance Mobile Agents Thinking Process Via Iterative Preference Learning Kun Huang et.al. 2505.12299 null
2025-05-18 LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage Approach Yinqiu Liu et.al. 2505.12247 null
2025-05-18 Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents Shuo Han et.al. 2505.12204 null
2025-05-20 LLM-DSE: Searching Accelerator Parameters with LLM Agents Hanyu Wang et.al. 2505.12188 link
2025-05-17 LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs Omar Choukrani et.al. 2505.12135 link
2025-05-17 Towards Sustainability in 6G Network Slicing with Energy-Saving and Optimization Methods Rodrigo Moreira et.al. 2505.12132 null
2025-05-17 Scalable Time-Tagged Data Acquisition for Entanglement Distribution in Quantum Networks Abderrahim Amlou et.al. 2505.12102 null
2025-05-17 Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents Tiannuo Yang et.al. 2505.12065 link
2025-05-17 AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research Renqi Chen et.al. 2505.12039 null
2025-05-17 Incentivize Contribution and Learn Parameters Too: Federated Learning with Strategic Data Owners Drashthi Doshi et.al. 2505.12010 null
2025-05-17 SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation Yuncheng Hua et.al. 2505.12006 null
2025-05-17 Interactional Fairness in LLM Multi-Agent Systems: An Evaluation Framework Ruta Binkyte et.al. 2505.12001 null
2025-05-17 Task Scheduling in Space-Air-Ground Uniformly Integrated Networks with Ripple Effects Chuan Huang et.al. 2505.11974 null
2025-05-17 MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models Luca Collini et.al. 2505.11963 null
2025-05-17 CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World Zoya Volovikova et.al. 2505.11962 null
2025-05-17 LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners Junhao Zheng et.al. 2505.11942 link
2025-05-17 Modèles de Substitution pour les Modèles à base d'Agents : Enjeux, Méthodes et Applications Paul Saves et.al. 2505.11912 link
2025-05-17 Benchmarking LLMs in an Embodied Environment for Blue Team Threat Hunting Xiaoqun Liu et.al. 2505.11901 null
2025-05-17 Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents Weikai Xu et.al. 2505.11891 null
2025-05-17 AR Secretary Agent: Real-time Memory Augmentation via LLM-powered Augmented Reality Glasses Raphaël A. El Haddad et.al. 2505.11888 null
2025-05-20 Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation Shuo Wang et.al. 2505.11886 null
2025-05-17 Position Paper: Bounded Alignment: What (Not) To Expect From AGI Agents Ali A. Minai et.al. 2505.11866 null
2025-05-17 Learning Pareto-Optimal Rewards from Noisy Preferences: A Framework for Multi-Objective Inverse Reinforcement Learning Kalyan Cherukuri et.al. 2505.11864 null
2025-05-17 RVTBench: A Benchmark for Visual Reasoning Tasks Yiqing Shen et.al. 2505.11838 link
2025-05-17 Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment Siliang Zeng et.al. 2505.11821 null
2025-05-17 BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering Taolin Zhang et.al. 2505.11811 null
2025-05-17 Retrospex: Language Agent Meets Offline Reinforcement Learning Critic Yufei Xiang et.al. 2505.11807 link
2025-05-17 Robustness of Incentive Mechanisms Against System Misspecification in Congestion Games Chih-Yuan Chiu et.al. 2505.11791 null
2025-05-17 OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration Shijun Li et.al. 2505.11765 link
2025-05-16 REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning Pawin Taechoyotin et.al. 2505.11718 null
2025-05-16 EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents Xilong Wang et.al. 2505.11717 null
2025-05-16 Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents Diksha Goel et.al. 2505.11708 null
2025-05-16 Forensics of Error Rates of Quantum Hardware Rupshali Roy et.al. 2505.11706 null
2025-05-16 Ambiguity Resolution in Text-to-Structured Data Mapping Zhibo Hu et.al. 2505.11679 null
2025-05-16 Terminators: Terms of Service Parsing and Auditing Agents Maruf Ahmed Mridul et.al. 2505.11672 null
2025-05-16 Learning from Less: Guiding Deep Reinforcement Learning with Differentiable Symbolic Planning Zihan Ye et.al. 2505.11661 null
2025-05-16 PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning Falong Fan et.al. 2505.11642 link
2025-05-20 Talk to Your Slides: Language-Driven Agents for Efficient Slide Editing Kyudan Jung et.al. 2505.11604 link
2025-05-16 Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search Rui Liu et.al. 2505.11601 null
2025-05-16 LLM Agents Are Hypersensitive to Nudges Manuel Cherep et.al. 2505.11584 null
2025-05-16 Toward Adaptive Categories: Dimensional Governance for Agentic AI Zeynep Engin et.al. 2505.11579 null
2025-05-15 Assessing Collective Reasoning in Multi-Agent LLMs via Hidden Profile Tasks Yuxuan Li et.al. 2505.11556 null
2025-05-14 TARGET: Benchmarking Table Retrieval for Generative Tasks Xingyu Ji et.al. 2505.11545 null
2025-05-16 Automatic Reward Shaping from Confounded Offline Data Mingxuan Li et.al. 2505.11478 null
2025-05-16 Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks Wesley A Suttle et.al. 2505.11461 null
2025-05-16 Robust Equilibria in Shared Resource Allocation via Strengthening Border's Theorem David X. Lin et.al. 2505.11431 null
2025-05-16 Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis Jing Liu et.al. 2505.11401 null
2025-05-16 Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation Zihan Wang et.al. 2505.11383 link
2025-05-16 GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents Lingxiao Diao et.al. 2505.11368 null
2025-05-16 Long-Term Average Impulse Control with Mean Field Interactions K. L. Helmes et.al. 2505.11345 null
2025-05-16 Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics Ardian Selmonaj et.al. 2505.11311 null
2025-05-16 Diffusion Learning with Partial Agent Participation and Local Updates Elsa Rizk et.al. 2505.11307 null
2025-05-16 Meta-World+: An Improved, Standardized, RL Benchmark Reginald McLean et.al. 2505.11289 link
2025-05-16 TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes Chao Zhang et.al. 2505.11270 null
2025-05-19 Massive-STEPS: Massive Semantic Trajectories for Understanding POI Check-ins -- Dataset and Benchmarks Wilson Wongso et.al. 2505.11239 link
2025-05-16 Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation Donghoon Lee et.al. 2505.11221 link
2025-05-16 From Intent Discovery to Recognition with Topic Modeling and Synthetic Data Aaron Rodrigues et.al. 2505.11176 null
2025-05-19 Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition Bo Yue et.al. 2505.11175 null
2025-05-16 MPMA: Preference Manipulation Attack Against Model Context Protocol Zihan Wang et.al. 2505.11154 null
2025-05-16 Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes Ashok Arora et.al. 2505.11153 null
2025-05-16 Reinforcement Learning for AMR Charging Decisions: The Impact of Reward and Action Space Design Janik Bischoff et.al. 2505.11136 null
2025-05-16 Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets Patrick Stöckermann et.al. 2505.11135 null
2025-05-16 Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Chan-Jan Hsu et.al. 2505.11107 null
2025-05-16 Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors Lang Feng et.al. 2505.11100 null
2025-05-16 LLM-Enhanced Symbolic Control for Safety-Critical Applications Amir

About

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages