[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2025-07-23 | DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models | Liwenhan Xie et.al. | 2507.17734 | null |
| 2025-07-23 | BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems | Malsha Ashani Mahawatta Dona et.al. | 2507.17722 | null |
| 2025-07-23 | Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks | Ilias Chatzistefanidis et.al. | 2507.17695 | null |
| 2025-07-23 | Simulating multiple human perspectives in socio-ecological systems using large language models | Yongchao Zeng et.al. | 2507.17680 | null |
| 2025-07-23 | LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning | Luca Salvatore Lorello et.al. | 2507.17482 | null |
| 2025-07-23 | ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents | Chang Nie et.al. | 2507.17462 | null |
| 2025-07-23 | IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception | Haichuan Li et.al. | 2507.17445 | null |
| 2025-07-23 | Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach | Hugh Adams et.al. | 2507.17433 | null |
| 2025-07-23 | CAPRI-CT: Causal Analysis and Predictive Reasoning for Image Quality Optimization in Computed Tomography | Sneha George Gnanakalavathy et.al. | 2507.17420 | null |
| 2025-07-23 | Residual Prophet Inequalities | Jose Correa et.al. | 2507.17391 | null |
| 2025-07-23 | DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning | Chuzhan Hao et.al. | 2507.17365 | null |
| 2025-07-23 | DeMo++: Motion Decoupling for Autonomous Driving | Bozhou Zhang et.al. | 2507.17342 | null |
| 2025-07-23 | HuNavSim 2.0 | Miguel Escudero-Jiménez et.al. | 2507.17317 | null |
| 2025-07-23 | EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents | Zijie Guo et.al. | 2507.17311 | null |
| 2025-07-23 | Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments | Shitong Zhu et.al. | 2507.17289 | null |
| 2025-07-23 | Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance | Rishi Parekh et.al. | 2507.17273 | null |
| 2025-07-23 | Agent Identity Evals: Measuring Agentic Identity | Elija Perrier et.al. | 2507.17257 | null |
| 2025-07-23 | LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks | Lijie Zheng et.al. | 2507.17188 | null |
| 2025-07-23 | Optimal Calibrated Signaling in Digital Auctions | Zhicheng Du et.al. | 2507.17187 | null |
| 2025-07-23 | FinGAIA: An End-to-End Benchmark for Evaluating AI Agents in Finance | Lingfeng Zeng et.al. | 2507.17186 | null |
| 2025-07-23 | Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria | Die Hu et.al. | 2507.17183 | null |
| 2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
| 2025-07-23 | CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards | Cheng Liu et.al. | 2507.17147 | null |
| 2025-07-23 | Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination | Mariam ALMutairi et.al. | 2507.17134 | null |
| 2025-07-23 | Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance | Yufei He et.al. | 2507.17131 | null |
| 2025-07-23 | Stochastically Structured Reservoir Computers for Financial and Economic System Identification | Lendy Banegas et.al. | 2507.17115 | null |
| 2025-07-22 | Deformable Cluster Manipulation via Whole-Arm Policy Learning | Jayadeep Jacob et.al. | 2507.17085 | null |
| 2025-07-22 | VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings | Ramin Giahi et.al. | 2507.17080 | null |
| 2025-07-22 | Approximation Techniques for the Reconstruction of the Probability Measure and the Coupling Parameters in a Curie-Weiss Model for Large Populations | Miguel Ballesteros et.al. | 2507.17073 | null |
| 2025-07-22 | Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation | Jessup Byun et.al. | 2507.17066 | null |
| 2025-07-22 | Parallelism Meets Adaptiveness: Scalable Documents Understanding in Multi-Agent LLM Systems | Chengxuan Xia et.al. | 2507.17061 | null |
| 2025-07-22 | Shared Control of Holonomic Wheelchairs through Reinforcement Learning | Jannis Bähler et.al. | 2507.17055 | null |
| 2025-07-22 | New Mechanisms in Flex Distribution for Bounded Suboptimal Multi-Agent Path Finding | Shao-Hung Chan et.al. | 2507.17054 | null |
| 2025-07-22 | Evaluating Uncertainty and Quality of Visual Language Action-enabled Robots | Pablo Valle et.al. | 2507.17049 | null |
| 2025-07-22 | Modeling for the Growth of Unorganized Retailing in the Presence of Organized and E-Retailing in Indian Pharmaceutical Industry | Koushik Mondal et.al. | 2507.17023 | null |
| 2025-07-22 | Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge? | Arduin Findeis et.al. | 2507.17015 | null |
| 2025-07-22 | Quantitative convergence for displacement monotone Mean Field Games of control | Joe Jackson et.al. | 2507.17014 | null |
| 2025-07-22 | Towards Autonomous Sustainability Assessment via Multimodal AI Agents | Zhihan Zhang et.al. | 2507.17012 | null |
| 2025-07-22 | On-chip stencil lithography for superconducting qubits | Roudy Hanna et.al. | 2507.17005 | null |
| 2025-07-22 | Hierarchical Reinforcement Learning Framework for Adaptive Walking Control Using General Value Functions of Lower-Limb Sensor Signals | Sonny T. Jones et.al. | 2507.16983 | null |
| 2025-07-22 | Text-to-SPARQL Goes Beyond English: Multilingual Question Answering Over Knowledge Graphs through Human-Inspired Reasoning | Aleksandr Perevalov et.al. | 2507.16971 | null |
| 2025-07-22 | Fundamental limits of distributed covariance matrix estimation via a conditional strong data processing inequality | Mohammad Reza Rahmani et.al. | 2507.16953 | null |
| 2025-07-22 | Multi-agent Reinforcement Learning for Robotized Coral Reef Sample Collection | Daniel Correa et.al. | 2507.16941 | null |
| 2025-07-22 | AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation | Nima Fathi et.al. | 2507.16940 | null |
| 2025-07-22 | Budget Allocation Policies for Real-Time Multi-Agent Path Finding | Raz Beck et.al. | 2507.16874 | null |
| 2025-07-21 | Reinforcement Learning in hyperbolic space for multi-step reasoning | Tao Xu et.al. | 2507.16864 | null |
| 2025-07-21 | MobileUse: A GUI Agent with Hierarchical Reflection for Autonomous Mobile Operation | Ning Li et.al. | 2507.16853 | null |
| 2025-07-21 | Dynamic Simulation Framework for Disinformation Dissemination and Correction With Social Bots | Boyu Qiao et.al. | 2507.16848 | null |
| 2025-07-22 | ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning | Chi-Pin Huang et.al. | 2507.16815 | null |
| 2025-07-22 | LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs | Da-Chen Lian et.al. | 2507.16809 | null |
| 2025-07-23 | Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning | Yanjun Zheng et.al. | 2507.16802 | null |
| 2025-07-23 | Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent | Xiaoyu Zhan et.al. | 2507.16799 | null |
| 2025-07-22 | Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning | Mian Ibad Ali Shah et.al. | 2507.16796 | null |
| 2025-07-22 | Generalized non-reciprocal phase transitions in multipopulation systems | Cheyne Weis et.al. | 2507.16763 | null |
| 2025-07-22 | AI-enhanced conversational agents for personalized asthma support Factors for engagement, value and efficacy | Laura Moradbakhti et.al. | 2507.16735 | null |
| 2025-07-23 | Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints | Zhenyun Yin et.al. | 2507.16727 | null |
| 2025-07-22 | RAVine: Reality-Aligned Evaluation for Agentic Search | Yilong Xu et.al. | 2507.16725 | null |
| 2025-07-22 | Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation | Viktor Muryn et.al. | 2507.16704 | null |
| 2025-07-22 | FOGNITE: Federated Learning-Enhanced Fog-Cloud Architecture | Somayeh Sobati-M et.al. | 2507.16668 | null |
| 2025-07-22 | Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis | Sara Giordano et.al. | 2507.16641 | null |
| 2025-07-22 | Novel Multi-Agent Action Masked Deep Reinforcement Learning for General Industrial Assembly Lines Balancing Problems | Ali Mohamed Ali et.al. | 2507.16635 | null |
| 2025-07-22 | Augmenting Von Neumann's Architecture for an Intelligent Future | Rajpreet Singh et.al. | 2507.16628 | null |
| 2025-07-22 | CTSL: Codebook-based Temporal-Spatial Learning for Accurate Non-Contrast Cardiac Risk Prediction Using Cine MRIs | Haoyang Su et.al. | 2507.16612 | null |
| 2025-07-22 | Smooth Games of Configuration in the Linear-Quadratic Setting | Jesse Milzman et.al. | 2507.16611 | null |
| 2025-07-22 | Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis | Xiaojiao Xiao et.al. | 2507.16579 | null |
| 2025-07-22 | Evaluating Social Acceptance of eXtended Reality (XR) Agent Technology: A User Study (Extended Version) | Megha Quamara et.al. | 2507.16562 | null |
| 2025-07-22 | A Distributed Actor-Critic Algorithm for Fixed-Time Consensus in Nonlinear Multi-Agent Systems | Aria Delshad et.al. | 2507.16520 | null |
| 2025-07-22 | Analogy making as amortised model construction | David G. Nagy et.al. | 2507.16511 | null |
| 2025-07-22 | Agentic RAG with Knowledge Graphs for Complex Multi-Hop Reasoning in Real-World Applications | Jean Lelong et.al. | 2507.16507 | null |
| 2025-07-22 | Arbitrage Tactics in the Local Markets via Hierarchical Multi-agent Reinforcement Learning | Haoyang Zhang et.al. | 2507.16479 | null |
| 2025-07-22 | Adaptive Bayesian Single-Shot Quantum Sensing | Ivana Nikoloska et.al. | 2507.16477 | null |
| 2025-07-22 | Towards Enforcing Company Policy Adherence in Agentic Workflows | Naama Zwerdling et.al. | 2507.16459 | null |
| 2025-07-22 | Distributed Oscillatory Guidance for Formation Flight of Fixed-Wing Drones | Yang Xu et.al. | 2507.16458 | null |
| 2025-07-23 | RIS-aided Latent Space Alignment for Semantic Channel Equalization | Tomás Hüttebräucker et.al. | 2507.16450 | null |
| 2025-07-22 | From model-based learning to model-free behaviour with Meta-Interpretive Learning | Stassa Patsantzis et.al. | 2507.16434 | null |
| 2025-07-22 | LLM-Driven Collaborative Model for Untangling Commits via Explicit and Implicit Dependency Reasoning | Bo Hou et.al. | 2507.16395 | null |
| 2025-07-22 | Application of LLM Guided Reinforcement Learning in Formation Control with Collision Avoidance | Chenhao Yao et.al. | 2507.16382 | null |
| 2025-07-22 | COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network | Xingjian Zhang et.al. | 2507.16306 | null |
| 2025-07-22 | ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry | Tianze Xu et.al. | 2507.16280 | null |
| 2025-07-22 | Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping | Srivatsan Krishnan et.al. | 2507.16249 | null |
| 2025-07-22 | FinResearchBench: A Logic Tree based Agent-as-a-Judge Evaluation Framework for Financial Research Agents | Run Sun et.al. | 2507.16248 | null |
| 2025-07-22 | Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery | Bo Wen et.al. | 2507.16229 | null |
| 2025-07-22 | Unbeatable imitation of a friend | Masahiko Ueda et.al. | 2507.16221 | null |
| 2025-07-22 | Best-of-Both-Worlds Guarantees with Fairer Endings | Telikepalli Kavitha et.al. | 2507.16209 | null |
| 2025-07-22 | CHIMERA: Compressed Hybrid Intelligence for Twin-Model Enhanced Multi-Agent Deep Reinforcement Learning for Multi-Functional RIS-Assisted Space-Air-Ground Integrated Networks | Li-Hsiang Shen et.al. | 2507.16204 | null |
| 2025-07-22 | SVAgent: AI Agent for Hardware Security Verification Assertion | Rui Guo et.al. | 2507.16203 | null |
| 2025-07-22 | RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs | Pengwei Jin et.al. | 2507.16200 | null |
| 2025-07-22 | Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task | Jared Moore et.al. | 2507.16196 | null |
| 2025-07-22 | Emergent Cognitive Convergence via Implementation: A Structured Loop Reflecting Four Theories of Mind (A Position Paper) | Myung Ho Kim et.al. | 2507.16184 | null |
| 2025-07-22 | Benchmarking LLM Privacy Recognition for Social Robot Decision Making | Dakota Sullivan et.al. | 2507.16124 | null |
| 2025-07-21 | Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization | Shengchao Liu et.al. | 2507.16110 | null |
| 2025-07-21 | Deep Researcher with Test-Time Diffusion | Rujun Han et.al. | 2507.16075 | null |
| 2025-07-21 | Asymptotic consensus with transmission and reaction delay: an overview | Jan Haskovec et.al. | 2507.16072 | null |
| 2025-07-21 | Is memory all you need? Data-driven Mori-Zwanzig modeling of Lagrangian particle dynamics in turbulent flows | Xander de Wit et.al. | 2507.16058 | null |
| 2025-07-23 | Making REST APIs Agent-Ready: From OpenAPI to Model Context Protocol Servers for Tool-Augmented LLMs | Meriem Mastouri et.al. | 2507.16044 | null |
| 2025-07-21 | A Pilot Study on LLM-Based Agentic Translation from Android to iOS: Pitfalls and Insights | Zhili Zeng et.al. | 2507.16037 | null |
| 2025-07-21 | Minor Embedding for Quantum Annealing with Reinforcement Learning | Riccardo Nembrini et.al. | 2507.16004 | null |
| 2025-07-21 | Automated Design of Structured Variational Quantum Circuits with Reinforcement Learning | Gloria Turati et.al. | 2507.16001 | null |
| 2025-07-21 | Red Supergiant Mass Loss and Mass-Loss Rates | Jacco Th. van Loon et.al. | 2507.15971 | null |
| 2025-07-23 | HyDRA: A Hybrid-Driven Reasoning Architecture for Verifiable Knowledge Graphs | Adrian Kaiser et.al. | 2507.15917 | null |
| 2025-07-21 | Towards Mitigation of Hallucination for LLM-empowered Agents: Progressive Generalization Bound Exploration and Watchdog Monitor | Siyuan Liu et.al. | 2507.15903 | null |
| 2025-07-21 | Advancing Responsible Innovation in Agentic AI: A study of Ethical Frameworks for Household Automation | Joydeep Chandra et.al. | 2507.15901 | null |
| 2025-07-20 | Integrating Reason-Based Moral Decision-Making in the Reinforcement Learning Architecture | Lisa Dargasz et.al. | 2507.15895 | null |
| 2025-07-20 | StaAgent: An Agentic Framework for Testing Static Analyzers | Elijah Nnorom et.al. | 2507.15892 | null |
| 2025-07-19 | AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs? | Ori Press et.al. | 2507.15887 | null |
| 2025-07-18 | ADEPTS: A Capability Framework for Human-Centered Agent Design | Pierluca D'Oro et.al. | 2507.15885 | null |
| 2025-07-21 | LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra | Seth Karten et.al. | 2507.15815 | null |
| 2025-07-21 | Density control of multi-agent swarms via bio-inspired leader-follower plasticity | Gian Carlo Maffettone et.al. | 2507.15781 | null |
| 2025-07-21 | A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining | Yifan Shen et.al. | 2507.15770 | null |
| 2025-07-21 | GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts | Jingyi Zheng et.al. | 2507.15761 | null |
| 2025-07-21 | Towards physician-centered oversight of conversational diagnostic AI | Elahe Vedadi et.al. | 2507.15743 | null |
| 2025-07-21 | General Matching Games | Felipe Garrido-Lucero et.al. | 2507.15737 | null |
| 2025-07-21 | Competitive Algorithms for Cooperative Multi-Agent Ski-Rental Problems | Xuchuang Wang et.al. | 2507.15727 | null |
| 2025-07-21 | Agentic AI for autonomous anomaly management in complex systems | Reza Vatankhah Barenji et.al. | 2507.15676 | null |
| 2025-07-21 | BugScope: Learn to Find Bugs Like Human | Jinyao Guo et.al. | 2507.15671 | null |
| 2025-07-21 | Asynchronous Collective Tree Exploration: a Distributed Algorithm, and a new Lower Bound | Romain Cosson et.al. | 2507.15658 | null |
| 2025-07-21 | Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training | Kailai Yang et.al. | 2507.15640 | null |
| 2025-07-21 | TacticCraft: Natural Language-Driven Tactical Adaptation for StarCraft II | Weiyu Ma et.al. | 2507.15618 | null |
| 2025-07-21 | Why can't Epidemiology be automated (yet)? | David Bann et.al. | 2507.15617 | null |
| 2025-07-21 | DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving | Zhihao Zhang et.al. | 2507.15615 | null |
| 2025-07-21 | Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario | Yinsong Chen et.al. | 2507.15587 | null |
| 2025-07-21 | FlowForge: Guiding the Creation of Multi-agent Workflows with Design Space Visualization as a Thinking Scaffold | Pan Hao et.al. | 2507.15559 | null |
| 2025-07-21 | PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors | Yimeng Chen et.al. | 2507.15550 | null |
| 2025-07-21 | HAMLET: Hyperadaptive Agent-based Modeling for Live Embodied Theatrics | Sizhou Chen et.al. | 2507.15518 | null |
| 2025-07-21 | The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents | Simon Kohaut et.al. | 2507.15478 | null |
| 2025-07-21 | The Emergence of Deep Reinforcement Learning for Path Planning | Thanh Thi Nguyen et.al. | 2507.15469 | null |
| 2025-07-23 | Solving nonconvex Hamilton--Jacobi--Isaacs equations with PINN-based policy iteration | Hee Jun Yang et.al. | 2507.15455 | null |
| 2025-07-21 | EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent | Jiaao Li et.al. | 2507.15428 | null |
| 2025-07-21 | PhishIntentionLLM: Uncovering Phishing Website Intentions through Multi-Agent Retrieval-Augmented Generation | Wenhao Li et.al. | 2507.15419 | null |
| 2025-07-21 | RAD: Retrieval High-quality Demonstrations to Enhance Decision-making | Lu Guo et.al. | 2507.15356 | null |
| 2025-07-21 | One Step is Enough: Multi-Agent Reinforcement Learning based on One-Step Policy Optimization for Order Dispatch on Ride-Sharing Platforms | Zijian Zhao et.al. | 2507.15351 | null |
| 2025-07-21 | QSAF: A Novel Mitigation Framework for Cognitive Degradation in Agentic AI | Hammad Atta et.al. | 2507.15330 | null |
| 2025-07-21 | Strategically Robust Game Theory via Optimal Transport | Nicolas Lanzetti et.al. | 2507.15325 | null |
| 2025-07-21 | Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems | Qian Xiong et.al. | 2507.15296 | null |
| 2025-07-21 | Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning | Elias Malomgré et.al. | 2507.15287 | null |
| 2025-07-21 | Event-Triggered Resilient Consensus of Networked Euler-Lagrange Systems Under Byzantine Attacks | Yuliang Fu et.al. | 2507.15283 | null |
| 2025-07-21 | IM-Chat: A Multi-agent LLM-based Framework for Knowledge Transfer in Injection Molding Industry | Junhyeong Lee et.al. | 2507.15268 | null |
| 2025-07-21 | SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search | Xiaofeng Shi et.al. | 2507.15245 | null |
| 2025-07-21 | FaultLine: Automated Proof-of-Vulnerability Generation Using LLM Agents | Vikram Nitin et.al. | 2507.15241 | null |
| 2025-07-21 | Solving Formal Math Problems by Decomposition and Iterative Reflection | Yichi Zhou et.al. | 2507.15225 | null |
| 2025-07-21 | EchoVoices: Preserving Generational Voices and Memories for Seniors and Children | Haiying Xu et.al. | 2507.15221 | null |
| 2025-07-21 | PromptArmor: Simple yet Effective Prompt Injection Defenses | Tianneng Shi et.al. | 2507.15219 | null |
| 2025-07-21 | 1H Polarization above 60% at room temperature by triplet dynamic nuclear polarization | Kenichiro Tateishi et.al. | 2507.15217 | null |
| 2025-07-21 | Personalized 3D Myocardial Infarct Geometry Reconstruction from Cine MRI with Explicit Cardiac Motion Modeling | Yilin Lyu et.al. | 2507.15194 | null |
| 2025-07-21 | Joint-Local Grounded Action Transformation for Sim-to-Real Transfer in Multi-Agent Traffic Control | Justin Turnau et.al. | 2507.15174 | null |
| 2025-07-20 | STL-GO: Spatio-Temporal Logic with Graph Operators for Distributed Systems with Multiple Network Topologies | Yiqi Zhao et.al. | 2507.15147 | null |
| 2025-07-20 | Can We Move Freely in NEOM's The Line? An Agent-Based Simulation of Human Mobility in a Futuristic Smart City | Abderaouf Bahi et.al. | 2507.15143 | null |
| 2025-07-20 | Statistical state dynamics based study of turbulent Eady fronts. Part 2. Finite amplitude equilibria | Eojin Kim et.al. | 2507.15134 | null |
| 2025-07-20 | Initialization-driven neural generation and training for high-dimensional optimal control and first-order mean field games | Mouhcine Assouli et.al. | 2507.15126 | null |
| 2025-07-20 | From Kicking to Causality: Simulating Infant Agency Detection with a Robust Intrinsic Reward | Xia Xu et.al. | 2507.15106 | null |
| 2025-07-20 | Search-Based Autonomous Vehicle Motion Planning Using Game Theory | Pouya Panahandeh et.al. | 2507.15088 | null |
| 2025-07-20 | WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization | Zhengwei Tao et.al. | 2507.15061 | null |
| 2025-07-20 | LibLMFuzz: LLM-Augmented Fuzz Target Generation for Black-box Libraries | Ian Hardgrove et.al. | 2507.15058 | null |
| 2025-07-20 | EduThink4AI: Translating Educational Critical Thinking into Multi-Agent LLM Systems | Xinmeng Hou et.al. | 2507.15015 | null |
| 2025-07-20 | The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering | Hao Li et.al. | 2507.15003 | null |
| 2025-07-20 | LLM-Enhanced Multi-Agent Reinforcement Learning with Expert Workflow for Real-Time P2P Energy Trading | Chengwei Lou et.al. | 2507.14995 | null |
| 2025-07-20 | Think Like an Engineer: A Neuro-Symbolic Collaboration Agent for Generative Software Requirements Elicitation and Self-Review | Sai Zhang et.al. | 2507.14969 | null |
| 2025-07-20 | STEPC: A Pixel-wise Nonuniformity Correction Framework for Photon-Counting CT in Multi-material Imaging Scenarios | Enze Zhou et.al. | 2507.14963 | null |
| 2025-07-20 | Probing EFX via PMMS: (Non-)Existence Results in Discrete Fair Division | Jarosław Byrka et.al. | 2507.14957 | null |
| 2025-07-20 | Echoes of the Land: An Interactive Installation Based on Physical Model of Earthquake | Ivan C. H. Liu et.al. | 2507.14947 | null |
| 2025-07-20 | Byzantine-Robust Decentralized Coordination of LLM Agents | Yongrae Jo et.al. | 2507.14928 | null |
| 2025-07-20 | Redefining Elderly Care with Agentic AI: Challenges and Opportunities | Ruhul Amin Khalil et.al. | 2507.14912 | null |
| 2025-07-20 | TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP | Fan Li et.al. | 2507.14904 | null |
| 2025-07-20 | Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies | Armin Kekić et.al. | 2507.14901 | null |
| 2025-07-20 | InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis | Jiale Liu et.al. | 2507.14899 | null |
| 2025-07-20 | AgentFly: Extensible and Scalable Reinforcement Learning for LM Agents | Renxi Wang et.al. | 2507.14897 | null |
| 2025-07-20 | Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems | H. M. Sabbir Ahmad et.al. | 2507.14850 | null |
| 2025-07-20 | Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree | Sam Johnson et.al. | 2507.14799 | null |
| 2025-07-19 | Towards AI Urban Planner in the Age of GenAI, LLMs, and Agentic AI | Yanjie Fu et.al. | 2507.14730 | null |
| 2025-07-19 | Simulating Chirality: Solving Distance- |
Brati Mondal et.al. | 2507.14723 | null |
| 2025-07-19 | Configurable multi-agent framework for scalable and realistic testing of llm-based agents | Sai Wang et.al. | 2507.14705 | null |
| 2025-07-19 | WSI-Agents: A Collaborative Multi-Agent System for Multi-Modal Whole Slide Image Analysis | Xinheng Lyu et.al. | 2507.14680 | null |
| 2025-07-19 | When Autonomy Goes Rogue: Preparing for Risks of Multi-Agent Collusion in Social Systems | Qibing Ren et.al. | 2507.14660 | null |
| 2025-07-19 | Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence | Faizan Contractor et.al. | 2507.14658 | null |
| 2025-07-19 | Agentic Satellite-Augmented Low-Altitude Economy and Terrestrial Networks: A Survey on Generative Approaches | Xiaozheng Gao et.al. | 2507.14633 | null |
| 2025-07-19 | Towards a Proactive Autoscaling Framework for Data Stream Processing at the Edge using GRU and Transfer Learning | Eugene Armah et.al. | 2507.14597 | null |
| 2025-07-19 | Amico: An Event-Driven Modular Framework for Persistent and Embedded Autonomy | Hongyi Yang et.al. | 2507.14513 | null |
| 2025-07-19 | Federated Reinforcement Learning in Heterogeneous Environments | Ukjo Hwang et.al. | 2507.14487 | null |
| 2025-07-22 | Routine: A Structural Planning Framework for LLM Agent System in Enterprise | Guancheng Zeng et.al. | 2507.14447 | null |
| 2025-07-18 | NetIntent: Leveraging Large Language Models for End-to-End Intent-Based SDN Automation | Md. Kamrul Hossain et.al. | 2507.14398 | null |
| 2025-07-18 | Adaptive Multi-Agent Reasoning via Automated Workflow Generation | Humza Sami et.al. | 2507.14393 | null |
| 2025-07-18 | Text-to-SQL for Enterprise Data Analytics | Albert Chen et.al. | 2507.14372 | null |
| 2025-07-18 | Stable matchings with switching costs | Boris Pittel et.al. | 2507.14362 | null |
| 2025-07-18 | FedStrategist: A Meta-Learning Framework for Adaptive and Robust Aggregation in Federated Learning | Md Rafid Haque et.al. | 2507.14322 | null |
| 2025-07-18 | Semantic Segmentation based Scene Understanding in Autonomous Vehicles | Ehsan Rassekh et.al. | 2507.14303 | null |
| 2025-07-18 | Distributed consensus-based observer design for target state estimation with bearing measurements | Marcelo Jacinto et.al. | 2507.14300 | null |
| 2025-07-18 | Age of Information Minimization in UAV-Enabled Integrated Sensing and Communication Systems | Yu Bai et.al. | 2507.14299 | null |
| 2025-07-18 | WebGuard: Building a Generalizable Guardrail for Web Agents | Boyuan Zheng et.al. | 2507.14293 | null |
| 2025-07-18 | DREAMS: Density Functional Theory Based Research Engine for Agentic Materials Simulation | Ziqi Wang et.al. | 2507.14267 | null |
| 2025-07-18 | Beyond DNS: Unlocking the Internet of AI Agents via the NANDA Index and Verified AgentFacts | Ramesh Raskar et.al. | 2507.14263 | null |
| 2025-07-17 | Towards an ABM on Proactive Community Adaptation for Climate Change | Önder Gürcan et.al. | 2507.14233 | null |
| 2025-07-17 | Intent-Based Network for RAN Management with Large Language Models | Fransiscus Asisi Bimo et.al. | 2507.14230 | null |
| 2025-07-18 | DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration | Xiyun Li et.al. | 2507.14088 | null |
| 2025-07-18 | Collaborative Rational Speech Act: Pragmatic Reasoning for Multi-Turn Dialog | Lautaro Estienne et.al. | 2507.14063 | null |
| 2025-07-23 | Well-posedness and propagation of chaos for multi-agent models with strategies and diffusive effects | Alessandro Baldi et.al. | 2507.14058 | null |
| 2025-07-18 | Online MMS Allocation for Chores | Jiaxin Song et.al. | 2507.14039 | null |
| 2025-07-18 | Architecting Human-AI Cocreation for Technical Services -- Interaction Modes and Contingency Factors | Jochen Wulf et.al. | 2507.14034 | null |
| 2025-07-18 | Byzantine-resilient federated online learning for Gaussian process regression | Xu Zhang et.al. | 2507.14021 | null |
| 2025-07-18 | DreamScene: 3D Gaussian-based End-to-end Text-to-3D Scene Generation | Haoran Li et.al. | 2507.13985 | null |
| 2025-07-18 | A Multi-Objective Optimization framework for Decentralized Learning with coordination constraints | Roberto Morales et.al. | 2507.13983 | null |
| 2025-07-18 | Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need | Bhishma Dedhia et.al. | 2507.13966 | null |
| 2025-07-18 | NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning | Qingyi Chen et.al. | 2507.13940 | null |
| 2025-07-18 | Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support | Jan Trienes et.al. | 2507.13937 | null |
| 2025-07-18 | Reframing attention as a reinforcement learning problem for causal discovery | Turan Orujlu et.al. | 2507.13920 | null |
| 2025-07-18 | Advanced X-rays techniques for research-oriented high-resolution imaging of articular cartilage: a scoping review | Simone Fantoni et.al. | 2507.13854 | null |
| 2025-07-18 | Impact of homophily in adherence to anti-epidemic measures on the spread of infectious diseases in social networks | Piotr Bentkowski et.al. | 2507.13848 | null |
| 2025-07-18 | Causal Knowledge Transfer for Multi-Agent Reinforcement Learning in Dynamic Environments | Kathrin Korte et.al. | 2507.13846 | null |
| 2025-07-18 | Principles and Reasons Behind Automated Vehicle Decisions in Ethically Ambiguous Everyday Scenarios | Lucas Elbert Suryana et.al. | 2507.13837 | null |
| 2025-07-18 | Conformal Data Contamination Tests for Trading or Sharing of Data | Martin V. Vejling et.al. | 2507.13835 | null |
| 2025-07-18 | Scalable Submodular Policy Optimization via Pruned Submodularity Graph | Aditi Anand et.al. | 2507.13834 | null |
| 2025-07-18 | CodeEdu: A Multi-Agent Collaborative Platform for Personalized Coding Education | Jianing Zhao et.al. | 2507.13814 | null |
| 2025-07-18 | From Extraction to Synthesis: Entangled Heuristics for Agent-Augmented Strategic Reasoning | Renato Ghisellini et.al. | 2507.13768 | null |
| 2025-07-21 | Navigating the Lobbying Landscape: Insights from Opinion Dynamics Models | Daniele Giachini et.al. | 2507.13767 | null |
| 2025-07-18 | AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework | Yu Yao et.al. | 2507.13729 | null |
| 2025-07-18 | CogniQ-H: A Soft Hierarchical Reinforcement Learning Paradigm for Automated Data Preparation | Jing Chang et.al. | 2507.13710 | null |
| 2025-07-18 | Minimum Clustering of Matrices Based on Phase Alignment | Honghao Wu et.al. | 2507.13678 | null |
| 2025-07-18 | Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones | Minze Li et.al. | 2507.13647 | null |
| 2025-07-18 | Differential Privacy in Kernelized Contextual Bandits via Random Projections | Nikola Pavlovic et.al. | 2507.13639 | null |
| 2025-07-17 | Evolving Neural Controllers for Xpilot-AI Racing Using Neuroevolution of Augmenting Topologies | Jim O'Connor et.al. | 2507.13549 | null |
| 2025-07-17 | Human-Like Trajectories Generation via Receding Horizon Tracking Applied to the TickTacking Interface | Daniele Masti et.al. | 2507.13528 | null |
| 2025-07-17 | Humans learn to prefer trustworthy AI over human partners | Yaomin Jiang et.al. | 2507.13524 | null |
| 2025-07-17 | GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination | Nabil Abdelaziz Ferhat Taleb et.al. | 2507.13511 | null |
| 2025-07-17 | Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents | Thomas Banker et.al. | 2507.13491 | null |
| 2025-07-17 | LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data | Aleksey Lapin et.al. | 2507.13413 | null |
| 2025-07-21 | A Survey of Context Engineering for Large Language Models | Lingrui Mei et.al. | 2507.13334 | null |
| 2025-07-17 | N Bugs on a Circle | Josh Briley et.al. | 2507.13333 | null |
| 2025-07-17 | Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis | Wang Xi et.al. | 2507.13285 | null |
| 2025-07-20 | Analysis Theory of Data Economy: Dataization, Technological Progress and Dynamic General Equilibrium | Yongheng Hu et.al. | 2507.13274 | null |
| 2025-07-17 | RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality | Ruohao Li et.al. | 2507.13247 | null |
| 2025-07-17 | GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems | Jisoo Lee et.al. | 2507.13190 | null |
| 2025-07-17 | Black Box Deployed -- Functional Criteria for Artificial Moral Agents in the LLM Era | Matthew E. Brophy et.al. | 2507.13175 | null |
| 2025-07-17 | Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback | Suzie Kim et.al. | 2507.13171 | null |
| 2025-07-17 | Prompt Injection 2.0: Hybrid AI Threats | Jeremy McHugh et.al. | 2507.13169 | null |
| 2025-07-17 | SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models | Xiangyu Dong et.al. | 2507.13152 | null |
| 2025-07-17 | RIDAS: A Multi-Agent Framework for AI-RAN with Representation- and Intention-Driven Agents | Kuiyuan Ding et.al. | 2507.13140 | null |
| 2025-07-17 | Governance, productivity and economic development | Cuong Le Van et.al. | 2507.13099 | null |
| 2025-07-17 | iReDev: A Knowledge-Driven Multi-Agent Framework for Intelligent Requirements Development | Dongming Jin et.al. | 2507.13081 | null |
| 2025-07-17 | Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication | Tianyu Song et.al. | 2507.13052 | null |
| 2025-07-17 | What Can Robots Teach Us About Trust and Reliance? An interdisciplinary dialogue between Social Sciences and Social Robotics | Julien Wacquez et.al. | 2507.13041 | null |
| 2025-07-17 | MAD-Spear: A Conformity-Driven Prompt Injection Attack on Multi-Agent Debate Systems | Yu Cui et.al. | 2507.13038 | null |
| 2025-07-17 | Lower Bound for Online MMS Assignment of Indivisible Chores | Masoud Seddighin et.al. | 2507.12984 | null |
| 2025-07-17 | Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning | Giwon Lee et.al. | 2507.12977 | null |
| 2025-07-21 | LaViPlan : Language-Guided Visual Path Planning with RLVR | Hayeon Oh et.al. | 2507.12911 | null |
| 2025-07-17 | Autonomous Resource Management in Microservice Systems via Reinforcement Learning | Yujun Zou et.al. | 2507.12879 | null |
| 2025-07-20 | Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command | Taylan Akay et.al. | 2507.12862 | null |
| 2025-07-17 | Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering | Muhammad Fadhil Ginting et.al. | 2507.12846 | null |
| 2025-07-17 | Machine-Readable Ads: Accessibility and Trust Patterns for AI Web Agents interacting with Online Advertisements | Joel Nitu et.al. | 2507.12844 | null |
| 2025-07-22 | Assessing Adaptive World Models in Machines with Novel Games | Lance Ying et.al. | 2507.12821 | null |
| 2025-07-17 | From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Gaurav Chaudhary et.al. | 2507.12815 | null |
| 2025-07-17 | MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models | Zhiwei Liu et.al. | 2507.12806 | null |
| 2025-07-17 | Imitating Mistakes in a Learning Companion AI Agent for Online Peer Learning | Sosui Moribe et.al. | 2507.12801 | null |
| 2025-07-17 | City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning | Penglei Sun et.al. | 2507.12795 | null |
| 2025-07-17 | A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models | Weijieying Ren et.al. | 2507.12774 | null |
| 2025-07-17 | Autonomy for Older Adult-Agent Interaction | Jiaxin An et.al. | 2507.12767 | null |
| 2025-07-17 | Public Evaluation on Potential Social Impacts of Fully Autonomous Cybernetic Avatars for Physical Support in Daily-Life Environments: Large-Scale Demonstration and Survey at Avatar Land | Lotfi El Hafi et.al. | 2507.12741 | null |
| 2025-07-17 | Competition Erases Simplicity: Tight Regret Bounds for Uniform Pricing with Multiple Buyers | Houshuang Chen et.al. | 2507.12733 | null |
| 2025-07-17 | Strategy Adaptation in Large Language Model Werewolf Agents | Fuya Nakamori et.al. | 2507.12732 | null |
| 2025-07-17 | Identification of Authoritative Nodes and Dismantling of Illicit Networks Using a Novel Metric for Measuring Strength of a Graph | Kartikeya Kansal et.al. | 2507.12711 | null |
| 2025-07-16 | Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models | Alex Zook et.al. | 2507.12666 | null |
| 2025-07-16 | NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting | Kuangshi Ai et.al. | 2507.12621 | null |
| 2025-07-16 | A Survey of Explainable Reinforcement Learning: Targets, Methods and Needs | Léo Saulières et.al. | 2507.12599 | null |
| 2025-07-16 | The Impact of Social Attractiveness on Casual Group Formation: Power-Law Group Sizes and Suppressed Percolation | Matheus S. Mariano et.al. | 2507.12585 | null |
| 2025-07-20 | Can Mental Imagery Improve the Thinking Capabilities of AI Systems? | Slimane Larabi et.al. | 2507.12555 | null |
| 2025-07-15 | FOUNDER: Grounding Foundation Models in World Models for Open-Ended Embodied Decision Making | Yucen Wang et.al. | 2507.12496 | null |
| 2025-07-15 | MR-LDM -- The Merge-Reactive Longitudinal Decision Model: Game Theoretic Human Decision Modeling for Interactive Sim Agents | Dustin Holley et.al. | 2507.12494 | null |
| 2025-07-15 | On multiagent online problems with predictions | Gabriel Istrate et.al. | 2507.12486 | null |
| 2025-07-14 | AI-Powered Math Tutoring: Platform for Personalized and Adaptive Education | Jarosław A. Chudziak et.al. | 2507.12484 | null |
| 2025-07-16 | Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data | Chandana Cheerla et.al. | 2507.12425 | null |
| 2025-07-16 | Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment | Noble Harasha et.al. | 2507.12400 | null |
| 2025-07-16 | Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate | Ana Davila et.al. | 2507.12370 | null |
| 2025-07-21 | GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities | Diganta Misra et.al. | 2507.12367 | null |
| 2025-07-16 | Social polarization promoted by sparse higher-order interactions | Hugo Pérez-Martínez et.al. | 2507.12325 | null |
| 2025-07-17 | Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot | Luca Garello et.al. | 2507.12273 | null |
| 2025-07-16 | Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes | Johann Frei et.al. | 2507.12261 | null |
| 2025-07-16 | Toward a Behavioural Translation Style Space: Simulating the Temporal Dynamics of Affect, Behaviour, and Cognition in Human Translation Production | Michael Carl et.al. | 2507.12208 | null |
| 2025-07-16 | BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search | Azhar Ikhtiarudin et.al. | 2507.12189 | null |
| 2025-07-16 | Fast and Scalable Game-Theoretic Trajectory Planning with Intentional Uncertainties | Zhenmin Huang et.al. | 2507.12174 | null |
| 2025-07-16 | Convergence Rate of Generalized Nash Equilibrium Learning in Strongly Monotone Games with Linear Constraints | Tatiana Tatarenko et.al. | 2507.12112 | null |
| 2025-07-16 | Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVs | Ye Han et.al. | 2507.12110 | null |
| 2025-07-16 | Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics | Muleilan Pei et.al. | 2507.12083 | null |
| 2025-07-16 | Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited | Anthony G Cohn et.al. | 2507.12059 | null |
| 2025-07-16 | Contracting with a Mechanism Designer | Tian Bai et.al. | 2507.12054 | null |
| 2025-07-16 | ARRC: Explainable, Workflow-Integrated Recommender for Sustainable Resource Optimization Across the Edge-Cloud Continuum | Brian-Frederik Jahnke et.al. | 2507.12032 | null |
| 2025-07-16 | QAS-QTNs: Curriculum Reinforcement Learning-Driven Quantum Architecture Search for Quantum Tensor Networks | Siddhant Dutta et.al. | 2507.12013 | null |
| 2025-07-16 | Understanding visual attention beehind bee-inspired UAV navigation | Pranav Rajbhandari et.al. | 2507.11992 | null |
| 2025-07-17 | Aime: Towards Fully-Autonomous Multi-Agent Framework | Yexuan Shi et.al. | 2507.11988 | null |
| 2025-07-16 | Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness | Yuki Sakamoto et.al. | 2507.11979 | null |
| 2025-07-16 | Online Training and Pruning of Deep Reinforcement Learning Networks | Valentin Frank Ingmar Guenter et.al. | 2507.11975 | null |
| 2025-07-16 | Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker | Yuhong Zhang et.al. | 2507.11972 | null |
| 2025-07-16 | IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving | Kanghyun Ryu et.al. | 2507.11940 | null |
| 2025-07-16 | From Generative to Episodic: Sample-Efficient Replicable Reinforcement Learning | Max Hopkins et.al. | 2507.11926 | null |
| 2025-07-16 | Hybrid Conformal Prediction-based Risk-Aware Model Predictive Planning in Dense, Uncertain Environments | Jeongyong Yang et.al. | 2507.11920 | null |
| 2025-07-16 | CoCre-Sam (Kokkuri-san): Modeling Ouija Board as Collective Langevin Dynamics Sampling from Fused Language Models | Tadahiro Taniguchi et.al. | 2507.11906 | null |
| 2025-07-16 | Extremal Testing for Network Software using LLMs | Rathin Singha et.al. | 2507.11898 | null |
| 2025-07-16 | Generative Intelligence Systems in the Flow of Group Emotions | Fernando Koch et.al. | 2507.11831 | null |
| 2025-07-16 | The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist | Haoxuan Zhang et.al. | 2507.11810 | null |
| 2025-07-16 | New allocation rule based on graph structures and their application to economic phenomena | Taiki Yamada et.al. | 2507.11808 | null |
| 2025-07-15 | Large-scale distributed synchronization systems, using a cancel-on-completion redundancy mechanism | Alexander Stolyar et.al. | 2507.11779 | null |
| 2025-07-15 | A Cellular Automata Approach to Donation Game | Marcin Kowalik et.al. | 2507.11744 | null |
| 2025-07-15 | Let's Think in Two Steps: Mitigating Agreement Bias in MLLMs with Self-Grounded Verification | Moises Andrade et.al. | 2507.11662 | null |
| 2025-07-15 | STAGED: A Multi-Agent Neural Network for Learning Cellular Interaction Dynamics | Joao F. Rocha et.al. | 2507.11660 | null |
| 2025-07-15 | VISTA: Monocular Segmentation-Based Mapping for Appearance and View-Invariant Global Localization | Hannah Shafferman et.al. | 2507.11653 | null |
| 2025-07-15 | General Modular Harness for LLM Agents in Multi-Turn Gaming Environments | Yuxuan Zhang et.al. | 2507.11633 | null |
| 2025-07-15 | AI, Humans, and Data Science: Optimizing Roles Across Workflows and the Workforce | Richard Timpone et.al. | 2507.11597 | null |
| 2025-07-14 | Consumer Law for AI Agents | Christoph Busch et.al. | 2507.11567 | null |
| 2025-07-14 | Emergent Heterogeneous Swarm Control Through Hebbian Learning | Fuda van Diggelen et.al. | 2507.11566 | null |
| 2025-07-14 | A Model Aware AIGC Task Offloading Algorithm in IIoT Edge Computing | Xin Wang et.al. | 2507.11560 | null |
| 2025-07-15 | DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering | Yinsheng Li et.al. | 2507.11527 | null |
| 2025-07-15 | Opinion dynamics: Statistical physics and beyond | Michele Starnini et.al. | 2507.11521 | null |
| 2025-07-15 | AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air | Shiyi Yang et.al. | 2507.11515 | null |
| 2025-07-15 | On the Complexity of the Optimal Correlated Equilibria in Extensive-Form Games | Vincent Cheval et.al. | 2507.11509 | null |
| 2025-07-15 | LF: Online Multi-Robot Path Planning Meets Optimal Trajectory Control | Ajay Shankar et.al. | 2507.11464 | null |
| 2025-07-15 | EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes | LG AI Research et.al. | 2507.11407 | null |
| 2025-07-15 | From Production Logistics to Smart Manufacturing: The Vision for a New RoboCup Industrial League | Supun Dissanayaka et.al. | 2507.11402 | null |
| 2025-07-20 | Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian | Andrei Niculae et.al. | 2507.11299 | null |
| 2025-07-15 | Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems | Dany Moshkovich et.al. | 2507.11277 | null |
| 2025-07-15 | An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling | Anh Nguyen-Duc et.al. | 2507.11272 | null |
| 2025-07-15 | Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Tal Fiskus et.al. | 2507.11269 | null |
| 2025-07-15 | An Agentic Flow for Finite State Machine Extraction using Prompt Chaining | Fares Wael et.al. | 2507.11222 | null |
| 2025-07-15 | Fair Contracts | Matteo Castiglioni et.al. | 2507.11214 | null |
| 2025-07-15 | Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias | Rushia Harada et.al. | 2507.11210 | null |
| 2025-07-15 | Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding | Conrad Borchers et.al. | 2507.11198 | null |
| 2025-07-15 | Quantized Rank Reduction: A Communications-Efficient Federated Learning Scheme for Network-Critical Applications | Dimitrios Kritsiolis et.al. | 2507.11183 | null |
| 2025-07-15 | AI Agent Architecture for Decentralized Trading of Alternative Assets | Ailiya Borjigin et.al. | 2507.11117 | null |
| 2025-07-15 | Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander | Li Wang et.al. | 2507.11079 | null |
| 2025-07-17 | SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks | Pavel Adamenko et.al. | 2507.11059 | null |
| 2025-07-16 | Journalism-Guided Agentic In-Context Learning for News Stance Detection | Dahyun Lee et.al. | 2507.11049 | null |
| 2025-07-15 | Value of History in Social Learning: Applications to Markets for History | Hiroto Sato et.al. | 2507.11029 | null |
| 2025-07-15 | DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models | Anthony Miyaguchi et.al. | 2507.10958 | null |
| 2025-07-15 | A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge | Shuangyao Huang et.al. | 2507.10913 | null |
| 2025-07-15 | Lessons Learned from Evaluation of LLM based Multi-agents in Safer Therapy Recommendation | Yicong Wu et.al. | 2507.10911 | null |
| 2025-07-15 | NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization | Zongtao He et.al. | 2507.10894 | null |
| 2025-07-15 | Start from the End: A Framework for Computational Policy Exploration to Inform Effective and Geospatially Consistent Interventions applied to COVID-19 in St. Louis | David O'Gara et.al. | 2507.10870 | null |
| 2025-07-14 | LLM-Guided Agentic Object Detection for Open-World Understanding | Furkan Mumcu et.al. | 2507.10844 | null |
| 2025-07-14 | Past, Present and Future: Exploring Adaptive AI in Software Development Bots | Omar Elsisi et.al. | 2507.10822 | null |
| 2025-07-14 | Semantic Context for Tool Orchestration | Robert Müller et.al. | 2507.10820 | null |
| 2025-07-14 | Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection | Huiyi Wang et.al. | 2507.10814 | null |
| 2025-07-14 | React to This (RTT): A Nonverbal Turing Test for Embodied AI | Chuxuan Zhang et.al. | 2507.10812 | null |
| 2025-07-14 | Warehouse Spatial Question Answering with LLM Agent | Hsiang-Wei Huang et.al. | 2507.10778 | null |
| 2025-07-14 | RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding | Benjamin Stoler et.al. | 2507.10749 | null |
| 2025-07-14 | Ground-Compose-Reinforce: Tasking Reinforcement Learning Agents through Formal Language | Andrew C. Li et.al. | 2507.10741 | null |
| 2025-07-14 | Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems | Sohan Shankar et.al. | 2507.10722 | null |
| 2025-07-14 | Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health | Jabari Kwesi et.al. | 2507.10695 | null |
| 2025-07-14 | Vision Language Action Models in Robotic Manipulation: A Systematic Review | Muhayy Ud Din et.al. | 2507.10672 | null |
| 2025-07-16 | From Semantic Web and MAS to Agentic AI: A Unified Narrative of the Web of Agents | Tatiana Petrova et.al. | 2507.10644 | null |
| 2025-07-14 | Enhancing the Capabilities of Large Language Models for API calls through Knowledge Graphs | Ye Yang et.al. | 2507.10630 | null |
| 2025-07-14 | Game Theory Meets LLM and Agentic AI: Reimagining Cybersecurity for the Age of Intelligent Threats | Quanyan Zhu et.al. | 2507.10621 | null |
| 2025-07-13 | Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks | Oluwaseyi Giwa et.al. | 2507.10619 | null |
| 2025-07-13 | LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents | Zihe Yan et.al. | 2507.10610 | null |
| 2025-07-12 | Emergence of Hierarchical Emotion Organization in Large Language Models | Bo Zhao et.al. | 2507.10599 | null |
| 2025-07-11 | ARPaCCino: An Agentic-RAG for Policy as Code Compliance | Francesco Romeo et.al. | 2507.10584 | null |
| 2025-07-11 | An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation | Vimaleswar A et.al. | 2507.10580 | null |
| 2025-07-16 | Truth Sleuth and Trend Bender: AI Agents to fact-check YouTube videos and influence opinions | Cécile Logé et.al. | 2507.10577 | null |
| 2025-07-14 | EmbRACE-3K: Embodied Reasoning and Action in Complex Environments | Mingxian Lin et.al. | 2507.10548 | null |
| 2025-07-14 | Graph World Model | Tao Feng et.al. | 2507.10539 | null |
| 2025-07-14 | DeepResearch |
Jennifer D'Souza et.al. | 2507.10522 | null |
| 2025-07-14 | An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments | Mikko Korkiakoski et.al. | 2507.10469 | null |
| 2025-07-14 | Logic layer Prompt Control Injection (LPCI): A Novel Security Vulnerability Class in Agentic Systems | Hammad Atta et.al. | 2507.10457 | null |
| 2025-07-14 | Negative entropy and non-equilibrium Euclidean shell | Yang An et.al. | 2507.10450 | null |
| 2025-07-14 | Am I on the Right Track? What Can Predicted Query Performance Tell Us about the Search Behaviour of Agentic RAG | Fangzheng Tian et.al. | 2507.10411 | null |
| 2025-07-14 | Machine-Learning to Trust | Ran Spiegler et.al. | 2507.10363 | null |
| 2025-07-14 | Toolsuite for Implementing Multiagent Systems Based on Communication Protocols | Amit K. Chopra et.al. | 2507.10324 | null |
| 2025-07-14 | Prompt Informed Reinforcement Learning for Visual Coverage Path Planning | Venkat Margapuri et.al. | 2507.10284 | null |
| 2025-07-14 | Toward Real-World Table Agents: Capabilities, Workflows, and Design Principles for LLM-based Table Intelligence | Jiaming Tian et.al. | 2507.10281 | null |
| 2025-07-14 | ToMacVF : Temporal Macro-action Value Factorization for Asynchronous Multi-Agent Reinforcement Learning | Wenjing Zhang et.al. | 2507.10251 | null |
| 2025-07-14 | Should We Ever Prefer Decision Transformer for Offline Reinforcement Learning? | Yumi Omori et.al. | 2507.10174 | null |
| 2025-07-14 | Play Style Identification Using Low-Level Representations of Play Traces in MicroRTS | Ruizhe Yu Xia et.al. | 2507.10172 | null |
| 2025-07-14 | Simulating Biases for Interpretable Fairness in Offline and Online Classifiers | Ricardo Inácio et.al. | 2507.10154 | null |
| 2025-07-14 | Adaptability in Multi-Agent Reinforcement Learning: A Framework and Unified Review | Siyi Hu et.al. | 2507.10142 | null |
| 2025-07-16 | A PBN-RL-XAI Framework for Discovering a "Hit-and-Run" Therapeutic Strategy in Melanoma | Zhonglin Liu et.al. | 2507.10136 | null |
| 2025-07-14 | Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering | Tung Sum Thomas Kwok et.al. | 2507.10088 | null |
| 2025-07-14 | Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires | Simon Münker et.al. | 2507.10073 | null |
| 2025-07-14 | Finetuning Deep Reinforcement Learning Policies with Evolutionary Strategies for Control of Underactuated Robots | Marco Calì et.al. | 2507.10030 | null |
| 2025-07-14 | The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents | Lixu Wang et.al. | 2507.10016 | null |
| 2025-07-14 | On The Role of Intentionality in Knowledge Representation: Analyzing Scene Context for Cognitive Agents with a Tiny Language Model | Mark Burgess et.al. | 2507.10000 | null |
| 2025-07-17 | Predictive & Trust-based Multi-Agent Coordination | Venkatraman Renganathan et.al. | 2507.09997 | null |
| 2025-07-14 | Evolution of Fear and Social Rewards in Prey-Predator Relationship | Yuji Kanagawa et.al. | 2507.09992 | null |
| 2025-07-14 | Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient | Xiaoyang Yu et.al. | 2507.09989 | null |
| 2025-07-14 | Quantum measurement of work in mesoscopic systems | Anant Vijay Varma et.al. | 2507.09977 | null |
| 2025-07-14 | Generalized Quantal Response Equilibrium: Existence and Efficient Learning | Apurv Shukla et.al. | 2507.09928 | null |
| 2025-07-14 | Intelligent Task Management via Dynamic Multi-region Division in LEO Satellite Networks | Zixuan Song et.al. | 2507.09926 | null |
| 2025-07-14 | Energy-Stable Swarm-Based Inertial Algorithms for Optimization | Xuelong Gu et.al. | 2507.09909 | null |
| 2025-07-14 | Large Population Models | Ayush Chopra et.al. | 2507.09901 | null |
| 2025-07-14 | Towards Realistic and Interpretable Market Simulations: Factorizing Financial Power Law using Optimal Transport | Ryuji Hashimoto et.al. | 2507.09863 | null |
| 2025-07-14 | Multi-residual Mixture of Experts Learning for Cooperative Control in Multi-vehicle Systems | Vindula Jayawardana et.al. | 2507.09836 | null |
| 2025-07-20 | Active Probing with Multimodal Predictions for Motion Planning | Darshan Gadginmath et.al. | 2507.09822 | null |
| 2025-07-13 | An infinitesimal generator approach on weak convergence of regulated multi-class matching systems | Bowen Xie et.al. | 2507.09789 | null |
| 2025-07-13 | TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit | Paulo Salem et.al. | 2507.09788 | null |
| 2025-07-13 | Toward accurate RUL and SOH estimation using reinforced graph-based PINNs enhanced with dynamic weights | Mohamadreza Akbari Pour et.al. | 2507.09766 | null |
| 2025-07-13 | IteraOptiRacing: A Unified Planning-Control Framework for Real-time Autonomous Racing for Iterative Optimal Performance | Yifan Zeng et.al. | 2507.09714 | null |
| 2025-07-13 | Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI | Phat Nguyen et.al. | 2507.09702 | null |
| 2025-07-13 | Networked Information Aggregation via Machine Learning | Michael Kearns et.al. | 2507.09683 | null |
| 2025-07-13 | Negotiating Comfort: Simulating Personality-Driven LLM Agents in Shared Residential Social Networks | Ann Nedime Nese Rende et.al. | 2507.09657 | null |
| 2025-07-13 | humancompatible.interconnect: Testing Properties of Repeated Uses of Interconnections of AI Systems | Rodion Nazarov et.al. | 2507.09626 | null |
| 2025-07-13 | On the existence of EFX allocations for goods | Ujjwal Kumar et.al. | 2507.09600 | null |
| 2025-07-17 | THOR: Transformer Heuristics for On-Demand Retrieval | Isaac Shi et.al. | 2507.09592 | null |
| 2025-07-13 | eSapiens: A Platform for Secure and Auditable Retrieval-Augmented Generation | Isaac Shi et.al. | 2507.09588 | null |
| 2025-07-13 | AICrypto: A Comprehensive Benchmark For Evaluating Cryptography Capabilities of Large Language Models | Yu Wang et.al. | 2507.09580 | null |
| 2025-07-13 | On Probabilistic Assignment Rules | Sreedurga Gogulapati et.al. | 2507.09550 | null |
| 2025-07-13 | Existence of Fair and Efficient Allocation of Indivisible Chores | Ryoga Mahara et.al. | 2507.09544 | null |
| 2025-07-13 | Learning to Control Dynamical Agents via Spiking Neural Networks and Metropolis-Hastings Sampling | Ali Safa et.al. | 2507.09540 | null |
| 2025-07-13 | Self-supervised Pretraining for Integrated Prediction and Planning of Automated Vehicles | Yangang Ren et.al. | 2507.09537 | null |
| 2025-07-13 | TruckV2X: A Truck-Centered Perception Dataset | Tenghui Xie et.al. | 2507.09505 | null |
| 2025-07-13 | GoalfyMax: A Protocol-Driven Multi-Agent System for Intelligent Experience Entities | Siyi Wu et.al. | 2507.09497 | null |
| 2025-07-13 | GenAI-based Multi-Agent Reinforcement Learning towards Distributed Agent Intelligence: A Generative-RL Agent Perspective | Hang Wang et.al. | 2507.09495 | null |
| 2025-07-13 | Evaluating LLMs on Sequential API Call Through Automated Test Generation | Yuheng Huang et.al. | 2507.09481 | null |
| 2025-07-16 | Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs | Yangning Li et.al. | 2507.09477 | null |
| 2025-07-13 | Incentive-Aware Dynamic Resource Allocation under Long-Term Cost Constraints | Yan Dai et.al. | 2507.09473 | null |
| 2025-07-13 | MobiWorld: World Models for Mobile Wireless Network | Haoye Chai et.al. | 2507.09462 | null |
| 2025-07-13 | Intermediate Interaction Strategies for Collective Behavior | Y. Kikuchi et.al. | 2507.09457 | null |
| 2025-07-13 | Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions | Yuanhong Zheng et.al. | 2507.09446 | null |
| 2025-07-12 | Contracting a crowd of heterogeneous agents | Guillermo Alonso Alvarez et.al. | 2507.09415 | null |
| 2025-07-12 | Adaptive Social Learning using Theory of Mind | Lance Ying et.al. | 2507.09409 | null |
| 2025-07-12 | LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing | Quanyan Zhu et.al. | 2507.09407 | null |
| 2025-07-12 | Knowledge Conceptualization Impacts RAG Efficacy | Chris Davis Jaldi et.al. | 2507.09389 | null |
| 2025-07-12 | Constrained Style Learning from Imperfect Demonstrations under Task Optimality | Kehan Wen et.al. | 2507.09371 | null |
| 2025-07-15 | Simulation for All: A Step-by-Step Cookbook for Developing Human-Centered Multi-Agent Transportation Simulators | Shiva Azimi et.al. | 2507.09367 | null |
| 2025-07-12 | When Developer Aid Becomes Security Debt: A Systematic Analysis of Insecure Behaviors in LLM Coding Agents | Matous Kozak et.al. | 2507.09329 | null |
| 2025-07-12 | StockSim: A Dual-Mode Order-Level Simulator for Evaluating Multi-Agent LLMs in Financial Markets | Charidimos Papadakis et.al. | 2507.09255 | null |
| 2025-07-12 | Hide-and-Shill: A Reinforcement Learning Framework for Market Manipulation Detection in Symphony-a Decentralized Multi-Agent System | Ronghua Shi et.al. | 2507.09179 | null |
| 2025-07-12 | Continual Reinforcement Learning by Planning with Online World Models | Zichen Liu et.al. | 2507.09177 | null |
| 2025-07-12 | RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking | Shuo Yang et.al. | 2507.09174 | null |
| 2025-07-12 | Tactile-VLA: Unlocking Vision-Language-Action Model's Physical Knowledge for Tactile Generalization | Jialei Huang et.al. | 2507.09160 | null |
| 2025-07-12 | Egalitarian-equivalent and strategy-proof mechanisms in homogeneous multi-object allocation problems | Hinata Kurashita et.al. | 2507.09152 | null |
| 2025-07-12 | A Study of Value-Aware Eigenoptions | Harshil Kotamreddy et.al. | 2507.09127 | null |
| 2025-07-12 | Proactive AI-and-RAN Workload Orchestration in O-RAN Architectures for 6G Networks | Syed Danial Ali Shah et.al. | 2507.09124 | null |
| 2025-07-12 | AInsight: Augmenting Expert Decision-Making with On-the-Fly Insights Grounded in Historical Data | Mohammad Abolnejadian et.al. | 2507.09100 | null |
| 2025-07-12 | Transformer based Collaborative Reinforcement Learning for Fluid Antenna System (FAS)-enabled 3D UAV Positioning | Xiaoren Xu et.al. | 2507.09094 | null |
| 2025-07-12 | Learning from Synthetic Labs: Language Models as Auction Participants | Anand Shah et.al. | 2507.09083 | null |
| 2025-07-11 | Infinite Video Understanding | Dell Zhang et.al. | 2507.09068 | null |
| 2025-07-11 | SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments | Avi Arora et.al. | 2507.09063 | null |
| 2025-07-11 | Behavioral Exploration: Learning to Explore via In-Context Adaptation | Andrew Wagenmaker et.al. | 2507.09041 | null |
| 2025-07-11 | Accelerating Drug Discovery Through Agentic AI: A Multi-Agent Approach to Laboratory Automation in the DMTA Cycle | Yao Fehlis et.al. | 2507.09023 | null |
| 2025-07-11 | How to Train a Leader: Hierarchical Reasoning in Multi-Agent LLMs | Andrew Estornell et.al. | 2507.08960 | null |
| 2025-07-15 | Bridging Literature and the Universe Via A Multi-Agent Large Language Model System | Xiaowen Zhang et.al. | 2507.08958 | null |
| 2025-07-11 | Optimizing Sequential Multi-Step Tasks with Parallel LLM Agents | Enhao Zhang et.al. | 2507.08944 | null |
| 2025-07-10 | AirScape: An Aerial Generative World Model with Motion Controllability | Baining Zhao et.al. | 2507.08885 | null |
| 2025-07-10 | Agent-based visualization of streaming text | Jordan Riley Benson et.al. | 2507.08884 | null |
| 2025-07-11 | NeuralOS: Towards Simulating Operating Systems via Neural Generative Models | Luke Rivard et.al. | 2507.08800 | null |
| 2025-07-11 | SPLASH! Sample-efficient Preference-based inverse reinforcement learning for Long-horizon Adversarial tasks from Suboptimal Hierarchical demonstrations | Peter Crowley et.al. | 2507.08707 | null |
| 2025-07-11 | elsciRL: Integrating Language Solutions into Reinforcement Learning Problem Settings | Philip Osborne et.al. | 2507.08705 | null |
| 2025-07-11 | Introspection of Thought Helps AI Agents | Haoran Sun et.al. | 2507.08664 | null |
| 2025-07-11 | Safe Deep Reinforcement Learning for Resource Allocation with Peak Age of Information Violation Guarantees | Berire Gunes Reyhan et.al. | 2507.08653 | null |
| 2025-07-11 | DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images | Haoran Sun et.al. | 2507.08648 | null |
| 2025-07-11 | OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception | Junho Koh et.al. | 2507.08644 | null |
| 2025-07-11 | Agentic Large Language Models for Conceptual Systems Engineering and Design | Soheyl Massoudi et.al. | 2507.08619 | null |
| 2025-07-11 | AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs | Florian Grötschla et.al. | 2507.08616 | null |
| 2025-07-11 | Emergent Natural Language with Communication Games for Improving Image Captioning Capabilities without Additional Data | Parag Dutta et.al. | 2507.08610 | null |
| 2025-07-11 | Unlocking Speech Instruction Data Potential with Query Rewriting | Yonghua Hei et.al. | 2507.08603 | null |
| 2025-07-11 | To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions | Dimitrios Emmanoulopoulos et.al. | 2507.08584 | null |
| 2025-07-11 | SAM2RL: Towards Reinforcement Learning Memory Control in Segment Anything Model 2 | Alen Adamyan et.al. | 2507.08548 | null |
| 2025-07-11 | Recursive Reward Aggregation | Yuting Tang et.al. | 2507.08537 | null |
| 2025-07-11 | Occlusion-Guided Feature Purification Learning via Reinforced Knowledge Distillation for Occluded Person Re-Identification | Yufei Zheng et.al. | 2507.08520 | null |
| 2025-07-11 | The stability of bi-polarization on dynamical directed graphs: an emergent game perspective | Yakun Wang et.al. | 2507.08449 | null |
| 2025-07-11 | Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences | Selina Heller et.al. | 2507.08440 | null |
| 2025-07-11 | Age of Information Optimization in Laser-charged UAV-assisted IoT Networks: A Multi-agent Deep Reinforcement Learning Method | Geng Sun et.al. | 2507.08429 | null |
| 2025-07-11 | A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities | Lu Xiang et.al. | 2507.08425 | null |
| 2025-07-11 | Temperature Measurement in Agent Systems | Christoph J. Börner et.al. | 2507.08394 | null |
| 2025-07-11 | Multi-Agent LLMs as Ethics Advocates in AI-Based Systems | Asma Yamani et.al. | 2507.08392 | null |
| 2025-07-11 | Online Pre-Training for Offline-to-Online Reinforcement Learning | Yongjae Shin et.al. | 2507.08387 | null |
| 2025-07-11 | Exploring Design of Multi-Agent LLM Dialogues for Research Ideation | Keisuke Ueda et.al. | 2507.08350 | null |
| 2025-07-11 | What Factors Affect LLMs and RLLMs in Financial Question Answering? | Peng Wang et.al. | 2507.08339 | null |
| 2025-07-11 | MK2 at PBIG Competition: A Prompt Generation Solution | Yuzheng Xu et.al. | 2507.08335 | null |
| 2025-07-11 | CRMAgent: A Multi-Agent LLM System for E-Commerce CRM Message Template Generation | Yinzhu Quan et.al. | 2507.08325 | null |
| 2025-07-15 | KAT-V1: Kwai-AutoThink Technical Report | Zizheng Zhan et.al. | 2507.08297 | null |
| 2025-07-11 | Agent Safety Alignment via Reinforcement Learning | Zeyang Sha et.al. | 2507.08270 | null |
| 2025-07-11 | Giving AI Agents Access to Cryptocurrency and Smart Contracts Creates New Vectors of AI Harm | Bill Marino et.al. | 2507.08249 | null |
| 2025-07-11 | Advancing AI Capabilities and Evolving Labor Outcomes | Jacob Dominski et.al. | 2507.08244 | null |
| 2025-07-10 | Effect of Static vs. Conversational AI-Generated Messages on Colorectal Cancer Screening Intent: a Randomized Controlled Trial | Neil K. R. Sehgal et.al. | 2507.08211 | null |
| 2025-07-10 | From Curiosity to Competence: How World Models Interact with the Dynamics of Exploration | Fryderyk Mantiuk et.al. | 2507.08210 | null |
| 2025-07-10 | Reasoning and Behavioral Equilibria in LLM-Nash Games: From Mindsets to Actions | Quanyan Zhu et.al. | 2507.08208 | null |
| 2025-07-10 | A Dynamic Stackelberg Game Framework for Agentic AI Defense Against LLM Jailbreaking | Zhengye Han et.al. | 2507.08207 | null |
| 2025-07-10 | KP-A: A Unified Network Knowledge Plane for Catalyzing Agentic Network Intelligence | Yun Tang et.al. | 2507.08164 | null |
| 2025-07-10 | Code with Me or for Me? How Increasing AI Automation Transforms Developer Workflows | Valerie Chen et.al. | 2507.08149 | null |
| 2025-07-10 | AI for NONMEM Coding in Pharmacometrics Research and Education: Shortcut or Pitfall? | Wenhao Zheng et.al. | 2507.08144 | null |
| 2025-07-10 | Noise-Enabled Goal Attainment in Crowded Collectives | Lucy Liu et.al. | 2507.08100 | null |
| 2025-07-10 | Multi-Scale Network Dynamics and Systemic Risk: A Model Context Protocol Approach to Financial Markets | Avishek Bhandari et.al. | 2507.08065 | null |
| 2025-07-10 | MCPmed: A Call for MCP-Enabled Bioinformatics Web Services for LLM-Driven Discovery | Matthias Flotho et.al. | 2507.08055 | null |
| 2025-07-09 | AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research | Talor Abramovich et.al. | 2507.08038 | null |
| 2025-07-14 | PyVision: Agentic Vision with Dynamic Tooling | Shitian Zhao et.al. | 2507.07998 | null |
| 2025-07-10 | OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding | JingLi Lin et.al. | 2507.07984 | null |
| 2025-07-15 | Reinforcement Learning with Action Chunking | Qiyang Li et.al. | 2507.07969 | null |
| 2025-07-10 | MIRIX: Multi-Agent Memory System for LLM-Based Agents | Yu Wang et.al. | 2507.07957 | null |
| 2025-07-10 | Agentic Retrieval of Topics and Insights from Earnings Calls | Anant Gupta et.al. | 2507.07906 | null |
| 2025-07-11 | The Trust Fabric: Decentralized Interoperability and Economic Coordination for the Agentic Web | Sree Bhargavi Balija et.al. | 2507.07901 | null |
| 2025-07-10 | Automating MD simulations for Proteins using Large language Models: NAMD-Agent | Achuth Chandrasekhar et.al. | 2507.07887 | null |
| 2025-07-10 | DocCHA: Towards LLM-Augmented Interactive Online diagnosis System | Xinyi Liu et.al. | 2507.07870 | null |
| 2025-07-10 | "So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents | Giovanni Dispoto et.al. | 2507.07848 | null |
| 2025-07-10 | Perceptual Distortions and Autonomous Representation Learning in a Minimal Robotic System | David Warutumo et.al. | 2507.07845 | null |
| 2025-07-10 | BEAVER: Building Environments with Assessable Variation for Evaluating Multi-Objective Reinforcement Learning | Ruohong Liu et.al. | 2507.07769 | null |
| 2025-07-10 | Beyond Connectivity: Higher-Order Network Framework for Capturing Memory-Driven Mobility Dynamics | Chen Zhang et.al. | 2507.07727 | null |
| 2025-07-10 | Multi-agent Reinforcement Learning-based In-place Scaling Engine for Edge-cloud Systems | Jovan Prodanov et.al. | 2507.07671 | null |
| 2025-07-10 | Upper Expected Meeting Times for Interdependent Stochastic Agents | Marco Sangalli et.al. | 2507.07626 | null |
| 2025-07-10 | Position: We Need An Algorithmic Understanding of Generative AI | Oliver Eberle et.al. | 2507.07544 | null |
| 2025-07-10 | Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System | Yuanchen Shi et.al. | 2507.07509 | null |
| 2025-07-10 | The Pandora's Box Problem with Sequential Inspections | Ali Aouad et.al. | 2507.07508 | null |
| 2025-07-15 | Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models | Varin Sikka et.al. | 2507.07505 | null |
| 2025-07-11 | StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley | Weihao Tan et.al. | 2507.07445 | null |
| 2025-07-10 | SAND: Boosting LLM Agents with Self-Taught Action Deliberation | Yu Xia et.al. | 2507.07441 | null |
| 2025-07-12 | DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search | Zerui Yang et.al. | 2507.07426 | null |
| 2025-07-10 | KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows | Zaifeng Pan et.al. | 2507.07400 | null |
| 2025-07-10 | PILOC: A Pheromone Inverse Guidance Mechanism and Local-Communication Framework for Dynamic Target Search of Multi-Agent in Unknown Environments | Hengrui Liu et.al. | 2507.07376 | null |
| 2025-07-11 | FLoRA: An Advanced AI-Powered Engine to Facilitate Hybrid Human-AI Regulated Learning | Xinyu Li et.al. | 2507.07362 | null |
| 2025-07-09 | Optimizing Model Splitting and Device Task Assignment for Deceptive Signal Assisted Private Multi-hop Split Learning | Dongyu Wei et.al. | 2507.07323 | null |
| 2025-07-09 | Optimizing Communication and Device Clustering for Clustered Federated Learning with Differential Privacy | Dongyu Wei et.al. | 2507.07320 | null |
| 2025-07-09 | Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation | Anirban Saha Anik et.al. | 2507.07307 | null |
| 2025-07-09 | ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning | Yichen Lu et.al. | 2507.07306 | null |
| 2025-07-09 | Application of LLMs to Multi-Robot Path Planning and Task Allocation | Ashish Kumar et.al. | 2507.07302 | null |
| 2025-07-09 | LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation | Sonia Raychaudhuri et.al. | 2507.07299 | null |
| 2025-07-09 | The Impact of Background Speech on Interruption Detection in Collaborative Groups | Mariah Bradford et.al. | 2507.07280 | null |
| 2025-07-09 | Convergence and Robustness Bounds for Distributed Asynchronous Shortest-Path | Jared Miller et.al. | 2507.07263 | null |
| 2025-07-11 | Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Licong Xu et.al. | 2507.07257 | null |
| 2025-07-09 | Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning | Elia Piccoli et.al. | 2507.07197 | null |
| 2025-07-09 | Evaluating Retrieval-Augmented Generation Agents for Autonomous Scientific Discovery in Astrophysics | Xueqing Xu et.al. | 2507.07155 | null |
| 2025-07-09 | 4KAgent: Agentic Any Image to 4K Super-Resolution | Yushen Zuo et.al. | 2507.07105 | null |
| 2025-07-09 | Graph-Based Complexity Metrics for Multi-Agent Curriculum Learning: A Validated Approach to Task Ordering in Cooperative Coordination Environments | Farhaan Ebadulla et.al. | 2507.07074 | null |
| 2025-07-09 | Robust signal decompositions on the circle | Aral Kose et.al. | 2507.07007 | null |
| 2025-07-09 | Federated Learning-based MARL for Strengthening Physical-Layer Security in B5G Networks | Deemah H. Tashman et.al. | 2507.06997 | null |
| 2025-07-09 | The User-Centric Geo-Experience: An LLM-Powered Framework for Enhanced Planning, Navigation, and Dynamic Adaptation | Jieren Deng et.al. | 2507.06993 | null |
| 2025-07-09 | Optimizing Cognitive Networks: Reinforcement Learning Meets Energy Harvesting Over Cascaded Channels | Deemah H. Tashman et.al. | 2507.06981 | null |
| 2025-07-09 | Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues | Fareya Ikram et.al. | 2507.06910 | null |
| 2025-07-09 | MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection | Ziyan Liu et.al. | 2507.06908 | null |
| 2025-07-09 | SemRaFiner: Panoptic Segmentation in Sparse and Noisy Radar Point Clouds | Matthias Zeller et.al. | 2507.06906 | null |
| 2025-07-09 | Designing Adaptive Algorithms Based on Reinforcement Learning for Dynamic Optimization of Sliding Window Size in Multi-Dimensional Data Streams | Abolfazl Zarghani et.al. | 2507.06901 | null |
| 2025-07-09 | VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation | Ziang Ye et.al. | 2507.06899 | null |
| 2025-07-09 | Toward Neurodivergent-Aware Productivity: A Systems and AI-Based Human-in-the-Loop Framework for ADHD-Affected Professionals | Raghavendra Deshmukh et.al. | 2507.06864 | null |
| 2025-07-11 | The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover | Matteo Lupinacci et.al. | 2507.06850 | null |
| 2025-07-10 | Artificial Generals Intelligence: Mastering Generals.io with Reinforcement Learning | Matej Straka et.al. | 2507.06825 | null |
| 2025-07-09 | Comparing Dialectical Systems: Contradiction and Counterexample in Belief Change (Extended Version) | Uri Andrews et.al. | 2507.06798 | null |
| 2025-07-09 | Multi-Task Multi-Agent Reinforcement Learning via Skill Graphs | Guobin Zhu et.al. | 2507.06690 | null |
| 2025-07-09 | Peer influence breaks ergodicity in an opinion dynamics model with external information | Federica De Domenico et.al. | 2507.06661 | null |
| 2025-07-09 | Growing Trees with an Agent: Accelerating RRTs with Learned, Multi-Step Episodic Exploration | Xinyu Wu et.al. | 2507.06605 | null |
| 2025-07-09 | Generalization in Reinforcement Learning for Radio Access Networks | Burak Demirel et.al. | 2507.06602 | null |
| 2025-07-15 | A Mathematical Theory of Discursive Networks | Juan B. Gutiérrez et.al. | 2507.06565 | null |
| 2025-07-09 | SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments | Tianshun Li et.al. | 2507.06564 | null |
| 2025-07-09 | On the Hardness of Unsupervised Domain Adaptation: Optimal Learners and Information-Theoretic Perspective | Zhiyi Dong et.al. | 2507.06552 | null |
| 2025-07-09 | ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture | Mingjin Zeng et.al. | 2507.06531 | null |
| 2025-07-09 | InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior | Huisheng Wang et.al. | 2507.06528 | null |
| 2025-07-09 | Gradientsys: A Multi-Agent LLM Scheduler with ReAct Orchestration | Xinyuan Song et.al. | 2507.06520 | null |
| 2025-07-13 | Prediction-Augmented Mechanism Design for Weighted Facility Location | Yangguang Shi et.al. | 2507.06509 | null |
| 2025-07-09 | Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings | Russell Taylor et.al. | 2507.06506 | null |
| 2025-07-09 | Learning To Communicate Over An Unknown Shared Network | Shivangi Agarwal et.al. | 2507.06499 | null |
| 2025-07-09 | Learning Japanese with Jouzu: Interaction Outcomes with Stylized Dialogue Fictional Agents | Zackary Rackauckas et.al. | 2507.06483 | null |
| 2025-07-09 | Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models | Aaron Dharna et.al. | 2507.06466 | null |
| 2025-07-08 | Eyes on the Road, Mind Beyond Vision: Context-Aware Multi-modal Enhanced Risk Anticipation | Jiaxun Zhang et.al. | 2507.06444 | null |
| 2025-07-08 | Distributed Optimization of Finite Condition Number for Laplacian Matrix in Multi-Agent Systems | Yicheng Xu et.al. | 2507.06440 | null |
| 2025-07-08 | Experience-Centric Resource Management in ISAC Networks: A Digital Agent-Assisted Approach | Xinyu Huang et.al. | 2507.06436 | null |
| 2025-07-08 | Representing Prompting Patterns with PDL: Compliance Agent Case Study | Mandana Vaziri et.al. | 2507.06396 | null |
| 2025-07-08 | VoI-aware Scheduling Schemes for Multi-Agent Formation Control | Federico Chiariotti et.al. | 2507.06392 | null |
| 2025-07-08 | Solving the Constrained Random Disambiguation Path Problem via Lagrangian Relaxation and Graph Reduction | Li Zhou et.al. | 2507.06346 | null |
| 2025-07-08 | Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms | Tarek Gasmi et.al. | 2507.06323 | null |
| 2025-07-08 | Too Human to Model:The Uncanny Valley of LLMs in Social Simulation -- When Generative Language Agents Misalign with Modelling Principles | Yongchao Zeng et.al. | 2507.06310 | null |
| 2025-07-08 | A Survey of Multi Agent Reinforcement Learning: Federated Learning and Cooperative and Noncooperative Decentralized Regimes | Kemboi Cheruiyot et.al. | 2507.06278 | null |
| 2025-07-11 | Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities | Gheorghe Comanici et.al. | 2507.06261 | null |
| 2025-07-10 | Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving | Xiangru Tang et.al. | 2507.06229 | null |
| 2025-07-08 | Aligned Textual Scoring Rules | Yuxuan Lu et.al. | 2507.06221 | null |
| 2025-07-08 | Evaluation of Habitat Robotics using Large Language Models | William Li et.al. | 2507.06157 | null |
| 2025-07-08 | OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety | Sanidhya Vijayvargiya et.al. | 2507.06134 | null |
| 2025-07-08 | A Directed Lazy Random Walk Model to Three-Way Dynamic Matching Problem | Souvik Roy et.al. | 2507.06126 | null |
| 2025-07-08 | On Lockean beliefs that are deductively closed and minimal change | Tommaso Flaminio et.al. | 2507.06042 | null |
| 2025-07-08 | Conditional Multi-Stage Failure Recovery for Embodied Agents | Youmna Farag et.al. | 2507.06016 | null |
| 2025-07-08 | From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination | Chang Yao et.al. | 2507.06004 | null |
| 2025-07-08 | Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models | Marc Oriol et.al. | 2507.05981 | null |
| 2025-07-08 | CogniPlay: a work-in-progress Human-like model for General Game Playing | Aloïs Rautureau et.al. | 2507.05868 | null |
| 2025-07-08 | Constella: Supporting Storywriters' Interconnected Character Creation through LLM-based Multi-Agents | Syemin Park et.al. | 2507.05820 | null |
| 2025-07-08 | Just Say Better or Worse: A Human-AI Collaborative Framework for Medical Image Segmentation Without Manual Annotations | Yizhe Zhang et.al. | 2507.05815 | null |
| 2025-07-10 | GTA1: GUI Test-time Scaling Agent | Yan Yang et.al. | 2507.05791 | null |
| 2025-07-08 | On the detection of medium inhomogeneity by contrast agent: wave scattering models and numerical implementations | Zhe Wang et.al. | 2507.05773 | null |
| 2025-07-08 | An autonomous agent for auditing and improving the reliability of clinical AI models | Lukas Kuhn et.al. | 2507.05755 | null |
| 2025-07-08 | An efficiency ordering of k-price auctions under complete information | Sumit Goel et.al. | 2507.05738 | null |
| 2025-07-08 | Large Language Models for Agent-Based Modelling: Current and possible uses across the modelling cycle | Loïs Vanhée et.al. | 2507.05723 | null |
| 2025-07-08 | MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment | Yucheng Shi et.al. | 2507.05720 | null |
| 2025-07-08 | Agentic-R1: Distilled Dual-Strategy Reasoning | Weihua Du et.al. | 2507.05707 | null |
| 2025-07-08 | R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding | Joonhyung Park et.al. | 2507.05673 | null |
| 2025-07-08 | ECom-Bench: Can LLM Agent Resolve Real-World E-commerce Customer Support Issues? | Haoxin Wang et.al. | 2507.05639 | null |
| 2025-07-08 | LLMs are Introvert | Litian Zhang et.al. | 2507.05638 | null |
| 2025-07-08 | How Not to Detect Prompt Injections with an LLM | Sarthak Choudhary et.al. | 2507.05630 | null |
| 2025-07-08 | Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study | Ibne Farabi Shihab et.al. | 2507.05619 | null |
| 2025-07-08 | Density Discontinuity Regression | Surya T Tokdar et.al. | 2507.05581 | null |
| 2025-07-08 | Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines | Wilka Carvalho et.al. | 2507.05561 | null |
| 2025-07-09 | AI Agent Smart Contract Exploit Generation | Arthur Gervais et.al. | 2507.05558 | null |
| 2025-07-07 | Evolutionary and Coevolutionary Multi-Agent Design Choices and Dynamics | Erik Hemberg et.al. | 2507.05534 | null |
| 2025-07-07 | Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment | Jiahuan Pei et.al. | 2507.05528 | null |
| 2025-07-07 | Cultivating Multimodal Intelligence: Interpretive Reasoning and Agentic RAG Approaches to Dermatological Diagnosis | Karishma Thakrar et.al. | 2507.05520 | null |
| 2025-07-09 | Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications | Jean-Philippe Corbeil et.al. | 2507.05517 | null |
| 2025-07-07 | Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents | Prahaladh Chandrahasan et.al. | 2507.05495 | null |
| 2025-07-07 | Constraint Hypergraphs as a Unifying Framework for Digital Twins | John Morris et.al. | 2507.05494 | null |
| 2025-07-07 | Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Ceferino Patino et.al. | 2507.05469 | null |
| 2025-07-07 | 2048: Reinforcement Learning in a Delayed Reward Environment | Prady Saligram et.al. | 2507.05465 | null |
| 2025-07-07 | A Systematization of Security Vulnerabilities in Computer Use Agents | Daniel Jones et.al. | 2507.05445 | null |
| 2025-07-07 | Motion Generation: A Survey of Generative Approaches and Benchmarks | Aliasghar Khani et.al. | 2507.05419 | null |
| 2025-07-07 | MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents | Ming Gong et.al. | 2507.05330 | null |
| 2025-07-07 | AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts | Kwangsuk Park et.al. | 2507.05321 | null |
| 2025-07-07 | OASBuilder: Generating OpenAPI Specifications from Online API Documentation with Large Language Models | Koren Lazar et.al. | 2507.05316 | null |
| 2025-07-10 | Fuzzy Classification Aggregation for a Continuum of Agents | Zijun Meng et.al. | 2507.05297 | null |
| 2025-07-05 | A LLM-Driven Multi-Agent Systems for Professional Development of Mathematics Teachers | Kaiqi Yang et.al. | 2507.05292 | null |
| 2025-07-03 | A Fuzzy Supervisor Agent Design for Clinical Reasoning Assistance in a Multi-Agent Educational Clinical Scenario Simulation | Weibing Zheng et.al. | 2507.05275 | null |
| 2025-07-07 | Spatio-Temporal LLM: Reasoning about Environments and Actions | Haozhen Zheng et.al. | 2507.05258 | null |
| 2025-07-07 | Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions | Yuanzhe Hu et.al. | 2507.05257 | null |
| 2025-07-07 | From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving | Fabian Konstantinidis et.al. | 2507.05254 | null |
| 2025-07-07 | Action Space Reduction Strategies for Reinforcement Learning in Autonomous Driving | Elahe Delavari et.al. | 2507.05251 | null |
| 2025-07-07 | Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration | Benjamin Li et.al. | 2507.05244 | null |
| 2025-07-08 | SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam? | Jingyi Chai et.al. | 2507.05241 | null |
| 2025-07-07 | StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling | Meng Wei et.al. | 2507.05240 | null |
| 2025-07-12 | MedGemma Technical Report | Andrew Sellergren et.al. | 2507.05201 | null |
| 2025-07-07 | CREW-WILDFIRE: Benchmarking Agentic Multi-Agent Collaborations at Scale | Jonathan Hyun et.al. | 2507.05178 | null |
| 2025-07-07 | Vector Cost Bimatrix Games with Applications to Autonomous Racing | Benjamin R. Toaz et.al. | 2507.05171 | null |
| 2025-07-07 | Critiques of World Models | Eric Xing et.al. | 2507.05169 | null |
| 2025-07-07 | Macroscopic Structural Light Absorbers | Jan M. Kaster et.al. | 2507.05152 | null |
| 2025-07-07 | Effects of Unplanned Incoming Flights on Airport Relief Processes after a Major Natural Disaster | Luka Van de Sype et.al. | 2507.05150 | null |
| 2025-07-07 | LERa: Replanning with Visual Feedback in Instruction Following | Svyatoslav Pchelintsev et.al. | 2507.05135 | null |
| 2025-07-07 | Optimal Consumption-Investment for General Utility with a Drawdown Constraint over a Finite-Time Horizon | Chonghu Guan et.al. | 2507.05115 | null |
| 2025-07-07 | Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance | Tobias Demmler et.al. | 2507.05098 | null |
| 2025-07-07 | Perspectives on How Sociology Can Advance Theorizing about Human-Chatbot Interaction and Developing Chatbots for Social Good | Celeste Campos-Castillo et.al. | 2507.05030 | null |
| 2025-07-07 | Linking Homeostasis to Reinforcement Learning: Internal State Control of Motivated Behavior | Naoto Yoshida et.al. | 2507.04998 | null |
| 2025-07-07 | From Autonomy to Agency: Agentic Vehicles for Human-Centered Mobility Systems | Jiangbo Yu et.al. | 2507.04996 | null |
| 2025-07-07 | Leadership Detection via Time-Lagged Correlation-Based Network Inference | Thayanne França da Silva et.al. | 2507.04917 | null |
| 2025-07-07 | MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction | Kaleem Ullah Qasim et.al. | 2507.04893 | null |
| 2025-07-07 | Fine-tuning on simulated data outperforms prompting for agent tone of voice | Ingo Marquardt et.al. | 2507.04889 | null |
| 2025-07-07 | Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning | Giwon Lee et.al. | 2507.04790 | null |
| 2025-07-07 | Training-free Generation of Temporally Consistent Rewards from VLMs | Yinuo Zhao et.al. | 2507.04789 | null |
| 2025-07-07 | FurniMAS: Language-Guided Furniture Decoration using Multi-Agent System | Toan Nguyen et.al. | 2507.04770 | null |
| 2025-07-07 | Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking | Maria Damanaki et.al. | 2507.04762 | null |
| 2025-07-07 | LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction | Sungmin Lee et.al. | 2507.04748 | null |
| 2025-07-07 | Who's the Mole? Modeling and Detecting Intention-Hiding Malicious Agents in LLM-Based Multi-Agent Systems | Yizhe Xie et.al. | 2507.04724 | null |
| 2025-07-07 | UrbanMind: Towards Urban General Intelligence via Tool-Enhanced Retrieval-Augmented Generation and Multilevel Optimization | Kai Yang et.al. | 2507.04706 | null |
| 2025-07-07 | Interpretable Reward Modeling with Active Concept Bottlenecks | Sonia Laguna et.al. | 2507.04695 | null |
| 2025-07-07 | Quantitative Single-particle Profiling of Extracellular Vesicles via Fluorescent Nanoparticle Tracking Analysis | Yiting Liu et.al. | 2507.04655 | null |
| 2025-07-07 | LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction | Yixin Yan et.al. | 2507.04634 | null |
| 2025-07-07 | Equilibrium Strategies for the N-agent Mean-Variance Investment Problem over a Random Horizon | Xiaoqing Liang et.al. | 2507.04611 | null |
| 2025-07-07 | VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents | Rui Meng et.al. | 2507.04590 | null |
| 2025-07-08 | Greedy Dynamic Matching | Nick Arnosti et.al. | 2507.04551 | null |
| 2025-07-06 | Grounded Gesture Generation: Language, Motion, and Space | Anna Deichler et.al. | 2507.04522 | null |
| 2025-07-09 | Constant-Approximate and Constant-Strategyproof Two-Facility Location | Elijah Journey Fullerton et.al. | 2507.04485 | null |
| 2025-07-06 | Agentic Distributed Computing | Ajay D. Kshemkalyani et.al. | 2507.04459 | null |
| 2025-07-06 | "Hi AirStar, Guide Me to the Badminton Court." | Ziqin Wang et.al. | 2507.04430 | null |
| 2025-07-06 | MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind | Emilio Villa-Cueva et.al. | 2507.04415 | null |
| 2025-07-06 | Multimedia Verification Through Multi-Agent Deep Research Multimodal Large Language Models | Huy Hoan Le et.al. | 2507.04410 | null |
| 2025-07-06 | Inverse Reinforcement Learning using Revealed Preferences and Passive Stochastic Optimization | Vikram Krishnamurthy et.al. | 2507.04396 | null |
| 2025-07-08 | MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Intelligence Agents | Georgios Ioannides et.al. | 2507.04376 | null |
| 2025-07-06 | Adaptive Malware Detection using Sequential Feature Selection: A Dueling Double Deep Q-Network (D3QN) Framework for Intelligent Classification | Naseem Khan et.al. | 2507.04372 | null |
| 2025-07-06 | WebSynthesis: World-Model-Guided MCTS for Efficient WebUI-Trajectory Synthesis | Yifei Gao et.al. | 2507.04370 | null |
| 2025-07-06 | Mission-Aligned Learning-Informed Control of Autonomous Systems: Formulation and Foundations | Vyacheslav Kungurtsev et.al. | 2507.04356 | null |
| 2025-07-06 | Wavelet Policy: Lifting Scheme for Policy Learning in Long-Horizon Tasks | Hao Huang et.al. | 2507.04331 | null |
| 2025-07-06 | Covalently Integrated CNT@rGO for Superior Conductivity and Cycling Stability in Lithium-Ion Batterie | Junwen Tang et.al. | 2507.04296 | null |
| 2025-07-06 | SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement | Liwen Xiao et.al. | 2507.04263 | null |
| 2025-07-06 | Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties | Guohong Liu et.al. | 2507.04227 | null |
| 2025-07-05 | Gathering Teams of Bounded Memory Agents on a Line | Younan Gao et.al. | 2507.04172 | null |
| 2025-07-05 | Comparative Evaluation of VR-Enabled Robots and Human Operators for Targeted Disease Management in Vineyards | Hasan Seyyedhasani et.al. | 2507.04167 | null |
| 2025-07-08 | Adaptive Two-sided Assortment Optimization: Revenue Maximization | Mohammadreza Ahmadnejadsaein et.al. | 2507.04156 | null |
| 2025-07-05 | Learning Humanoid Arm Motion via Centroidal Momentum Regularized Multi-Agent Reinforcement Learning | Ho Jae Lee et.al. | 2507.04140 | null |
| 2025-07-05 | BYOKG-RAG: Multi-Strategy Graph Retrieval for Knowledge Graph Question Answering | Costas Mavromatis et.al. | 2507.04127 | null |
| 2025-07-05 | Enhancing Robustness of LLM-Driven Multi-Agent Systems through Randomized Smoothing | Jinwei Hu et.al. | 2507.04105 | null |
| 2025-07-05 | How to Train Your LLM Web Agent: A Statistical Diagnosis | Dheeraj Vattikonda et.al. | 2507.04103 | null |
| 2025-07-05 | Dynamic Asset Pricing with α-MEU Model | Jiacheng Fan et.al. | 2507.04093 | null |
| 2025-07-05 | Accurate and Efficient World Modeling with Masked Latent Transformers | Maxime Burchi et.al. | 2507.04075 | null |
| 2025-07-05 | Efficiency through Evolution, A Darwinian Approach to Agent-Based Economic Forecast Modeling | Martin Jaraiz et.al. | 2507.04074 | null |
| 2025-07-05 | HAWK: A Hierarchical Workflow Framework for Multi-Agent Collaboration | Yuyang Cheng et.al. | 2507.04067 | null |
| 2025-07-05 | TopoMAS: Large Language Model Driven Topological Materials Multiagent System | Baohua Zhang et.al. | 2507.04053 | null |
| 2025-07-05 | Breaking Imitation Bottlenecks: Reinforced Diffusion Powers Diverse Trajectory Generation | Ziying Song et.al. | 2507.04049 | null |
| 2025-07-05 | Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation | Ziyu Zhu et.al. | 2507.04047 | null |
| 2025-07-05 | Ready Jurist One: Benchmarking Language Agents for Legal Intelligence in Dynamic Environments | Zheng Jia et.al. | 2507.04037 | null |
| 2025-07-05 | PresentAgent: Multimodal Agent for Presentation Video Generation | Jingwei Shi et.al. | 2507.04036 | null |
| 2025-07-05 | Exploring a Gamified Personality Assessment Method through Interaction with Multi-Personality LLM Agents | Baiqiao Zhang et.al. | 2507.04005 | null |
| 2025-07-05 | MalVol-25: A Diverse, Labelled and Detailed Volatile Memory Dataset for Malware Detection and Response Testing and Validation | Dipo Dunsin et.al. | 2507.03993 | null |
| 2025-07-05 | Fair and Efficient Allocation of Indivisible Mixed Manna | Siddharth Barman et.al. | 2507.03946 | null |
| 2025-07-05 | CortexDebate: Debating Sparsely and Equally for Multi-Agent Debate | Yiliu Sun et.al. | 2507.03928 | null |
| 2025-07-05 | Agent Exchange: Shaping the Future of AI Agent Economics | Yingxuan Yang et.al. | 2507.03904 | null |
| 2025-07-05 | Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing | Rahil P Mehta et.al. | 2507.03870 | null |
| 2025-07-04 | Participatory Evolution of Artificial Life Systems via Semantic Feedback | Shuowen Li et.al. | 2507.03839 | null |
| 2025-07-04 | Leveraging Large Language Models for Tacit Knowledge Discovery in Organizational Contexts | Gianlucca Zuin et.al. | 2507.03811 | null |
| 2025-07-04 | Generating Novelty in Open-World Multi-Agent Strategic Board Games | Mayank Kejriwal et.al. | 2507.03802 | null |
| 2025-07-04 | Learning Dark Souls Combat Through Pixel Input With Neuroevolution | Jim O'Connor et.al. | 2507.03793 | null |
| 2025-07-04 | Less is More: Empowering GUI Agent with Context-Aware Simplification | Gongwei Chen et.al. | 2507.03730 | null |
| 2025-07-04 | Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models | Riya Naik et.al. | 2507.03726 | null |
| 2025-07-09 | Can LLMs Play Ô Ăn Quan Game? A Study of Multi-Step Planning and Decision Making | Sang Quang Nguyen et.al. | 2507.03711 | null |
| 2025-07-04 | Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning | Rebekah A. Gelpí et.al. | 2507.03682 | null |
| 2025-07-04 | STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking | Tek Raj Chhetri et.al. | 2507.03674 | null |
| 2025-07-04 | Recon, Answer, Verify: Agents in Search of Truth | Satyam Shukla et.al. | 2507.03671 | null |
| 2025-07-04 | When Does Diversity Matter? A Unified Framework for Binary-Choice Dynamics | Arkadiusz Jędrzejewski et.al. | 2507.03665 | null |
| 2025-07-04 | Is It Time To Treat Prompts As Code? A Multi-Use Case Study For Prompt Optimization Using DSPy | Francisca Lemos et.al. | 2507.03620 | null |
| 2025-07-04 | EvoAgentX: An Automated Framework for Evolving Agentic Workflows | Yingxu Wang et.al. | 2507.03616 | null |
| 2025-07-04 | On characterization and existence of a constrained correlated equilibria in Markov games | Tingting Ni et.al. | 2507.03502 | null |
| 2025-07-09 | Reinforcement Learning-based Feature Generation Algorithm for Scientific Data | Meng Xiao et.al. | 2507.03498 | null |
| 2025-07-04 | AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions | Abdellah Zeggai et.al. | 2507.03493 | null |
| 2025-07-04 | Explainable Information Retrieval in the Audit Domain | Alexander Frummet et.al. | 2507.03479 | null |
| 2025-07-04 | REAL: Benchmarking Abilities of Large Language Models for Housing Transactions and Services | Kexin Zhu et.al. | 2507.03477 | null |
| 2025-07-04 | Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis | Weitong Zhang et.al. | 2507.03460 | null |
| 2025-07-04 | ElliottAgents: A Natural Language-Driven Multi-Agent System for Stock Market Analysis and Prediction | Jarosław A. Chudziak et.al. | 2507.03435 | null |
| 2025-07-04 | Lessons from a Chimp: AI "Scheming" and the Quest for Ape Language | Christopher Summerfield et.al. | 2507.03409 | null |
| 2025-07-04 | Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky | Ashutosh Hathidara et.al. | 2507.03336 | null |
| 2025-07-04 | Mirror in the Model: Ad Banner Image Generation via Reflective Multi-LLM and Multi-modal Agents | Zhao Wang et.al. | 2507.03326 | null |
| 2025-07-04 | GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation | Himanshu Dutta et.al. | 2507.03311 | null |
| 2025-07-04 | Dyn-O: Building Structured World Models with Object-Centric Representations | Zizhao Wang et.al. | 2507.03298 | null |
| 2025-07-04 | LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents | Anand Gokhale et.al. | 2507.03293 | null |
| 2025-07-04 | Conformal Information Pursuit for Interactively Guiding Large Language Models | Kwan Ho Ryan Chan et.al. | 2507.03279 | null |
| 2025-07-04 | GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning | Jie Peng et.al. | 2507.03267 | null |
| 2025-07-04 | Coalitional stability under myopic expectations and externalities | Agustin G. Bonifacio et.al. | 2507.03259 | null |
| 2025-07-04 | CodeAgents: A Token-Efficient Framework for Codified Multi-Agent Reasoning in LLMs | Bruce Yang et.al. | 2507.03254 | null |
| 2025-07-03 | SI-Agent: An Agentic Framework for Feedback-Driven Generation and Tuning of Human-Readable System Instructions for Large Language Models | Jeshwanth Challagundla et.al. | 2507.03223 | null |
| 2025-07-03 | In vivo imaging of central nervous system fluid spaces using synchrotron radiation-based micro computed tomography | Marta Girona Alarcón et.al. | 2507.03186 | null |
| 2025-07-03 | Last-Iterate Convergence of No-Regret Learning for Equilibria in Bargaining Games | Serafina Kamp et.al. | 2507.03150 | null |
| 2025-07-03 | RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents | Peisong Wang et.al. | 2507.03112 | null |
| 2025-07-03 | From Turing to Tomorrow: The UK's Approach to AI Regulation | Oliver Ritchie et.al. | 2507.03050 | null |
| 2025-07-02 | Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Abhishek Verma et.al. | 2507.03026 | null |
| 2025-07-02 | OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering | Zipeng Qiu et.al. | 2507.03018 | null |
| 2025-07-10 | Establishing Best Practices for Building Rigorous Agentic Benchmarks | Yuxuan Zhu et.al. | 2507.02825 | null |
| 2025-07-03 | Moral Responsibility or Obedience: What Do We Want from AI? | Joseph Boland et.al. | 2507.02788 | null |
| 2025-07-06 | KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs | Yuzhang Xie et.al. | 2507.02773 | null |
| 2025-07-03 | Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work | Guangwei Zhang et.al. | 2507.02760 | null |
| 2025-07-03 | Defining and classifying models of groups: The social ontology of higher-order networks | Jonathan St-Onge et.al. | 2507.02758 | null |
| 2025-07-03 | Multi-agent Auditory Scene Analysis | Caleb Rascon et.al. | 2507.02755 | null |
| 2025-07-03 | Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks | Sizhe Chen et.al. | 2507.02735 | null |
| 2025-07-03 | Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving | Matthieu Zimmer et.al. | 2507.02726 | null |
| 2025-07-03 | A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control | Zilin Kang et.al. | 2507.02712 | null |
| 2025-07-03 | Fluid Democracy in Federated Data Aggregation | Aditya Vema Reddy Kesari et.al. | 2507.02710 | null |
| 2025-07-03 | Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents | Jiangrong Wu et.al. | 2507.02699 | null |
| 2025-07-03 | Multi-Agent Reinforcement Learning for Dynamic Pricing in Supply Chains: Benchmarking Strategic Agent Behaviours under Realistically Simulated Market Conditions | Thomas Hazenberg et.al. | 2507.02698 | null |
| 2025-07-03 | On the Convergence of Large Language Model Optimizer for Black-Box Network Management | Hoon Lee et.al. | 2507.02689 | null |
| 2025-07-03 | TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for Spatial Public Goods Games | Zhaoqilin Yang et.al. | 2507.02675 | null |
| 2025-07-03 | Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification | Deepak Narayan Gadde et.al. | 2507.02660 | null |
| 2025-07-03 | Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search | Jiajie Jin et.al. | 2507.02652 | null |
| 2025-07-03 | On Efficient Bayesian Exploration in Model-Based Reinforcement Learning | Alberto Caron et.al. | 2507.02639 | null |
| 2025-07-03 | VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning | Siran Chen et.al. | 2507.02626 | null |
| 2025-07-03 | Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory | Kenneth Payne et.al. | 2507.02618 | null |
| 2025-07-03 | DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making | Tianqi Shang et.al. | 2507.02616 | null |
| 2025-07-03 | WebSailor: Navigating Super-human Reasoning for Web Agent | Kuan Li et.al. | 2507.02592 | null |
| 2025-07-03 | AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench | Edan Toledo et.al. | 2507.02554 | null |
| 2025-07-03 | Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue | Paulo Ricardo Knob et.al. | 2507.02537 | null |
| 2025-07-03 | A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion | Maryem Fadili et.al. | 2507.02430 | null |
| 2025-07-03 | CyberRAG: An agentic RAG cyber attack classification and reporting tool | Francesco Blefari et.al. | 2507.02424 | null |
| 2025-07-03 | Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization | Caio Azevedo et.al. | 2507.02406 | null |
| 2025-07-03 | Deep Reinforcement Learning-Based DRAM Equalizer Parameter Optimization Using Latent Representations | Muhammad Usama et.al. | 2507.02365 | null |
| 2025-07-03 | OMS: On-the-fly, Multi-Objective, Self-Reflective Ad Keyword Generation via LLM Agent | Bowen Chen et.al. | 2507.02353 | null |
| 2025-07-03 | CineMyoPS: Segmenting Myocardial Pathologies from Cine Cardiac MR | Wangbin Ding et.al. | 2507.02289 | null |
| 2025-07-03 | MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent | Hongli Yu et.al. | 2507.02259 | null |
| 2025-07-03 | SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement | Zeyu Lei et.al. | 2507.02252 | null |
| 2025-07-04 | CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset in Adverse Weather | Minghao Ning et.al. | 2507.02245 | null |
| 2025-07-04 | Dilution, Diffusion and Symbiosis in Spatial Prisoner's Dilemma with Reinforcement Learning | Gustavo C. Mangold et.al. | 2507.02211 | null |
| 2025-07-08 | Average Action Efficiency Rises Monotonically in Self-Organizing Systems via Stochastic Least-Action Dynamics | Georgi Yordanov Georgiev et.al. | 2507.02209 | null |
| 2025-07-02 | Operator-Theoretic Methods for Differential Games | Craig Bakker et.al. | 2507.02203 | null |
| 2025-07-02 | Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations of Human Trust | Amogh Mannekote et.al. | 2507.02197 | null |
| 2025-07-02 | Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models | Fangjian Lei et.al. | 2507.02182 | null |
| 2025-07-02 | Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNN | Miroslav Cibula et.al. | 2507.02171 | null |
| 2025-07-02 | Synergizing Logical Reasoning, Knowledge Management and Collaboration in Multi-Agent LLM System | Adam Kostka et.al. | 2507.02170 | null |
| 2025-07-02 | The optimal degree for maximizing rumor spreading on a ring lattice | Ana C. Díaz Bacca et.al. | 2507.02141 | null |
| 2025-07-02 | PAL: Designing Conversational Agents as Scalable, Cooperative Patient Simulators for Palliative-Care Training | Neil K. R. Sehgal et.al. | 2507.02122 | null |
| 2025-07-02 | What Neuroscience Can Teach AI About Learning in Continuously Changing Environments | Daniel Durstewitz et.al. | 2507.02103 | null |
| 2025-07-02 | The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems | Reza Yousefi Maragheh et.al. | 2507.02097 | null |
| 2025-07-02 | Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab | Haonan Duan et.al. | 2507.02083 | null |
| 2025-07-02 | Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs | Mohammad Ali Alomrani et.al. | 2507.02076 | null |
| 2025-07-05 | RoboBrain 2.0 Technical Report | BAAI RoboBrain Team et.al. | 2507.02029 | null |
| 2025-07-01 | STELLA: Self-Evolving LLM Agent for Biomedical Research | Ruofan Jin et.al. | 2507.02004 | null |
| 2025-07-01 | Dynamic Strategy Adaptation in Multi-Agent Environments with Large Language Models | Shaurya Mallampati et.al. | 2507.02002 | null |
| 2025-07-04 | Towards a Playground to Democratize Experimentation and Benchmarking of AI Agents for Network Troubleshooting | Zhihao Wang et.al. | 2507.01997 | null |
| 2025-06-29 | Integrating Large Language Models in Financial Investments and Market Analysis: A Survey | Sedigheh Mahdavi et.al. | 2507.01990 | null |
| 2025-07-02 | The Thin Line Between Comprehension and Persuasion in LLMs | Adrian de Wynter et.al. | 2507.01936 | null |
| 2025-07-03 | Decision-Oriented Text Evaluation | Yu-Shiang Huang et.al. | 2507.01923 | null |
| 2025-07-02 | An in-silico lung phantom to assess the performance of pulmonary artery segmentation using angiogram | Sunder Neelakantan et.al. | 2507.01867 | null |
| 2025-07-02 | Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents | Sanjay Krishna Anbalagan et.al. | 2507.01862 | null |
| 2025-07-02 | TD-MPC-Opt: Distilling Model-Based Multi-Task Reinforcement Learning Agents | Dmytro Kuzmenko et.al. | 2507.01823 | null |
| 2025-07-06 | AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction | Bin Rao et.al. | 2507.01801 | null |
| 2025-07-02 | ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving | Kai Chen et.al. | 2507.01735 | null |
| 2025-07-02 | Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI | Gopichand Kanumolu et.al. | 2507.01717 | null |
| 2025-07-02 | Using Machine Learning to Compute Constrained Optimal Carbon Tax Rules | Felix Kübler et.al. | 2507.01704 | null |
| 2025-07-02 | AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness | Zixin Chen et.al. | 2507.01702 | null |
| 2025-07-02 | Exploring Advanced LLM Multi-Agent Systems Based on Blackboard Architecture | Bochen Han et.al. | 2507.01701 | null |
| 2025-07-02 | Quantum reinforcement learning in dynamic environments | Oliver Sefrin et.al. | 2507.01691 | null |
| 2025-07-02 | What does really matter in image goal navigation? | Gianluca Monaci et.al. | 2507.01667 | null |
| 2025-07-02 | Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems | Zhaoyan Sun et.al. | 2507.01599 | null |
| 2025-07-02 | Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation | Shutong Feng et.al. | 2507.01594 | null |
| 2025-07-02 | Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors | Yulan Gao et.al. | 2507.01574 | null |
| 2025-07-02 | Time-Varying Coverage Control: A Distributed Tracker-Planner MPC Framework | Patrick Benito Eberhard et.al. | 2507.01567 | null |
| 2025-07-02 | Chargax: A JAX Accelerated EV Charging Simulator | Koen Ponse et.al. | 2507.01522 | null |
| 2025-07-02 | Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning | Yanfei Zhang et.al. | 2507.01489 | null |
| 2025-07-02 | BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments | Yibo Qiu et.al. | 2507.01485 | null |
| 2025-07-02 | Using multi-agent architecture to mitigate the risk of LLM hallucinations | Abd Elrahman Amer et.al. | 2507.01446 | null |
| 2025-07-02 | Reinforcement Learning for Discrete-time LQG Mean Field Social Control Problems with Unknown Dynamics | Hanfang Zhang et.al. | 2507.01420 | null |
| 2025-07-02 | Evaluating LLM Agent Collusion in Double Auctions | Kushal Agrawal et.al. | 2507.01413 | null |
| 2025-07-02 | RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms | Ziyao Wang et.al. | 2507.01378 | null |
| 2025-07-02 | AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing | Yinwang Ren et.al. | 2507.01376 | null |
| 2025-07-02 | Context-Aware Code Wiring Recommendation with LLM-based Agent | Taiming Wang et.al. | 2507.01315 | null |
| 2025-07-02 | LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction | Muhammad Atta ur Rahman et.al. | 2507.01308 | null |
| 2025-07-02 | Optimal Dispersion Under Asynchrony | Debasish Pattanayak et.al. | 2507.01298 | null |
| 2025-07-05 | Frustratingly Simple Retrieval Improves Challenging, Reasoning-Intensive Benchmarks | Xinxi Lyu et.al. | 2507.01297 | null |
| 2025-07-02 | GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant | Michał Matak et.al. | 2507.01259 | null |
| 2025-07-02 | AIGVE-MACS: Unified Multi-Aspect Commenting and Scoring Model for AI-Generated Video Evaluation | Xiao Liu et.al. | 2507.01255 | null |
| 2025-07-01 | Rethinking the Illusion of Thinking | Iñaki Dellibarda Varela et.al. | 2507.01231 | null |
| 2025-07-01 | SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound | Yunke Ao et.al. | 2507.01152 | null |
| 2025-07-01 | Agentic AI in Product Management: A Co-Evolutionary Model | Nishant A. Parikh et.al. | 2507.01069 | null |
| 2025-06-30 | Epitome: Pioneering an Experimental Platform for AI-Social Science Integration | Jingjing Qu et.al. | 2507.01061 | null |
| 2025-06-30 | Optimizing Conversational Product Recommendation via Reinforcement Learning | Kang Liu et.al. | 2507.01060 | null |
| 2025-06-29 | Automated Vehicles Should be Connected with Natural Language | Xiangbo Gao et.al. | 2507.01059 | null |
| 2025-07-01 | Running Quantum Computers in Discovery Mode | Benedikt Placke et.al. | 2507.01013 | null |
| 2025-07-02 | GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning | GLM-V Team et.al. | 2507.01006 | null |
| 2025-07-01 | RTMap: Real-Time Recursive Mapping with Change Detection and Localization | Yuheng Du et.al. | 2507.00980 | null |
| 2025-07-01 | Enhancing LLM Agent Safety via Causal Influence Prompting | Dongyoon Hahm et.al. | 2507.00979 | null |
| 2025-07-01 | Decentralised Multi-Manager Fund Framework | Arman Abgaryan et.al. | 2507.00978 | null |
| 2025-07-01 | Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact | Rizwan Qureshi et.al. | 2507.00951 | null |
| 2025-07-01 | WebArXiv: Evaluating Multimodal Agents on Time-Invariant arXiv Tasks | Zihao Sun et.al. | 2507.00938 | null |
| 2025-07-01 | A Survey: Learning Embodied Intelligence from Physical Simulators and World Models | Xiaoxiao Long et.al. | 2507.00917 | null |
| 2025-07-01 | Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications | Jindong Han et.al. | 2507.00914 | null |
| 2025-07-01 | MemeCMD: An Automatically Generated Chinese Multi-turn Dialogue Dataset with Contextually Retrieved Memes | Yuheng Wang et.al. | 2507.00891 | null |
| 2025-07-01 | TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation | Xi Xuan et.al. | 2507.00875 | null |
| 2025-07-01 | The Evolution of Altruistic Rationality Provides a Solution to Social Dilemmas via Rational Reciprocity | Mohammad Salahshour et.al. | 2507.00858 | null |
| 2025-07-01 | Enhancing Vehicular Platooning with Wireless Federated Learning: A Resource-Aware Control Framework | Beining Wu et.al. | 2507.00856 | null |
| 2025-07-01 | Ranking Quantilized Mean-Field Games with an Application to Early-Stage Venture Investments | Rinel Foguen Tchuendom et.al. | 2507.00853 | null |
| 2025-07-01 | SafeMobile: Chain-level Jailbreak Detection and Automated Evaluation for Multimodal Mobile Agents | Siyuan Liang et.al. | 2507.00841 | null |
| 2025-07-01 | Many LLMs Are More Utilitarian Than One | Anita Keshmirian et.al. | 2507.00814 | null |
| 2025-07-02 | Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments | Tom Maus et.al. | 2507.00762 | null |
| 2025-07-01 | Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity | Jacopo Nudo et.al. | 2507.00657 | null |
| 2025-07-01 | ChatHLS: Towards Systematic Design Automation and Optimization for High-Level Synthesis | Runkai Li et.al. | 2507.00642 | null |
| 2025-07-04 | Horus: A Protocol for Trustless Delegation Under Uncertainty | David Shi et.al. | 2507.00631 | null |
| 2025-07-01 | Quantum Circuit Structure Optimization for Quantum Reinforcement Learning | Seok Bin Son et.al. | 2507.00589 | null |
| 2025-07-01 | Collaborative Multi-Agent Reinforcement Learning Approach for Elastic Cloud Resource Scaling | Bruce Fang et.al. | 2507.00550 | null |
| 2025-07-01 | Rethinking Group Recommender Systems in the Era of Generative AI: From One-Shot Recommendations to Agentic Group Decision Support | Dietmar Jannach et.al. | 2507.00535 | null |
| 2025-07-01 | PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning | Weiran Guo et.al. | 2507.00485 | null |
| 2025-07-01 | ARIG: Autoregressive Interactive Head Generation for Real-time Conversations | Ying Guo et.al. | 2507.00472 | null |
| 2025-07-01 | Best Agent Identification for General Game Playing | Matthew Stephenson et.al. | 2507.00451 | null |
| 2025-07-01 | Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems | Reza Ahmadvand et.al. | 2507.00443 | null |
| 2025-07-01 | Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning | Maggie Huan et.al. | 2507.00432 | null |
| 2025-07-01 | Multi-Agent Coordination under Poisson Observations: A Global Game Approach | Marcos M. Vasconcelos et.al. | 2507.00424 | null |
| 2025-07-01 | Evolutionary Dynamics with Self-Interaction Learning in Networked Systems | Ziyan Zeng et.al. | 2507.00422 | null |
| 2025-07-01 | Minimal Construction of Graphs with Maximum Robustness | Haejoon Lee et.al. | 2507.00415 | null |
| 2025-07-01 | iPanda: An Intelligent Protocol Testing and Debugging Agent for Conformance Testing | Xikai Sun et.al. | 2507.00378 | null |
| 2025-07-01 | VTS-Guided AI Interaction Workflow for Business Insights | Sun Ding et.al. | 2507.00347 | null |
| 2025-06-30 | Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems | Oren Fivel et.al. | 2507.00268 | null |
| 2025-06-30 | Examining Reject Relations in Stimulus Equivalence Simulations | Alexis Carrillo et.al. | 2507.00265 | null |
| 2025-06-30 | Endogenous Network Structures with Precision and Dimension Choices | Nikhil Kumar et.al. | 2507.00249 | null |
| 2025-06-30 | LineRetriever: Planning-Aware Observation Reduction for Web Agents | Imene Kerboua et.al. | 2507.00210 | null |
| 2025-06-30 | BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis | Vidhi Rathore et.al. | 2507.00180 | null |
| 2025-06-30 | AI-Governed Agent Architecture for Web-Trustworthy Tokenization of Alternative Assets | Ailiya Borjigin et.al. | 2507.00096 | null |
| 2025-06-30 | State and Memory is All You Need for Robust and Reliable AI Agents | Matthew Muhoberac et.al. | 2507.00081 | null |
| 2025-06-29 | VoyagerVision: Investigating the Role of Multi-modal Information for Open-ended Learning Systems | Ethan Smyth et.al. | 2507.00079 | null |
| 2025-07-01 | SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning | Bo Liu et.al. | 2506.24119 | null |
| 2025-06-30 | Protocol insecurity with finitely many sessions and XOR | R Ramanujam et.al. | 2506.24072 | null |
| 2025-06-30 | Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC | Xinming Wei et.al. | 2506.24045 | null |
| 2025-06-30 | Ella: Embodied Social Agents with Lifelong Memory | Hongxin Zhang et.al. | 2506.24019 | null |
| 2025-06-30 | Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning | Seungjun Yi et.al. | 2506.23998 | null |
| 2025-06-30 | Harnessing AI Agents to Advance Research on Refugee Child Mental Health | Aditya Shrivastava et.al. | 2506.23992 | null |
| 2025-06-30 | LLM Agents Are the Antidote to Walled Gardens | Samuele Marro et.al. | 2506.23978 | null |
| 2025-06-30 | Flexible Moral Hazard Problems with Adverse Selection | Siwen Liu et.al. | 2506.23954 | null |
| 2025-06-30 | Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice | Akshit Kumar et.al. | 2506.23924 | null |
| 2025-06-30 | A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents | Hang Su et.al. | 2506.23844 | null |
| 2025-06-30 | Sociophysics models inspired by the Ising model | Pratik Mullick et.al. | 2506.23837 | null |
| 2025-06-30 | Towards the "Digital Me": A vision of authentic Conversational Agents powered by personal Human Digital Twins | Lluís C. Coll et.al. | 2506.23826 | null |
| 2025-06-30 | Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning | Anton Andreychuk et.al. | 2506.23793 | null |
| 2025-07-01 | Synthetically Expressive: Evaluating gesture and voice for emotion and empathy in VR and 2D scenarios | Haoyang Du et.al. | 2506.23777 | null |
| 2025-06-30 | Leveraging a Multi-Agent LLM-Based System to Educate Teachers in Hate Incidents Management | Ewelina Gajewska et.al. | 2506.23774 | null |
| 2025-06-30 | A Survey of LLM-based Automated Program Repair: Taxonomies, Design Paradigms, and Applications | Boyang Yang et.al. | 2506.23749 | null |
| 2025-06-30 | DABstep: Data Agent Benchmark for Multi-step Reasoning | Alex Egg et.al. | 2506.23719 | null |
| 2025-06-30 | Agent4S: The Transformation of Research Paradigms from the Perspective of Large Language Models | Boyuan Zheng et.al. | 2506.23692 | null |
| 2025-06-30 | PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red | Zihao Liu et.al. | 2506.23689 | null |
| 2025-06-30 | Efficient Interleaved Speech Modeling through Knowledge Distillation | Mohammadmahdi Nouriborji et.al. | 2506.23670 | null |
| 2025-06-30 | L0: Reinforcement Learning to Become General Agents | Junjie Zhang et.al. | 2506.23667 | null |
| 2025-06-30 | Self-correcting Reward Shaping via Language Models for Reinforcement Learning Agents in Games | António Afonso et.al. | 2506.23626 | null |
| 2025-06-30 | Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs | Manuel Pratelli et.al. | 2506.23610 | null |
| 2025-06-30 | Evaluating Multi-Agent Defences Against Jailbreaking Attacks on Large Language Models | Maria Carolina Cornelia Wit et.al. | 2506.23576 | null |
| 2025-06-30 | CooT: Learning to Coordinate In-Context with Coordination Transformers | Huai-Chih Wang et.al. | 2506.23549 | null |
| 2025-06-30 | Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent | Haocheng Yu et.al. | 2506.23485 | null |
| 2025-06-30 | NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments | Xuan Yao et.al. | 2506.23468 | null |
| 2025-06-30 | Accessible Data Access and Analysis by People who are Blind or Have Low Vision | Samuel Reinders et.al. | 2506.23443 | null |
| 2025-06-29 | Do LLMs Dream of Discrete Algorithms? | Claudionor Coelho Jr et.al. | 2506.23408 | null |
| 2025-06-29 | ATGen: A Framework for Active Text Generation | Akim Tsvigun et.al. | 2506.23342 | null |
| 2025-06-29 | IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering | Parker Liu et.al. | 2506.23329 | null |
| 2025-06-29 | InfGen: Scenario Generation as Next Token Group Prediction | Zhenghao Peng et.al. | 2506.23316 | null |
| 2025-06-29 | GATSim: Urban Mobility Simulation with Generative Agents | Qi Liu et.al. | 2506.23306 | null |
| 2025-06-29 | Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games | David Guzman Piedrahita et.al. | 2506.23276 | null |
| 2025-06-29 | FinStat2SQL: A Text2SQL Pipeline for Financial Statement Analysis | Quang Hung Nguyen et.al. | 2506.23273 | null |
| 2025-06-29 | From Prompt Injections to Protocol Exploits: Threats in LLM-Powered AI Agents Workflows | Mohamed Amine Ferrag et.al. | 2506.23260 | null |
| 2025-06-29 | Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models | Maarten Hugenholtz et.al. | 2506.23164 | null |
| 2025-06-29 | Benchmarking Deep Search over Heterogeneous Enterprise Data | Prafulla Kumar Choubey et.al. | 2506.23139 | null |
| 2025-06-29 | Learning Motion Skills with Adaptive Assistive Curriculum Force in Humanoid Robots | Zhanxiang Cao et.al. | 2506.23125 | null |
| 2025-06-29 | Curious Causality-Seeking Agents Learn Meta Causal World | Zhiyu Zhao et.al. | 2506.23068 | null |
| 2025-06-29 | AURA: Agent for Understanding, Reasoning, and Automated Tool Use in Voice-Driven Tasks | Leander Melroy Maben et.al. | 2506.23049 | null |
| 2025-06-29 | SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions | Xianzhe Fan et.al. | 2506.23046 | null |
| 2025-06-28 | Fragile, Robust, and Antifragile: A Perspective from Parameter Responses in Reinforcement Learning Under Stress | Zain ul Abdeen et.al. | 2506.23036 | null |
| 2025-06-28 | A "Good" Regulator May Provide a World Model for Intelligent Systems | Bradly Alicea et.al. | 2506.23032 | null |
| 2025-06-28 | Scenario-Based Hierarchical Reinforcement Learning for Automated Driving Decision Making | M. Youssef Abdelhamid et.al. | 2506.23023 | null |
| 2025-06-28 | A Reinforcement Learning Approach for Optimal Control in Microgrids | Davide Salaorni et.al. | 2506.22995 | null |
| 2025-06-28 | Resilient-Native and Intelligent Next-Generation Wireless Systems: Key Enablers, Foundations, and Applications | Mehdi Bennis et.al. | 2506.22991 | null |
| 2025-06-28 | Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models | Younwoo Choi et.al. | 2506.22957 | null |
| 2025-06-28 | GamerAstra: Enhancing Video Game Accessibility for Blind and Low-Vision Players through a Multi-Agent AI Framework | Tianrun Qiu et.al. | 2506.22937 | null |
| 2025-06-28 | Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example | Bei Zhou et.al. | 2506.22894 | null |
| 2025-06-28 | Agentic Enterprise: AI-Centric User to User-Centric AI | Arpit Narechania et.al. | 2506.22893 | null |
| 2025-06-28 | CP-Guard: A Unified, Probability-Agnostic, and Adaptive Framework for Malicious Agent Detection and Defense in Multi-Agent Embodied Perception Systems | Senkang Hu et.al. | 2506.22890 | null |
| 2025-06-28 | Cooperation as Black Box: Conceptual Fluctuation and Diagnostic Tools for Misalignment in MAS | Shayak Nandi et.al. | 2506.22876 | null |
| 2025-06-28 | Momentum-based Accelerated Algorithm for Distributed Optimization under Sector-Bound Nonlinearity | Mohammadreza Doostmohammadian et.al. | 2506.22855 | null |
| 2025-07-02 | DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues | Kyochul Jang et.al. | 2506.22853 | null |
| 2025-06-28 | Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems | Yucheng Cai et.al. | 2506.22852 | null |
| 2025-06-28 | Actively induced supercoiling can slow down plasmid solutions by trapping the threading entanglements | Roman Staňo et.al. | 2506.22842 | null |
| 2025-06-28 | Memory as a Service (MaaS): Rethinking Contextual Memory as Service-Oriented Modules for Collaborative Agents | Haichang Li et.al. | 2506.22815 | null |
| 2025-06-28 | BayesLoRA: Task-Specific Uncertainty in Low-Rank Adapters | Cooper Doyle et.al. | 2506.22809 | null |
| 2025-06-28 | Trusted Routing for Blockchain-Enabled Low-Altitude Intelligent Networks | Sijie He et.al. | 2506.22745 | null |
| 2025-06-28 | Questions as cognitive filters | Willem Conradie et.al. | 2506.22735 | null |
| 2025-06-28 | FairMarket-RL: LLM-Guided Fairness Shaping for Multi-Agent Reinforcement Learning in Peer-to-Peer Markets | Shrenik Jadhav et.al. | 2506.22708 | null |
| 2025-06-28 | General Autonomous Cybersecurity Defense: Learning Robust Policies for Dynamic Topologies and Diverse Attackers | Arun Ramamurthy et.al. | 2506.22706 | null |
| 2025-06-27 | Knowledge-Guided Multi-Agent Framework for Automated Requirements Development: A Vision | Jiangping Huang et.al. | 2506.22656 | null |
| 2025-06-27 | URSA: The Universal Research and Scientific Agent | Michael Grosskopf et.al. | 2506.22653 | null |
| 2025-06-27 | QoS-aware State-Augmented Learnable Algorithm for Wireless Coexistence Parameter Management | Mohammad Reza Fasihi et.al. | 2506.22652 | null |
| 2025-06-27 | Entropy Regularized Belief Reporting | Elchin Suleymanov et.al. | 2506.22649 | null |
| 2025-06-27 | Ludax: A GPU-Accelerated Domain Specific Language for Board Games | Graham Todd et.al. | 2506.22609 | null |
| 2025-06-27 | RExBench: Can coding agents autonomously implement AI research extensions? | Nicholas Edwards et.al. | 2506.22598 | null |
| 2025-06-27 | Capacity Planning in Stable Matching with Truthful or Strategic Preference Uncertainty | Maria Bazotte et.al. | 2506.22560 | null |
| 2025-07-01 | Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset | Vasu Agrawal et.al. | 2506.22554 | null |
| 2025-06-26 | Integrated Multimodal Sensing and Communication: Challenges, Technologies, and Architectures | Yubo Peng et.al. | 2506.22507 | null |
| 2025-06-30 | The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements | Bingchen Zhao et.al. | 2506.22419 | null |
| 2025-06-27 | Why Are Parsing Actions for Understanding Message Hierarchies Not Random? | Daichi Kato et.al. | 2506.22366 | null |
| 2025-06-27 | Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation | Tao Li et.al. | 2506.22365 | null |
| 2025-07-03 | Embodied AI Agents: Modeling the World | Pascale Fung et.al. | 2506.22355 | null |
| 2025-06-27 | Agent-based modeling and the sociology of money: some suggestions for refining monetary theory using social simulation | Eduardo Coltre Ferraciolli et.al. | 2506.22318 | null |
| 2025-06-27 | Artificial Intelligent Disobedience: Rethinking the Agency of Our Artificial Teammates | Reuth Mirsky et.al. | 2506.22276 | null |
| 2025-06-27 | Exploring Modularity of Agentic Systems for Drug Discovery | Laura van Weesep et.al. | 2506.22189 | null |
| 2025-06-27 | Autonomic Microservice Management via Agentic AI and MAPE-K Integration | Matteo Esposito et.al. | 2506.22185 | null |
| 2025-06-27 | A Different Approach to AI Safety: Proceedings from the Columbia Convening on Openness in Artificial Intelligence and AI Safety | Camille François et.al. | 2506.22183 | null |
| 2025-06-27 | ASVSim (AirSim for Surface Vehicles): A High-Fidelity Simulation Framework for Autonomous Surface Vehicle Research | Bavo Lesy et.al. | 2506.22174 | null |
| 2025-06-27 | Learning Distributed Safe Multi-Agent Navigation via Infinite-Horizon Optimal Graph Control | Fenglan Wang et.al. | 2506.22117 | null |
| 2025-06-27 | Flocking with random non-reciprocal interactions | Jiwon Choi et.al. | 2506.22060 | null |
| 2025-06-27 | Universal Retrieval for Multimodal Trajectory Modeling | Xuan Zhang et.al. | 2506.22056 | null |
| 2025-06-27 | TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning | Alessandro Sestini et.al. | 2506.22008 | null |
| 2025-06-27 | A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments | Akshay Jaitly et.al. | 2506.21982 | null |
| 2025-06-27 | SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Shuhan Tan et.al. | 2506.21976 | null |
| 2025-06-27 | Don't Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism | Simon Münker et.al. | 2506.21974 | null |
| 2025-06-27 | More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents | Weimin Xiong et.al. | 2506.21967 | null |
| 2025-06-27 | CAL-RAG: Retrieval-Augmented Multi-Agent Generation for Content-Aware Layout Design | Najmeh Forouzandehmehr et.al. | 2506.21934 | null |
| 2025-06-27 | ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation | Reza Yousefi Maragheh et.al. | 2506.21931 | null |
| 2025-06-27 | SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding | Zhao Jin et.al. | 2506.21924 | null |
| 2025-06-27 | Advancements and Challenges in Continual Reinforcement Learning: A Comprehensive Review | Amara Zuffer et.al. | 2506.21899 | null |
| 2025-06-27 | Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation | Qiyue Gao et.al. | 2506.21876 | null |
| 2025-06-27 | A Survey of Continual Reinforcement Learning | Chaofan Pan et.al. | 2506.21872 | null |
| 2025-06-27 | GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles | Mengyi Shan et.al. | 2506.21839 | null |
| 2025-06-26 | When Networks Mislead: How Partisan Communication Undermines Democratic Decision-Making | Hsuan-Wei Lee et.al. | 2506.21820 | null |
| 2025-06-26 | CitySim: Modeling Urban Behaviors and City Dynamics with Large-Scale LLM-Driven Agent Simulation | Nicolas Bougie et.al. | 2506.21805 | null |
| 2025-06-26 | Adaptive Multipath-Based SLAM for Distributed MIMO Systems | Xuhong Li et.al. | 2506.21798 | null |
| 2025-06-26 | MobiVerse: Scaling Urban Mobility Simulation with Hybrid Lightweight Domain-Specific Generator and Large Language Models | Yifan Liu et.al. | 2506.21784 | null |
| 2025-06-26 | Simultaneously Fair Allocation of Indivisible Items Across Multiple Dimensions | Yasushi Kawase et.al. | 2506.21727 | null |
| 2025-06-26 | SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents | Wanxin Tian et.al. | 2506.21669 | null |
| 2025-06-26 | Monetary Macro Accounting Theory | Renéee Menéndez et.al. | 2506.21651 | null |
| 2025-06-23 | TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge | Zhiyuan Zhang et.al. | 2506.21618 | null |
| 2025-06-26 | Whole-Body Conditioned Egocentric Video Prediction | Yutong Bai et.al. | 2506.21552 | null |
| 2025-06-26 | PsyLite Technical Report | Fangjun Ding et.al. | 2506.21536 | null |
| 2025-07-03 | Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge | Boyu Gou et.al. | 2506.21506 | null |
| 2025-06-26 | From multi-allocations to allocations, with subadditive valuations | Uriel Feige et.al. | 2506.21493 | null |
| 2025-06-29 | Ad-Hoc Human-AI Coordination Challenge | Tin Dizdarević et.al. | 2506.21490 | null |
| 2025-06-26 | Reinforcement Learning for Optimal Control of Spin Magnetometers | Logan W. Cooke et.al. | 2506.21475 | null |
| 2025-06-26 | Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents | Tianyi Men et.al. | 2506.21252 | null |
| 2025-06-26 | Dynamic Risk-Aware MPPI for Mobile Robots in Crowds via Efficient Monte Carlo Approximations | Elia Trevisan et.al. | 2506.21205 | null |
| 2025-06-26 | Artificial Delegates Resolve Fairness Issues in Perpetual Voting with Partial Turnout | Apurva Shah et.al. | 2506.21186 | null |
| 2025-06-26 | Performance improvement of spatial semantic segmentation with enriched audio features and agent-based error correction for DCASE 2025 Challenge Task 4 | Jongyeon Park et.al. | 2506.21174 | null |
| 2025-06-26 | Curriculum-Guided Antifragile Reinforcement Learning for Secure UAV Deconfliction under Observation-Space Attacks | Deepak Kumar Panda et.al. | 2506.21129 | null |
| 2025-06-26 | GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction | Muleilan Pei et.al. | 2506.21121 | null |
| 2025-06-26 | Homogenization of Multi-agent Learning Dynamics in Finite-state Markov Games | Yann Kerzreho et.al. | 2506.21079 | null |
| 2025-06-26 | RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment | Suorong Yang et.al. | 2506.21037 | null |
| 2025-06-26 | Evidence-based diagnostic reasoning with multi-agent copilot for human pathology | Chengkuan Chen et.al. | 2506.20964 | null |
| 2025-06-26 | Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation | Chenkai Sun et.al. | 2506.20949 | null |
| 2025-06-26 | ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks | Joshua H. Davis et.al. | 2506.20938 | null |
| 2025-06-26 | Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market | Chi-Sheng Chen et.al. | 2506.20930 | null |
| 2025-06-26 | LLM-guided Chemical Process Optimization with a Multi-Agent Approach | Tong Zeng et.al. | 2506.20921 | null |
| 2025-06-26 | FaSTA |
Advait Gupta et.al. | 2506.20911 | null |
| 2025-06-26 | Smoothness Meets Autobidding: Tight Price of Anarchy Bounds for Simultaneous First-Price Auctions | Riccardo Colini-Baldeschi et.al. | 2506.20908 | null |
| 2025-06-25 | Complex Model Transformations by Reinforcement Learning with Uncertain Human Guidance | Kyanna Dagenais et.al. | 2506.20883 | null |
| 2025-06-28 | Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine | Sebastian Joseph et.al. | 2506.20876 | null |
| 2025-06-25 | GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization | Martin Andrews et.al. | 2506.20807 | null |
| 2025-06-25 | Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis | Zhonghao Zhan et.al. | 2506.20806 | null |
| 2025-06-25 | A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools | Minh-Hao Van et.al. | 2506.20743 | null |
| 2025-06-25 | MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation | Gurusha Juneja et.al. | 2506.20737 | null |
| 2025-06-25 | MMSearch-R1: Incentivizing LMMs to Search | Jinming Wu et.al. | 2506.20670 | null |
| 2025-06-25 | The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Andrei Lupu et.al. | 2506.20664 | null |
| 2025-06-25 | Memento: Note-Taking for Your Future Self | Chao Wan et.al. | 2506.20642 | null |
| 2025-06-25 | Towards Community-Driven Agents for Machine Learning Engineering | Sijie Li et.al. | 2506.20640 | null |
| 2025-06-25 | Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm | Baixiang Huang et.al. | 2506.20606 | null |
| 2025-06-25 | Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges | Alexander D. Kalian et.al. | 2506.20598 | null |
| 2025-06-25 | An Explicit Solution for the Problem of Optimal Investment with Random Endowment | Michael Donisch et.al. | 2506.20506 | null |
| 2025-06-25 | Engineering Sentience | Konstantin Demin et.al. | 2506.20504 | null |
| 2025-06-25 | Opinion Dynamics with Highly Oscillating Opinions | Víctor A. Vargas-Pérez et.al. | 2506.20472 | null |
| 2025-06-25 | An Agentic System for Rare Disease Diagnosis with Traceable Reasoning | Weike Zhao et.al. | 2506.20430 | null |
| 2025-06-25 | SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models | Dipayan Saha et.al. | 2506.20415 | null |
| 2025-06-26 | TAPS: Tool-Augmented Personalisation via Structured Tagging | Ekaterina Taktasheva et.al. | 2506.20409 | null |
| 2025-06-25 | A Visualization Framework for Exploring Multi-Agent-Based Simulations Case Study of an Electric Vehicle Home Charging Ecosystem | Kristoffer Christensen et.al. | 2506.20400 | null |
| 2025-06-27 | Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards | Jihao Gu et.al. | 2506.20332 | null |
| 2025-06-26 | Finding the Easy Way Through -- the Probabilistic Gap Planner for Social Robot Navigation | Malte Probst et.al. | 2506.20320 | null |
| 2025-06-25 | Exact and approximate maximin share allocations in multi-graphs | George Christodoulou et.al. | 2506.20317 | null |
| 2025-06-25 | Language Modeling by Language Models | Junyan Cheng et.al. | 2506.20249 | null |
| 2025-06-25 | Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox | Malikussaid et.al. | 2506.20102 | null |
| 2025-06-25 | PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models | Wang Bill Zhu et.al. | 2506.20097 | null |
| 2025-06-25 | From Conversation to Orchestration: HCI Challenges and Opportunities in Interactive Multi-Agentic Systems | Sarah Schömbs et.al. | 2506.20091 | null |
| 2025-06-24 | Beyond Autocomplete: Designing CopilotLens Towards Transparent and Explainable AI Coding Agents | Runlong Ye et.al. | 2506.20062 | null |
| 2025-06-24 | Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models | Zhicheng Zhang et.al. | 2506.20061 | null |
| 2025-06-26 | Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception | Eric C. Joyce et.al. | 2506.20045 | null |
| 2025-06-24 | Learning Bilateral Team Formation in Cooperative Multi-Agent Reinforcement Learning | Koorosh Moslemi et.al. | 2506.20039 | null |
| 2025-06-24 | Automated Generation of Diverse Courses of Actions for Multi-Agent Operations using Binary Optimization and Graph Learning | Prithvi Poddar et.al. | 2506.20031 | null |
| 2025-06-24 | Polynomial-Time Approximation Schemes via Utility Alignment: Unit-Demand Pricing and More | Robin Bowers et.al. | 2506.20030 | null |
| 2025-06-24 | QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges | Abdul Basit et.al. | 2506.20008 | null |
| 2025-06-24 | Can One Safety Loop Guard Them All? Agentic Guard Rails for Federated Computing | Narasimha Raghavan Veeraragavan et.al. | 2506.20000 | null |
| 2025-06-24 | Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation | Xinyi Ni et.al. | 2506.19998 | null |
| 2025-07-02 | TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design | Geonwoo Cho et.al. | 2506.19997 | null |
| 2025-06-24 | Prover Agent: An Agent-based Framework for Formal Mathematical Proofs | Kaito Baba et.al. | 2506.19923 | null |
| 2025-06-24 | JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning | Ai Han et.al. | 2506.19846 | null |
| 2025-06-24 | MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Yucheng Zhou et.al. | 2506.19835 | null |
| 2025-06-24 | Curating art exhibitions using machine learning | Eurico Covas et.al. | 2506.19813 | null |
| 2025-06-24 | LLM-Based Social Simulations Require a Boundary | Zengqing Wu et.al. | 2506.19806 | null |
| 2025-06-24 | Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Menglong Zhang et.al. | 2506.19785 | null |
| 2025-06-24 | SAGE: Strategy-Adaptive Generation Engine for Query Rewriting | Teng Wang et.al. | 2506.19783 | null |
| 2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
| 2025-06-24 | From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking | Gyeongwon James Kim et.al. | 2506.19724 | null |
| 2025-07-02 | A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures | Dezhang Kong et.al. | 2506.19676 | null |
| 2025-06-24 | How trust networks shape students' opinions about the proficiency of artificially intelligent assistants | Yutong Bu et.al. | 2506.19655 | null |
| 2025-06-24 | HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions | Mrunmai Vivek Phatak et.al. | 2506.19639 | null |
| 2025-06-24 | Mobile oscillators in a mobile multi-cluster network | Venceslas Nguefoue Meli et.al. | 2506.19617 | null |
| 2025-06-24 | Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI | Sha Zhang et.al. | 2506.19613 | null |
| 2025-06-24 | Robotics Under Construction: Challenges on Job Sites | Haruki Uchiito et.al. | 2506.19597 | null |
| 2025-06-30 | Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning | Harisankar Babu et.al. | 2506.19592 | null |
| 2025-06-24 | Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects | Federico Tavella et.al. | 2506.19579 | null |
| 2025-06-24 | KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs | Kelin Fu et.al. | 2506.19527 | null |
| 2025-06-24 | MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications | Aleksandr Algazinov et.al. | 2506.19502 | null |
| 2025-06-24 | NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling | Yan Jiang et.al. | 2506.19500 | null |
| 2025-06-24 | SceneCrafter: Controllable Multi-View Driving Scene Editing | Zehao Zhu et.al. | 2506.19488 | null |
| 2025-06-24 | Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning | Russell Beale et.al. | 2506.19484 | null |
| 2025-06-24 | LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code | Shahbaz Siddeeq et.al. | 2506.19481 | null |
| 2025-06-24 | Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System | Lixuan He et.al. | 2506.19433 | null |
| 2025-06-24 | Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection | Yazhou Zhang et.al. | 2506.19420 | null |
| 2025-06-24 | Center of Gravity-Guided Focusing Influence Mechanism for Multi-Agent Reinforcement Learning | Yisak Park et.al. | 2506.19417 | null |
| 2025-06-24 | Is an object-centric representation beneficial for robotic manipulation ? | Alexandre Chapin et.al. | 2506.19408 | null |
| 2025-06-24 | Do cell culturing influence the radiosensitizing effect of gold nanoparticles part 1: scrutinizing recent evidence for data consistency | Hans Rabus et.al. | 2506.19372 | null |
| 2025-06-24 | Computing Tree Structures in Anonymous Graphs via Mobile Agents | Prabhat Kumar Chand et.al. | 2506.19365 | null |
| 2025-06-24 | Distributed Interview Selection for Stable Matching in Large Random Markets | Richard Cole et.al. | 2506.19345 | null |
| 2025-06-26 | The Autonomy of the Lightning Network: A Mathematical and Economic Proof of Structural Decoupling from BTC | Craig Steven Wright et.al. | 2506.19333 | null |
| 2025-06-24 | Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs | Liang Zeng et.al. | 2506.19290 | null |
| 2025-06-24 | Robust Behavior Cloning Via Global Lipschitz Regularization | Shili Wu et.al. | 2506.19250 | null |
| 2025-06-24 | Augmenting Multi-Agent Communication with State Delta Trajectory | Yichen Tang et.al. | 2506.19209 | null |
| 2025-06-25 | Vertex addition to a ball graph with application to reliability and area coverage in autonomous swarms | Calum Buchanan et.al. | 2506.19197 | null |
| 2025-06-23 | Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition | Craig Steven Wright et.al. | 2506.19191 | null |
| 2025-06-23 | Distilling Tool Knowledge into Language Models via Back-Translated Traces | Xingyue Huang et.al. | 2506.19171 | null |
| 2025-06-23 | AgenticControl: An Automated Control Design Framework Using Large Language Models | Mohammad Narimani et.al. | 2506.19160 | null |
| 2025-06-23 | Model Reference Adaptive Control of Networked Systems with State and Input Delays | Moh Kamalul Wafi et.al. | 2506.19138 | null |
| 2025-06-23 | Emergent collective dynamics from motile photokinetic organisms | J. Morales et.al. | 2506.19081 | null |
| 2025-06-23 | How brains build higher order representations of uncertainty | Megan A. K. Peters et.al. | 2506.19057 | null |
| 2025-06-26 | From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents | Weizhi Zhang et.al. | 2506.18959 | null |
| 2025-06-23 | A Comment On "The Illusion of Thinking": Reframing the Reasoning Cliff as an Agentic Gap | Sheraz Khan et.al. | 2506.18957 | null |
| 2025-06-23 | SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications | Jinyang Li et.al. | 2506.18951 | null |
| 2025-06-22 | Advanced Applications of Generative AI in Actuarial Science: Case Studies Beyond ChatGPT | Simon Hatzesberger et.al. | 2506.18942 | null |
| 2025-06-23 | Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models | Kiymet Akdemir et.al. | 2506.18900 | null |
| 2025-06-23 | Steering Conceptual Bias via Transformer Latent-Subspace Activation | Vansh Sharma et.al. | 2506.18887 | null |
| 2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
| 2025-06-23 | Broad Validity of the First-Order Approach in Moral Hazard | Eduardo Azevedo et.al. | 2506.18873 | null |
| 2025-06-25 | Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning | Anthony Kobanda et.al. | 2506.18847 | null |
| 2025-06-23 | Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories | Islem Bouzenia et.al. | 2506.18824 | null |
| 2025-06-23 | Multi-Agent Online Control with Adversarial Disturbances | Anas Barakat et.al. | 2506.18814 | null |
| 2025-06-23 | Fair Allocation with Money: What is Your Objective? | Noga Klein Elmalem et.al. | 2506.18794 | null |
| 2025-06-23 | TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation | Kamil Szczepanik et.al. | 2506.18783 | null |
| 2025-06-23 | Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI | Daniel M. Lang et.al. | 2506.18720 | null |
| 2025-06-23 | Safety-Aware Optimal Scheduling for Autonomous Masonry Construction using Collaborative Heterogeneous Aerial Robots | Marios-Nektarios Stamatopoulos et.al. | 2506.18697 | null |
| 2025-06-23 | MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation | Ruicheng Zhang et.al. | 2506.18679 | null |
| 2025-06-23 | MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Tianchen Deng et.al. | 2506.18678 | null |
| 2025-06-23 | Dual-level Behavioral Consistency for Inter-group and Intra-group Coordination in Multi-Agent Systems | Shuocun Yang et.al. | 2506.18651 | null |
| 2025-06-23 | Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits | Yannik Mahlau et.al. | 2506.18627 | null |
| 2025-06-23 | Reply to "Emergent LLM behaviors are observationally equivalent to data leakage" | Ariel Flint Ashery et.al. | 2506.18600 | null |
| 2025-06-23 | Agentic Markets: Game Dynamics and Equilibrium in Markets with Learning Agents | Martin Bichler et.al. | 2506.18571 | null |
| 2025-06-23 | Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning | Jiexin Zhang et.al. | 2506.18560 | null |
| 2025-06-23 | T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent | Hong Qing Yu et.al. | 2506.18559 | null |
| 2025-06-23 | Unilateral determination of causal order in a cyclic process | Ilyass Mejdoub et.al. | 2506.18540 | null |
| 2025-06-23 | Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning | Azad Deihim et.al. | 2506.18537 | null |
| 2025-06-23 | Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance | Yu Han et.al. | 2506.18511 | null |
| 2025-06-23 | Reliability-Adjusted Prioritized Experience Replay | Leonard S. Pleiss et.al. | 2506.18482 | null |
| 2025-06-23 | AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction | Gengyuan Zhang et.al. | 2506.18472 | null |
| 2025-06-23 | Networked pointing system: Bearing-only target localization and pointing control | Shiyao Li et.al. | 2506.18460 | null |
| 2025-06-23 | A Motivational Architecture for Open-Ended Learning Challenges in Robots | Alejandro Romero et.al. | 2506.18454 | null |
| 2025-06-23 | GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System | Quang Nguyen et.al. | 2506.18448 | null |
| 2025-06-23 | A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction | Chengjie Liu et.al. | 2506.18424 | null |
| 2025-06-23 | Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots | Imene Tarakli et.al. | 2506.18365 | null |
| 2025-06-27 | Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team | Weilun Yu et.al. | 2506.18348 | null |
| 2025-06-23 | Use Property-Based Testing to Bridge LLM Code Generation and Validation | Lehan He et.al. | 2506.18315 | null |
| 2025-06-23 | A stochastic model for the diffusion of competing opinions with trend-following, opposition, and indifference | Manuel González-Navarrete et.al. | 2506.18313 | null |
| 2025-06-23 | Advanced For-Loop for QML algorithm search | FuTe Wong et.al. | 2506.18260 | null |
| 2025-06-22 | Wisdom of Crowds Through Myopic Self-Confidence Adaptation | Giacomo Como et.al. | 2506.18195 | null |
| 2025-06-22 | Mapping The Invisible Internet: Framework and Dataset | Siddique Abubakr Muntaka et.al. | 2506.18159 | null |
| 2025-06-22 | Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation | Xinzge Gao et.al. | 2506.18158 | null |
| 2025-06-22 | CoachGPT: A Scaffolding-based Academic Writing Assistant | Fumian Chen et.al. | 2506.18149 | null |
| 2025-06-22 | Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game | Xiang Yuming et.al. | 2506.18126 | null |
| 2025-06-22 | Deep Research Agents: A Systematic Examination And Roadmap | Yuxuan Huang et.al. | 2506.18096 | null |
| 2025-06-27 | MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering | Jisheng Dang et.al. | 2506.18071 | null |
| 2025-06-26 | Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities | Yuanchen Bei et.al. | 2506.18019 | null |
| 2025-06-22 | Ultra-Efficient Contracts: Breaking the Substitutes Barrier in Combinatorial Contracts | Michal Feldman et.al. | 2506.18008 | null |
| 2025-06-22 | An Axiomatization of the Random Priority Rule | Christian Basteck et.al. | 2506.17997 | null |
| 2025-06-22 | Non-Euclidean Enriched Contraction Theory for Monotone Operators and Monotone Dynamical Systems | Diego Deplano et.al. | 2506.17990 | null |
| 2025-06-22 | GeNIE: A Generalizable Navigation System for In-the-Wild Environments | Jiaming Wang et.al. | 2506.17960 | null |
| 2025-06-22 | ASTER: Adaptive Spatio-Temporal Early Decision Model for Dynamic Resource Allocation | Shulun Chen et.al. | 2506.17929 | null |
| 2025-06-22 | Learning, Reasoning, Refinement: A Framework for Kahneman's Dual-System Intelligence in GUI Agents | Jinjie Wei et.al. | 2506.17913 | null |
| 2025-06-22 | Towards Robust Fact-Checking: A Multi-Agent System with Advanced Evidence Retrieval | Tam Trinh et.al. | 2506.17878 | null |
| 2025-06-21 | Out of Control -- Why Alignment Needs Formal Control Theory (and an Alignment Control Stack) | Elija Perrier et.al. | 2506.17846 | null |
| 2025-06-21 | Reflective Verbal Reward Design for Pluralistic Alignment | Carter Blair et.al. | 2506.17834 | null |
| 2025-06-21 | Is Your Automated Software Engineer Trustworthy? | Noble Saji Mathews et.al. | 2506.17812 | null |
| 2025-06-21 | Bayesian Social Deduction with Graph-Informed Language Models | Shahab Rahimirad et.al. | 2506.17788 | null |
| 2025-06-21 | AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction | Song Wang et.al. | 2506.17784 | null |
| 2025-06-21 | Toward Autonomous UI Exploration: The UIExplorer Benchmark | Andrei Cristian Nica et.al. | 2506.17779 | null |
| 2025-06-21 | Optimizing Exploration with a New Uncertainty Framework for Active SLAM Systems | Sebastian Sansoni et.al. | 2506.17775 | null |
| 2025-06-21 | PAGENT: Learning to Patch Software Engineering Agents | Haoran Xue et.al. | 2506.17772 | null |
| 2025-06-21 | CARTS: Collaborative Agents for Recommendation Textual Summarization | Jiao Chen et.al. | 2506.17765 | null |
| 2025-06-21 | Experimental Evidence for the Propagation and Preservation of Machine Discoveries in Human Populations | Levin Brinkmann et.al. | 2506.17741 | null |
| 2025-06-21 | Distributed Butterfly Analysis using Mobile Agents | Prabhat Kumar Chand et.al. | 2506.17721 | null |
| 2025-06-21 | Wealth Thermalization Hypothesis | Klaus M. Frahm et.al. | 2506.17720 | null |
| 2025-06-21 | Beyond Syntax: Action Semantics Learning for App Agents | Bohan Tang et.al. | 2506.17697 | null |
| 2025-06-21 | Network Heterogeneity and Value of Information | Kota Murayama et.al. | 2506.17660 | null |
| 2025-06-21 | Diffusion of Tracer Particles in Early Growing Biofilms. A Computer Simulation Study | Fabian A. Garcia Daza et.al. | 2506.17653 | null |
| 2025-06-21 | May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs | Shaoyu Yang et.al. | 2506.17642 | null |
| 2025-06-21 | JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent | Yunlong Lin et.al. | 2506.17612 | null |
| 2025-06-26 | Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown | Bowen Wang et.al. | 2506.17589 | null |
| 2025-06-21 | Towards Zero-Shot Coordination between Teams of Agents: The N-XPlay Framework | Ava Abderezaei et.al. | 2506.17560 | null |
| 2025-06-24 | Breaking Single-Tester Limits: Multi-Agent LLMs for Multi-User Feature Testing | Sidong Feng et.al. | 2506.17539 | null |
| 2025-06-20 | Kaleidoscopic Teaming in Multi Agent Simulations | Ninareh Mehrabi et.al. | 2506.17514 | null |
| 2025-06-20 | A Grassroots Network and Community Roadmap for Interconnected Autonomous Science Laboratories for Accelerated Discovery | Rafael Ferreira da Silva et.al. | 2506.17510 | null |
| 2025-06-20 | From Unstructured Communication to Intelligent RAG: Multi-Agent Automation for Supply Chain Knowledge Bases | Yao Zhang et.al. | 2506.17484 | null |
| 2025-06-20 | General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting | Bernard Lange et.al. | 2506.17462 | null |
| 2025-06-20 | OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections | Manasa Bharadwaj et.al. | 2506.17449 | null |
| 2025-06-20 | Resource Rational Contractualism Should Guide AI Alignment | Sydney Levine et.al. | 2506.17434 | null |
| 2025-06-20 | UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making | Jinhao Duan et.al. | 2506.17419 | null |
| 2025-06-20 | Challenges in Grounding Language in the Real World | Peter Lindes et.al. | 2506.17375 | null |
| 2025-06-20 | Cash or Comfort? How LLMs Value Your Inconvenience | Mateusz Cedro et.al. | 2506.17367 | null |
| 2025-06-19 | Advanced Game-Theoretic Frameworks for Multi-Agent AI Challenges: A 2025 Outlook | Pavel Malinovskiy et.al. | 2506.17348 | null |
| 2025-06-19 | Adaptive Social Metaverse Streaming based on Federated Multi-Agent Deep Reinforcement Learning | Zijian Long et.al. | 2506.17342 | null |
| 2025-06-19 | AI is the Strategy: From Agentic AI to Autonomous Business Models onto Strategy in the Age of AI | René Bohnsack et.al. | 2506.17339 | null |
| 2025-06-24 | PBFT-Backed Semantic Voting for Multi-Agent Memory Pruning | Duong Bach et.al. | 2506.17338 | null |
| 2025-06-19 | Privacy-Preserving LLM Interaction with Socratic Chain-of-Thought Reasoning and Homomorphically Encrypted Vector Databases | Yubeen Bae et.al. | 2506.17336 | link |
| 2025-06-19 | LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Shuo Yan et.al. | 2506.17335 | null |
| 2025-06-19 | Beyond Prediction -- Structuring Epistemic Integrity in Artificial Reasoning Systems | Craig Steven Wright et.al. | 2506.17331 | null |
| 2025-06-18 | MAARTA:Multi-Agentic Adaptive Radiology Teaching Assistant | Akash Awasthi et.al. | 2506.17320 | null |
| 2025-06-18 | Context manipulation attacks : Web agents are susceptible to corrupted memory | Atharv Singh Patlan et.al. | 2506.17318 | null |
| 2025-06-18 | Can Large Language Models Be Trusted Paper Reviewers? A Feasibility Study | Chuanlei Li et.al. | 2506.17311 | null |
| 2025-06-17 | SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library | Satyam Mishra et.al. | 2506.17297 | null |
| 2025-06-25 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221 | null |
| 2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213 | link |
| 2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208 | null |
| 2025-06-20 | Towards AI Search Paradigm | Yuchen Li et.al. | 2506.17188 | null |
| 2025-06-20 | Capturing Misalignment | Pierfrancesco Guarino et.al. | 2506.17176 | null |
| 2025-06-20 | A Note on Proper Relational Structures | Adam Bjorndahl et.al. | 2506.17142 | null |
| 2025-06-20 | When Can Model-Free Reinforcement Learning be Enough for Thinking? | Josiah P. Hanna et.al. | 2506.17124 | null |
| 2025-06-20 | A general multi-stratum model for a nanofunctionalized releasing capsule: a computational study | Elia Onofri et.al. | 2506.17078 | null |
| 2025-06-20 | Behavior Driven Development for 3D Games | Fernando Pastor Ricós et.al. | 2506.17057 | null |
| 2025-06-20 | Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment | Leizhen Wang et.al. | 2506.17029 | null |
| 2025-06-20 | A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving | Hanlin Wu et.al. | 2506.17004 | null |
| 2025-06-20 | Elevating Styled Mahjong Agents with Learning from Demonstration | Lingfeng Li et.al. | 2506.16995 | null |
| 2025-06-20 | RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering | Ines Besrour et.al. | 2506.16988 | link |
| 2025-06-20 | Formal Control for Uncertain Systems via Contract-Based Probabilistic Surrogates (Extended Version) | Oliver Schön et.al. | 2506.16971 | null |
| 2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940 | link |
| 2025-06-20 | Do You Know What I Mean? A Syntactic Representation for Differential Bounded Awareness | Ani Guerdjikova et.al. | 2506.16901 | null |
| 2025-06-20 | Engineering Resilience: An Energy-Based Approach to Sustainable Behavioural Interventions | Arpitha Srivathsa Malavalli et.al. | 2506.16836 | null |
| 2025-06-20 | Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting | Michał Wawer et.al. | 2506.16813 | null |
| 2025-06-20 | Distributed Affine Formation Control of Linear Multi-agent Systems with Adaptive Event-triggering | Chenjun Liu et.al. | 2506.16797 | null |
| 2025-06-20 | Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly | Lance Ying et.al. | 2506.16755 | null |
| 2025-06-20 | Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation | Kosuke Nakanishi et.al. | 2506.16753 | link |
| 2025-06-20 | A Scalable Post-Processing Pipeline for Large-Scale Free-Space Multi-Agent Path Planning with PiBT | Arjo Chakravarty et.al. | 2506.16748 | link |
| 2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731 | null |
| 2025-06-20 | DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy | Weitao Zhou et.al. | 2506.16720 | null |
| 2025-06-20 | Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation | Chenxu Wang et.al. | 2506.16718 | link |
| 2025-06-20 | Mean-field and Monte Carlo Analysis of Multi-Species Dynamics of agents | Eduardo Velasco Stock et.al. | 2506.16717 | null |
| 2025-06-20 | Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models | Lu Gao et.al. | 2506.16699 | null |
| 2025-06-20 | Interpretable Low-Dimensional Modeling of Spatiotemporal Agent States for Decision Making in Football Tactics | Kenjiro Ide et.al. | 2506.16696 | null |
| 2025-06-20 | Closed curve covering and multiagent TSP ratios | Travis Dillon et.al. | 2506.16675 | null |
| 2025-06-19 | SemAgent: A Semantics Aware Program Repair Agent | Anvith Pabba et.al. | 2506.16650 | null |
| 2025-06-19 | Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces | Jiamin He et.al. | 2506.16608 | null |
| 2025-06-19 | AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions | Ihor Pysmennyi et.al. | 2506.16586 | link |
| 2025-06-19 | ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning | Zexi Liu et.al. | 2506.16499 | null |
| 2025-06-19 | Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support | Sophie Chiang et.al. | 2506.16473 | null |
| 2025-06-19 | StoryWriter: A Multi-Agent Framework for Long Story Generation | Haotian Xia et.al. | 2506.16445 | null |
| 2025-06-19 | Agentic Personalisation of Cross-Channel Marketing Experiences | Sami Abboud et.al. | 2506.16429 | null |
| 2025-06-19 | When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework | Zhen Xu et.al. | 2506.16411 | null |
| 2025-06-19 | IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks | Xiaoya Lu et.al. | 2506.16402 | null |
| 2025-06-19 | GoalLadder: Incremental Goal Discovery with Vision-Language Models | Alexey Zakharov et.al. | 2506.16396 | null |
| 2025-06-19 | AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios | Yunhao Hou et.al. | 2506.16371 | link |
| 2025-06-19 | Data-Driven Policy Mapping for Safe RL-based Energy Management Systems | Theo Zangato et.al. | 2506.16352 | null |
| 2025-06-19 | Improved Exploration in GFlownets via Enhanced Epistemic Neural Networks | Sajan Muhammad et.al. | 2506.16313 | null |
| 2025-06-19 | M-Predictive Spliner: Enabling Spatiotemporal Multi-Opponent Overtaking for Autonomous Racing | Nadine Imholz et.al. | 2506.16301 | null |
| 2025-06-19 | Coordination of Electrical and Heating Resources by Self-Interested Agents | Rico Schrage et.al. | 2506.16277 | null |
| 2025-06-19 | VideoGAN-based Trajectory Proposal for Automated Vehicles | Annajoyce Mariani et.al. | 2506.16209 | link |
| 2025-06-19 | Solving Zero-Sum Convex Markov Games | Fivos Kalogiannis et.al. | 2506.16120 | null |
| 2025-06-19 | Towards AI-Driven RANs for 6G and Beyond: Architectural Advancements and Future Horizons | Mathushaharan Rathakrishnan et.al. | 2506.16070 | null |
| 2025-06-19 | Human-Centered Shared Autonomy for Motor Planning, Learning, and Control Applications | MH Farhadi et.al. | 2506.16044 | null |
| 2025-06-19 | OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents | Reyna Abhyankar et.al. | 2506.16042 | null |
| 2025-06-19 | DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning | Boyu Li et.al. | 2506.16012 | link |
| 2025-06-19 | SimuPanel: A Novel Immersive Multi-Agent System to Simulate Interactive Expert Panel Discussion | Xiangyang He et.al. | 2506.16010 | null |
| 2025-06-19 | HybridRAG-based LLM Agents for Low-Carbon Optimization in Low-Altitude Economy Networks | Jinbo Wen et.al. | 2506.15947 | null |
| 2025-06-19 | On the optimal regret of collaborative personalized linear bandits | Bruce Huang et.al. | 2506.15943 | null |
| 2025-06-19 | Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues | Myke C. Cohen et.al. | 2506.15928 | null |
| 2025-06-23 | From RAG to Agentic: Validating Islamic-Medicine Responses with LLM Agents | Mohammad Amaan Sayeed et.al. | 2506.15911 | null |
| 2025-06-18 | Fair Contracts in Principal-Agent Games with Heterogeneous Types | Jakub Tłuczek et.al. | 2506.15887 | null |
| 2025-06-18 | Modeling society with a responsible elite | Yana Tsodikova et.al. | 2506.15877 | null |
| 2025-06-18 | CooperRisk: A Driving Risk Quantification Pipeline with Multi-Agent Cooperative Perception and Prediction | Mingyue Lei et.al. | 2506.15868 | null |
| 2025-06-18 | Understanding Online Polarization Through Human-Agent Interaction in a Synthetic LLM-Based Social Network | Tim Donkers et.al. | 2506.15866 | null |
| 2025-06-18 | Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples | Viral Rasik Galaiya et.al. | 2506.15865 | null |
| 2025-06-18 | Learning to Coordinate Under Threshold Rewards: A Cooperative Multi-Agent Bandit Framework | Michael Ledford et.al. | 2506.15856 | null |
| 2025-06-18 | MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents | Zijian Zhou et.al. | 2506.15841 | null |
| 2025-06-18 | Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning | Emanuele Musumeci et.al. | 2506.15828 | null |
| 2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825 | null |
| 2025-06-18 | Veracity: An Open-Source AI Fact-Checking System | Taylor Lynn Curtis et.al. | 2506.15794 | null |
| 2025-06-18 | Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation | Ruoyu Wang et.al. | 2506.15757 | null |
| 2025-06-18 | RecBayes: Recurrent Bayesian Ad Hoc Teamwork in Large Partially Observable Domains | João G. Ribeiro et.al. | 2506.15756 | null |
| 2025-06-23 | OAgents: An Empirical Study of Building Effective Agents | He Zhu et.al. | 2506.15741 | null |
| 2025-06-17 | SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents | Jonathan Kutasov et.al. | 2506.15740 | null |
| 2025-06-20 | Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence | Yining Hong et.al. | 2506.15677 | null |
| 2025-06-18 | Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers | Tommaso Green et.al. | 2506.15674 | link |
| 2025-06-18 | SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence | Yao Zhang et.al. | 2506.15672 | null |
| 2025-06-18 | PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection | Wenhao Li et.al. | 2506.15656 | null |
| 2025-06-18 | FindingDory: A Benchmark to Evaluate Memory in Embodied Agents | Karmesh Yadav et.al. | 2506.15635 | null |
| 2025-06-18 | The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games | Lyle Goodyear et.al. | 2506.15624 | null |
| 2025-06-18 | Multi-Agent, Multi-Scale Systems with the Koopman Operator | Craig Bakker et.al. | 2506.15589 | null |
| 2025-06-18 | Learning to flock in open space by avoiding collisions and staying together | Martino Brambati et.al. | 2506.15587 | null |
| 2025-06-18 | Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents | Aline Dobrovsky et.al. | 2506.15567 | null |
| 2025-06-18 | Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning | Roger Creus Castanyer et.al. | 2506.15544 | link |
| 2025-06-18 | Co-Creative Learning via Metropolis-Hastings Interaction between Humans and AI | Ryota Okumura et.al. | 2506.15468 | null |
| 2025-06-18 | AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Zhouhong Gu et.al. | 2506.15451 | link |
| 2025-06-18 | Understanding GUI Agent Localization Biases through Logit Sharpness | Xingjian Tao et.al. | 2506.15425 | null |
| 2025-06-18 | Reward Models in Deep Reinforcement Learning: A Survey | Rui Yu et.al. | 2506.15421 | null |
| 2025-06-18 | Multi-Timescale Gradient Sliding for Distributed Optimization | Junhui Zhang et.al. | 2506.15387 | null |
| 2025-06-18 | Tractable Graph Structures in EFX Orientation | Václav Blažej et.al. | 2506.15379 | null |
| 2025-06-18 | Efficient and Generalizable Environmental Understanding for Visual Navigation | Ruoyu Wang et.al. | 2506.15377 | null |
| 2025-06-18 | Learning to Maximize Quantum Neural Network Expressivity via Effective Rank | Juan Yao et.al. | 2506.15375 | null |
| 2025-06-18 | Designing Intent: A Multimodal Framework for Human-Robot Cooperation in Industrial Workspaces | Francesco Chiossi et.al. | 2506.15293 | null |
| 2025-06-18 | RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments | Yuchuan Fu et.al. | 2506.15253 | link |
| 2025-06-18 | Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels | Jiahao You et.al. | 2506.15225 | null |
| 2025-06-18 | Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study | Mohamad A. Hady et.al. | 2506.15207 | null |
| 2025-06-18 | ImprovDML: Improved Trade-off in Private Byzantine-Resilient Distributed Machine Learning | Bing Liu et.al. | 2506.15181 | null |
| 2025-06-18 | From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem | Yanxu Mao et.al. | 2506.15170 | null |
| 2025-06-18 | Efficient reallocation of indivisible resources: Pair-efficiency versus Pareto-efficiency | Pinaki Mandal et.al. | 2506.15169 | null |
| 2025-06-18 | LLM Agent for Hyper-Parameter Optimization | Wanzhe Wang et.al. | 2506.15167 | null |
| 2025-06-18 | Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs | Jing Yang Lee et.al. | 2506.15131 | null |
| 2025-06-19 | Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality | Ziqin Chen et.al. | 2506.15106 | null |
| 2025-06-18 | DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory | Zihe Ji et.al. | 2506.15096 | null |
| 2025-06-18 | EmojiVoice: Towards long-term controllable expressivity in robot speech | Paige Tuttösí et.al. | 2506.15085 | null |
| 2025-06-18 | HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models | Trishna Chakraborty et.al. | 2506.15065 | null |
| 2025-06-18 | 2BSDE with uncertain horizon and application to stochastic control in erratic environments | Alberto Gennaro et.al. | 2506.15037 | null |
| 2025-06-19 | Context Matters: Learning Generalizable Rewards via Calibrated Features | Alexandra Forsey-Smerek et.al. | 2506.15012 | null |
| 2025-06-17 | MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning | Tristan Tomilin et.al. | 2506.14990 | link |
| 2025-06-17 | Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits | Tianyi Xu et.al. | 2506.14988 | null |
| 2025-06-17 | OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents | Thomas Kuntz et.al. | 2506.14866 | link |
| 2025-06-17 | Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching | Qizheng Zhang et.al. | 2506.14852 | null |
| 2025-06-13 | Recent Advances in Multi-Agent Human Trajectory Prediction: A Comprehensive Review | Céline Finet et.al. | 2506.14831 | null |
| 2025-06-17 | RobotSmith: Generative Robotic Tool Design for Acquisition of Complex Manipulation Skills | Chunru Lin et.al. | 2506.14763 | null |
| 2025-06-17 | Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems | Shiyu Cheng et.al. | 2506.14749 | null |
| 2025-06-17 | AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes | Jiahao Qiu et.al. | 2506.14728 | null |
| 2025-06-17 | Linear Planar 3-SAT and Its Applications in Planning | Victorien Desbois et.al. | 2506.14713 | null |
| 2025-06-17 | AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions | Aishan Liu et.al. | 2506.14697 | null |
| 2025-06-17 | Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference | Kalliyan Velasco et.al. | 2506.14690 | null |
| 2025-06-17 | Unified Software Engineering agent as AI Software Engineer | Leonhard Applis et.al. | 2506.14683 | null |
| 2025-06-17 | StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery | Jina Kim et.al. | 2506.14670 | null |
| 2025-06-17 | SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning | Hexian Ni et.al. | 2506.14648 | null |
| 2025-06-17 | GenerationPrograms: Fine-grained Attribution with Executable Programs | David Wan et.al. | 2506.14580 | link |
| 2025-06-17 | Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack | Daewon Kang et.al. | 2506.14539 | null |
| 2025-06-17 | Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution | Xiaohan Zheng et.al. | 2506.14529 | null |
| 2025-06-17 | SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks | Zijian Song et.al. | 2506.14512 | null |
| 2025-06-17 | Toward Safety-First Human-Like Decision Making for Autonomous Vehicles in Time-Varying Traffic Flow | Xiao Wang et.al. | 2506.14502 | null |
| 2025-06-17 | LLM-Powered Swarms: A New Frontier or a Conceptual Stretch? | Muhammad Atta Ur Rahman et.al. | 2506.14496 | null |
| 2025-06-17 | GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies | Jingqi Yang et.al. | 2506.14477 | link |
| 2025-06-17 | SimSpark: Interactive Simulation of Social Media Behaviors | Ziyue Lin et.al. | 2506.14476 | null |
| 2025-06-17 | Hamiltonian Formalism for Comparing Quantum and Classical Intelligence | Elija Perrier et.al. | 2506.14456 | null |
| 2025-06-17 | Active Digital Twins via Active Inference | Matteo Torzoni et.al. | 2506.14453 | null |
| 2025-06-17 | Adaptive Reinforcement Learning for Unobservable Random Delays | John Wikman et.al. | 2506.14411 | null |
| 2025-06-17 | System 0: Transforming Artificial Intelligence into a Cognitive Extension | Massimo Chiriatti et.al. | 2506.14376 | null |
| 2025-06-18 | ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies | Jinyan Yuan et.al. | 2506.14315 | null |
| 2025-06-17 | Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent | Xueyang Feng et.al. | 2506.14302 | null |
| 2025-06-17 | ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems | Fanzhi Zeng et.al. | 2506.14299 | null |
| 2025-06-17 | From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Seongbo Jang et.al. | 2506.14285 | link |
| 2025-06-17 | Mxplainer: Explain and Learn Insights by Imitating Mahjong Agents | Lingfeng Li et.al. | 2506.14246 | link |
| 2025-06-17 | A Novel Indicator for Quantifying and Minimizing Information Utility Loss of Robot Teams | Xiyu Zhao et.al. | 2506.14237 | null |
| 2025-06-17 | Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team | Md Tanzib Hosain et.al. | 2506.14234 | null |
| 2025-06-17 | AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents | Jingxu Xie et.al. | 2506.14205 | link |
| 2025-06-17 | MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment | Junghwan Kim et.al. | 2506.14199 | null |
| 2025-06-17 | Hierarchical Multi-Agent Reinforcement Learning-based Coordinated Spatial Reuse for Next Generation WLANs | Jiaming Yu et.al. | 2506.14187 | null |
| 2025-06-17 | Affective-CARA: A Knowledge Graph Driven Framework for Culturally Adaptive Emotional Intelligence in HCI | Nirodya Pussadeniya et.al. | 2506.14166 | null |
| 2025-06-17 | Light Aircraft Game : Basic Implementation and training results analysis | Hanzhong Cao et.al. | 2506.14164 | link |
| 2025-06-17 | Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Amirhossein Rajabpour et.al. | 2506.14162 | link |
| 2025-06-17 | StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework | Shayan Talaei et.al. | 2506.14159 | null |
| 2025-06-17 | Dividing Conflicting Items Fairly | Ayumi Igarashi et.al. | 2506.14149 | null |
| 2025-06-17 | RadFabric: Agentic AI System with Reasoning Capability for Radiology | Wenting Chen et.al. | 2506.14142 | null |
| 2025-06-17 | FormGym: Doing Paperwork with Agents | Matthew Toles et.al. | 2506.14079 | null |
| 2025-06-17 | Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification | Nathaniel Pinckney et.al. | 2506.14074 | link |
| 2025-06-16 | Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning | Martin Klissarov et.al. | 2506.14045 | null |
| 2025-06-16 | SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement | Chelsi Jain et.al. | 2506.14035 | link |
| 2025-06-16 | A Cooperative Contactless Object Transport with Acoustic Robots | Narsimlu Kemsaram et.al. | 2506.13957 | link |
| 2025-06-16 | ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture | Vishesh Kumar Tanwar et.al. | 2506.13935 | null |
| 2025-06-16 | How Does LLM Reasoning Work for Code? A Survey and a Call to Action | Ira Ceka et.al. | 2506.13932 | null |
| 2025-06-16 | Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems | Zhongzhi Yu et.al. | 2506.13905 | null |
| 2025-06-16 | LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning | Miho Koda et.al. | 2506.13841 | link |
| 2025-06-16 | Recent trends in socio-epidemic modelling: behaviours and their determinants | Daniele Proverbio et.al. | 2506.13837 | null |
| 2025-06-15 | The Reflexive Integrated Information Unit: A Differentiable Primitive for Artificial Consciousness | Gnankan Landry Regis N'guessan et.al. | 2506.13825 | link |
| 2025-06-15 | The Synthetic Mirror -- Synthetic Data at the Age of Agentic AI | Marcelle Momha et.al. | 2506.13818 | null |
| 2025-06-14 | DeepSeq: High-Throughput Single-Cell RNA Sequencing Data Labeling via Web Search-Augmented Agentic Generative AI Foundation Models | Saleem A. Al Dajani et.al. | 2506.13817 | null |
| 2025-06-13 | Investigating the Potential of Large Language Model-Based Router Multi-Agent Architectures for Foundation Design Automation: A Task Classification and Expert Selection Study | Sompote Youwai et.al. | 2506.13811 | null |
| 2025-06-13 | Causality in the human niche: lessons for machine learning | Richard D. Lange et.al. | 2506.13803 | null |
| 2025-06-13 | Enhancing Clinical Decision Support and EHR Insights through LLMs and the Model Context Protocol: An Open-Source MCP-FHIR Framework | Abul Ehtesham et.al. | 2506.13800 | null |
| 2025-06-16 | MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering | Arya Fayyazi et.al. | 2506.13755 | null |
| 2025-06-16 | PB |
Brahim Driss et.al. | 2506.13741 | null |
| 2025-06-16 | The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jiashun Liu et.al. | 2506.13672 | null |
| 2025-06-16 | We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems | Junfeng Fang et.al. | 2506.13666 | link |
| 2025-06-16 | Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning | Shulin Tian et.al. | 2506.13654 | null |
| 2025-06-16 | xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations | Kaiyuan Chen et.al. | 2506.13651 | null |
| 2025-06-16 | Deceptive Path Planning: A Bayesian Game Approach | Violetta Rostobaya et.al. | 2506.13650 | null |
| 2025-06-16 | CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation | Yuwei Du et.al. | 2506.13599 | null |
| 2025-06-16 | Agent Capability Negotiation and Binding Protocol (ACNBP) | Ken Huang et.al. | 2506.13590 | link |
| 2025-06-16 | Non-exchangeable mean-field theory for adaptive weights: propagation of chaos and graphon sampling lemma | Datong Zhou et.al. | 2506.13587 | null |
| 2025-06-16 | Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes | Bernhard Hilpert et.al. | 2506.13583 | null |
| 2025-06-17 | A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints | Jonathan Hoss et.al. | 2506.13566 | link |
| 2025-06-16 | Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework | Hemjyoti Das et.al. | 2506.13478 | null |
| 2025-06-16 | Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning | David Bani-Harouni et.al. | 2506.13474 | null |
| 2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469 | null |
| 2025-06-16 | Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics | YR Darr et.al. | 2506.13453 | null |
| 2025-06-16 | Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization | Sebastian Griesbach et.al. | 2506.13345 | link |
| 2025-06-16 | Towards Pervasive Distributed Agentic Generative AI -- A State of The Art | Gianni Molinari et.al. | 2506.13324 | null |
| 2025-06-16 | RL-Guided MPC for Autonomous Greenhouse Control | Salim Msaad et.al. | 2506.13278 | null |
| 2025-06-16 | Screen Reader Users in the Vibe Coding Era: Adaptation, Empowerment, and New Accessibility Landscape | Nan Chen et.al. | 2506.13270 | null |
| 2025-06-16 | Reconstruction-free magnetic control of DIII-D plasma with deep reinforcement learning | G. F. Subbotin et.al. | 2506.13267 | null |
| 2025-06-16 | COME: Adding Scene-Centric Forecasting Control to Occupancy World Model | Yining Shi et.al. | 2506.13260 | link |
| 2025-06-16 | On Immutable Memory Systems for Artificial Agents: A Blockchain-Indexed Automata-Theoretic Framework Using ECDH-Keyed Merkle Chains | Craig Steven Wright et.al. | 2506.13246 | null |
| 2025-06-16 | A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs | Guoxi Zhang et.al. | 2506.13245 | null |
| 2025-06-16 | Mixed-variable policy-based optimization | Jonathan Viquerat et.al. | 2506.13240 | null |
| 2025-06-16 | Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty | Ziyu Li et.al. | 2506.13207 | null |
| 2025-06-19 | Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments | Xuan Wang et.al. | 2506.13205 | null |
| 2025-06-16 | Querying Large Automotive Software Models: Agentic vs. Direct LLM Approaches | Lukasz Mazur et.al. | 2506.13171 | null |
| 2025-06-16 | Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback | Tanmay Goyal et.al. | 2506.13163 | null |
| 2025-06-16 | Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management | DongNyeong Heo et.al. | 2506.13153 | null |
| 2025-06-16 | AlphaEvolve: A coding agent for scientific and algorithmic discovery | Alexander Novikov et.al. | 2506.13131 | null |
| 2025-06-16 | Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning | Stella C. Dong et.al. | 2506.13113 | null |
| 2025-06-16 | Leveraging In-Context Learning for Language Model Agents | Shivanshu Gupta et.al. | 2506.13109 | null |
| 2025-06-17 | Towards the Autonomous Optimization of Urban Logistics: Training Generative AI with Scientific Tools via Agentic Digital Twins and Model Context Protocol | Haowen Xu et.al. | 2506.13068 | link |
| 2025-06-16 | MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models? | Xixian Yong et.al. | 2506.13065 | null |
| 2025-06-16 | PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue | George Shaikovski et.al. | 2506.13063 | null |
| 2025-06-16 | MAGIC: Multi-Agent Argumentation and Grammar Integrated Critiquer | Joaquin Jordan et.al. | 2506.13037 | null |
| 2025-06-15 | Discovering Coordinated Processes From Social Online Networks | Anna Kalenkova et.al. | 2506.12988 | link |
| 2025-06-15 | On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness | Arnav Mehra et.al. | 2506.12950 | null |
| 2025-06-15 | Scaling Test-time Compute for LLM Agents | King Zhu et.al. | 2506.12928 | null |
| 2025-06-15 | Sectoral Coupling in Linguistic State Space | Sebastian Dumbrava et.al. | 2506.12927 | null |
| 2025-06-15 | Distributed Composite Optimization with Sub-Weibull Noises | Zhan Yu et.al. | 2506.12901 | null |
| 2025-06-15 | Homeostatic Coupling for Prosocial Behavior | Naoto Yoshida et.al. | 2506.12894 | null |
| 2025-06-15 | Exploring the Potential of Metacognitive Support Agents for Human-AI Co-Creation | Frederic Gmeiner et.al. | 2506.12879 | null |
| 2025-06-15 | WereWolf-Plus: An Update of Werewolf Game setting Based on DSGBench | Xinyuan Xia et.al. | 2506.12841 | null |
| 2025-06-15 | Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models | Tung Minh Luu et.al. | 2506.12822 | null |
| 2025-06-15 | PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates | Yun Liu et.al. | 2506.12821 | null |
| 2025-06-15 | Mastering Da Vinci Code: A Comparative Study of Transformer, LLM, and PPO-based Agents | LeCheng Zhang et.al. | 2506.12801 | null |
| 2025-06-15 | Resilient-native and Intelligent NextG Systems | Mehdi Bennis et.al. | 2506.12795 | null |
| 2025-06-15 | Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling | Zhilin Lin et.al. | 2506.12735 | null |
| 2025-06-15 | Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems | Yuqi Ping et.al. | 2506.12710 | null |
| 2025-06-15 | SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation | Yashothara Shanmugarasa et.al. | 2506.12699 | null |
| 2025-06-15 | SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation | Xiaofeng Shi et.al. | 2506.12689 | null |
| 2025-06-14 | LIFELONG SOTOPIA: Evaluating Social Intelligence of Language Agents Over Lifelong Social Interactions | Hitesh Goel et.al. | 2506.12666 | null |
| 2025-06-14 | Behavioral Generative Agents for Energy Operations | Cong Chen et.al. | 2506.12664 | null |
| 2025-06-14 | Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics | Jiarui Liu et.al. | 2506.12657 | null |
| 2025-06-14 | Mapping Neural Signals to Agent Performance, A Step Towards Reinforcement Learning from Neural Feedback | Julia Santaniello et.al. | 2506.12636 | null |
| 2025-06-14 | Towards Building General Purpose Embedding Models for Industry 4.0 Agents | Christodoulos Constantinides et.al. | 2506.12607 | null |
| 2025-06-17 | The Rise of AI Companions: How Human-Chatbot Relationships Influence Well-Being | Yutong Zhang et.al. | 2506.12605 | null |
| 2025-06-14 | Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow | Jie Pan et.al. | 2506.12600 | null |
| 2025-06-14 | Moment Restrictions for Nonlinear Panel Data Models with Feedback | Stéphane Bonhomme et.al. | 2506.12569 | null |
| 2025-06-17 | AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving | Wentao Zhang et.al. | 2506.12508 | link |
| 2025-06-18 | Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning | Ali Baheri et.al. | 2506.12497 | null |
| 2025-06-14 | Tiered Agentic Oversight: A Hierarchical Multi-Agent System for AI Safety in Healthcare | Yubin Kim et.al. | 2506.12482 | null |
| 2025-06-14 | Generalizable Trajectory Prediction via Inverse Reinforcement Learning with Mamba-Graph Architecture | Wenyun Li et.al. | 2506.12474 | null |
| 2025-06-14 | Levels of Autonomy for AI Agents | K. J. Kevin Feng et.al. | 2506.12469 | null |
| 2025-06-14 | Adding links wisely: how an influencer seeks for leadership in opinion dynamics? | Lingfei Wang et.al. | 2506.12463 | null |
| 2025-06-14 | Topology-Assisted Spatio-Temporal Pattern Disentangling for Scalable MARL in Large-scale Autonomous Traffic Control | Rongpeng Li et.al. | 2506.12453 | null |
| 2025-06-14 | Plan Your Travel and Travel with Your Plan: Wide-Horizon Planning and Evaluation via LLM | Dongjie Yang et.al. | 2506.12421 | null |
| 2025-06-14 | Ghost Policies: A New Paradigm for Understanding and Learning from Failure in Deep Reinforcement Learning | Xabier Olaz et.al. | 2506.12366 | null |
| 2025-06-17 | Sharp Tools: How Developers Wield Agentic AI in Real Software Engineering Tasks | Aayush Kumar et.al. | 2506.12347 | null |
| 2025-06-14 | SheetMind: An End-to-End LLM-Powered Multi-Agent Framework for Spreadsheet Automation | Ruiyan Zhu et.al. | 2506.12339 | link |
| 2025-06-14 | Artificial Intelligence in Team Dynamics: Who Gets Replaced and Why? | Xienan Cheng et.al. | 2506.12337 | null |
| 2025-06-14 | IndoorWorld: Integrating Physical Task Solving and Social Simulation in A Heterogeneous Multi-Agent Environment | Dekun Wu et.al. | 2506.12331 | null |
| 2025-06-14 | Similar Formation Control of Multi-Agent Systems over Directed Acyclic Graphs via Matrix-Weighted Laplacian | Zhipeng Fan et.al. | 2506.12297 | null |
| 2025-06-13 | Cloud Infrastructure Management in the Age of AI Agents | Zhenning Yang et.al. | 2506.12270 | null |
| 2025-06-13 | The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs | Avinash Baidya et.al. | 2506.12266 | null |
| 2025-06-13 | Reversing the Paradigm: Building AI-First Systems with Human Guidance | Cosimo Spera et.al. | 2506.12245 | null |
| 2025-06-13 | Privacy Reasoning in Ambiguous Contexts | Ren Yi et.al. | 2506.12241 | null |
| 2025-06-13 | A Fast, Reliable, and Secure Programming Language for LLM Agents with Code Actions | Stephen Mell et.al. | 2506.12202 | null |
| 2025-06-13 | PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification | Yujie Zhao et.al. | 2506.12200 | link |
| 2025-06-13 | OSI Stack Redesign for Quantum Networks: Requirements, Technologies, Challenges, and Future Directions | Shakil Ahmed et.al. | 2506.12195 | null |
| 2025-06-13 | Because we have LLMs, we Can and Should Pursue Agentic Interpretability | Been Kim et.al. | 2506.12152 | null |
| 2025-06-13 | Eliciting Reasoning in Language Models with Cognitive Tools | Brown Ebouky et.al. | 2506.12115 | null |
| 2025-06-13 | EconGym: A Scalable AI Testbed with Diverse Economic Tasks | Qirui Mi et.al. | 2506.12110 | null |
| 2025-06-13 | DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents | Hao Li et.al. | 2506.12104 | link |
| 2025-06-12 | "I Hadn't Thought About That": Creators of Human-like AI Weigh in on Ethics And Neurodivergence | Naba Rizvi et.al. | 2506.12098 | null |
| 2025-06-12 | DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion | Khang Nguyen et.al. | 2506.12095 | null |
| 2025-06-12 | Military AI Cyber Agents (MAICAs) Constitute a Global Threat to Critical Infrastructure | Timothy Dubber et.al. | 2506.12094 | null |
| 2025-06-13 | Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale | Junha Lee et.al. | 2506.12009 | null |
| 2025-06-13 | Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents? | Ramesh Raskar et.al. | 2506.12003 | null |
| 2025-06-13 | Self-Regulating Cars: Automating Traffic Control in Free Flow Road Networks | Ankit Bhardwaj et.al. | 2506.11973 | null |
| 2025-06-13 | Visual Pre-Training on Unlabeled Images using Reinforcement Learning | Dibya Ghosh et.al. | 2506.11967 | null |
| 2025-06-13 | Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning | Mohammadamin Moradi et.al. | 2506.11957 | null |
| 2025-06-13 | Secure API-Driven Research Automation to Accelerate Scientific Discovery | Tyler J. Skluzacek et.al. | 2506.11950 | null |
| 2025-06-13 | Breaking Habits: On the Role of the Advantage Function in Learning Causal State Representations | Miguel Suau et.al. | 2506.11912 | null |
| 2025-06-13 | Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients | Chapa Sirithunge et.al. | 2506.11906 | null |
| 2025-06-13 | An Explainable AI Framework for Dynamic Resource Management in Vehicular Network Slicing | Haochen Sun et.al. | 2506.11882 | null |
| 2025-06-13 | Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems | Zhipeng Bao et.al. | 2506.11842 | null |
| 2025-06-13 | Mean Field Games without Rational Expectations | Benjamin Moll et.al. | 2506.11838 | null |
| 2025-06-13 | The Space Between Us: A Methodological Framework for Researching Bonding and Proxemics in Situated Group-Agent Interactions | Ana Müller et.al. | 2506.11829 | null |
| 2025-06-13 | Revealing Political Bias in LLMs through Structured Multi-Agent Debate | Aishwarya Bandaru et.al. | 2506.11825 | link |
| 2025-06-13 | PE-MA: Parameter-Efficient Co-Evolution of Multi-Agent Systems | Yingfan Deng et.al. | 2506.11803 | null |
| 2025-06-13 | Solving Inverse Problems in Stochastic Self-Organising Systems through Invariant Representations | Elias Najarro et.al. | 2506.11796 | link |
| 2025-06-13 | ALEA IACTA EST: A Declarative Domain-Specific Language for Manually Performable Random Experiments | Baltasar Trancón y Widemann et.al. | 2506.11794 | null |
| 2025-06-13 | SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Hwiwon Lee et.al. | 2506.11791 | link |
| 2025-06-16 | AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments | Zikang Leng et.al. | 2506.11773 | null |
| 2025-06-13 | Convergence to equilibrium for a class of exchange economies | R. S. MacKay et.al. | 2506.11770 | null |
| 2025-06-13 | DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents | Mingxuan Du et.al. | 2506.11763 | null |
| 2025-06-13 | Bias and Identifiability in the Bounded Confidence Model | Claudio Borile et.al. | 2506.11751 | null |
| 2025-06-13 | Interaction, Process, Infrastructure: A Unified Architecture for Human-Agent Collaboration | Yun Wang et.al. | 2506.11718 | null |
| 2025-06-13 | Generalised Rate Control Approach For Stream Processing Applications | Ziren Xiao et.al. | 2506.11710 | null |
| 2025-06-13 | Growing with Experience: Growing Neural Networks in Deep Reinforcement Learning | Lukas Fehring et.al. | 2506.11706 | null |
| 2025-06-17 | A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences | Pratibha Zunjare et.al. | 2506.11681 | null |
| 2025-06-13 | Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control | Lambert Lee et.al. | 2506.11650 | null |
| 2025-06-13 | High Probability Convergence of Distributed Clipped Stochastic Gradient Descent with Heavy-tailed Noise | Yuchen Yang et.al. | 2506.11647 | null |
| 2025-06-13 | LoRA-Gen: Specializing Large Language Model via Online LoRA Generation | Yicheng Xiao et.al. | 2506.11638 | null |
| 2025-06-13 | "If we misunderstand the client, we misspend 100 hours": Exploring conversational AI and response types for information elicitation | Daniel Hove Paludan et.al. | 2506.11610 | null |
| 2025-06-13 | Learn to Preserve Personality: Federated Foundation Models in Recommendations | Zhiwei Li et.al. | 2506.11563 | null |
| 2025-06-13 | AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction | Syeda Kisaa Fatima et.al. | 2506.11475 | null |
| 2025-06-13 | Linear-quadratic stochastic nonzero-sum differential games between graphon teams | De-xuan Xu et.al. | 2506.11468 | null |
| 2025-06-13 | Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention | Xuan Duy Ta et.al. | 2506.11445 | null |
| 2025-06-13 | ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification | Yiyang Jin et.al. | 2506.11442 | null |
| 2025-06-13 | Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards | Jeff Da et.al. | 2506.11425 | null |
| 2025-06-13 | FocalAD: Local Motion Planning for End-to-End Autonomous Driving | Bin Sun et.al. | 2506.11419 | null |
| 2025-06-13 | Complexity guarantees for risk-neutral generalized Nash equilibrium problems | Haochen Tao et.al. | 2506.11409 | null |
| 2025-06-13 | Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning | Liying Wang et.al. | 2506.11376 | null |
| 2025-06-12 | From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review | Yaohui Zhang et.al. | 2506.11343 | null |
| 2025-06-12 | A Hybrid Adaptive Nash Equilibrium Solver for Distributed Multi-Agent Systems with Game-Theoretic Jump Triggering | Qiuyu Miao et.al. | 2506.11304 | null |
| 2025-06-12 | TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy | Héctor Carrión et.al. | 2506.11302 | link |
| 2025-06-12 | Shapley Machine: A Game-Theoretic Framework for N-Agent Ad Hoc Teamwork | Jianhong Wang et.al. | 2506.11285 | link |
| 2025-06-12 | Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation | Benjamin Elder et.al. | 2506.11266 | null |
| 2025-06-12 | Sensor Model Identification via Simultaneous Model Selection and State Variable Determination | Christian Brommer et.al. | 2506.11263 | null |
| 2025-06-12 | LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation | Ngoc Phuoc An Vo et.al. | 2506.11237 | null |
| 2025-06-12 | Beyond Formal Semantics for Capabilities and Skills: Model Context Protocol in Manufacturing | Luis Miguel Vieira da Silva et.al. | 2506.11180 | null |
| 2025-06-12 | Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning | Xue Zhou et.al. | 2506.11172 | null |
| 2025-06-11 | ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator | Wenlong Hou et.al. | 2506.11150 | null |
| 2025-06-11 | Autonomous Computer Vision Development with Agentic AI | Jin Kim et.al. | 2506.11140 | link |
| 2025-06-10 | GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech Instructions | Wenkang Han et.al. | 2506.11127 | null |
| 2025-06-12 | AutoMind: Adaptive Knowledgeable Agent for Automated Data Science | Yixin Ou et.al. | 2506.10974 | link |
| 2025-06-12 | Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop | Justin Kerr et.al. | 2506.10968 | null |
| 2025-06-12 | SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks | Lianghong Guo et.al. | 2506.10954 | link |
| 2025-06-12 | Build the web for agents, not agents for the web | Xing Han Lù et.al. | 2506.10953 | null |
| 2025-06-14 | Monitoring Decomposition Attacks in LLMs with Lightweight Sequential Monitors | Chen Yueh-Han et.al. | 2506.10949 | link |
| 2025-06-12 | Execution Guided Line-by-Line Code Generation | Boaz Lavon et.al. | 2506.10948 | link |
| 2025-06-12 | Dynamic Epistemic Friction in Dialogue | Timothy Obiso et.al. | 2506.10934 | null |
| 2025-06-12 | Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence | Eduardo Baena et.al. | 2506.10925 | null |
| 2025-06-12 | Prediction and control of geometry-induced nematic order in growing multicellular systems | Lukas Hupe et.al. | 2506.10867 | null |
| 2025-06-12 | CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Alireza Salemi et.al. | 2506.10844 | link |
| 2025-06-12 | Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches | Andrea Moglia et.al. | 2506.10825 | null |
| 2025-06-15 | VideoDeepResearch: Long Video Understanding With Agentic Tool Using | Huaying Yuan et.al. | 2506.10821 | link |
| 2025-06-13 | Joint Beamforming with Extremely Large Scale RIS: A Sequential Multi-Agent A2C Approach | Zhi Chai et.al. | 2506.10815 | null |
| 2025-06-12 | OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems | Xiaozhe Li et.al. | 2506.10764 | link |
| 2025-06-12 | Integrating Large Language Models into Text Animation: An Intelligent Editing System with Inline and Chat Interaction | Bao Zhang et.al. | 2506.10762 | null |
| 2025-06-12 | Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding | Yuhang Zhang et.al. | 2506.10756 | null |
| 2025-06-12 | Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering | Sai Prasanna Teja Reddy Bogireddy et.al. | 2506.10751 | null |
| 2025-06-12 | Cursed Equilibria and Knightian Uncertainty in a Trading Game | Jurek Preker et.al. | 2506.10663 | null |
| 2025-06-12 | SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis | Sergio Burdisso et.al. | 2506.10622 | link |
| 2025-06-12 | AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation | Haoyuan Shi et.al. | 2506.10540 | null |
| 2025-06-12 | Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models | Sangmin Song et.al. | 2506.10504 | null |
| 2025-06-12 | BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis | Surya Jasper et.al. | 2506.10501 | null |
| 2025-06-16 | Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications | Felix Härer et.al. | 2506.10467 | link |
| 2025-06-12 | Are We Generalizing from the Exception? An In-the-Wild Study on Group-Sensitive Conversation Design in Human-Agent Interactions | Ana Müller et.al. | 2506.10462 | null |
| 2025-06-12 | Equitable Mechanism Design for Facility Location | Toby Walsh et.al. | 2506.10460 | null |
| 2025-06-12 | Multi-dimensional Autoscaling of Processing Services: A Comparison of Agent-based Methods | Boris Sedlak et.al. | 2506.10420 | null |
| 2025-06-12 | Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges | Jintao Liang et.al. | 2506.10408 | null |
| 2025-06-12 | EQA-RM: A Generative Embodied Reward Model with Test-time Scaling | Yuhang Chen et.al. | 2506.10389 | null |
| 2025-06-12 | Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills | Yuquan Xie et.al. | 2506.10387 | null |
| 2025-06-12 | NeuroPAL: Punctuated Anytime Learning with Neuroevolution for Macromanagement in Starcraft: Brood War | Jim O'Connor et.al. | 2506.10384 | null |
| 2025-06-12 | Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts | Zaijing Li et.al. | 2506.10357 | null |
| 2025-06-12 | Provably Learning from Language Feedback | Wanqiao Xu et.al. | 2506.10341 | null |
| 2025-06-12 | Seeding an Uncertain Technology | Eric Gao et.al. | 2506.10340 | null |
| 2025-06-13 | A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Cameron Angliss et.al. | 2506.10326 | link |
| 2025-06-12 | Minimizing False Positives in Static Bug Detection via LLM-Enhanced Path Feasibility Analysis | Xueying Du et.al. | 2506.10322 | null |
| 2025-06-12 | WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models | Qiyue Yin et.al. | 2506.10264 | null |
| 2025-06-12 | Enhancing Ultrasound Molecular Imaging: Toward Real-Time RPCA-Based Filtering to Differentiate Bound and Free Microbubbles | Hoda S. Hashemi et.al. | 2506.10257 | null |
| 2025-06-15 | Extended Creativity: A Conceptual Framework for Understanding Human-AI Creative Relations | Andrea Gaggioli et.al. | 2506.10249 | null |
| 2025-06-11 | Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems | Filip Cano et.al. | 2506.10192 | null |
| 2025-06-11 | AURA: A Multi-Agent Intelligence Framework for Knowledge-Enhanced Cyber Threat Attribution | Nanda Rani et.al. | 2506.10175 | null |
| 2025-06-11 | A Navigation Framework Utilizing Vision-Language Models | Yicheng Duan et.al. | 2506.10172 | link |
| 2025-06-14 | Disclosure Audits for LLM Agents | Saswat Das et.al. | 2506.10171 | null |
| 2025-06-11 | Exploring EEG Responses during Observation of Actions Performed by Human Actor and Humanoid Robot | Anh T. Nguyen et.al. | 2506.10170 | null |
| 2025-06-11 | Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective | Minye Shao et.al. | 2506.10142 | link |
| 2025-06-11 | Provable Sim-to-Real Transfer via Offline Domain Randomization | Arnaud Fickinger et.al. | 2506.10133 | null |
| 2025-06-11 | Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information | Christodoulos Constantinides et.al. | 2506.10086 | null |
| 2025-06-11 | Cybernetic Marionette: Channeling Collective Agency Through a Wearable Robot in a Live Dancer-Robot Duet | Anup Sathya et.al. | 2506.10079 | null |
| 2025-06-11 | A quantum semantic framework for natural language processing | Christopher J. Agostino et.al. | 2506.10077 | null |
| 2025-06-11 | Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy | Malvern Madondo et.al. | 2506.10073 | null |
| 2025-06-11 | Cooling a Qubit using n Others | Jake Xuereb et.al. | 2506.10059 | link |
| 2025-06-17 | TaskCraft: Automated Generation of Agentic Tasks | Dingfeng Shi et.al. | 2506.10055 | link |
| 2025-06-11 | Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling | Tim Z. Xiao et.al. | 2506.09998 | null |
| 2025-06-11 | SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance | Wentao Ge et.al. | 2506.09968 | null |
| 2025-06-11 | The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability | Jiachen Hu et.al. | 2506.09940 | null |
| 2025-06-11 | On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing | Junlin Chen et.al. | 2506.09924 | null |
| 2025-06-11 | PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants | Zheng Zhao et.al. | 2506.09902 | link |
| 2025-06-11 | "What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended) | Noel Brindise et.al. | 2506.09901 | null |
| 2025-06-11 | OctoNav: Towards Generalist Embodied Navigation | Chen Gao et.al. | 2506.09839 | null |
| 2025-06-11 | Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy | Tonghe Wang et.al. | 2506.09805 | null |
| 2025-06-11 | Delegations as Adaptive Representation Patterns: Rethinking Influence in Liquid Democracy | Davide Grossi et.al. | 2506.09789 | null |
| 2025-06-11 | Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | Shuo Jiang et.al. | 2506.09755 | null |
| 2025-06-11 | Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints | Xiangkai Zhang et.al. | 2506.09748 | null |
| 2025-06-11 | Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring | Gusseppe Bravo-Rocca et.al. | 2506.09742 | null |
| 2025-06-11 | Patterns of Patterns III | Joseph Corneli et.al. | 2506.09696 | null |
| 2025-06-11 | Intent Factored Generation: Unleashing the Diversity in Your Language Model | Eltayeb Ahmed et.al. | 2506.09659 | null |
| 2025-06-11 | Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives | Wei Zeng et.al. | 2506.09656 | null |
| 2025-06-11 | DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy | Kaixuan Xu et.al. | 2506.09655 | null |
| 2025-06-11 | Effective Red-Teaming of Policy-Adherent Agents | Itay Nakash et.al. | 2506.09600 | null |
| 2025-06-11 | VAULT: A Mobile Mapping System for ROS 2-based Autonomous Robots | Miguel Á. González-Santamarta et.al. | 2506.09583 | null |
| 2025-06-11 | MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Gaurav Chaudhary et.al. | 2506.09574 | null |
| 2025-06-11 | ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning | Yu Sun et.al. | 2506.09513 | link |
| 2025-06-11 | Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design | Andreas Schlaginhaufen et.al. | 2506.09508 | null |
| 2025-06-11 | A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes | Thomas J. Ringstrom et.al. | 2506.09499 | null |
| 2025-06-11 | Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation | Yuxin Liu et.al. | 2506.09485 | null |
| 2025-06-11 | Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing | Maria Damanaki et.al. | 2506.09469 | null |
| 2025-06-11 | Generalization Error Analysis for Attack-Free and Byzantine-Resilient Decentralized Learning with Data Heterogeneity | Haoxiang Ye et.al. | 2506.09438 | null |
| 2025-06-11 | When Is Diversity Rewarded in Cooperative Multi-Agent Learning? | Michael Amir et.al. | 2506.09434 | null |
| 2025-06-11 | A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy | Henry Peng Zou et.al. | 2506.09420 | link |
| 2025-06-11 | Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models | Zongjie Li et.al. | 2506.09396 | null |
| 2025-06-15 | LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization | Jiaqi Tang et.al. | 2506.09373 | null |
| 2025-06-11 | ContextBuddy: AI-Enhanced Contextual Insights for Security Alert Investigation (Applied to Intrusion Detection) | Ronal Singh et.al. | 2506.09365 | null |
| 2025-06-11 | Intelligent System of Emergent Knowledge: A Coordination Fabric for Billions of Minds | Moshi Wei et.al. | 2506.09335 | null |
| 2025-06-11 | Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation | Arjun Vaithilingam Sudhakar et.al. | 2506.09331 | null |
| 2025-06-10 | UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench | Boxi Yu et.al. | 2506.09289 | link |
| 2025-06-10 | Improved Approximate EFX Guarantees for Multigraphs | Alireza Kaviani et.al. | 2506.09288 | null |
| 2025-06-10 | Learning The Minimum Action Distance | Lorenzo Steccanella et.al. | 2506.09276 | null |
| 2025-06-10 | Uncertainty Prioritized Experience Replay | Rodrigo Carrasco-Davis et.al. | 2506.09270 | null |
| 2025-06-10 | Agent-based Condition Monitoring Assistance with Multimodal Industrial Database Retrieval Augmented Generation | Karl Löwenmark et.al. | 2506.09247 | null |
| 2025-06-10 | Robust Noise Attenuation via Adaptive Pooling of Transformer Outputs | Greyson Brothers et.al. | 2506.09215 | null |
| 2025-06-10 | Optimal Task Offloading with Firm Deadlines for Mobile Edge Computing Systems | Khai Doan et.al. | 2506.09180 | null |
| 2025-06-10 | Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism | Haoyuan Cai et.al. | 2506.09176 | link |
| 2025-06-10 | MultiNet: An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models | Pranav Guruprasad et.al. | 2506.09172 | null |
| 2025-06-10 | Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search | Samuel Holt et.al. | 2506.09171 | null |
| 2025-06-10 | FAIRTOPIA: Envisioning Multi-Agent Guardianship for Disrupting Unfair AI Pipelines | Athena Vakali et.al. | 2506.09107 | null |
| 2025-06-10 | FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making | Jiaxiang Chen et.al. | 2506.09080 | null |
| 2025-06-08 | BG-HOP: A Bimanual Generative Hand-Object Prior | Sriram Krishna et.al. | 2506.09068 | link |
| 2025-06-10 | ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering | Yuki Imajuku et.al. | 2506.09050 | link |
| 2025-06-10 | VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Li Kang et.al. | 2506.09049 | null |
| 2025-06-10 | Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation | Xiaowen Ma et.al. | 2506.09046 | null |
| 2025-06-10 | The Decoupled Risk Landscape in Performative Prediction | Javier Sanguino et.al. | 2506.09044 | null |
| 2025-06-10 | Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System | Yuan Guo et.al. | 2506.08972 | null |
| 2025-06-10 | Towards Robust Deep Reinforcement Learning against Environmental State Perturbation | Chenxu Wang et.al. | 2506.08961 | null |
| 2025-06-10 | What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities | Wendong Bu et.al. | 2506.08933 | null |
| 2025-06-10 | Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL) | Maria-Veronica Ciocanel et.al. | 2506.08916 | link |
| 2025-06-10 | Intention-Conditioned Flow Occupancy Models | Chongyi Zheng et.al. | 2506.08902 | link |
| 2025-06-10 | Pairwise similarity method for majority domination problem | N. I. Shushko et.al. | 2506.08886 | null |
| 2025-06-10 | Deploying SICNav in the Field: Safe and Interactive Crowd Navigation using MPC and Bilevel Optimization | Sepehr Samavi et.al. | 2506.08851 | null |
| 2025-06-10 | Agile Reinforcement Learning for Real-Time Task Scheduling in Edge Computing | Amin Avan et.al. | 2506.08850 | link |
| 2025-06-11 | Design Patterns for Securing LLM Agents against Prompt Injections | Luca Beurer-Kellner et.al. | 2506.08837 | null |
| 2025-06-10 | Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents | Irene Testini et.al. | 2506.08800 | null |
| 2025-06-10 | Improved LLM Agents for Financial Document Question Answering | Nelvin Tan et.al. | 2506.08726 | null |
| 2025-06-10 | PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly | Liang Ma et.al. | 2506.08708 | null |
| 2025-06-10 | Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs | Šimon Sedláček et.al. | 2506.08633 | null |
| 2025-06-10 | Modular Recurrence in Contextual MDPs for Universal Morphology Control | Laurens Engwegen et.al. | 2506.08630 | null |
| 2025-06-10 | Geometric Hyperscanning under Active Inference | Nicolas Hinrichs et.al. | 2506.08599 | null |
| 2025-06-10 | HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning | Yang Lv et.al. | 2506.08580 | null |
| 2025-06-10 | Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations | Yibo Cui et.al. | 2506.08566 | null |
| 2025-06-10 | FEDTAIL: Federated Long-Tailed Domain Generalization with Sharpness-Guided Gradient Matching | Sunny Gupta et.al. | 2506.08518 | null |
| 2025-06-12 | MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning | Kuo Yang et.al. | 2506.08507 | null |
| 2025-06-10 | Learning to Lead: Incentivizing Strategic Agents in the Dark | Yuchen Wu et.al. | 2506.08438 | null |
| 2025-06-10 | Attention-based Learning for 3D Informative Path Planning | Rui Zhao et.al. | 2506.08434 | null |
| 2025-06-12 | CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models | Ziqi. Liu et.al. | 2506.08430 | null |
| 2025-06-10 | Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy | Utkarsh Pratiush et.al. | 2506.08423 | link |
| 2025-06-11 | TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration | Weiya Li et.al. | 2506.08403 | link |
| 2025-06-10 | Reinforce LLM Reasoning through Multi-Agent Reflection | Yurun Yuan et.al. | 2506.08379 | null |
| 2025-06-10 | Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models | Wentao Shi et.al. | 2506.08352 | link |
| 2025-06-11 | Your Agent Can Defend Itself against Backdoor Attacks | Li Changjiang et.al. | 2506.08336 | null |
| 2025-06-10 | ORFS-agent: Tool-Using Agents for Chip Design Optimization | Amur Ghose et.al. | 2506.08332 | null |
| 2025-06-10 | Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study | Ira Ceka et.al. | 2506.08311 | null |
| 2025-06-11 | HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation | Hongjun Wu et.al. | 2506.08296 | null |
| 2025-06-09 | From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information? | Zhanke Zhou et.al. | 2506.08295 | link |
| 2025-06-09 | From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium | Xie Yi et.al. | 2506.08292 | link |
| 2025-06-09 | Scaling Laws of Motion Forecasting and Planning -- A Technical Report | Mustafa Baniodeh et.al. | 2506.08228 | null |
| 2025-06-09 | Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms | Jared Claypoole et.al. | 2506.08192 | null |
| 2025-06-09 | Anomaly, Class Division, and Decoupling in Wealth Dynamics | Jaeseok Hur et.al. | 2506.08175 | null |
| 2025-06-09 | Ego-centric Learning of Communicative World Models for Autonomous Driving | Hang Wang et.al. | 2506.08149 | null |
| 2025-06-09 | EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments | Zefang Liu et.al. | 2506.08136 | null |
| 2025-06-09 | SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents | Subhrangshu Nandi et.al. | 2506.08119 | null |
| 2025-06-09 | Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance Graph | Akash Vishwakarma et.al. | 2506.08098 | link |
| 2025-06-09 | Towards AI-assisted Neutrino Flavor Theory Design | Jason Benjamin Baretz et.al. | 2506.08080 | link |
| 2025-06-08 | UAVs Meet Agentic AI: A Multidomain Survey of Autonomous Aerial Intelligence and Agentic UAVs | Ranjan Sapkota et.al. | 2506.08045 | null |
| 2025-06-09 | GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior | Penghao Wu et.al. | 2506.08012 | null |
| 2025-06-09 | Dreamland: Controllable World Creation with Simulator and Generative Models | Sicheng Mo et.al. | 2506.08006 | null |
| 2025-06-09 | Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System | Fan Yang et.al. | 2506.07997 | null |
| 2025-06-09 | Victor Barres et.al. | 2506.07982 | link | |
| 2025-06-09 | Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator | Alberto Bazán-Guillén et.al. | 2506.07980 | null |
| 2025-06-10 | Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction | Junhong Shen et.al. | 2506.07976 | link |
| 2025-06-09 | HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Hongzheng Chen et.al. | 2506.07972 | link |
| 2025-06-09 | Diffusion of Responsibility in Collective Decision Making | Pavel Naumov et.al. | 2506.07935 | null |
| 2025-06-09 | LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement | Dimitris Panagopoulos et.al. | 2506.07915 | null |
| 2025-06-09 | A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit | Andrea Tiranti et.al. | 2506.07877 | null |
| 2025-06-09 | Simulating nationwide coupled disease and fear spread in an agent-based model | Joy Kitson et.al. | 2506.07842 | null |
| 2025-06-09 | Control strategies and trends to equilibrium for kinetic models of opinion dynamics driven by social activity | Andrea Bondesan et.al. | 2506.07840 | null |
| 2025-06-09 | Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information | Jan Corazza et.al. | 2506.07829 | null |
| 2025-06-11 | A Proposal to Extend the Common Model of Cognition with Metacognition | John Laird et.al. | 2506.07807 | null |
| 2025-06-13 | Agent Semantics, Semantic Spacetime, and Graphical Reasoning | Mark Burgess et.al. | 2506.07756 | null |
| 2025-06-09 | Deep Equivariant Multi-Agent Control Barrier Functions | Nikolaos Bousias et.al. | 2506.07755 | null |
| 2025-06-09 | Delay Optimization in Remote ID-Based UAV Communication via BLE and Wi-Fi Switching | Yian Zhu et.al. | 2506.07715 | null |
| 2025-06-09 | QUITE: A Query Rewrite System Beyond Rules with LLM Agents | Yuyang Song et.al. | 2506.07675 | null |
| 2025-06-09 | MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents | Yunhe Yan et.al. | 2506.07672 | null |
| 2025-06-09 | SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling | Haoran Wang et.al. | 2506.07636 | null |
| 2025-06-09 | Blending Participatory Design and Artificial Awareness for Trustworthy Autonomous Vehicles | Ana Tanevska et.al. | 2506.07633 | null |
| 2025-06-09 | MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity | Bikash Saha et.al. | 2506.07586 | null |
| 2025-06-09 | Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models | Ramakrishna Appicharla et.al. | 2506.07583 | null |
| 2025-06-11 | SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems | Peiran Li et.al. | 2506.07564 | null |
| 2025-06-12 | CheMatAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool Learning | Mengsong Wu et.al. | 2506.07551 | link |
| 2025-06-09 | Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Weiqiang Jin et.al. | 2506.07548 | link |
| 2025-06-09 | Fractional Collisions: A Framework for Risk Estimation of Counterfactual Conflicts using Autonomous Driving Behavior Simulations | Sreeja Roy-Singh et.al. | 2506.07540 | null |
| 2025-06-09 | Coordinating Search-Informed Reasoning and Reasoning-Guided Search in Claim Verification | Qisheng Hu et.al. | 2506.07528 | null |
| 2025-06-09 | IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents | Shiwei Feng et.al. | 2506.07524 | null |
| 2025-06-09 | Taking Flight with Dialogue: Enabling Natural Language Control for PX4-based Drone Agent | Shoon Kit Lim et.al. | 2506.07509 | link |
| 2025-06-09 | Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models | Mickel Liu et.al. | 2506.07468 | link |
| 2025-06-09 | Efficient Generation of Diverse Cooperative Agents with World Models | Yi Loo et.al. | 2506.07450 | null |
| 2025-06-09 | Generate Realistic Test Scenes for V2X Communication Systems | An Guo et.al. | 2506.07419 | null |
| 2025-06-11 | MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Philip R. Liu et.al. | 2506.07400 | link |
| 2025-06-09 | G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems | Guibin Zhang et.al. | 2506.07398 | link |
| 2025-06-09 | From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks | Yuyang Zhou et.al. | 2506.07392 | link |
| 2025-06-09 | Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents | Yun Hua et.al. | 2506.07388 | null |
| 2025-06-09 | Extended Version of "Distributed Adaptive Resilient Consensus Control for Uncertain Nonlinear Multiagent Systems Against Deception Attacks" | Mengze Yu et.al. | 2506.07374 | null |
| 2025-06-09 | Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking | Jun Chen et.al. | 2506.07351 | null |
| 2025-06-09 | MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation | Yijie Deng et.al. | 2506.07350 | null |
| 2025-06-09 | Distributed Risk-Sensitive Safety Filters for Uncertain Discrete-Time Systems | Armin Lederer et.al. | 2506.07347 | null |
| 2025-06-09 | Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation | Yijie Deng et.al. | 2506.07338 | null |
| 2025-06-09 | Digital Twin-based Smart Manufacturing: Dynamic Line Reconfiguration for Disturbance Handling | Bo Fu et.al. | 2506.07332 | null |
| 2025-06-08 | SCGAgent: Recreating the Benefits of Reasoning Models for Secure Code Generation with Agentic Workflows | Rebecca Saul et.al. | 2506.07313 | null |
| 2025-06-08 | Multi-Step Guided Diffusion for Image Restoration on Edge Devices: Toward Lightweight Perception in Embodied AI | Aditya Chakravarty et.al. | 2506.07286 | null |
| 2025-06-08 | Secondary Stakeholders in AI: Fighting for, Brokering, and Navigating Agency | Leah Hope Ajmani et.al. | 2506.07281 | null |
| 2025-06-08 | A Cramér-von Mises Approach to Incentivizing Truthful Data Sharing | Alex Clinton et.al. | 2506.07272 | null |
| 2025-06-08 | Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs | Atahan Özer et.al. | 2506.07270 | null |
| 2025-06-08 | Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments | Xinran Li et.al. | 2506.07232 | null |
| 2025-06-08 | LLM-Enhanced Rapid-Reflex Async-Reflect Embodied Agent for Real-Time Decision-Making in Dynamically Changing Environments | Yangqing Zheng et.al. | 2506.07223 | null |
| 2025-06-08 | BIMgent: Towards Autonomous Building Modeling via Computer-use Agents | Zihan Deng et.al. | 2506.07217 | null |
| 2025-06-08 | Adaptive Consensus with Exponential Decay | Woocheol Choi et.al. | 2506.07203 | null |
| 2025-06-08 | Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions | Kanato Nakanishi et.al. | 2506.07200 | null |
| 2025-06-08 | Exploring Effective Strategies for Building a Customised GPT Agent for Coding Classroom Dialogues | Luwei Bai et.al. | 2506.07194 | null |
| 2025-06-08 | Value-Set Iteration: Computing Optimal Correlated Equilibria in Infinite-Horizon Multi-Player Stochastic Games | Jiarui Gan et.al. | 2506.07186 | null |
| 2025-06-12 | Delegation with Costly Inspection | Mohammad T. Hajiaghayi et.al. | 2506.07162 | null |
| 2025-06-08 | Mind the Web: The Security of Web Use Agents | Avishag Shapira et.al. | 2506.07153 | null |
| 2025-06-08 | BRIGHT+: Upgrading the BRIGHT Benchmark with MARCUS, a Multi-Agent RAG Clean-Up Suite | Liyang Chen et.al. | 2506.07116 | null |
| 2025-06-08 | Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models | Samir Abdaljalil et.al. | 2506.07106 | null |
| 2025-06-08 | Decentralized Optimization with Amplified Privacy via Efficient Communication | Wei Huo et.al. | 2506.07102 | null |
| 2025-06-08 | On the Generalization of Data-Assisted Control in port-Hamiltonian Systems (DAC-pH) | Mostafa Eslami et.al. | 2506.07079 | null |
| 2025-06-08 | A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge | Tarique Dahri et.al. | 2506.07055 | null |
| 2025-06-08 | QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine | Anushka Jha et.al. | 2506.07046 | null |
| 2025-06-08 | Accelerating Two-Dimensional Materials Research via a Universal Interatomic Potential and Large Language Model Agent | Haidi Wang et.al. | 2506.07043 | null |
| 2025-06-08 | MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks | Sanjoy Chowdhury et.al. | 2506.07016 | null |
| 2025-06-08 | Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments | Riley Simmons-Edler et.al. | 2506.06981 | null |
| 2025-06-08 | Near Optimal Non-asymptotic Sample Complexity of 1-Identification | Zitian Li et.al. | 2506.06978 | null |
| 2025-06-08 | Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning | Subhojyoti Mukherjee et.al. | 2506.06964 | null |
| 2025-06-08 | Deontically Constrained Policy Improvement in Reinforcement Learning Agents | Alena Makarova et.al. | 2506.06959 | null |
| 2025-06-08 | Position: Simulating Society Requires Simulating Thought | Chance Jiajie Li et.al. | 2506.06958 | null |
| 2025-06-07 | An Agentic Framework for Autonomous Metamaterial Modeling and Inverse Design | Darui Lu et.al. | 2506.06935 | null |
| 2025-06-07 | Boosting LLM Reasoning via Spontaneous Self-Correction | Xutong Zhao et.al. | 2506.06923 | null |
| 2025-06-07 | Multimodal Spatial Language Maps for Robot Navigation and Manipulation | Chenguang Huang et.al. | 2506.06862 | null |
| 2025-06-07 | DONUT: A Decoder-Only Model for Trajectory Prediction | Markus Knoche et.al. | 2506.06854 | null |
| 2025-06-07 | United Minds or Isolated Agents? Exploring Coordination of LLMs under Cognitive Load Theory | HaoYang Shang et.al. | 2506.06843 | null |
| 2025-06-07 | AI-Generated Compromises for Coalition Formation | Eyal Briman et.al. | 2506.06837 | null |
| 2025-06-07 | Is Optimal Transport Necessary for Inverse Reinforcement Learning? | Zixuan Dong et.al. | 2506.06793 | null |
| 2025-06-07 | Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain | Dimitris Panagopoulos et.al. | 2506.06786 | null |
| 2025-06-07 | AI PsyRoom: Artificial Intelligence Platform for Segmented Yearning and Reactive Outcome Optimization Method | Yigui Feng et.al. | 2506.06740 | null |
| 2025-06-07 | WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making | Guillaume Levy et.al. | 2506.06725 | null |
| 2025-06-07 | Contextual Experience Replay for Self-Improvement of Language Agents | Yitao Liu et.al. | 2506.06698 | null |
| 2025-06-07 | Self-Adapting Improvement Loops for Robotic Learning | Calvin Luo et.al. | 2506.06658 | null |
| 2025-06-07 | Active Test-time Vision-Language Navigation | Heeju Ko et.al. | 2506.06630 | null |
| 2025-06-06 | AI Simulation by Digital Twins: Systematic Survey, Reference Framework, and Mapping to a Standardized Architecture | Xiaoran Liu et.al. | 2506.06580 | null |
| 2025-06-11 | Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce | Yijia Shao et.al. | 2506.06576 | null |
| 2025-06-12 | The Optimization Paradox in Clinical AI Multi-Agent Systems | Suhana Bedi et.al. | 2506.06574 | link |
| 2025-06-06 | Enhancing Robot Safety via MLLM-Based Semantic Interpretation of Failure Data | Aryaman Gupta et.al. | 2506.06570 | null |
| 2025-06-06 | Adapting Under Fire: Multi-Agent Reinforcement Learning for Adversarial Drift in Network Security | Emilia Rivas et.al. | 2506.06565 | null |
| 2025-06-06 | KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes | Eugenie Lai et.al. | 2506.06541 | link |
| 2025-06-06 | ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search | Sam Earle et.al. | 2506.06524 | null |
| 2025-06-06 | Improving LLM-Powered EDA Assistants with RAFT | Luyao Shi et.al. | 2506.06500 | null |
| 2025-06-06 | Fake Friends and Sponsored Ads: The Risks of Advertising in Conversational Search | Jacob Erickson et.al. | 2506.06447 | null |
| 2025-06-06 | Improving choice model specification using reinforcement learning | Gabriel Nova et.al. | 2506.06410 | null |
| 2025-06-04 | CPS-Guard: Framework for Dependability Assurance of AI- and LLM-Based Cyber-Physical Systems | Trisanth Srinivasan et.al. | 2506.06381 | null |
| 2025-06-06 | PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time | Weizhi Zhang et.al. | 2506.06254 | null |
| 2025-06-06 | Longer Lists Yield Better Matchings | Yuri Faenza et.al. | 2506.06217 | null |
| 2025-06-06 | Can Theoretical Physics Research Benefit from Language Agents? | Sirui Lu et.al. | 2506.06214 | null |
| 2025-06-06 | A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization | Muhammed Ustaomeroglu et.al. | 2506.06179 | null |
| 2025-06-06 | Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach | James Ford et.al. | 2506.06175 | null |
| 2025-06-06 | The Lock-in Hypothesis: Stagnation by Algorithm | Tianyi Alex Qiu et.al. | 2506.06166 | null |
| 2025-06-06 | (AI peers) are people learning from the same standpoint: Perception of AI characters in a Collaborative Science Investigation | Eunhye Grace Ko et.al. | 2506.06165 | null |
| 2025-06-06 | Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks | Adiba Mahbub Proma et.al. | 2506.06153 | null |
| 2025-06-06 | CCLSTM: Coupled Convolutional Long-Short Term Memory Network for Occupancy Flow Forecasting | Peter Lengyel et.al. | 2506.06128 | null |
| 2025-06-06 | Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library | Weixun Wang et.al. | 2506.06122 | null |
| 2025-06-06 | VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning | Zikang Wang et.al. | 2506.06097 | null |
| 2025-06-06 | On-board Mission Replanning for Adaptive Cooperative Multi-Robot Systems | Elim Kwan et.al. | 2506.06094 | null |
| 2025-06-06 | Self driving algorithm for an active four wheel drive racecar | Gergely Bari et.al. | 2506.06077 | null |
| 2025-06-06 | Conversational Interfaces for Parametric Conceptual Architectural Design: Integrating Mixed Reality with LLM-driven Interaction | Ruochen Ji et.al. | 2506.06066 | null |
| 2025-06-06 | Modeling human reputation-seeking behavior in a spatio-temporally complex public good provision game | Edward Hughes et.al. | 2506.06032 | null |
| 2025-06-06 | When to Trust Context: Self-Reflective Debates for Context Reliability | Zeqi Zhou et.al. | 2506.06020 | null |
| 2025-06-06 | AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search | Yu Li et.al. | 2506.06017 | null |
| 2025-06-06 | Propose or Vote: A simple Democratic Procedure | Hans Gersbach et.al. | 2506.05998 | null |
| 2025-06-06 | Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning | Yuheng Lei et.al. | 2506.05985 | link |
| 2025-06-06 | MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks | Zonglin Wu et.al. | 2506.05982 | link |
| 2025-06-10 | CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents | Qingbin Zeng et.al. | 2506.05981 | null |
| 2025-06-06 | Quantum Checkers: The Development and Analysis of a Quantum Combinatorial Game | Marien Raat et.al. | 2506.05962 | null |
| 2025-06-06 | Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes | Alessandro Montenegro et.al. | 2506.05953 | null |
| 2025-06-06 | Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games | Philipp Plank et.al. | 2506.05894 | null |
| 2025-06-06 | CodeContests+: High-Quality Test Case Generation for Competitive Programming | Zihan Wang et.al. | 2506.05817 | null |
| 2025-06-06 | MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning | Ye Bai et.al. | 2506.05813 | null |
| 2025-06-06 | Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction | Yesheng Zhang et.al. | 2506.05810 | null |
| 2025-06-06 | To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt | Zhilong Wang et.al. | 2506.05739 | null |
| 2025-06-06 | Hybrid Stabilization Protocol for Cross-Chain Digital Assets Using Adaptor Signatures and AI-Driven Arbitrage | Shengwei You et.al. | 2506.05708 | null |
| 2025-06-06 | Multi-Project Contracts | Tal Alon et.al. | 2506.05705 | null |
| 2025-06-06 | Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces | Chaofan Pan et.al. | 2506.05702 | null |
| 2025-06-06 | Ordering-disordering dynamics of the voter model under random external bias | Roni Muslim et.al. | 2506.05669 | null |
| 2025-06-06 | A Modular Haptic Display with Reconfigurable Signals for Personalized Information Transfer | Antonio Alvarez Valdivia et.al. | 2506.05648 | null |
| 2025-06-06 | Diffusive Spreading Across Dynamic Mitochondrial Network Architectures | Keaton B. Holt et.al. | 2506.05643 | null |
| 2025-06-09 | Toward Greater Autonomy in Materials Discovery Agents: Unifying Planning, Physics, and Scientists | Lianhao Zhou et.al. | 2506.05616 | null |
| 2025-06-05 | Beating the Logarithmic Barrier for the Subadditive Maximin Share Problem | Masoud Seddighin et.al. | 2506.05613 | null |
| 2025-06-05 | OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation | Ziyi Wang et.al. | 2506.05606 | null |
| 2025-06-05 | Stochastic maximum principle for optimal control problem of non exchangeable mean field systems | Idris Kharroubi et.al. | 2506.05595 | null |
| 2025-06-05 | Collaborative Learning in Agentic Systems: A Collective AI is Greater Than the Sum of Its Parts | Saptarshi Nath et.al. | 2506.05577 | link |
| 2025-06-05 | Applying Informer for Option Pricing: A Transformer-Based Approach | Feliks Bańka et.al. | 2506.05565 | null |
| 2025-06-05 | Improving LLMs with a knowledge from databases | Petr Máša et.al. | 2506.05560 | null |
| 2025-06-05 | Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data | Vlastimil Martinek et.al. | 2506.05542 | link |
| 2025-06-05 | SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms | Arnesh Batra et.al. | 2506.05538 | link |
| 2025-06-05 | Quantum circuits as a game: A reinforcement learning agent for quantum compilation and its application to reconfigurable neutral atom arrays | Kouhei Nakaji et.al. | 2506.05536 | null |
| 2025-06-05 | Avoiding Death through Fear Intrinsic Conditioning | Rodney Sanchez et.al. | 2506.05529 | null |
| 2025-06-05 | Sequence Modeling for N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2506.05527 | null |
| 2025-06-05 | Towards Data Systems That Are Business Semantic-Centric and AI Agents-Assisted | Cecil Pang et.al. | 2506.05520 | null |
| 2025-06-05 | Speech Neurophysiology in Realistic Contexts: Big Hype or Big Leap? | Giovanni M. Di Liberto et.al. | 2506.05494 | null |
| 2025-06-05 | A MARL-based Approach for Easing MAS Organization Engineering | Julien Soulé et.al. | 2506.05437 | null |
| 2025-06-05 | Robustness Evaluation for Video Models with Reinforcement Learning | Ashwin Ramesh Babu et.al. | 2506.05431 | null |
| 2025-06-05 | Mixture-of-Experts Meets In-Context Reinforcement Learning | Wenhao Wu et.al. | 2506.05426 | null |
| 2025-06-05 | Constructive Symbolic Reinforcement Learning via Intuitionistic Logic and Goal-Chaining Inference | Andrei T. Patrascu et.al. | 2506.05422 | null |
| 2025-06-03 | Rational Superautotrophic Diplomacy (SupraAD); A Conceptual Framework for Alignment Based on Interdisciplinary Findings on the Fundamentals of Cognition | Andrea Morris et.al. | 2506.05389 | null |
| 2025-06-05 | Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Niv Eckhaus et.al. | 2506.05309 | link |
| 2025-06-05 | ProRefine: Inference-time Prompt Refinement with Textual Feedback | Deepak Pandita et.al. | 2506.05305 | null |
| 2025-06-05 | Control Tax: The Price of Keeping AI in Check | Mikhail Terekhov et.al. | 2506.05296 | null |
| 2025-06-05 | A Smooth Sea Never Made a Skilled |
Arnav Kumar Jain et.al. | 2506.05294 | link |
| 2025-06-05 | Tight analyses of first-order methods with error feedback | Daniel Berg Thomsen et.al. | 2506.05271 | link |
| 2025-06-06 | Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams | Mohammed Almutairi et.al. | 2506.05265 | null |
| 2025-06-05 | Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning | Dravyansh Sharma et.al. | 2506.05252 | null |
| 2025-06-05 | Towards Language-Augmented Multi-Agent Deep Reinforcement Learning | Maxime Toquebiau et.al. | 2506.05236 | null |
| 2025-06-05 | A Framework for Ethical Judgment of Smart City Applications | Weichen Shi et.al. | 2506.05172 | null |
| 2025-06-05 | An emergence-oriented approach to cyclic pursuit | Zhaozhan Yao et.al. | 2506.05157 | null |
| 2025-06-05 | Truly Self-Improving Agents Require Intrinsic Metacognitive Learning | Tennison Liu et.al. | 2506.05109 | null |
| 2025-06-05 | LLM-Guided Scenario-based GUI Testing | Shengcheng Yu et.al. | 2506.05079 | null |
| 2025-06-05 | Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System | Haokun Liu et.al. | 2506.05020 | null |
| 2025-06-05 | ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development | Zhenran Xu et.al. | 2506.05010 | link |
| 2025-06-05 | QiMeng: Fully Automated Hardware and Software Design for Processor Chip | Rui Zhang et.al. | 2506.05007 | null |
| 2025-06-05 | Agentic AI for Intent-Based Industrial Automation | Marcos Lima Romero et.al. | 2506.04980 | link |
| 2025-06-05 | Optimization for Semantic-Aware Resource Allocation under CPT-based Utilities | Symeon Vaidanis et.al. | 2506.04952 | null |
| 2025-06-05 | Goal-Oriented Semantic Resource Allocation with Cumulative Prospect Theoretic Agents | Symeon Vaidanis et.al. | 2506.04947 | null |
| 2025-06-05 | No Trade Under Verifiable Information | Spyros Galanis et.al. | 2506.04944 | null |
| 2025-06-05 | Energentic Intelligence: From Self-Sustaining Systems to Enduring Artificial Life | Atahan Karagoz et.al. | 2506.04916 | null |
| 2025-06-05 | Efficient Path Planning and Task Allocation Algorithm for Boolean Specifications | Ioana Hustiu et.al. | 2506.04881 | link |
| 2025-06-05 | LLMs for sensory-motor control: Combining in-context and iterative learning | Jônata Tyska Carvalho et.al. | 2506.04867 | link |
| 2025-06-05 | Towards a Multi-Agent Simulation of Cyber-attackers and Cyber-defenders Battles | Julien Soulé et.al. | 2506.04849 | null |
| 2025-06-05 | Oversight Structures for Agentic AI in Public-Sector Organizations | Chris Schmitz et.al. | 2506.04836 | null |
| 2025-06-05 | Safe Planning and Policy Optimization via World Model Learning | Artem Latyshev et.al. | 2506.04828 | null |
| 2025-06-05 | Distributionally Robust Auction Design with Deferred Inspection | Halil I. Bayrak et.al. | 2506.04767 | null |
| 2025-06-05 | SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs | Shuhan Xu et.al. | 2506.04743 | null |
| 2025-06-05 | Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling | Bihan Xu et.al. | 2506.04699 | null |
| 2025-06-05 | Gen-n-Val: Agentic Image Data Generation and Validation | Jing-En Huang et.al. | 2506.04676 | null |
| 2025-06-05 | E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction | Zhichao Yang et.al. | 2506.04654 | null |
| 2025-06-05 | Agents of Change: Self-Evolving LLM Agents for Strategic Planning | Nikolas Belle et.al. | 2506.04651 | null |
| 2025-06-05 | Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents | Juhyun Oh et.al. | 2506.04649 | link |
| 2025-06-05 | CHANCERY: Evaluating corporate governance reasoning capabilities in language models | Lucas Irwin et.al. | 2506.04636 | null |
| 2025-06-05 | Composing Agents to Minimize Worst-case Risk | Guruprerana Shabadi et.al. | 2506.04632 | null |
| 2025-06-05 | Enhancing Efficiency and Propulsion in Bio-mimetic Robotic Fish through End-to-End Deep Reinforcement Learning | Xinyu Cui et.al. | 2506.04627 | null |
| 2025-06-05 | Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning | Haochen Zhang et.al. | 2506.04626 | null |
| 2025-06-05 | Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning | Zhiyuan Ma et.al. | 2506.04625 | null |
| 2025-06-05 | Subjective Perspectives within Learned Representations Predict High-Impact Innovation | Likun Cao et.al. | 2506.04616 | null |
| 2025-06-05 | SmartAvatar: Text- and Image-Guided Human Avatar Generation with VLM AI Agents | Alexander Huang-Menders et.al. | 2506.04606 | null |
| 2025-06-05 | Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning | Ziqi Jia et.al. | 2506.04595 | null |
| 2025-06-05 | Demonstrations of Integrity Attacks in Multi-Agent Systems | Can Zheng et.al. | 2506.04572 | null |
| 2025-06-05 | OpenAg: Democratizing Agricultural Intelligence | Srikanth Thudumu et.al. | 2506.04571 | null |
| 2025-06-05 | From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems | Jiayi Chen et.al. | 2506.04565 | null |
| 2025-06-04 | SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning | Nikita Oskolkov et.al. | 2506.04505 | null |
| 2025-06-04 | CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective | Jiayu Liu et.al. | 2506.04481 | null |
| 2025-06-04 | MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale | Ran Xu et.al. | 2506.04405 | null |
| 2025-06-04 | Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning | Suzan Ece Ada et.al. | 2506.04399 | null |
| 2025-06-04 | Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care | Saurabh Kumar et.al. | 2506.04389 | null |
| 2025-06-04 | Replay Can Provably Increase Forgetting | Yasaman Mahdaviyeh et.al. | 2506.04377 | null |
| 2025-06-04 | WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning | Delong Chen et.al. | 2506.04363 | null |
| 2025-06-04 | The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective | Jiin Kim et.al. | 2506.04301 | null |
| 2025-06-04 | AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents | Fengze Liu et.al. | 2506.04293 | null |
| 2025-06-04 | Automated Skill Discovery for Language Agents through Exploration and Iterative Feedback | Yongjin Yang et.al. | 2506.04287 | null |
| 2025-06-04 | Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response | Lei Han et.al. | 2506.04276 | null |
| 2025-06-03 | CORA: Coalitional Rational Advantage Decomposition for Multi-Agent Policy Gradients | Mengda Ji et.al. | 2506.04265 | null |
| 2025-06-04 | OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Junting Chen et.al. | 2506.04217 | link |
| 2025-06-04 | Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs | Alex DeWeese et.al. | 2506.04215 | null |
| 2025-06-06 | TracLLM: A Generic Framework for Attributing Long Context LLMs | Yanting Wang et.al. | 2506.04202 | link |
| 2025-06-04 | MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures | Elena Zamaraeva et.al. | 2506.04195 | null |
| 2025-06-04 | SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models | Yuhao Wu et.al. | 2506.04180 | null |
| 2025-06-04 | A primal-dual price-optimization method for computing equilibrium prices in mean-field games models | Xu Wang et.al. | 2506.04169 | link |
| 2025-06-04 | Image Editing As Programs with Diffusion Models | Yujia Hu et.al. | 2506.04158 | null |
| 2025-06-05 | macOSWorld: A Multilingual Interactive Benchmark for GUI Agents | Pei Yang et.al. | 2506.04135 | link |
| 2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
| 2025-06-04 | CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues | Disha Sheshanarayana et.al. | 2506.04131 | null |
| 2025-06-04 | TextAtari: 100K Frames Game Playing with Language Agents | Wenhao Li et.al. | 2506.04098 | link |
| 2025-06-04 | AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment | Anastasiia Ivanova et.al. | 2506.04089 | link |
| 2025-06-04 | Optimal Transport-based Domain Alignment as a Preprocessing Step for Federated Learning | Luiz Manella Pereira et.al. | 2506.04071 | null |
| 2025-06-04 | AI Agents for Conversational Patient Triage: Preliminary Simulation-Based Evaluation with Real-World EHR Data | Sina Rashidian et.al. | 2506.04032 | null |
| 2025-06-04 | AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents | Akshat Naik et.al. | 2506.04018 | null |
| 2025-06-04 | Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning | Junqi Gao et.al. | 2506.03939 | link |
| 2025-06-04 | HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models | Zhaolu Kang et.al. | 2506.03922 | link |
| 2025-06-04 | Causal Explanations Over Time: Articulated Reasoning for Interactive Environments | Sebastian Rödling et.al. | 2506.03915 | null |
| 2025-06-04 | Jet-Feedback on kpc scales: a review | Dipanjan Mukherjee et.al. | 2506.03888 | null |
| 2025-06-04 | PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading | Qiuhan Han et.al. | 2506.03861 | null |
| 2025-06-04 | AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance | Dhaval Patel et.al. | 2506.03828 | link |
| 2025-06-04 | Learning Equilibria in Matching Games with Bandit Feedback | Andreas Athanasopoulos et.al. | 2506.03802 | null |
| 2025-06-04 | From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation | Peter Pfeiffer et.al. | 2506.03801 | null |
| 2025-06-04 | Misalignment or misuse? The AGI alignment tradeoff | Max Hellrigel-Holderbaum et.al. | 2506.03755 | null |
| 2025-06-04 | A Retrieval-Augmented Multi-Agent Framework for Psychiatry Diagnosis | Mengxi Xiao et.al. | 2506.03750 | link |
| 2025-06-04 | AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives | Aniruddh Sikdar et.al. | 2506.03709 | null |
| 2025-06-04 | Stability Notions for Hospital Residents with Sizes | Haricharan Balasundaram et.al. | 2506.03638 | null |
| 2025-06-04 | Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations | Shaoshan Liu et.al. | 2506.03613 | link |
| 2025-06-04 | Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games | Dongmin Park et.al. | 2506.03610 | null |
| 2025-06-08 | Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems | Yu Ma et.al. | 2506.03586 | null |
| 2025-06-05 | Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving | Li Zeqiao et.al. | 2506.03568 | link |
| 2025-06-04 | From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context | Yuanchen Bai et.al. | 2506.03546 | null |
| 2025-06-04 | CogniPair: From LLM Chatbots to Conscious AI Agents -- GNWT-Based Multi-Agent Digital Twins for Social Pairing -- Dating & Hiring Applications | Wanghao Ye et.al. | 2506.03543 | null |
| 2025-06-04 | Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement | Xiaofeng Zhou et.al. | 2506.03541 | null |
| 2025-06-04 | Go-Browse: Training Web Agents with Structured Exploration | Apurva Gandhi et.al. | 2506.03533 | null |
| 2025-06-04 | GA-S |
Yunyao Zhang et.al. | 2506.03532 | link |
| 2025-06-04 | How Far Are We from Predicting Missing Modalities with Foundation Models? | Guanzhou Ke et.al. | 2506.03530 | link |
| 2025-06-04 | Correlated equilibrium implementation: Navigating toward social optima with learning dynamics | Soumen Banerjee et.al. | 2506.03528 | null |
| 2025-06-04 | Path Generation and Evaluation in Video Games: A Nonparametric Statistical Approach | Daniel Campa et.al. | 2506.03522 | null |
| 2025-06-04 | VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety | Han Zhang et.al. | 2506.03520 | null |
| 2025-06-04 | SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models | Arnab Debnath et.al. | 2506.03516 | null |
| 2025-06-04 | Computational Architects of Society: Quantum Machine Learning for Social Rule Genesis | Shan Shan et.al. | 2506.03503 | null |
| 2025-06-04 | CORE: Constraint-Aware One-Step Reinforcement Learning for Simulation-Guided Neural Network Accelerator Design | Yifeng Xiao et.al. | 2506.03474 | null |
| 2025-06-03 | The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks | Walter Mayor et.al. | 2506.03404 | null |
| 2025-06-03 | Impact of Rankings and Personalized Recommendations in Marketplaces | Omar Besbes et.al. | 2506.03369 | null |
| 2025-06-03 | A Differential Perspective on Distributional Reinforcement Learning | Juan Sebastian Rojas et.al. | 2506.03333 | null |
| 2025-06-03 | Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic Workflows | Yifei Ming et.al. | 2506.03332 | null |
| 2025-06-03 | The Future of Continual Learning in the Era of Foundation Models: Three Key Directions | Jack Bell et.al. | 2506.03320 | null |
| 2025-06-03 | FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes | Christodoulos Constantinides et.al. | 2506.03278 | link |
| 2025-06-03 | NetPress: Dynamically Generated LLM Benchmarks for Network Applications | Yajie Zhou et.al. | 2506.03231 | link |
| 2025-06-03 | Multiple-Frequencies Population-Based Training | Waël Doulazmi et.al. | 2506.03225 | null |
| 2025-06-02 | Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments | Umberto Gonçalves de Sousa et.al. | 2506.03205 | null |
| 2025-06-03 | GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents | Qianhui Wu et.al. | 2506.03143 | null |
| 2025-06-03 | Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning | Yinjie Wang et.al. | 2506.03136 | link |
| 2025-06-03 | Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff | Sophie Greenwood et.al. | 2506.03102 | null |
| 2025-06-03 | EgoVLM: Policy Optimization for Egocentric Video Understanding | Ashwin Vinod et.al. | 2506.03097 | link |
| 2025-06-03 | DPO Learning with LLMs-Judge Signal for Computer Use Agents | Man Luo et.al. | 2506.03095 | null |
| 2025-06-03 | Provable Reinforcement Learning from Human Feedback with an Unknown Link Function | Qining Zhang et.al. | 2506.03066 | null |
| 2025-06-03 | MAEBE: Multi-Agent Emergent Behavior Framework | Sinem Erisken et.al. | 2506.03053 | null |
| 2025-06-03 | EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment | Mikolaj Walczak et.al. | 2506.03046 | null |
| 2025-06-06 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2506.03038 | null |
| 2025-06-03 | TestAgent: An Adaptive and Intelligent Expert for Human Assessment | Junhao Yu et.al. | 2506.03032 | null |
| 2025-06-03 | Coding Agents with Multimodal Browsing are Generalist Problem Solvers | Aditya Bharat Soni et.al. | 2506.03011 | null |
| 2025-06-03 | DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models | Jiarui Wang et.al. | 2506.03007 | null |
| 2025-06-03 | A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems | Đorđe Klisura et.al. | 2506.02998 | null |
| 2025-06-03 | Mapping Student-AI Interaction Dynamics in Multi-Agent Learning Environments: Supporting Personalised Learning and Reducing Performance Gaps | Zhanxin Hao et.al. | 2506.02993 | null |
| 2025-06-03 | Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation | Li Zhang et.al. | 2506.02992 | null |
| 2025-06-03 | Adaptive Graph Pruning for Multi-Agent Communication | Boyi Li et.al. | 2506.02951 | null |
| 2025-06-03 | Abstract Counterfactuals for Language Model Agents | Edoardo Pona et.al. | 2506.02946 | null |
| 2025-06-08 | Hallucination to Consensus: Multi-Agent LLMs for End-to-End Test Generation with Accurate Oracles | Qinghua Xu et.al. | 2506.02943 | null |
| 2025-06-03 | ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms | Praneet Sai Madhu Surabhi et.al. | 2506.02931 | link |
| 2025-06-03 | Large Processor Chip Model | Kaiyan Chang et.al. | 2506.02929 | null |
| 2025-06-03 | The Limits of Predicting Agents from Behaviour | Alexis Bellot et.al. | 2506.02923 | null |
| 2025-06-03 | Text-guided Generation of Efficient Personalized Inspection Plans | Xingpeng Sun et.al. | 2506.02917 | null |
| 2025-06-03 | A Continual Offline Reinforcement Learning Benchmark for Navigation Tasks | Anthony Kobanda et.al. | 2506.02883 | null |
| 2025-06-03 | It's the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics | Matthew Kowal et.al. | 2506.02873 | null |
| 2025-06-03 | Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights | Mathieu Andreux et.al. | 2506.02865 | null |
| 2025-06-03 | CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech | Helin Wang et.al. | 2506.02863 | null |
| 2025-06-03 | ATAG: AI-Agent Application Threat Assessment with Attack Graphs | Parth Atulbhai Gandhi et.al. | 2506.02859 | null |
| 2025-06-03 | Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods | Tom Danino et.al. | 2506.02841 | null |
| 2025-06-03 | On dual-rate consensus under transmission delays | David Umsonst et.al. | 2506.02840 | null |
| 2025-06-03 | DeepShop: A Benchmark for Deep Research Shopping Agents | Yougang Lyu et.al. | 2506.02839 | null |
| 2025-06-03 | TaxAgent: How Large Language Model Designs Fiscal Policy | Jizhou Wang et.al. | 2506.02838 | null |
| 2025-06-03 | Solving the Pod Repositioning Problem with Deep Reinforced Adaptive Large Neighborhood Search | Lin Xie et.al. | 2506.02746 | null |
| 2025-06-03 | Why do AI agents communicate in human language? | Pengcheng Zhou et.al. | 2506.02739 | null |
| 2025-06-03 | Benchmarking and Advancing Large Language Models for Local Life Services | Xiaochong Lan et.al. | 2506.02720 | null |
| 2025-06-03 | Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems | Guanzhong Chen et.al. | 2506.02718 | null |
| 2025-06-04 | MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching | Liang Yue et.al. | 2506.02689 | null |
| 2025-06-03 | Decompose, Plan in Parallel, and Merge: A Novel Paradigm for Large Language Models based Planning with Multiple Constraints | Zhengdong Lu et.al. | 2506.02683 | null |
| 2025-06-03 | Bounded confidence dynamics generates opinion cascades on a growing scale-free network | David Hernandez et.al. | 2506.02669 | null |
| 2025-06-03 | FAuNO: Semi-Asynchronous Federated Reinforcement Learning Framework for Task Offloading in Edge Systems | Frederico Metelo et.al. | 2506.02668 | null |
| 2025-06-04 | Non-exchangeable evolutionary and mean field games and their applications | H. Yoshioka et.al. | 2506.02644 | null |
| 2025-06-03 | Compositional Learning for Modular Multi-Agent Self-Organizing Networks | Qi Liao et.al. | 2506.02616 | null |
| 2025-06-04 | Multi Layered Autonomy and AI Ecologies in Robotic Art Installations | Baoyang Chen et.al. | 2506.02606 | null |
| 2025-06-03 | Computational adversarial risk analysis for general security games | Jose Manuel Camacho et.al. | 2506.02603 | null |
| 2025-06-03 | A Hybrid Approach to Indoor Social Navigation: Integrating Reactive Local Planning and Proactive Global Planning | Arnab Debnath et.al. | 2506.02593 | null |
| 2025-06-03 | CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale | Zhun Wang et.al. | 2506.02548 | link |
| 2025-06-03 | Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems | Pengfei He et.al. | 2506.02546 | null |
| 2025-06-03 | VerificAgent: Integrating Expert Knowledge and Fact-Checked Memory for Robust Domain-Specific Task Planning | Thong Q. Nguyen et.al. | 2506.02539 | null |
| 2025-06-03 | Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making | Xu Wan et.al. | 2506.02522 | null |
| 2025-06-03 | To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents | Kyra Wang et.al. | 2506.02514 | link |
| 2025-06-03 | AURA: Agentic Upskilling via Reinforced Abstractions | Alvin Zhu et.al. | 2506.02507 | null |
| 2025-06-03 | VPI-Bench: Visual Prompt Injection Attacks for Computer-Use Agents | Tri Cao et.al. | 2506.02456 | link |
| 2025-06-03 | Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework | Zhaorui Yang et.al. | 2506.02454 | null |
| 2025-06-03 | From Anger to Joy: How Nationality Personas Shape Emotion Attribution in Large Language Models | Mahammed Kamruzzaman et.al. | 2506.02431 | null |
| 2025-06-04 | Comparative Analysis of AI Agent Architectures for Entity Relationship Classification | Maryam Berijanian et.al. | 2506.02426 | link |
| 2025-06-03 | VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments | Zelai Xu et.al. | 2506.02387 | null |
| 2025-06-03 | Multi-agent Markov Entanglement | Shuze Chen et.al. | 2506.02385 | null |
| 2025-06-03 | Evaluating LLM Agent Adherence to Hierarchical Safety Principles: A Lightweight Benchmark for Probing Foundational Controllability Components | Ram Potham et.al. | 2506.02357 | null |
| 2025-06-03 | DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization | Jeonghun Kang et.al. | 2506.02351 | null |
| 2025-06-02 | LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback | Thai Hoang et.al. | 2506.02298 | null |
| 2025-06-02 | Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction | Samuel Li et.al. | 2506.02265 | null |
| 2025-06-02 | Composable Building Blocks for Controllable and Transparent Interactive AI Systems | Sebe Vanbrabant et.al. | 2506.02262 | null |
| 2025-06-02 | Stochastically Dominant Peer Prediction | Yichi Zhang et.al. | 2506.02259 | null |
| 2025-06-02 | Optimal Coordination of Flexible DERs in Local Energy and Flexibility Markets to Ensure Social Equity | Niloofar Pourghaderi et.al. | 2506.02179 | null |
| 2025-06-02 | Reflection-Based Memory For Web navigation Agents | Ruhana Azam et.al. | 2506.02158 | null |
| 2025-06-02 | Small Language Models are the Future of Agentic AI | Peter Belcak et.al. | 2506.02153 | null |
| 2025-06-04 | The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning | Edward Y. Chang et.al. | 2506.02139 | null |
| 2025-06-02 | Descriptive History Representations: Learning Representations by Answering Questions | Guy Tennenholtz et.al. | 2506.02125 | null |
| 2025-06-02 | Enhancing Interpretability of Quantum-Assisted Blockchain Clustering via AI Agent-Based Qualitative Analysis | Yun-Cheng Tsai et.al. | 2506.02068 | null |
| 2025-06-01 | The Measurement Imbalance in Agentic AI Evaluation Undermines Industry Productivity Claims | Kiana Jafari Meimandi et.al. | 2506.02064 | null |
| 2025-06-01 | Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI | Nikola Balic et.al. | 2506.02055 | link |
| 2025-06-01 | Phenotypic Profile-Informed Generation of Drug-Like Molecules via Dual-Channel Variational Autoencoders | Hui Liu et.al. | 2506.02051 | null |
| 2025-06-01 | Decoupled Hierarchical Reinforcement Learning with State Abstraction for Discrete Grids | Qingyu Xiao et.al. | 2506.02050 | null |
| 2025-06-01 | EvoGit: Decentralized Code Evolution via Git-Based Multi-Agent Collaboration | Beichen Huang et.al. | 2506.02049 | link |
| 2025-06-01 | Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges | Lajos Muzsai et.al. | 2506.02048 | null |
| 2025-05-31 | Beyond the Protocol: Unveiling Attack Vectors in the Model Context Protocol Ecosystem | Hao Song et.al. | 2506.02040 | link |
| 2025-06-02 | WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks | Atsuyuki Miyai et.al. | 2506.01952 | null |
| 2025-06-02 | Should Decision-Makers Reveal Classifiers in Online Strategic Classification? | Han Shao et.al. | 2506.01936 | null |
| 2025-06-02 | Online Competitive Information Gathering for Partially Observable Trajectory Games | Mel Krusniak et.al. | 2506.01927 | null |
| 2025-06-02 | COALESCE: Economic and Security Dynamics of Skill-Based Task Outsourcing Among Team of Autonomous LLM Agents | Manish Bhatt et.al. | 2506.01900 | null |
| 2025-06-02 | WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue | Yaoyao Qian et.al. | 2506.01881 | link |
| 2025-06-02 | Pearl: Automatic Code Optimization Using Deep Reinforcement Learning | Djamel Rassem Lamouri et.al. | 2506.01880 | null |
| 2025-06-02 | CONFETTI: Conversational Function-Calling Evaluation Through Turn-Level Interactions | Tamer Alkhouli et.al. | 2506.01859 | null |
| 2025-06-02 | Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research | Jennifer Haase et.al. | 2506.01839 | null |
| 2025-06-02 | The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships? | Djallel Bouneffouf et.al. | 2506.01813 | null |
| 2025-06-02 | A Study on the MCP x A2A Framework for Enhancing Interoperability of LLM-based Autonomous Agents | Cheonsu Jeong et.al. | 2506.01804 | null |
| 2025-06-02 | Enhancing Customer Service Chatbots with Context-Aware NLU through Selective Attention and Multi-task Learning | Subhadip Nandi et.al. | 2506.01781 | null |
| 2025-06-02 | Thinking in Character: Advancing Role-Playing Agents with Role-Aware Reasoning | Yihong Tang et.al. | 2506.01748 | null |
| 2025-06-02 | Self-Challenging Language Model Agents | Yifei Zhou et.al. | 2506.01716 | null |
| 2025-06-02 | A Descriptive and Normative Theory of Human Beliefs in RLHF | Sylee Dandekar et.al. | 2506.01692 | null |
| 2025-06-02 | Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts | Benjamin Schiffer et.al. | 2506.01685 | null |
| 2025-06-02 | A Hierarchical Bin Packing Framework with Dual Manipulators via Heuristic Search and Deep Reinforcement Learning | Beomjoon Lee et.al. | 2506.01628 | null |
| 2025-06-02 | Social Cooperation in Conversational AI Agents | Mustafa Mert Çelikok et.al. | 2506.01624 | null |
| 2025-06-02 | MAGIK: Mapping to Analogous Goals via Imagination-enabled Knowledge Transfer | Ajsal Shereef Palattuparambil et.al. | 2506.01623 | null |
| 2025-06-02 | General agents need world models | Jonathan Richens et.al. | 2506.01622 | null |
| 2025-06-02 | MLA-Trust: Benchmarking Trustworthiness of Multimodal LLM Agents in GUI Environments | Xiao Yang et.al. | 2506.01616 | null |
| 2025-06-02 | Trajectory First: A Curriculum for Discovering Diverse Policies | Cornelius V. Braun et.al. | 2506.01568 | null |
| 2025-06-02 | EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation | Bingqian Lin et.al. | 2506.01551 | null |
| 2025-06-03 | LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation | Guobin Zhu et.al. | 2506.01538 | null |
| 2025-06-03 | Quantum Agents | Eldar Sultanow et.al. | 2506.01536 | null |
| 2025-06-03 | STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework | Wenhao Liu et.al. | 2506.01531 | link |
| 2025-06-02 | FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents | Bobo Li et.al. | 2506.01520 | null |
| 2025-06-02 | PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization | Zouying Cao et.al. | 2506.01475 | null |
| 2025-06-02 | Agentic AI and Multiagentic: Are We Reinventing the Wheel? | V. Botti et.al. | 2506.01463 | null |
| 2025-06-02 | Agentic Episodic Control | Xidong Yang et.al. | 2506.01442 | null |
| 2025-06-02 | Distinguishing Autonomous AI Agents from Collaborative Agentic Systems: A Comprehensive Framework for Understanding Modern Intelligent Architectures | Prashik Buddhaghosh Bansod et.al. | 2506.01438 | null |
| 2025-06-02 | FinRobot: Generative Business Process AI Agents for Enterprise Resource Planning in Finance | Hongyang Yang et.al. | 2506.01423 | null |
| 2025-06-02 | SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Rafael Flor-Rodríguez et.al. | 2506.01418 | link |
| 2025-06-02 | Sparse Imagination for Efficient Visual World Model Planning | Junha Chun et.al. | 2506.01392 | null |
| 2025-06-02 | AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning | Zhong Zhang et.al. | 2506.01391 | link |
| 2025-06-02 | Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents | Manan Suri et.al. | 2506.01344 | null |
| 2025-06-02 | Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models | Yiwen Jiang et.al. | 2506.01334 | null |
| 2025-06-02 | An Empirical Study of Group Conformity in Multi-Agent Systems | Min Choi et.al. | 2506.01332 | null |
| 2025-06-02 | ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding | Yiyang Zhou et.al. | 2506.01300 | null |
| 2025-06-02 | RAISE: Reasoning Agent for Interactive SQL Exploration | Fernando Granado et.al. | 2506.01273 | null |
| 2025-06-02 | CleanS2S: Single-file Framework for Proactive Speech-to-Speech Interaction | Yudong Lu et.al. | 2506.01268 | null |
| 2025-06-02 | Comprehensive Vulnerability Analysis is Necessary for Trustworthy LLM-MAS | Pengfei He et.al. | 2506.01245 | null |
| 2025-06-01 | A mean field game model with non-local spatial interactions and resources accumulation | Daria Ghilli et.al. | 2506.01200 | null |
| 2025-06-04 | Test Automation for Interactive Scenarios via Promptable Traffic Simulation | Augusto Mondelli et.al. | 2506.01199 | null |
| 2025-06-01 | Near-feasible Fair Allocations in Two-sided Markets | Javier Cembrano et.al. | 2506.01178 | null |
| 2025-06-01 | GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering | Muhammad Qasim Ali et.al. | 2506.01174 | null |
| 2025-06-01 | Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism | Orchid Chetia Phukan et.al. | 2506.01148 | null |
| 2025-06-01 | DeepVerse: 4D Autoregressive Video Generation as a World Model | Junyi Chen et.al. | 2506.01103 | null |
| 2025-06-01 | Modular Speaker Architecture: A Framework for Sustaining Responsibility and Contextual Integrity in Multi-Agent AI Communication | Khe-Han Toh et.al. | 2506.01095 | null |
| 2025-06-01 | The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process | Florian Carichon et.al. | 2506.01080 | null |
| 2025-06-01 | SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models | Thinh Pham et.al. | 2506.01062 | null |
| 2025-06-04 | MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch | Xiang Fei et.al. | 2506.01056 | null |
| 2025-06-01 | Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution | Meysam Alizadeh et.al. | 2506.01055 | null |
| 2025-06-01 | Robust and Safe Multi-Agent Reinforcement Learning Framework with Communication for Autonomous Vehicles | Keshawn Smith et.al. | 2506.00982 | null |
| 2025-06-01 | HMPC-assisted Adversarial Inverse Reinforcement Learning for Smart Home Energy Management | Jiadong He et.al. | 2506.00898 | null |
| 2025-06-01 | Toward a Theory of Agents as Tool-Use Decision-Makers | Hongru Wang et.al. | 2506.00886 | null |
| 2025-06-01 | CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching | Leying Zhang et.al. | 2506.00885 | null |
| 2025-06-01 | Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks | Qiang Chen et.al. | 2506.00856 | null |
| 2025-06-01 | Federated Deep Reinforcement Learning-Driven O-RAN for Automatic Multirobot Reconfiguration | Faisal Ahmed et.al. | 2506.00822 | null |
| 2025-06-01 | Action Dependency Graphs for Globally Optimal Coordinated Reinforcement Learning | Jianglin Ding et.al. | 2506.00797 | null |
| 2025-06-01 | Predicting Empirical AI Research Outcomes with Language Models | Jiaxin Wen et.al. | 2506.00794 | null |
| 2025-06-01 | CO-OPERA: A Human-AI Collaborative Playwriting Tool to Support Creative Storytelling for Interdisciplinary Drama Education | Xuejiao Ma et.al. | 2506.00791 | link |
| 2025-06-01 | CoP: Agentic Red-teaming for Large Language Models using Composition of Principles | Chen Xiong et.al. | 2506.00781 | null |
| 2025-05-31 | Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences? | Zhuojun Gu et.al. | 2506.00751 | null |
| 2025-05-31 | DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments | Chiyu Zhang et.al. | 2506.00739 | link |
| 2025-05-31 | Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning | Javier Bisbal et.al. | 2506.00727 | null |
| 2025-05-31 | Browser Fingerprinting Using WebAssembly | Mordechai Guri et.al. | 2506.00719 | null |
| 2025-05-31 | An LLM Agent for Functional Bug Detection in Network Protocols | Mingwei Zheng et.al. | 2506.00714 | link |
| 2025-05-31 | Adaptive Traffic-Following Scheme for Orderly Distributed Control of Multi-Vehicle Systems | Anahita Jain et.al. | 2506.00703 | null |
| 2025-06-04 | Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning | Junaid Muzaffar et.al. | 2506.00691 | null |
| 2025-05-31 | AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents | Hanjun Luo et.al. | 2506.00641 | null |
| 2025-05-31 | Social Construction of Urban Space: Understanding Neighborhood Boundaries Using Rental Listings | Adam Visokay et.al. | 2506.00634 | null |
| 2025-05-31 | The Disparate Effects of Partial Information in Bayesian Strategic Learning | Srikanth Avasarala et.al. | 2506.00627 | null |
| 2025-06-04 | RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents | Jingyi Yang et.al. | 2506.00618 | null |
| 2025-05-31 | PAKTON: A Multi-Agent Framework for Question Answering in Long Legal Agreements | Petros Raptopoulos et.al. | 2506.00608 | link |
| 2025-05-31 | Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn | Hongyao Tang et.al. | 2506.00592 | null |
| 2025-05-31 | Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs | Yufa Zhou et.al. | 2506.00577 | link |
| 2025-05-31 | ORAN-GUIDE: RAG-Driven Prompt Learning for LLM-Augmented Reinforcement Learning in O-RAN Network Slicing | Fatemeh Lotfi et.al. | 2506.00576 | null |
| 2025-05-31 | Prompt-Tuned LLM-Augmented DRL for Dynamic O-RAN Network Slicing | Fatemeh Lotfi et.al. | 2506.00574 | null |
| 2025-05-31 | MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning | Peng Xia et.al. | 2506.00555 | null |
| 2025-05-31 | Two-Sided Manipulation Games in Stable Matching Markets | Hadi Hosseini et.al. | 2506.00554 | null |
| 2025-05-31 | AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation | Ming Wang et.al. | 2506.00551 | link |
| 2025-05-31 | Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages | Hyangsuk Min et.al. | 2506.00549 | null |
| 2025-05-31 | Flying Co-Stereo: Enabling Long-Range Aerial Dense Mapping via Collaborative Stereo Vision of Dynamic-Baseline | Zhaoying Wang et.al. | 2506.00546 | null |
| 2025-06-04 | ARIA: Training Language Agents with Intention-Driven Reward Aggregation | Ruihan Yang et.al. | 2506.00539 | null |
| 2025-05-31 | Temac: Multi-Agent Collaboration for Automated Web GUI Testing | Chenxu Liu et.al. | 2506.00520 | null |
| 2025-05-31 | Goal-Aware Identification and Rectification of Misinformation in Multi-Agent Systems | Zherui Li et.al. | 2506.00509 | null |
| 2025-05-31 | Reinforcement Learning for Hanabi | Nina Cohen et.al. | 2506.00458 | null |
| 2025-05-31 | RLAE: Reinforcement Learning-Assisted Ensemble for LLMs | Yuqian Fu et.al. | 2506.00439 | null |
| 2025-05-31 | Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions | Jihyoung Jang et.al. | 2506.00421 | null |
| 2025-05-31 | World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks | Changyuan Zhao et.al. | 2506.00417 | null |
| 2025-05-31 | LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | Yi Yang et.al. | 2506.00411 | null |
| 2025-05-31 | Sensor Fusion Methods for Gaussian Mixture Models | Ishan Paranjape et.al. | 2506.00383 | null |
| 2025-05-31 | Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents | Xiao Yu et.al. | 2506.00320 | null |
| 2025-05-30 | Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model | Oliver Mortensen et.al. | 2506.00286 | null |
| 2025-05-30 | MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility | Yexiao He et.al. | 2506.00235 | null |
| 2025-05-30 | Sorrel: A simple and flexible framework for multi-agent reinforcement learning | Rebekah A. Gelpí et.al. | 2506.00228 | link |
| 2025-05-30 | REIC: RAG-Enhanced Intent Classification at Scale | Ziji Zhang et.al. | 2506.00210 | null |
| 2025-05-30 | When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs | Xinyue Shen et.al. | 2506.00197 | null |
| 2025-05-30 | Breakpoint: Scalable evaluation of system-level reasoning in LLM code agents | Kaivalya Hariharan et.al. | 2506.00172 | null |
| 2025-06-03 | A novel sensitivity analysis method for agent-based models stratifies in-silico tumor spheroid simulations | Edward H. Rohr et.al. | 2506.00168 | null |
| 2025-05-30 | Werewolf: A Straightforward Game Framework with TTS for Improved User Engagement | Qihui Fan et.al. | 2506.00160 | null |
| 2025-05-30 | MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation | Biqi Rebekah Zhao et.al. | 2506.00143 | null |
| 2025-05-30 | Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation | Reece Keller et.al. | 2506.00138 | null |
| 2025-05-30 | A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things | Mohammadhossein Homaei et.al. | 2506.00133 | null |
| 2025-05-30 | Adapting Offline Reinforcement Learning with Online Delays | Simon Sinong Zhan et.al. | 2506.00131 | null |
| 2025-05-30 | Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents | Yaxin Luo et.al. | 2505.24878 | link |
| 2025-05-30 | Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks | Tajamul Ashraf et.al. | 2505.24876 | link |
| 2025-05-30 | VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software | Brandon Man et.al. | 2505.24838 | link |
| 2025-05-30 | Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation | Yucheng Zhou et.al. | 2505.24787 | link |
| 2025-06-02 | EXP-Bench: Can AI Conduct AI Research Experiments? | Patrick Tser Jern Kon et.al. | 2505.24785 | link |
| 2025-05-30 | Emergent Dynamics of Active Systems on Curved Environments | Euan D. Mackay et.al. | 2505.24730 | null |
| 2025-05-30 | CoRet: Improved Retriever for Code Editing | Fabio Fehr et.al. | 2505.24715 | null |
| 2025-05-30 | Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting | Wei Chen et.al. | 2505.24710 | link |
| 2025-05-30 | Towards a unified user modeling language for engineering human centered AI systems | Aaron Conrardy et.al. | 2505.24697 | null |
| 2025-05-30 | Multiple LLM Agents Debate for Equitable Cultural Alignment | Dayeon Ki et.al. | 2505.24671 | link |
| 2025-05-30 | Black-box Adversarial Attacks on CNN-based SLAM Algorithms | Maria Rafaela Gkeka et.al. | 2505.24654 | null |
| 2025-05-30 | Online Budget-Feasible Mechanism Design with Predictions | Georgios Amanatidis et.al. | 2505.24624 | null |
| 2025-05-30 | Distributed Intelligence in the Computing Continuum with Active Inference | Victor Casamayor Pujol et.al. | 2505.24618 | null |
| 2025-05-30 | When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation | Daniela Occhipinti et.al. | 2505.24613 | null |
| 2025-06-02 | AutoChemSchematic AI: A Closed-Loop, Physics-Aware Agentic Framework for Auto-Generating Chemical Process and Instrumentation Diagrams | Sakhinana Sagar Srinivas et.al. | 2505.24584 | null |
| 2025-05-30 | NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization | Hyuntak Kim et.al. | 2505.24575 | null |
| 2025-05-30 | CREFT: Sequential Multi-Agent LLM for Character Relation Extraction | Ye Eun Chun et.al. | 2505.24553 | null |
| 2025-05-30 | Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Compatibility | Leonid Kondrashov et.al. | 2505.24551 | null |
| 2025-05-30 | Online Fair Division with Additional Information | Tzeh Yuan Neoh et.al. | 2505.24503 | null |
| 2025-05-30 | RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation | Zhentao Xie et.al. | 2505.24442 | link |
| 2025-05-30 | P: A Universal Measure of Predictive Intelligence | David Gamez et.al. | 2505.24426 | null |
| 2025-05-30 | Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer | Yilun Kong et.al. | 2505.24378 | link |
| 2025-05-30 | Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research | Qianqian Zhang et.al. | 2505.24354 | null |
| 2025-05-30 | Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents | Fanhang Man et.al. | 2505.24331 | link |
| 2025-05-30 | Online Fair Allocations with Binary Valuations and Beyond | Yuanyuan Wang et.al. | 2505.24321 | null |
| 2025-05-30 | ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving | Yongming Chen et.al. | 2505.24317 | null |
| 2025-05-30 | R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning | Harsh Goel et.al. | 2505.24265 | link |
| 2025-05-30 | Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games | Neemesh Yadav et.al. | 2505.24255 | link |
| 2025-05-30 | Rethinking Continual Learning with Progressive Neural Collapse | Zheng Wang et.al. | 2505.24254 | null |
| 2025-05-30 | Proactive Guidance of Multi-Turn Conversation in Industrial Search | Xiaoyu Li et.al. | 2505.24251 | null |
| 2025-05-30 | An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring | Sana Ebrahimi et.al. | 2505.24239 | null |
| 2025-05-30 | SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems | Xu He et.al. | 2505.24201 | null |
| 2025-05-30 | Learning Gentle Humanoid Locomotion and End-Effector Stabilization Control | Yitang Li et.al. | 2505.24198 | link |
| 2025-05-30 | Learning API Functionality from Demonstrations for Tool-based Agents | Bhrij Patel et.al. | 2505.24197 | null |
| 2025-05-30 | Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control | Zijie Xu et.al. | 2505.24161 | null |
| 2025-05-30 | Don't Just Follow MLLM Plans: Robust and Efficient Planning for Open-world Agents | Seungjoon Lee et.al. | 2505.24157 | null |
| 2025-05-30 | Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction | Chenyou Fan et.al. | 2505.24156 | null |
| 2025-05-30 | Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning | Ehtesamul Azim et.al. | 2505.24155 | link |
| 2025-05-30 | Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning | Pengcheng Dai et.al. | 2505.24113 | null |
| 2025-05-30 | Deception in Oligopoly Games via Adaptive Nash Seeking Systems | Michael Tang et.al. | 2505.24112 | null |
| 2025-05-29 | mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation | Chan-Wei Hu et.al. | 2505.24073 | null |
| 2025-05-29 | Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning | Jiashun Liu et.al. | 2505.24061 | null |
| 2025-05-29 | LLM Agents Should Employ Security Principles | Kaiyuan Zhang et.al. | 2505.24019 | null |
| 2025-05-29 | ConversAR: Exploring Embodied LLM-Powered Group Conversations in Augmented Reality for Second Language Learners | Jad Bendarkawi et.al. | 2505.24000 | null |
| 2025-05-29 | Multi-RAG: A Multimodal Retrieval-Augmented Generation System for Adaptive Video Understanding | Mingyang Mao et.al. | 2505.23990 | null |
| 2025-05-29 | Rules, agents and order | Amalia Puente et.al. | 2505.23985 | null |
| 2025-05-29 | Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation | Henry Conklin et.al. | 2505.23960 | null |
| 2025-05-29 | Estimating Misreporting in the Presence of Genuine Modification: A Causal Perspective | Dylan Zapzalka et.al. | 2505.23954 | null |
| 2025-05-29 | Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach | Melika Sepidband et.al. | 2505.23953 | null |
| 2025-05-29 | InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback | Boyuan Chen et.al. | 2505.23950 | null |
| 2025-05-29 | Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve | Yuanzhe Liu et.al. | 2505.23946 | null |
| 2025-05-29 | ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents | Feiteng Fang et.al. | 2505.23923 | null |
| 2025-05-29 | OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation | Mengkang Hu et.al. | 2505.23885 | link |
| 2025-05-29 | Combining Deep Architectures for Information Gain estimation and Reinforcement Learning for multiagent field exploration | Emanuele Masiero et.al. | 2505.23865 | null |
| 2025-05-29 | DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control | Wuhao Wang et.al. | 2505.23857 | null |
| 2025-05-29 | Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease | Nic Dobbins et.al. | 2505.23852 | null |
| 2025-05-28 | Seven Security Challenges That Must be Solved in Cross-domain Multi-agent LLM Systems | Ronny Ko et.al. | 2505.23847 | null |
| 2025-05-28 | Scalable, Symbiotic, AI and Non-AI Agent Based Parallel Discrete Event Simulations | Atanu Barai et.al. | 2505.23846 | null |
| 2025-05-28 | GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance | Zaixi Zhang et.al. | 2505.23839 | link |
| 2025-05-28 | CoMaPOI: A Collaborative Multi-Agent Framework for Next POI Prediction Bridging the Gap Between Trajectory and Language | Lin Zhong et.al. | 2505.23837 | null |
| 2025-05-28 | Large Language Models Often Know When They Are Being Evaluated | Joe Needham et.al. | 2505.23836 | null |
| 2025-05-28 | Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective | Qingchuan Ma et.al. | 2505.23833 | link |
| 2025-05-28 | Privacy-Preserving Inconsistency Measurement | Carl Corea et.al. | 2505.23825 | null |
| 2025-05-27 | Aligning LLMs by Predicting Preferences from User Writing Samples | Stéphane Aroca-Ouellette et.al. | 2505.23815 | null |
| 2025-05-29 | From Chat Logs to Collective Insights: Aggregative Question Answering | Wentao Zhang et.al. | 2505.23765 | null |
| 2025-05-29 | ZeroGUI: Automating Online GUI Learning at Zero Human Cost | Chenyu Yang et.al. | 2505.23762 | link |
| 2025-05-29 | ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks | Akashah Shabbir et.al. | 2505.23752 | link |
| 2025-05-29 | ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | Zexi Liu et.al. | 2505.23723 | link |
| 2025-05-29 | COBRA: Contextual Bandit Algorithm for Ensuring Truthful Strategic Agents | Arun Verma et.al. | 2505.23720 | null |
| 2025-05-29 | From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems | Zeinab Nezami et.al. | 2505.23710 | null |
| 2025-05-29 | Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics | Ran Zhang et.al. | 2505.23695 | link |
| 2025-05-29 | ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork | Caroline Wang et.al. | 2505.23686 | link |
| 2025-05-31 | GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents | Manish Shetty et.al. | 2505.23671 | link |
| 2025-05-29 | Initial Luminally Deposited FGF4 Critically Influences Blastocyst Patterning | Michael A. Ramirez-Sierra et.al. | 2505.23650 | null |
| 2025-05-29 | Securing AI Agents with Information-Flow Control | Manuel Costa et.al. | 2505.23643 | link |
| 2025-05-29 | MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment | John Halloran et.al. | 2505.23634 | null |
| 2025-06-02 | MAPLE: A Mobile Agent with Persistent Finite State Machines for Structured Task Reasoning | Linqiang Guo et.al. | 2505.23596 | null |
| 2025-05-29 | SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | Kunlun Zhu et.al. | 2505.23559 | link |
| 2025-05-29 | Going from a Representative Agent to Counterfactuals in Combinatorial Choice | Yanqiu Ruan et.al. | 2505.23546 | null |
| 2025-05-29 | TRAP: Targeted Redirecting of Agentic Preferences | Hangoo Kang et.al. | 2505.23518 | null |
| 2025-05-29 | PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views | Mohamed Rayan Barhdadi et.al. | 2505.23481 | link |
| 2025-05-29 | Socratic-PRMBench: Benchmarking Process Reward Models with Systematic Reasoning Patterns | Xiang Li et.al. | 2505.23474 | null |
| 2025-05-29 | On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment | Safwan Labbi et.al. | 2505.23459 | null |
| 2025-05-29 | Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents | Zhejian Yang et.al. | 2505.23450 | null |
| 2025-06-01 | Emergent Risk Awareness in Rational Agents under Resource Constraints | Daniel Jarne Ornia et.al. | 2505.23436 | null |
| 2025-05-29 | From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents | Tobias Lindenbauer et.al. | 2505.23422 | link |
| 2025-06-01 | SWE-bench Goes Live! | Linghao Zhang et.al. | 2505.23419 | link |
| 2025-05-29 | Agent Interpolation for Knowledge | Marta Bílková et.al. | 2505.23401 | null |
| 2025-05-29 | GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning | Jusheng Zhang et.al. | 2505.23399 | null |
| 2025-05-29 | Grower-in-the-Loop Interactive Reinforcement Learning for Greenhouse Climate Control | Maxiu Xiao et.al. | 2505.23355 | null |
| 2025-05-29 | Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | Xu Shen et.al. | 2505.23352 | link |
| 2025-06-02 | ScEdit: Script-based Assessment of Knowledge Editing | Xinye Li et.al. | 2505.23291 | link |
| 2025-05-29 | Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception | Guangyuan Liu et.al. | 2505.23275 | null |
| 2025-05-29 | Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | Chunlong Xie et.al. | 2505.23266 | null |
| 2025-05-29 | Achieving Equitability with Subsidy | Yuanyuan Wang et.al. | 2505.23251 | null |
| 2025-05-29 | Context-Aware Semantic Communication for the Wireless Networks | Guangyuan Liu et.al. | 2505.23249 | null |
| 2025-05-29 | OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software | Lingkai Meng et.al. | 2505.23239 | link |
| 2025-05-29 | TrackVLA: Embodied Visual Tracking in the Wild | Shaoan Wang et.al. | 2505.23189 | null |
| 2025-05-29 | Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration | Yilong Li et.al. | 2505.23187 | null |
| 2025-05-29 | Conceptual Framework Toward Embodied Collective Adaptive Intelligence | Fan Wang et.al. | 2505.23153 | null |
| 2025-05-29 | Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners | Michal Nauman et.al. | 2505.23150 | null |
| 2025-05-29 | PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents | Haoyu Chen et.al. | 2505.23130 | null |
| 2025-05-29 | Learning to Incentivize in Repeated Principal-Agent Problems with Adversarial Agent Arrivals | Junyan Liu et.al. | 2505.23124 | null |
| 2025-05-29 | A Constructed Response: Designing and Choreographing Robot Arm Movements in Collaborative Dance Improvisation | Xiaoyu Chang et.al. | 2505.23090 | null |
| 2025-05-29 | Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble | Amit Kumthekar et.al. | 2505.23075 | null |
| 2025-05-29 | CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model Agents | Zhen Xiang et.al. | 2505.23055 | link |
| 2025-05-29 | AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models | Jinchuan Zhang et.al. | 2505.23020 | link |
| 2025-06-01 | Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration | Zeying Gong et.al. | 2505.23019 | null |
| 2025-05-29 | A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs | Chiwan Park et.al. | 2505.23006 | null |
| 2025-05-29 | LLM Agents for Bargaining with Utility-based Feedback | Jihwan Oh et.al. | 2505.22998 | null |
| 2025-05-29 | Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation | Hoang Pham et.al. | 2505.22993 | null |
| 2025-05-29 | MenTeR: A fully-automated Multi-agenT workflow for end-to-end RF/Analog Circuits Netlist Design | Pin-Han Chen et.al. | 2505.22990 | null |
| 2025-05-29 | Free Lunch for User Experience: Crowdsourcing Agents for Scalable User Studies | Siyang Liu et.al. | 2505.22981 | null |
| 2025-05-29 | Learning Recommender Mechanisms for Bayesian Stochastic Games | Bengisu Guresti et.al. | 2505.22979 | null |
| 2025-05-29 | MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming | Chengqi Zheng et.al. | 2505.22967 | null |
| 2025-05-29 | ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind | Peixuan Han et.al. | 2505.22961 | link |
| 2025-05-29 | Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness | Yongjin Yang et.al. | 2505.22960 | null |
| 2025-05-29 | Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents | Jenny Zhang et.al. | 2505.22954 | link |
| 2025-05-28 | WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning | Yuchen Zhuang et.al. | 2505.22942 | null |
| 2025-05-28 | A Smart-Contract to Resolve Multiple Equilibrium in Intermediated Trade | Daniel Aronoff et.al. | 2505.22940 | null |
| 2025-05-28 | On the Resolution of Stochastic MPECs over Networks: Distributed Implicit Zeroth-Order Gradient Tracking Methods | Mohammadjavad Ebrahimi et.al. | 2505.22916 | null |
| 2025-05-28 | Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents | Cristian Chica et.al. | 2505.22909 | null |
| 2025-05-28 | Conversational Alignment with Artificial Intelligence in Context | Rachel Katharine Sterken et.al. | 2505.22907 | null |
| 2025-05-30 | Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel | Carlota Parés-Morlans et.al. | 2505.22861 | null |
| 2025-05-28 | Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment | Krti Tallam et.al. | 2505.22852 | null |
| 2025-05-28 | RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation | Nikita Khramov et.al. | 2505.22846 | null |
| 2025-05-28 | A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems | Jonghan Lim et.al. | 2505.22814 | null |
| 2025-05-28 | First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay | Andrew Zhu et.al. | 2505.22809 | link |
| 2025-05-28 | Dynamic Task Adaptation for Multi-Robot Manufacturing Systems with Large Language Models | Jonghan Lim et.al. | 2505.22804 | null |
| 2025-05-28 | Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games | Antonio Ocello et.al. | 2505.22781 | null |
| 2025-05-28 | MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue Evaluators | John Mendonça et.al. | 2505.22777 | link |
| 2025-05-28 | Calibrated Value-Aware Model Learning with Stochastic Environment Models | Claas Voelcker et.al. | 2505.22772 | null |
| 2025-05-28 | Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential Fields | Arseniy Pertzovsky et.al. | 2505.22753 | null |
| 2025-05-28 | HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer | Qi Cai et.al. | 2505.22705 | link |
| 2025-05-28 | Design and testing of an agent chatbot supporting decision making with public transport data | Luca Fantin et.al. | 2505.22698 | null |
| 2025-05-28 | When Does Neuroevolution Outcompete Reinforcement Learning in Transfer Learning Tasks? | Eleni Nisioti et.al. | 2505.22696 | link |
| 2025-05-28 | LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning | Tengfei Lyu et.al. | 2505.22695 | null |
| 2025-05-28 | 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | Wenbo Hu et.al. | 2505.22657 | null |
| 2025-05-28 | Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | Michael Kirchhof et.al. | 2505.22655 | null |
| 2025-05-28 | WebDancer: Towards Autonomous Information Seeking Agency | Jialong Wu et.al. | 2505.22648 | link |
| 2025-06-01 | FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | Younggyo Seo et.al. | 2505.22642 | null |
| 2025-05-28 | LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents | Rui Li et.al. | 2505.22634 | null |
| 2025-05-28 | HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym | Ngoc La et.al. | 2505.22597 | link |
| 2025-05-28 | GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git | Tobias Lindenbauer et.al. | 2505.22583 | link |
| 2025-05-30 | Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems | Hoang Pham et.al. | 2505.22571 | null |
| 2025-05-28 | Universal Visuo-Tactile Video Understanding for Embodied Interaction | Yifan Xie et.al. | 2505.22566 | null |
| 2025-05-28 | Training RL Agents for Multi-Objective Network Defense Tasks | Andres Molina-Markham et.al. | 2505.22531 | null |
| 2025-05-28 | AI instructional agent improves student's perceived learner control and learning outcome: empirical evidence from a randomized controlled trial | Fei Qin et.al. | 2505.22526 | null |
| 2025-05-28 | From Strangers to Assistants: Fast Desire Alignment for Embodied Agent-User Adaptation | Yuanfei Wang et.al. | 2505.22503 | null |
| 2025-05-28 | EvolveSearch: An Iterative Self-Evolving Search Agent | Dingchu Zhang et.al. | 2505.22501 | null |
| 2025-05-28 | Human-Centered Human-AI Collaboration (HCHAC) | Qi Gao et.al. | 2505.22477 | null |
| 2025-05-29 | Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems | Jiaxi Yang et.al. | 2505.22467 | null |
| 2025-05-28 | AI Mathematician: Towards Fully Automated Frontier Mathematical Research | Yuanhang Liu et.al. | 2505.22451 | null |
| 2025-05-28 | COSMOS: A Data-Driven Probabilistic Time Series simulator for Chemical Plumes across Spatial Scales | Arunava Nag et.al. | 2505.22436 | link |
| 2025-05-28 | Exact Algorithms and Lower Bounds for Forming Coalitions of Constrained Maximum Size | Foivos Fioravantes et.al. | 2505.22384 | null |
| 2025-05-28 | AgentDNS: A Root Domain Naming System for LLM Agents | Enfang Cui et.al. | 2505.22368 | null |
| 2025-05-28 | From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications | Feibo Jiang et.al. | 2505.22311 | null |
| 2025-05-28 | Voice CMS: updating the knowledge base of a digital assistant through conversation | Grzegorz Wolny et.al. | 2505.22303 | null |
| 2025-05-29 | YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | Mingzhuang Wang et.al. | 2505.22250 | null |
| 2025-05-28 | Efficient Leave-one-out Approximation in LLM Multi-agent Debate Based on Introspection | Yue Cui et.al. | 2505.22192 | null |
| 2025-05-28 | MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction | Mohammad Samin Nur Chowdhury et.al. | 2505.22187 | null |
| 2025-05-28 | Online Fair Division for Personalized |
Georgios Amanatidis et.al. | 2505.22174 | null |
| 2025-05-28 | Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL | Claude Formanek et.al. | 2505.22151 | null |
| 2025-05-28 | Lifted Forward Planning in Relational Factored Markov Decision Processes with Concurrent Actions | Florian Andreas Marwitz et.al. | 2505.22147 | null |
| 2025-05-28 | Sentiment Simulation using Generative AI Agents | Melrose Tia et.al. | 2505.22125 | null |
| 2025-05-30 | VIRAL: Vision-grounded Integration for Reward design And Learning | Valentin Cuzin-Rambaud et.al. | 2505.22092 | link |
| 2025-05-28 | AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation | Yan Rong et.al. | 2505.22053 | null |
| 2025-05-28 | Reinforced Reasoning for Embodied Planning | Di Wu et.al. | 2505.22050 | null |
| 2025-05-28 | VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries | Nasir Hussain et.al. | 2505.22010 | null |
| 2025-05-28 | Efficiently Enhancing General Agents With Hierarchical-categorical Memory | Changze Qiao et.al. | 2505.22006 | null |
| 2025-05-28 | Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning | Naoto Yoshida et.al. | 2505.21985 | null |
| 2025-05-28 | Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset | Fakhraddin Alwajih et.al. | 2505.21979 | null |
| 2025-05-29 | DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation | Tianjun Gu et.al. | 2505.21969 | link |
| 2025-05-28 | MapStory: LLM-Powered Text-Driven Map Animation Prototyping with Human-in-the-Loop Editing | Aditya Gunturu et.al. | 2505.21966 | null |
| 2025-05-28 | UI-Evol: Automatic Knowledge Evolving for Computer Use Agents | Ziyun Zhang et.al. | 2505.21964 | null |
| 2025-05-28 | LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents | Taro Yano et.al. | 2505.21963 | null |
| 2025-05-28 | Properties of zero-determinant strategies in multichannel games | Masahiko Ueda et.al. | 2505.21952 | null |
| 2025-06-01 | RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments | Zeyi Liao et.al. | 2505.21936 | link |
| 2025-05-28 | Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference | Yue Zhu et.al. | 2505.21919 | null |
| 2025-05-31 | Modeling and Optimizing User Preferences in AI Copilots: A Comprehensive Survey and Taxonomy | Saleh Afzoon et.al. | 2505.21907 | null |
| 2025-05-28 | Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development | Rennai Qiu et.al. | 2505.21898 | null |
| 2025-05-28 | Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation | Yu-Lun Song et.al. | 2505.21880 | null |
| 2025-06-02 | GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning | Shikhhar Siingh et.al. | 2505.21863 | null |
| 2025-05-27 | AI Agent Governance: A Field Guide | Jam Kraprayoon et.al. | 2505.21808 | null |
| 2025-05-27 | Events and their Localisation are Relative to a Lab | V. Vilasini et.al. | 2505.21797 | null |
| 2025-05-27 | Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation | Tharindu Kumarage et.al. | 2505.21784 | null |
| 2025-05-27 | BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum | Yubin Kim et.al. | 2505.21757 | null |
| 2025-05-27 | AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models | Dongjune Chang et.al. | 2505.21741 | null |
| 2025-05-27 | Deep Reinforcement Learning Agents are not even close to Human Intelligence | Quentin Delfosse et.al. | 2505.21731 | null |
| 2025-05-27 | On Reconfigurable Bisimulation, with an Application to the Distributed Synthesis Problem | Yehia Abd Alrahman et.al. | 2505.21672 | null |
| 2025-05-27 | Classifying and Clustering Trading Agents | Mateusz Wilinski et.al. | 2505.21662 | link |
| 2025-05-27 | PreGenie: An Agentic Framework for High-quality Visual Presentation Generation | Xiaojie Xu et.al. | 2505.21660 | null |
| 2025-05-27 | Herd Behavior: Investigating Peer Influence in LLM-based Multi-Agent Systems | Young-Min Cho et.al. | 2505.21588 | null |
| 2025-05-27 | AITEE -- Agentic Tutor for Electrical Engineering | Christopher Knievel et.al. | 2505.21582 | link |
| 2025-05-27 | RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving | Huacan Wang et.al. | 2505.21577 | link |
| 2025-05-27 | ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools | Zhucong Li et.al. | 2505.21569 | null |
| 2025-05-26 | Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework | Julien Soulé et.al. | 2505.21559 | null |
| 2025-05-26 | Fermionic operatorial model of a system with competitive and cooperative interactions | M. Gorgone et.al. | 2505.21554 | null |
| 2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
| 2025-05-27 | AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery | Haowei Wang et.al. | 2505.21499 | link |
| 2025-05-27 | Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers | Wei Pang et.al. | 2505.21497 | link |
| 2025-05-27 | UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents | Han Xiao et.al. | 2505.21496 | link |
| 2025-05-27 | Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming | Yang Yang et.al. | 2505.21486 | null |
| 2025-05-27 | Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | Zijun Liu et.al. | 2505.21471 | link |
| 2025-05-27 | Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO | Muzhi Zhu et.al. | 2505.21457 | null |
| 2025-05-27 | Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks | Francesco Cozzi et.al. | 2505.21426 | link |
| 2025-05-27 | GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation | Naizhu Jin et.al. | 2505.21425 | null |
| 2025-05-27 | Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery | Lina Zhao et.al. | 2505.21418 | null |
| 2025-05-27 | A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment | Brett Bissey et.al. | 2505.21414 | null |
| 2025-05-27 | MRSD: Multi-Resolution Skill Discovery for HRL Agents | Shashank Sharma et.al. | 2505.21410 | null |
| 2025-05-27 | Breaking co-existence: zealotry vs. nonlinear social impact | Christopher R. Kitching et.al. | 2505.21407 | null |
| 2025-05-27 | AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs | Xuanwen Ding et.al. | 2505.21389 | link |
| 2025-05-27 | Distributed equilibrium seeking in aggregative games: linear convergence under singular perturbations lens | Guido Carnevale et.al. | 2505.21386 | null |
| 2025-05-27 | Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History | Qishuai Zhong et.al. | 2505.21362 | link |
| 2025-05-28 | PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims | Valentin Knappich et.al. | 2505.21342 | null |
| 2025-05-27 | Large Language Models Miss the Multi-Agent Mark | Emanuele La Malfa et.al. | 2505.21298 | null |
| 2025-05-27 | Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | Saman Marandi et.al. | 2505.21291 | null |
| 2025-05-27 | PACT: A Contract-Theoretic Framework for Pricing Agentic AI Services Powered by Large Language Models | Ya-Ting Yang et.al. | 2505.21286 | null |
| 2025-05-27 | XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration | Shaoqing Zhang et.al. | 2505.21279 | null |
| 2025-05-27 | Data-Driven Cellular Mobility Management via Bayesian Optimization and Reinforcement Learning | Mohamed Benzaghta et.al. | 2505.21249 | null |
| 2025-05-27 | Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies | Felix Chalumeau et.al. | 2505.21236 | null |
| 2025-05-27 | Quantum AIXI: Universal Intelligence via Quantum Information | Elija Perrier et.al. | 2505.21170 | null |
| 2025-05-27 | GGBond: Growing Graph-Based AI-Agent Society for Socially-Aware Recommender Simulation | Hailin Zhong et.al. | 2505.21154 | null |
| 2025-05-27 | IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model | Yang Zhao et.al. | 2505.21146 | null |
| 2025-05-27 | Creativity in LLM-based Multi-Agent Systems: A Survey | Yi-Cheng Lin et.al. | 2505.21116 | null |
| 2025-05-27 | Simulating Ethics: Using LLM Debate Panels to Model Deliberation on Medical Dilemmas | Hazem Zohny et.al. | 2505.21112 | null |
| 2025-05-27 | CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building | Zhengmin Yu et.al. | 2505.21069 | null |
| 2025-05-27 | Agent-Environment Alignment via Automated Interface Generation | Kaiming Liu et.al. | 2505.21055 | null |
| 2025-05-27 | RefAV: Towards Planning-Centric Scenario Mining | Cainan Davidson et.al. | 2505.20981 | link |
| 2025-05-27 | Identifying Super Spreaders in Multilayer Networks | Michał Czuba et.al. | 2505.20980 | null |
| 2025-05-28 | Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement | Keheliya Gallaba et.al. | 2505.20973 | null |
| 2025-05-27 | Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence | Mehdi Bennis et.al. | 2505.20964 | null |
| 2025-05-27 | Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective | Yang Zhang et.al. | 2505.20922 | link |
| 2025-05-27 | Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | Pingrui Zhang et.al. | 2505.20897 | link |
| 2025-05-27 | Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment | Leizhen Wang et.al. | 2505.20889 | null |
| 2025-05-27 | MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems | Kai Chen et.al. | 2505.20824 | link |
| 2025-05-27 | MT-Mol:Multi Agent System with Tool-based Reasoning for Molecular Optimization | Hyomin Kim et.al. | 2505.20820 | null |
| 2025-05-27 | Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective | Krishna Singh Rajput et.al. | 2505.20816 | null |
| 2025-05-27 | Can Agents Fix Agent Issues? | Alfin Wijaya Rahardja et.al. | 2505.20749 | null |
| 2025-05-27 | RRO: LLM Agent Optimization Through Rising Reward Trajectories | Zilong Wang et.al. | 2505.20737 | null |
| 2025-05-27 | E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing | Cheonsu Jeong et.al. | 2505.20733 | null |
| 2025-05-27 | SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution | Hanlin Wang et.al. | 2505.20732 | link |
| 2025-05-27 | ManiTaskGen: A Comprehensive Task Generator for Benchmarking and Improving Vision-Language Agents on Embodied Decision-Making | Liu Dai et.al. | 2505.20726 | null |
| 2025-05-27 | A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs | Alberto Pliego Marugán et.al. | 2505.20725 | null |
| 2025-05-28 | VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models | Kui Wu et.al. | 2505.20718 | null |
| 2025-05-27 | Hierarchical Instruction-aware Embodied Visual Tracking | Kui Wu et.al. | 2505.20710 | null |
| 2025-05-27 | Berk-Nash Rationalizability | Ignacio Esponda et.al. | 2505.20708 | null |
| 2025-05-27 | GIFARC: Synthetic Dataset for Leveraging Human-Intuitive Analogies to Elevate AI Reasoning | Woochang Sim et.al. | 2505.20672 | null |
| 2025-05-27 | LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation | Heng Tan et.al. | 2505.20671 | null |
| 2025-05-27 | MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning | Zikang Guo et.al. | 2505.20670 | null |
| 2025-05-30 | AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage | Xuanle Zhao et.al. | 2505.20662 | link |
| 2025-05-27 | BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism | Qinzhuo Wu et.al. | 2505.20660 | null |
| 2025-05-27 | An Optimisation Framework for Unsupervised Environment Design | Nathan Monette et.al. | 2505.20659 | null |
| 2025-05-27 | CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models | Yi Zhan et.al. | 2505.20642 | null |
| 2025-05-27 | IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios | Yifan Li et.al. | 2505.20640 | null |
| 2025-05-27 | Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration | Sibo Xiao et.al. | 2505.20625 | null |
| 2025-05-29 | The challenge of hidden gifts in multi-agent reinforcement learning | Dane Malenfant et.al. | 2505.20579 | null |
| 2025-05-26 | Synergising Hierarchical Data Centers and Power Networks: A Privacy-Preserving Approach | Junhong Liu et.al. | 2505.20575 | null |
| 2025-05-26 | xChemAgents: Agentic AI for Explainable Quantum Chemistry | Can Polat et.al. | 2505.20574 | link |
| 2025-05-26 | Byzantine-Resilient Distributed P2P Energy Trading via Spatial-Temporal Anomaly Detection | Junhong Liu et.al. | 2505.20567 | null |
| 2025-05-26 | Learning a Pessimistic Reward Model in RLHF | Yinglun Xu et.al. | 2505.20556 | null |
| 2025-05-28 | Trade among moral agents with information asymmetries | José Ignacio Rivero-Wildemauwe et.al. | 2505.20551 | null |
| 2025-05-26 | Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting | Ana Rita Ortigoso et.al. | 2505.20521 | null |
| 2025-05-26 | CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic | Yuxuan Sun et.al. | 2505.20510 | null |
| 2025-05-26 | Reconceptualizing Smart Microscopy: From Data Collection to Knowledge Creation by Multi-Agent Integration | P. S. Kesavan et.al. | 2505.20466 | null |
| 2025-05-26 | OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation | Raktim Gautam Goswami et.al. | 2505.20425 | null |
| 2025-05-26 | RetroMotion: Retrocausal Motion Forecasting Models are Instructable | Royden Wagner et.al. | 2505.20414 | link |
| 2025-05-26 | SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents | Ibragim Badertdinov et.al. | 2505.20411 | link |
| 2025-05-26 | Algorithmic Control Improves Residential Building Energy and EV Management when PV Capacity is High but Battery Capacity is Low | Lennart Ullner et.al. | 2505.20377 | null |
| 2025-05-26 | VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection | Zeyi Huang et.al. | 2505.20289 | null |
| 2025-05-26 | Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution | Jiahao Qiu et.al. | 2505.20286 | link |
| 2025-05-27 | MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability | Weiqi Wu et.al. | 2505.20285 | link |
| 2025-05-26 | OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction | Haonan Zhang et.al. | 2505.20277 | link |
| 2025-05-26 | Ten Principles of AI Agent Economics | Ke Yang et.al. | 2505.20273 | null |
| 2025-05-26 | syftr: Pareto-Optimal Generative AI | Alexander Conway et.al. | 2505.20266 | link |
| 2025-05-26 | On Path to Multimodal Historical Reasoning: HistBench and HistAgent | Jiahao Qiu et.al. | 2505.20246 | link |
| 2025-05-26 | Shutdownable Agents through POST-Agency | Elliott Thornley et.al. | 2505.20203 | null |
| 2025-05-26 | THiNK: Can Large Language Models Think-aloud? | Yongan Yu et.al. | 2505.20184 | link |
| 2025-05-26 | The Problem of Algorithmic Collisions: Mitigating Unforeseen Risks in a Connected World | Maurice Chiodo et.al. | 2505.20181 | null |
| 2025-05-27 | MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents | Ziming Wei et.al. | 2505.20148 | link |
| 2025-05-26 | Agentic 3D Scene Generation with Spatially Contextualized VLMs | Xinhang Liu et.al. | 2505.20129 | null |
| 2025-05-26 | Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers | Zhengliang Shi et.al. | 2505.20128 | link |
| 2025-05-26 | Agentic AI Process Observability: Discovering Behavioral Variability | Fabiana Fournier et.al. | 2505.20127 | null |
| 2025-05-26 | Agents Require Metacognitive and Strategic Reasoning to Succeed in the Coming Labor Markets | Simpson Zhang et.al. | 2505.20120 | null |
| 2025-05-27 | TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent | Dominik Meier et.al. | 2505.20118 | link |
| 2025-05-26 | MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning | Thang Nguyen et.al. | 2505.20096 | null |
| 2025-05-26 | SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale | Qi Li et.al. | 2505.20094 | null |
| 2025-05-26 | REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | Le Zhang et.al. | 2505.20046 | link |
| 2025-05-26 | Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking | Yihan Chen et.al. | 2505.20023 | null |
| 2025-05-26 | WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback | Minda Hu et.al. | 2505.20013 | null |
| 2025-05-26 | The Many Challenges of Human-Like Agents in Virtual Game Environments | Maciej Świechowski et.al. | 2505.20011 | null |
| 2025-05-26 | Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents | Tao Wu et.al. | 2505.19997 | null |
| 2025-05-26 | The residual maximin share | Uriel Feige et.al. | 2505.19961 | null |
| 2025-05-26 | MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research | Hui Chen et.al. | 2505.19955 | link |
| 2025-05-26 | Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval | Rong-Cheng Tu et.al. | 2505.19952 | null |
| 2025-05-26 | Signed Angle Rigid Graphs for Network Localization and Formation Control | Jinpeng Huang et.al. | 2505.19945 | null |
| 2025-05-26 | Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making | Yejin Son et.al. | 2505.19933 | null |
| 2025-05-27 | Evaluating AI cyber capabilities with crowdsourced elicitation | Artem Petrov et.al. | 2505.19915 | null |
| 2025-05-26 | EMAC+: Embodied Multimodal Agent for Collaborative Planning with VLM+LLM | Shuang Ao et.al. | 2505.19905 | null |
| 2025-05-26 | ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows | Qiushi Sun et.al. | 2505.19897 | null |
| 2025-05-26 | Large Language Models as Autonomous Spacecraft Operators in Kerbal Space Program | Alejandro Carrasco et.al. | 2505.19896 | link |
| 2025-05-26 | Deep Active Inference Agents for Delayed and Long-Horizon Environments | Yavar Taheri Yeganeh et.al. | 2505.19867 | link |
| 2025-05-26 | DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning | Leander Diaz-Bone et.al. | 2505.19850 | link |
| 2025-05-26 | Multi-Agent Reinforcement Learning in Cybersecurity: From Fundamentals to Applications | Christoph R. Landolt et.al. | 2505.19837 | null |
| 2025-05-26 | SecVulEval: Benchmarking LLMs for Real-World C/C++ Vulnerability Detection | Md Basim Uddin Ahmed et.al. | 2505.19828 | link |
| 2025-05-26 | Integrating emotional intelligence, memory architecture, and gestures to achieve empathetic humanoid robot interaction in an educational setting | Fuze Sun et.al. | 2505.19803 | null |
| 2025-05-26 | Opinion dynamics for an increasing population of agents. A symmetric continuous agent model | Ioannis Markou et.al. | 2505.19791 | null |
| 2025-05-26 | TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning | Yuhui Chen et.al. | 2505.19769 | null |
| 2025-05-26 | T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search | Xing Cui et.al. | 2505.19768 | null |
| 2025-05-26 | RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback | Junyang Shu et.al. | 2505.19767 | null |
| 2025-05-26 | Agentic Predictor: Performance Prediction for Agentic Workflows via Multi-View Encoding | Patara Trirat et.al. | 2505.19764 | link |
| 2025-05-26 | Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | Zican Hu et.al. | 2505.19761 | link |
| 2025-05-26 | NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering | Ruisheng Cao et.al. | 2505.19754 | null |
| 2025-05-26 | ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection | Juxin Niu et.al. | 2505.19734 | link |
| 2025-05-26 | Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning | Quentin Rouxel et.al. | 2505.19717 | null |
| 2025-05-28 | JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning | Jing Yu Lim et.al. | 2505.19698 | null |
| 2025-05-26 | Large Language Models for Planning: A Comprehensive and Systematic Survey | Pengfei Cao et.al. | 2505.19683 | link |
| 2025-05-26 | FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks | Atsunori Moteki et.al. | 2505.19662 | null |
| 2025-05-26 | Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation | Xiaochuan Liu et.al. | 2505.19647 | link |
| 2025-05-26 | Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning | Byunghyun Yoo et.al. | 2505.19637 | null |
| 2025-05-26 | DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue | Yichun Feng et.al. | 2505.19630 | link |
| 2025-05-28 | AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems | Yu Shang et.al. | 2505.19623 | null |
| 2025-05-26 | Multi-Agent Collaboration via Evolving Orchestration | Yufan Dang et.al. | 2505.19591 | null |
| 2025-05-26 | LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer | Rasoul Zahedifar et.al. | 2505.19567 | null |
| 2025-05-26 | AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare | Ying Xiao et.al. | 2505.19562 | link |
| 2025-05-26 | Towards Multi-Granularity Memory Association and Selection for Long-Term Conversational Agents | Derong Xu et.al. | 2505.19549 | link |
| 2025-05-26 | DoctorRAG: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients | Yuxing Lu et.al. | 2505.19538 | null |
| 2025-05-26 | Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning | Shijie Liu et.al. | 2505.19532 | null |
| 2025-05-26 | Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs | Zhenhao Zhou et.al. | 2505.19489 | link |
| 2025-05-26 | VLMLight: Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning | Maonan Wang et.al. | 2505.19486 | null |
| 2025-05-26 | Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs | Hao Kang et.al. | 2505.19481 | link |
| 2025-05-26 | Judging with Many Minds: Do More Perspectives Mean Less Prejudice? | Chiyu Ma et.al. | 2505.19477 | link |
| 2025-05-26 | Improving Recommendation Fairness without Sensitive Attributes Using Multi-Persona LLMs | Haoran Xin et.al. | 2505.19473 | null |
| 2025-05-26 | Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI | Ranjan Sapkota et.al. | 2505.19443 | null |
| 2025-05-26 | Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents | Ye Ye et.al. | 2505.19436 | link |
| 2025-05-26 | Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | Peijie Dong et.al. | 2505.19433 | link |
| 2025-05-26 | Frictional Agent Alignment Framework: Slow Down and Don't Break Things | Abhijnan Nath et.al. | 2505.19428 | link |
| 2025-05-26 | Fusion Intelligence for Digital Twinning AI Data Centers: A Synergistic GenAI-PhyAI Approach | Ruihang Wang et.al. | 2505.19409 | null |
| 2025-05-26 | CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems | Yan Wen et.al. | 2505.19405 | null |
| 2025-05-27 | DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving | Anqing Jiang et.al. | 2505.19381 | null |
| 2025-05-26 | Belief Attribution as Mental Explanation: The Role of Accuracy, Informativity, and Causality | Lance Ying et.al. | 2505.19376 | null |
| 2025-05-27 | Prompting Decision Transformers for Zero-Shot Reach-Avoid Policies | Kevin Li et.al. | 2505.19337 | null |
| 2025-05-25 | What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study | Bhanuka Gamage et.al. | 2505.19325 | null |
| 2025-05-25 | Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning | Rex Chen et.al. | 2505.19316 | null |
| 2025-05-25 | Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking | Robin D. Pesl et.al. | 2505.19310 | null |
| 2025-05-25 | A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control | Ken Huang et.al. | 2505.19301 | null |
| 2025-05-25 | A likelihood-based Bayesian inference framework for the calibration of and selection between stochastic velocity-jump models | Arianna Ceccarelli et.al. | 2505.19292 | null |
| 2025-05-25 | A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning | Yuzheng Hu et.al. | 2505.19281 | link |
| 2025-05-25 | A General Theory of Risk Sharing | Vasily Melnikov et.al. | 2505.19276 | null |
| 2025-05-25 | Agentic Information Theory: Ergodicity and Intrinsic Semantics of Information Processes | James P. Crutchfield et.al. | 2505.19275 | null |
| 2025-05-25 | ALRPHFS: Adversarially Learned Risk Patterns with Hierarchical Fast & Slow Reasoning for Robust Agent Defense | Shiyu Xiang et.al. | 2505.19260 | null |
| 2025-05-25 | DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research | João Coelho et.al. | 2505.19253 | null |
| 2025-05-25 | Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees | Sourav Ganguly et.al. | 2505.19238 | null |
| 2025-05-25 | Sensorimotor features of self-awareness in multimodal large language models | Iñaki Dellibarda Varela et.al. | 2505.19237 | null |
| 2025-05-25 | GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling | Jialong Zhou et.al. | 2505.19234 | null |
| 2025-05-25 | Numerical Analysis of Damage Evolution in Open Hole CFRP Laminates Modified with Electrospun Self Healing Diels Alder Interleaves | Marianna Chantzi et.al. | 2505.19232 | null |
| 2025-05-25 | Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding | Shiyue Wang et.al. | 2505.19219 | null |
| 2025-05-25 | Omni-Perception: Omnidirectional Collision Avoidance for Legged Locomotion in Dynamic Environments | Zifan Wang et.al. | 2505.19214 | null |
| 2025-05-25 | When Ethics and Payoffs Diverge: LLM Agents in Morally Charged Social Dilemmas | Steffen Backmann et.al. | 2505.19212 | link |
| 2025-05-25 | SpeakStream: Streaming Text-to-Speech with Interleaved Data | Richard He Bai et.al. | 2505.19206 | null |
| 2025-05-25 | OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization | Meher Bhaskar Madiraju et.al. | 2505.19205 | link |
| 2025-05-27 | Structuring the Unstructured: A Multi-Agent System for Extracting and Querying Financial KPIs and Guidance | Chanyeol Choi et.al. | 2505.19197 | null |
| 2025-05-27 | When Two LLMs Debate, Both Think They'll Win | Pradyumna Shyama Prasad et.al. | 2505.19184 | null |
| 2025-05-25 | Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style | Debdeep Sanyal et.al. | 2505.19173 | null |
| 2025-05-25 | Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence | Thomas P. Kehler et.al. | 2505.19167 | null |
| 2025-05-25 | The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework | Feiran Liu et.al. | 2505.19139 | null |
| 2025-05-25 | Incentivizing High-Quality Human Annotations with Golden Questions | Shang Liu et.al. | 2505.19134 | null |
| 2025-05-25 | Agentic Visualization: Extracting Agent-based Design Patterns from Visualization Systems | Vaishali Dhanoa et.al. | 2505.19101 | null |
| 2025-05-25 | ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World | Runliang Niu et.al. | 2505.19095 | link |
| 2025-05-25 | A Systematic Classification of Vulnerabilities in MoveEVM Smart Contracts (MWC) | Selçuk Topal et.al. | 2505.19047 | null |
| 2025-05-25 | SANNet: A Semantic-Aware Agentic AI Networking Framework for Multi-Agent Cross-Layer Coordination | Yong Xiao et.al. | 2505.18946 | null |
| 2025-05-25 | MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems | Xuanming Zhang et.al. | 2505.18943 | link |
| 2025-05-24 | Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos | Andrea Ramazzina et.al. | 2505.18899 | link |
| 2025-05-24 | Security Concerns for Large Language Models: A Survey | Miles Q. Li et.al. | 2505.18889 | null |
| 2025-05-24 | Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach | Yuchen Wu et.al. | 2505.18882 | null |
| 2025-05-24 | SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes | Dicong Qiu et.al. | 2505.18881 | null |
| 2025-05-24 | CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions | Kung-Hsiang Huang et.al. | 2505.18878 | link |
| 2025-05-24 | Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning | Maeva Guerrier et.al. | 2505.18858 | null |
| 2025-05-24 | Multi-Party Conversational Agents: A Survey | Sagar Sapkota et.al. | 2505.18845 | null |
| 2025-05-24 | Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning | Jinzheng Li et.al. | 2505.18831 | null |
| 2025-05-24 | LiteCUA: Computer as MCP Server for Computer-Use Agent on AIOS | Kai Mei et.al. | 2505.18829 | link |
| 2025-05-24 | Agent-Based Decentralized Energy Management of EV Charging Station with Solar Photovoltaics via Multi-Agent Reinforcement Learning | Jiarong Fan et.al. | 2505.18750 | null |
| 2025-05-27 | Peijie Yu et.al. | 2505.18746 | link | |
| 2025-05-24 | Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction | Wei Shen et.al. | 2505.18731 | null |
| 2025-05-24 | AI-Researcher: Autonomous Scientific Innovation | Jiabin Tang et.al. | 2505.18705 | link |
| 2025-05-24 | LLM-QFL: Distilling Large Language Model for Quantum Federated Learning | Dev Gurung et.al. | 2505.18656 | link |
| 2025-05-24 | SEW: Self-Evolving Agentic Workflows for Automated Code Generation | Siwei Liu et.al. | 2505.18646 | link |
| 2025-05-24 | DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | Zhihao Jia et.al. | 2505.18630 | null |
| 2025-05-24 | A representation theorem for events within lattice structures of state-spaces | Alex A. T. Rathke et.al. | 2505.18615 | null |
| 2025-05-27 | Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models | Chen Han et.al. | 2505.18596 | null |
| 2025-05-24 | MisoDICE: Multi-Agent Imitation from Unlabeled Mixed-Quality Demonstrations | The Viet Bui et.al. | 2505.18595 | null |
| 2025-05-24 | Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks | Joery A. de Vries et.al. | 2505.18591 | link |
| 2025-05-24 | Removal of Hallucination on Hallucination: Debate-Augmented RAG | Wentao Hu et.al. | 2505.18581 | link |
| 2025-05-24 | MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework | Yifan Zhu et.al. | 2505.18572 | null |
| 2025-05-24 | Benchmarking Poisoning Attacks against Retrieval-Augmented Generation | Baolei Zhang et.al. | 2505.18543 | null |
| 2025-05-24 | MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs | Pengyu Wang et.al. | 2505.18530 | null |
| 2025-05-24 | Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning | Junlin Wang et.al. | 2505.18487 | link |
| 2025-05-24 | Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services | Guoheng Sun et.al. | 2505.18471 | null |
| 2025-05-27 | A Survey of LLM |
Xuanhe Zhou et.al. | 2505.18458 | link |
| 2025-05-24 | EdgeAgentX: A Novel Framework for Agentic AI at the Edge in Military Communication Networks | Abir Ray et.al. | 2505.18457 | null |
| 2025-05-24 | A numerical demonstration of dynamic stall control | Sarasija Sudharsan et.al. | 2505.18449 | null |
| 2025-05-24 | Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning | Zhiyao Zhang et.al. | 2505.18433 | null |
| 2025-05-23 | Reinforcement Learning for Ballbot Navigation in Uneven Terrain | Achkan Salehi et.al. | 2505.18417 | link |
| 2025-05-23 | DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding | Yue Jiang et.al. | 2505.18411 | link |
| 2025-05-23 | An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems | Fangqiao Tian et.al. | 2505.18397 | null |
| 2025-05-23 | Dynamic Risk Assessments for Offensive Cybersecurity Agents | Boyi Wei et.al. | 2505.18384 | link |
| 2025-05-23 | Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems | Hansa Meghwani et.al. | 2505.18366 | null |
| 2025-05-23 | Persona Alchemy: Designing, Evaluating, and Implementing Psychologically-Grounded LLM Agents for Diverse Stakeholder Representation | Sola Kim et.al. | 2505.18351 | null |
| 2025-05-23 | The Cell Must Go On: Agar.io for Continual Reinforcement Learning | Mohamed A. Mohamed et.al. | 2505.18347 | link |
| 2025-05-23 | Diffusion Self-Weighted Guidance for Offline Reinforcement Learning | Augusto Tagle et.al. | 2505.18345 | null |
| 2025-05-23 | CrashAgent: Crash Scenario Generation via Multi-modal Reasoning | Miao Li et.al. | 2505.18341 | null |
| 2025-05-23 | Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play | Jiaxun Cui et.al. | 2505.18334 | null |
| 2025-05-23 | Single-agent or Multi-agent Systems? Why Not Both? | Mingyan Gao et.al. | 2505.18286 | null |
| 2025-05-23 | Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control | Alireza Rezazadeh et.al. | 2505.18279 | null |
| 2025-05-23 | BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs | Mingning Guo et.al. | 2505.18229 | link |
| 2025-05-23 | Implementing Agents in JavaScript | Timotheus Kampik et.al. | 2505.18228 | null |
| 2025-05-23 | IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis | Hanyu Li et.al. | 2505.18223 | link |
| 2025-05-23 | CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games | Shuhang Xu et.al. | 2505.18218 | link |
| 2025-05-23 | LA-RCS: LLM-Agent-Based Robot Control System | TaekHyun Park et.al. | 2505.18214 | null |
| 2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148 | null |
| 2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145 | null |
| 2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135 | link |
| 2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121 | null |
| 2025-05-23 | Facility Location with Public Locations and Private Doubly-Peaked Costs | Richard Cole et.al. | 2505.18114 | null |
| 2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105 | link |
| 2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098 | null |
| 2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079 | null |
| 2025-05-23 | Linear Mixture Distributionally Robust Markov Decision Processes | Zhishuai Liu et.al. | 2505.18044 | null |
| 2025-05-27 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2505.17997 | null |
| 2025-05-23 | Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity | Zhihong Chen et.al. | 2505.17937 | link |
| 2025-05-23 | Formalizing Embeddedness Failures in Universal Artificial Intelligence | Cole Wyeth et.al. | 2505.17882 | null |
| 2025-05-23 | Best Group Identification in Multi-Objective Bandits | Mohammad Shahverdikondori et.al. | 2505.17869 | null |
| 2025-05-23 | DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization | Hongshu Guo et.al. | 2505.17866 | null |
| 2025-05-23 | Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | Ziwei Zhou et.al. | 2505.17862 | link |
| 2025-05-23 | Superplatforms Have to Attack AI Agents | Jianghao Lin et.al. | 2505.17861 | null |
| 2025-05-23 | Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning | Nicolas Castanet et.al. | 2505.17830 | null |
| 2025-05-23 | Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models | Xuchen Pan et.al. | 2505.17826 | link |
| 2025-05-23 | Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Bálint Gyevnár et.al. | 2505.17801 | null |
| 2025-05-23 | DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors | Tazeek Bin Abdur Rakib et.al. | 2505.17795 | null |
| 2025-05-23 | The Real Barrier to LLM Agent Usability is Agentic ROI | Weiwen Liu et.al. | 2505.17767 | null |
| 2025-05-23 | HRSim: An agent-based simulation platform for high-capacity ride-sharing services | Wang Chen et.al. | 2505.17758 | link |
| 2025-05-23 | Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions | Ashwin George et.al. | 2505.17739 | link |
| 2025-05-23 | Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios | Xueyang Zhou et.al. | 2505.17735 | null |
| 2025-05-23 | URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles | Ahmet Onur Akman et.al. | 2505.17734 | null |
| 2025-05-23 | Get Experience from Practice: LLM Agents with Record & Replay | Erhu Feng et.al. | 2505.17716 | null |
| 2025-05-23 | Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek | Xueyang Li et.al. | 2505.17702 | null |
| 2025-05-23 | Star-like thermoresponsive microgels: a new class of soft nanocolloids | Elisa Ballin et.al. | 2505.17700 | null |
| 2025-05-23 | Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution | Jiawei Du et.al. | 2505.17673 | null |
| 2025-05-23 | Simulating Macroeconomic Expectations using LLM Agents | Jianhao Lin et.al. | 2505.17648 | null |
| 2025-05-23 | HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning | Chuhao Zhou et.al. | 2505.17645 | null |
| 2025-05-27 | TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments | Yuheng Lu et.al. | 2505.17629 | link |
| 2025-05-23 | CAS-IQA: Teaching Vision-Language Models for Synthetic Angiography Quality Assessment | Bo Wang et.al. | 2505.17619 | null |
| 2025-05-23 | Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments | Qingyu Lu et.al. | 2505.17616 | link |
| 2025-05-23 | Distilling LLM Agent into Small Models with Retrieval and Code Tools | Minki Kang et.al. | 2505.17612 | link |
| 2025-05-23 | Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning | Till Freihaut et.al. | 2505.17610 | null |
| 2025-05-23 | Controlled Agentic Planning & Reasoning for Mechanism Synthesis | João Pedro Gandarela et.al. | 2505.17607 | null |
| 2025-05-23 | AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model | Tijmen de Haan et.al. | 2505.17592 | null |
| 2025-05-23 | USTBench: Benchmarking and Dissecting Spatiotemporal Reasoning of LLMs as Urban Agents | Siqi Lai et.al. | 2505.17572 | null |
| 2025-05-26 | Novobo: Supporting Teachers' Peer Learning of Instructional Gestures by Teaching a Mentee AI-Agent Together | Jiaqi Jiang et.al. | 2505.17557 | null |
| 2025-05-23 | Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs | Shuhang Xu et.al. | 2505.17512 | null |
| 2025-05-23 | Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification | Aditya Gautam et.al. | 2505.17511 | null |
| 2025-05-23 | The Discovery Engine: A Framework for AI-Driven Synthesis and Navigation of Scientific Knowledge Landscapes | Vladimir Baulin et.al. | 2505.17500 | null |
| 2025-05-23 | PD |
Dezheng Bao et.al. | 2505.17492 | null |
| 2025-05-23 | MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning | Yusheng Zhao et.al. | 2505.17481 | null |
| 2025-05-23 | Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning | Xingyu Tan et.al. | 2505.17464 | null |
| 2025-05-23 | LLM-BSCVM: An LLM-Based Blockchain Smart Contract Vulnerability Management Framework | Yanli Jin et.al. | 2505.17416 | link |
| 2025-05-23 | Emergence of Anti-chemotactic Flocking in Active Biomimetic Colloids | Joseph D. Lopes et.al. | 2505.17394 | null |
| 2025-05-23 | Curriculum Guided Reinforcement Learning for Efficient Multi Hop Retrieval Augmented Generation | Yuelyu Ji et.al. | 2505.17391 | null |
| 2025-05-23 | Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition | Zichen Wang et.al. | 2505.17379 | null |
| 2025-05-22 | A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety | Ankita Kushwaha et.al. | 2505.17342 | null |
| 2025-05-22 | Partner Modelling Emerges in Recurrent Agents (But Only When It Matters) | Ruaridh Mon-Williams et.al. | 2505.17323 | null |
| 2025-05-22 | Control of Renewable Energy Communities using AI and Real-World Data | Tiago Fonseca et.al. | 2505.17321 | null |
| 2025-05-22 | Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty | Peilin Wu et.al. | 2505.17281 | null |
| 2025-05-22 | ConvoyNext: A Scalable Testbed Platform for Cooperative Autonomous Vehicle Systems | Hossein Maghsoumi et.al. | 2505.17275 | link |
| 2025-05-22 | Navigating Polytopes with Safety: A Control Barrier Function Approach | Tamas G. Molnar et.al. | 2505.17270 | link |
| 2025-05-22 | Backdoors in DRL: Four Environments Focusing on In-distribution Triggers | Chace Ashcraft et.al. | 2505.17248 | null |
| 2025-05-22 | Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG) | Clayton Cohn et.al. | 2505.17238 | null |
| 2025-05-22 | ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects | Jipeng Zhang et.al. | 2505.17231 | null |
| 2025-05-22 | RetroChat: Designing for the Preservation of Past Digital Experiences | Suifang Zhou et.al. | 2505.17208 | null |
| 2025-05-22 | LengthLogD: A Length-Stratified Ensemble Framework for Enhanced Peptide Lipophilicity Prediction via Multi-Scale Feature Integration | Shuang Wu et.al. | 2505.17198 | null |
| 2025-05-22 | Can Large Language Models Design Biological Weapons? Evaluating Moremi Bio | Gertrude Hattoh et.al. | 2505.17154 | null |
| 2025-05-22 | LLM-Powered Agents for Navigating Venice's Historical Cadastre | Tristan Karch et.al. | 2505.17148 | null |
| 2025-05-22 | RAP: Runtime-Adaptive Pruning for LLM Inference | Huanrong Liu et.al. | 2505.17138 | null |
| 2025-05-21 | Swarm Intelligence Enhanced Reasoning: A Density-Driven Framework for LLM-Based Multi-Agent Optimization | Ying Zhu et.al. | 2505.17115 | null |
| 2025-05-21 | CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | Minghao Shao et.al. | 2505.17107 | link |
| 2025-05-21 | P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark | Tao Sun et.al. | 2505.17104 | link |
| 2025-05-20 | Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization | Yihong Wu et.al. | 2505.17086 | null |
| 2025-05-22 | SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | Haoning Wu et.al. | 2505.17012 | link |
| 2025-05-22 | X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | Rui Ye et.al. | 2505.16997 | link |
| 2025-05-22 | MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems | Rui Ye et.al. | 2505.16988 | link |
| 2025-05-22 | T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning | Amartya Chakraborty et.al. | 2505.16986 | null |
| 2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982 | null |
| 2025-05-22 | Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design | Zhenkun Li et.al. | 2505.16979 | null |
| 2025-05-22 | SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | Yaxin Du et.al. | 2505.16975 | link |
| 2025-05-22 | Modeling Inequality in Complex Networks of Strategic Agents using Iterative Game-Theoretic Transactions | Mayank Kejriwal et.al. | 2505.16966 | null |
| 2025-05-22 | Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection | Jiaying Fu et.al. | 2505.16954 | null |
| 2025-05-22 | A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | Shengyu Feng et.al. | 2505.16952 | null |
| 2025-05-22 | AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios | Yunjia Qi et.al. | 2505.16944 | link |
| 2025-05-25 | NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification | NovelSeek Team et.al. | 2505.16938 | link |
| 2025-05-22 | Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning | Bosung Kim et.al. | 2505.16928 | null |
| 2025-05-22 | Risk-Averse Reinforcement Learning with Itakura-Saito Loss | Igor Udovichenko et.al. | 2505.16925 | null |
| 2025-05-22 | RealEngine: Simulating Autonomous Driving in Realistic Context | Junzhe Jiang et.al. | 2505.16902 | link |
| 2025-05-22 | Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks | Hongyuan Tao et.al. | 2505.16901 | null |
| 2025-05-22 | Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships | Kerem Oktar et.al. | 2505.16899 | null |
| 2025-05-22 | Hydrogen peroxide electrogeneration from O2 electroreduction: a review focusing on carbon electrocatalysts and environmental applications | Aline B. Trench et.al. | 2505.16887 | null |
| 2025-05-22 | Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning | Alihan Hüyük et.al. | 2505.16833 | null |
| 2025-05-22 | From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization | Haonian Ji et.al. | 2505.16832 | link |
| 2025-05-22 | GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent | Bin Xie et.al. | 2505.16827 | link |
| 2025-05-22 | LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols | Ziming liu et.al. | 2505.16821 | null |
| 2025-05-22 | A modular framework for automated evaluation of procedural content generation in serious games with deep reinforcement learning agents | Eleftherios Kalafatis et.al. | 2505.16801 | null |
| 2025-05-22 | Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making | Qianlei Jia et.al. | 2505.16781 | null |
| 2025-05-22 | Sequential Monte Carlo for Policy Optimization in Continuous POMDPs | Hany Abdulsamad et.al. | 2505.16732 | null |
| 2025-05-22 | MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models | Xuanqi Gao et.al. | 2505.16700 | null |
| 2025-05-22 | CoNav: Collaborative Cross-Modal Reasoning for Embodied Navigation | Haihong Hao et.al. | 2505.16663 | link |
| 2025-05-22 | O |
Jianbiao Mei et.al. | 2505.16582 | link |
| 2025-05-22 | How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning | Max Weltevrede et.al. | 2505.16581 | null |
| 2025-05-22 | Large Language Model-Empowered Interactive Load Forecasting | Yu Zuo et.al. | 2505.16577 | null |
| 2025-05-22 | EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | Spencer Hong et.al. | 2505.16576 | link |
| 2025-05-22 | Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning | Junchi Yao et.al. | 2505.16557 | null |
| 2025-05-22 | Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events | Mengzhu Liu et.al. | 2505.16455 | null |
| 2025-05-22 | Beyond Static Testbeds: An Interaction-Centric Agent Simulation Platform for Dynamic Recommender Systems | Song Jin et.al. | 2505.16429 | null |
| 2025-05-22 | Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach | Xiaoran Yin et.al. | 2505.16422 | null |
| 2025-05-22 | WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning | Zhepei Wei et.al. | 2505.16421 | link |
| 2025-05-22 | VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving | Yansong Qu et.al. | 2505.16377 | null |
| 2025-05-22 | Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance | Taeyoon Kwon et.al. | 2505.16348 | null |
| 2025-05-22 | Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions | Marc Brooks et.al. | 2505.16311 | null |
| 2025-05-22 | No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery | Xiaoxue Han et.al. | 2505.16288 | null |
| 2025-05-22 | ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay | Fanbin Lu et.al. | 2505.16282 | link |
| 2025-05-22 | HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation | Shijie Zhang et.al. | 2505.16281 | null |
| 2025-05-22 | Spatio-temporal agent-based modelling of malaria | Camelia R. Walker et.al. | 2505.16240 | link |
| 2025-05-22 | CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering | Yuren Mao et.al. | 2505.16229 | null |
| 2025-05-22 | Velocity Completion Task and Method for Event-based Player Positional Data in Soccer | Rikuhei Umemoto et.al. | 2505.16199 | null |
| 2025-05-22 | Fairness and Efficiency in Human-Agent Teams: An Iterative Algorithm Design Approach | Mai Lee Chang et.al. | 2505.16171 | null |
| 2025-05-22 | LLM-Powered AI Agent Systems and Their Applications in Industry | Guannan Liang et.al. | 2505.16120 | null |
| 2025-05-22 | BioDSA-1K: Benchmarking Data Science Agents for Biomedical Research | Zifeng Wang et.al. | 2505.16100 | null |
| 2025-05-24 | Reinforcement Learning for Stock Transactions | Ziyi Zhou et.al. | 2505.16099 | null |
| 2025-05-22 | Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development | Ming Shen et.al. | 2505.16086 | null |
| 2025-05-21 | A Distributed Local Energy Market Clearing Framework Using a Two-Loop ADMM Method | Milad Kabirifar et.al. | 2505.16070 | null |
| 2025-05-21 | How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior | Zidi Xiong et.al. | 2505.16067 | link |
| 2025-05-21 | Bayesian adaptive randomization in the I-SPY2.2 sequential multiple assignment randomized trial | Peter Norwood et.al. | 2505.16047 | null |
| 2025-05-21 | Towards improved pest management of the soybean aphid | Urvashi Verma et.al. | 2505.16013 | null |
| 2025-05-21 | Position: Agentic Systems Constitute a Key Component of Next-Generation Intelligent Image Processing | Jinjin Gu et.al. | 2505.16007 | null |
| 2025-05-21 | MAPS: A Multilingual Benchmark for Global Agent Performance and Security | Omer Hofman et.al. | 2505.15935 | null |
| 2025-05-21 | ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation | Tony Montes et.al. | 2505.15928 | link |
| 2025-05-21 | Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition | Dong Won Lee et.al. | 2505.15922 | null |
| 2025-05-21 | Text-to-Pipeline: Bridging Natural Language and Data Preparation Pipelines | Yuhang Ge et.al. | 2505.15874 | null |
| 2025-05-23 | InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation | Yunjia Xi et.al. | 2505.15872 | null |
| 2025-05-21 | AutoData: A Multi-Agent System for Open Web Data Collection | Tianyi Ma et.al. | 2505.15859 | link |
| 2025-05-21 | Large Language Model-Powered Agent for C to Rust Code Translation | HoHyun Sim et.al. | 2505.15858 | null |
| 2025-05-21 | Simulating Prosocial Behavior and Social Contagion in LLM Agents under Institutional Interventions | Yujia Zhou et.al. | 2505.15857 | link |
| 2025-05-22 | GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents | Yuqi Zhou et.al. | 2505.15810 | link |
| 2025-05-21 | The Agentic Economy | David M. Rothschild et.al. | 2505.15799 | null |
| 2025-05-22 | HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | Zhiwen Chen et.al. | 2505.15793 | null |
| 2025-05-21 | Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning | Pedro P. Santos et.al. | 2505.15782 | null |
| 2025-05-21 | Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses | Xiaoxue Yang et.al. | 2505.15738 | link |
| 2025-05-21 | DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | Gaurav Srivastava et.al. | 2505.15734 | null |
| 2025-05-21 | Quantum Dots as Functional Nanosystems for Enhanced Biomedical Applications | Pronama Biswas et.al. | 2505.15705 | null |
| 2025-05-21 | HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning | Xiaodong Mei et.al. | 2505.15703 | null |
| 2025-05-21 | Average Reward Reinforcement Learning for Omega-Regular and Mean-Payoff Objectives | Milad Kazemi et.al. | 2505.15693 | null |
| 2025-05-21 | From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems | Xiuchao Sui et.al. | 2505.15685 | link |
| 2025-05-21 | Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model | Ke Hu et.al. | 2505.15670 | null |
| 2025-05-21 | Improved power methods for computing eigenvalues of dual quaternion Hermitian matrices | Yongjun Chen et.al. | 2505.15584 | null |
| 2025-05-21 | The equilibrium price of bubble assets | Charles Bertucci et.al. | 2505.15578 | null |
| 2025-05-21 | Temporal Spectrum Cartography in Low-Altitude Economy Networks: A Generative AI Framework with Multi-Agent Learning | Changyuan Zhao et.al. | 2505.15571 | null |
| 2025-05-21 | Riemannian EXTRA: Communication-efficient decentralized optimization over compact submanifolds with data heterogeneity | Jiayuan Wu et.al. | 2505.15537 | null |
| 2025-05-21 | Collaborative Problem-Solving in an Optimization Game | Isidora Jeknic et.al. | 2505.15490 | link |
| 2025-05-21 | Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL | Xintong Zhang et.al. | 2505.15436 | null |
| 2025-05-21 | X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System | Peng Wang et.al. | 2505.15372 | link |
| 2025-05-21 | Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One | Yiwen Song et.al. | 2505.15306 | null |
| 2025-05-22 | AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving | Kangan Qian et.al. | 2505.15298 | null |
| 2025-05-21 | Agent-based Liquidity Risk Modelling for Financial Markets | Perukrishnen Vytelingum et.al. | 2505.15296 | null |
| 2025-05-21 | LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models | Qianyue Hao et.al. | 2505.15293 | null |
| 2025-05-21 | Web-Shepherd: Advancing PRMs for Reinforcing Web Agents | Hyungjoo Chae et.al. | 2505.15277 | link |
| 2025-05-21 | AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection | Jiatao Li et.al. | 2505.15261 | null |
| 2025-05-24 | ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search | Hyunseok Lee et.al. | 2505.15259 | null |
| 2025-05-21 | Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets | Idriss Malek et.al. | 2505.15251 | null |
| 2025-05-21 | BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems | Andy K. Zhang et.al. | 2505.15216 | null |
| 2025-05-21 | ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection | Jeonghye Kim et.al. | 2505.15182 | null |
| 2025-05-21 | R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization | Yuante Li et.al. | 2505.15155 | link |
| 2025-05-21 | lmgame-Bench: How Good are LLMs at Playing Games? | Lanxiang Hu et.al. | 2505.15146 | link |
| 2025-05-21 | Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data | Adib Bazgir et.al. | 2505.15132 | null |
| 2025-05-21 | On Discounted Infinite-Time Mean Field Games | Zeyu Yang et.al. | 2505.15131 | null |
| 2025-05-21 | An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents | Bowen Jin et.al. | 2505.15117 | link |
| 2025-05-21 | A Risk Taxonomy for Evaluating AI-Powered Psychotherapy Agents | Ian Steenstra et.al. | 2505.15108 | null |
| 2025-05-21 | StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization | Ziliang Wang et.al. | 2505.15107 | null |
| 2025-05-21 | Nek Minit: Harnessing Pragmatic Metacognitive Prompting for Explainable Sarcasm Detection of Australian and Indian English | Ishmanbir Singh et.al. | 2505.15095 | null |
| 2025-05-21 | Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories | Nanxu Gong et.al. | 2505.15076 | null |
| 2025-05-21 | ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges | Cheng Qian et.al. | 2505.15068 | link |
| 2025-05-21 | UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking | Sarfraz Ahmad et.al. | 2505.15063 | link |
| 2025-05-21 | AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars | Tianbao Zhang et.al. | 2505.15058 | null |
| 2025-05-21 | PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration | Yingming Pu et.al. | 2505.15047 | link |
| 2025-05-21 | Toward Task Capable Active Matter: Learning to Avoid Clogging in Confined Collectives via Collisions | Kehinde O. Aina et.al. | 2505.15033 | null |
| 2025-05-21 | COSMIC: Enabling Full-Stack Co-Design and Optimization of Distributed Machine Learning Systems | Aditi Raju et.al. | 2505.15020 | null |
| 2025-05-21 | HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning | Kryspin Varys et.al. | 2505.15011 | link |
| 2025-05-21 | Meta-Design Matters: A Self-Design Multi-Agent System | Zixuan Ke et.al. | 2505.14996 | null |
| 2025-05-20 | JARVIS: A Multi-Agent Code Assistant for High-Quality EDA Script Generation | Ghasem Pasandi et.al. | 2505.14978 | null |
| 2025-05-20 | MedBrowseComp: Benchmarking Medical Deep Research and Computer Use | Shan Chen et.al. | 2505.14963 | null |
| 2025-05-20 | Characteristic scales and adaptation in higher-order contagions | Giulio Burgio et.al. | 2505.14930 | link |
| 2025-05-20 | Think, Reflect, Create: Metacognitive Learning for Zero-Shot Robotic Planning with LLMs | Wenjie Lin et.al. | 2505.14899 | null |
| 2025-05-20 | On the Day They Experience: Awakening Self-Sovereign Experiential AI Agents | Botao Amber Hu et.al. | 2505.14893 | null |
| 2025-05-20 | Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters | Danqing Wang et.al. | 2505.14886 | null |
| 2025-05-20 | Unremarkable to Remarkable AI Agent: Exploring Boundaries of Agent Intervention for Adults With and Without Cognitive Impairment | Mai Lee Chang et.al. | 2505.14872 | null |
| 2025-05-20 | MAATS: A Multi-Agent Automated Translation System Based on MQM Evaluation | Xi Wang et.al. | 2505.14848 | link |
| 2025-05-20 | Beyond Symmetry in Repeated Games with Restarts | Henry Fleischmann et.al. | 2505.14847 | null |
| 2025-05-20 | Cooperative Bargaining Games Without Utilities: Mediated Solutions from Direction Oracles | Kushagra Gupta et.al. | 2505.14817 | link |
| 2025-05-20 | Integrating Field of View in Human-Aware Collaborative Planning | Ya-Chuan Hsu et.al. | 2505.14805 | null |
| 2025-05-20 | Chih-Yu Chang et.al. | 2505.14756 | link | |
| 2025-05-20 | R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution | Xu Yang et.al. | 2505.14738 | link |
| 2025-05-20 | The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents | Mohammad Rubyet Islam et.al. | 2505.14727 | null |
| 2025-05-20 | NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search | Sunhao Dai et.al. | 2505.14680 | null |
| 2025-05-20 | ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions | Bufang Yang et.al. | 2505.14668 | null |
| 2025-05-20 | AI Agents in the Electricity Market Game with Cryptocurrency Transactions: A Post-Terminator Analysis | Microsoft Copilot et.al. | 2505.14612 | null |
| 2025-05-20 | Agent Context Protocols Enhance Collective Inference | Devansh Bhardwaj et.al. | 2505.14569 | null |
| 2025-05-20 | Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study | Saahil Mahato et.al. | 2505.14544 | link |
| 2025-05-20 | A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version) | Gaia Belardinelli et.al. | 2505.14539 | null |
| 2025-05-20 | Energy-Efficient Deep Reinforcement Learning with Spiking Transformers | Mohammad Irfan Uddin et.al. | 2505.14533 | null |
| 2025-05-22 | BACON: A fully explainable AI model with graded logic for decision making problems | Haishi Bai et.al. | 2505.14510 | null |
| 2025-05-20 | Design and Evaluation of a Microservices Cloud Framework for Online Travel Platforms | Biman Barua et.al. | 2505.14508 | null |
| 2025-05-20 | Security of Distributed Gradient Descent Against Byzantine Agents | Sribalaji C. Anand et.al. | 2505.14473 | null |
| 2025-05-20 | Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks | Kamal Singh et.al. | 2505.14459 | null |
| 2025-05-21 | Robustness Evaluation of Graph-based News Detection Using Network Structural Information | Xianghua Zeng et.al. | 2505.14453 | null |
| 2025-05-23 | Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents | Pengzhou Cheng et.al. | 2505.14418 | null |
| 2025-05-20 | Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation | Peter Baile Chen et.al. | 2505.14398 | null |
| 2025-05-20 | Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds | Gaël Gendron et.al. | 2505.14396 | link |
| 2025-05-20 | Information-optimal measurement: From fixed sampling protocols to adaptive spectroscopy | J. Schroeder et.al. | 2505.14364 | null |
| 2025-05-20 | DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning | Ziwei Zheng et.al. | 2505.14362 | link |
| 2025-05-20 | PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs | Sho Inoue et.al. | 2505.14356 | link |
| 2025-05-20 | Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy | Zihao Feng et.al. | 2505.14299 | null |
| 2025-05-20 | EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection | Yijie Lu et.al. | 2505.14289 | null |
| 2025-05-20 | Visual Agentic Reinforcement Fine-Tuning | Ziyu Liu et.al. | 2505.14246 | link |
| 2025-05-20 | Safety Devolution in AI Agents | Cheng Yu et.al. | 2505.14215 | null |
| 2025-05-20 | Embedded Mean Field Reinforcement Learning for Perimeter-defense Game | Li Wang et.al. | 2505.14209 | null |
| 2025-05-20 | DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation | He Wang et.al. | 2505.14163 | null |
| 2025-05-20 | MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem | Fan Liu et.al. | 2505.14148 | link |
| 2025-05-20 | s3: You Don't Need That Much Data to Train a Search Agent via RL | Pengcheng Jiang et.al. | 2505.14146 | link |
| 2025-05-20 | Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent | Fanglin Mo et.al. | 2505.14141 | null |
| 2025-05-20 | MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow | Yuan-Hao Jiang et.al. | 2505.14126 | null |
| 2025-05-20 | A novel approach to process TRISO nuclear fuel using plasma-aided chemistry | Tobias Chemnitz et.al. | 2505.14108 | null |
| 2025-05-20 | Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering | Yihua Zhu et.al. | 2505.14099 | null |
| 2025-05-20 | Personalized and Resilient Distributed Learning Through Opinion Dynamics | Luca Ballotta et.al. | 2505.14081 | null |
| 2025-05-22 | BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks | Weihong Du et.al. | 2505.14079 | link |
| 2025-05-22 | Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning | Wenlin Zhang et.al. | 2505.14069 | link |
| 2025-05-20 | Exploring Temporal Graphs with Frequent and Regular Edges | Duncan Adamson et.al. | 2505.14046 | null |
| 2025-05-20 | Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning | Ruiyi Yang et.al. | 2505.13994 | null |
| 2025-05-20 | CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring | Jiamin Su et.al. | 2505.13965 | null |
| 2025-05-20 | MultiDrive: A Co-Simulation Framework Bridging 2D and 3D Driving Simulation for AV Software Validation | Marc Kaufeld et.al. | 2505.13959 | link |
| 2025-05-20 | Memory-Centric Embodied Question Answer | Mingliang Zhai et.al. | 2505.13948 | null |
| 2025-05-20 | MLZero: A Multi-Agent System for End-to-end Machine Learning Automation | Haoyang Fang et.al. | 2505.13941 | link |
| 2025-05-20 | DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery | Kun Li et.al. | 2505.13940 | link |
| 2025-05-21 | CLEVER: A Curated Benchmark for Formally Verified Code Generation | Amitayush Thakur et.al. | 2505.13938 | link |
| 2025-05-20 | Efficient Agent Training for Computer Use | Yanheng He et.al. | 2505.13909 | link |
| 2025-05-21 | Mobile-Agent-V: A Video-Guided Approach for Effortless and Efficient Operational Knowledge Injection in Mobile Automation | Junyang Wang et.al. | 2505.13887 | null |
| 2025-05-22 | PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks | Guobin Shen et.al. | 2505.13862 | link |
| 2025-05-20 | A Challenge to Build Neuro-Symbolic Video Agents | Sahil Shah et.al. | 2505.13851 | link |
| 2025-05-20 | Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams | Zhi Su et.al. | 2505.13834 | null |
| 2025-05-20 | Online Resource Sharing: Better Robust Guarantees via Randomized Strategies | David X. Lin et.al. | 2505.13824 | link |
| 2025-05-20 | Structured Agent Distillation for Large Language Model | Jun Liu et.al. | 2505.13820 | null |
| 2025-05-20 | RAG/LLM Augmented Switching Driven Polymorphic Metaheuristic Framework | Faramarz Safi Esfahani et.al. | 2505.13808 | null |
| 2025-05-19 | Model Cards for AI Teammates: Comparing Human-AI Team Familiarization Methods for High-Stakes Environments | Ryan Bowers et.al. | 2505.13773 | link |
| 2025-05-19 | Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis | Ruiquan Huang et.al. | 2505.13768 | null |
| 2025-05-21 | Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making | Jacob Kleiman et.al. | 2505.13761 | null |
| 2025-05-19 | Benchmarking MOEAs for solving continuous multi-objective RL problems | Carlos Hernández et.al. | 2505.13726 | link |
| 2025-05-19 | Revenue-Optimal Efficient Mechanism Design with General Type Spaces | Siddharth Prasad et.al. | 2505.13687 | null |
| 2025-05-19 | MAFA: A multi-agent framework for annotation | Mahmood Hegazy et.al. | 2505.13668 | null |
| 2025-05-19 | Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents | Karina Zainullina et.al. | 2505.13652 | null |
| 2025-05-19 | Non-Obvious Manipulability in Additively Separable and Fractional Hedonic Games | Diodato Ferraioli et.al. | 2505.13642 | null |
| 2025-05-19 | Incentivizing Truthful Language Models via Peer Elicitation Games | Baiting Chen et.al. | 2505.13636 | link |
| 2025-05-19 | Q |
Yousouf Taghzouti et.al. | 2505.13572 | null |
| 2025-05-19 | Learning Dynamics of RNNs in Closed-Loop Environments | Yoav Ger et.al. | 2505.13567 | link |
| 2025-05-19 | Counter-Inferential Behavior in Natural and Artificial Cognitive Systems | Serge Dolgikh et.al. | 2505.13551 | null |
| 2025-05-19 | Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems | Ke Chen et.al. | 2505.13546 | null |
| 2025-05-19 | Origin-Destination Pattern Effects on Large-Scale Mixed Traffic Control via Multi-Agent Reinforcement Learning | Muyang Fan et.al. | 2505.13543 | link |
| 2025-05-18 | LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems | Shengkang Gu et.al. | 2505.13528 | null |
| 2025-05-18 | ACPs: Agent Collaboration Protocols for the Internet of Agents | Jun Liu et.al. | 2505.13523 | null |
| 2025-05-17 | HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems | Zhipeng Hou et.al. | 2505.13516 | link |
| 2025-05-16 | Can AI Freelancers Compete? Benchmarking Earnings, Reliability, and Task Success at Scale | David Noever et.al. | 2505.13511 | null |
| 2025-05-16 | An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | Ayesha Amjad et.al. | 2505.13504 | null |
| 2025-05-19 | G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | Liang Chen et.al. | 2505.13426 | link |
| 2025-05-20 | A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut | Gabriel Malikal et.al. | 2505.13405 | null |
| 2025-05-19 | Robin: A multi-agent system for automating scientific discovery | Ali Essam Ghareeb et.al. | 2505.13400 | null |
| 2025-05-19 | Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges | Hongru Wang et.al. | 2505.13328 | null |
| 2025-05-19 | Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions | Saleh Soudijani et.al. | 2505.13311 | null |
| 2025-05-19 | TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents | Yifu Cai et.al. | 2505.13291 | link |
| 2025-05-19 | Hybrid Voting-Based Task Assignment in Modular Construction Scenarios | Daniel Weiner et.al. | 2505.13278 | null |
| 2025-05-19 | From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery | Tianshi Zheng et.al. | 2505.13259 | link |
| 2025-05-19 | Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability | Jingyi Ren et.al. | 2505.13258 | link |
| 2025-05-19 | Composing Dextrous Grasping and In-hand Manipulation via Scoring with a Reinforcement Learning Critic | Lennart Röstel et.al. | 2505.13253 | null |
| 2025-05-19 | Agentic Publications: An LLM-Driven Framework for Interactive Scientific Publishing, Supplementing Traditional Papers with AI-Powered Knowledge Systems | Roberto Pugliese et.al. | 2505.13246 | null |
| 2025-05-19 | Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis | Tianbao Xie et.al. | 2505.13227 | null |
| 2025-05-19 | Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities | Lili Zhang et.al. | 2505.13195 | null |
| 2025-05-19 | When a Reinforcement Learning Agent Encounters Unknown Unknowns | Juntian Zhu et.al. | 2505.13188 | null |
| 2025-05-19 | Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping | Jianfeng Xu et.al. | 2505.13182 | null |
| 2025-05-19 | Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair | Han Zheng et.al. | 2505.13103 | null |
| 2025-05-19 | The Hidden Dangers of Browsing AI Agents | Mykyta Mudryi et.al. | 2505.13076 | null |
| 2025-05-19 | CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents | Rebecca Westhäußer et.al. | 2505.13044 | null |
| 2025-05-19 | Adversarial Reasoning for Repair Based on Inferred Program Intent | He Ye et.al. | 2505.13008 | null |
| 2025-05-20 | From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents | Liangxuan Wu et.al. | 2505.12981 | null |
| 2025-05-19 | Improved Approximation Ratio for Strategyproof Facility Location on a Cycle | Krzysztof Rogowski et.al. | 2505.12943 | null |
| 2025-05-20 | Leveraging LLM Inconsistency to Boost Pass@k Performance | Uri Dalal et.al. | 2505.12938 | null |
| 2025-05-19 | The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | Pedro M. P. Curvo et.al. | 2505.12923 | link |
| 2025-05-19 | PyFCG: Fluid Construction Grammar in Python | Paul Van Eecke et.al. | 2505.12920 | null |
| 2025-05-19 | Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach | Hao Fang et.al. | 2505.12902 | null |
| 2025-05-19 | From Grunts to Grammar: Emergent Language from Cooperative Foraging | Maytus Piriyajitakonkij et.al. | 2505.12872 | null |
| 2025-05-19 | GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents | Zheng Wu et.al. | 2505.12842 | link |
| 2025-05-19 | Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs | Zhuo Yang et.al. | 2505.12833 | null |
| 2025-05-19 | Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning | Wei-Chen Liao et.al. | 2505.12811 | null |
| 2025-05-19 | Mixture Policy based Multi-Hop Reasoning over N-tuple Temporal Knowledge Graphs | Zhongni Hou et.al. | 2505.12788 | null |
| 2025-05-19 | Forewarned is Forearmed: A Survey on Large Language Model-based Agents in Autonomous Cyberattacks | Minrui Xu et.al. | 2505.12786 | null |
| 2025-05-19 | Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization | Haochen Yuan et.al. | 2505.12759 | null |
| 2025-05-19 | Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses | Yingkai Kang et.al. | 2505.12710 | null |
| 2025-05-19 | PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI | Yingchen He et.al. | 2505.12707 | null |
| 2025-05-19 | AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use | Yaotian Yang et.al. | 2505.12650 | link |
| 2025-05-19 | Two out of Three (ToT): using self-consistency to make robust predictions | Jung Hoon Lee et.al. | 2505.12642 | null |
| 2025-05-19 | Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents | Yunseok Jang et.al. | 2505.12632 | null |
| 2025-05-19 | Dual-Agent Reinforcement Learning for Automated Feature Generation | Wanfu Gao et.al. | 2505.12628 | link |
| 2025-05-19 | Lightweight and Effective Preference Construction in PIBT for Large-Scale Multi-Agent Pathfinding | Keisuke Okumura et.al. | 2505.12623 | null |
| 2025-05-19 | HIL: Hybrid Imitation Learning of Diverse Parkour Skills from Videos | Jiashun Wang et.al. | 2505.12619 | null |
| 2025-05-19 | Action-Dependent Optimality-Preserving Reward Shaping | Grant C. Forbes et.al. | 2505.12611 | null |
| 2025-05-19 | The Hamiltonian of Poly-matrix Zero-sum Games | Toshihiro Ota et.al. | 2505.12609 | link |
| 2025-05-19 | Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis | Yifan Hu et.al. | 2505.12597 | link |
| 2025-05-19 | AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection | Tiankai Yang et.al. | 2505.12594 | link |
| 2025-05-18 | A Survey of Attacks on Large Language Models | Wenrui Xu et.al. | 2505.12567 | null |
| 2025-05-18 | ESC-Judge: A Framework for Comparing Emotional Support Conversational Agents | Navid Madani et.al. | 2505.12531 | null |
| 2025-05-18 | InnateCoder: Learning Programmatic Options with Foundation Models | Rubens O. Moraes et.al. | 2505.12508 | link |
| 2025-05-18 | Optimal Task and Motion Planning for Autonomous Systems Using Petri Nets | Zhou He et.al. | 2505.12503 | null |
| 2025-05-18 | ALAS: A Stateful Multi-LLM Agent Framework for Disruption-Aware Planning | Edward Y. Chang et.al. | 2505.12501 | null |
| 2025-05-18 | UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning | Longxi Gao et.al. | 2505.12493 | null |
| 2025-05-18 | Proposal for Improving Google A2A Protocol: Safeguarding Sensitive Data in Multi-Agent Systems | Yedidel Louck et.al. | 2505.12490 | null |
| 2025-05-18 | Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems | Haochun Wang et.al. | 2505.12467 | null |
| 2025-05-18 | Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning | Junzhe Jiang et.al. | 2505.12465 | null |
| 2025-05-18 | BadNAVer: Exploring Jailbreak Attacks On Vision-and-Language Navigation | Wenqi Lyu et.al. | 2505.12443 | null |
| 2025-05-20 | IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems | Liwen Wang et.al. | 2505.12442 | null |
| 2025-05-18 | Learning to Play Like Humans: A Framework for LLM Adaptation in Interactive Fiction Games | Jinming Zhang et.al. | 2505.12439 | null |
| 2025-05-20 | Steady-State Strategy Synthesis for Swarms of Autonomous Agents | Martin Jonáš et.al. | 2505.12406 | null |
| 2025-05-18 | Automated Profile Inference with Language Model Agents | Yuntao Du et.al. | 2505.12402 | link |
| 2025-05-18 | MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks | Yinghao Zhu et.al. | 2505.12371 | link |
| 2025-05-18 | Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning | Xinbin Yuan et.al. | 2505.12370 | link |
| 2025-05-18 | A universal policy wrapper with guarantees | Anton Bolychev et.al. | 2505.12354 | null |
| 2025-05-18 | Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance | Yufeng Wang et.al. | 2505.12334 | null |
| 2025-05-18 | Robust Planning for Autonomous Driving via Mixed Adversarial Diffusion Predictions | Albert Zhao et.al. | 2505.12327 | null |
| 2025-05-18 | BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind | Rikunari Sagara et.al. | 2505.12321 | link |
| 2025-05-18 | Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization | Hongbiao Zhu et.al. | 2505.12311 | null |
| 2025-05-18 | Enhance Mobile Agents Thinking Process Via Iterative Preference Learning | Kun Huang et.al. | 2505.12299 | null |
| 2025-05-18 | LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage Approach | Yinqiu Liu et.al. | 2505.12247 | null |
| 2025-05-18 | Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents | Shuo Han et.al. | 2505.12204 | null |
| 2025-05-20 | LLM-DSE: Searching Accelerator Parameters with LLM Agents | Hanyu Wang et.al. | 2505.12188 | link |
| 2025-05-17 | LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs | Omar Choukrani et.al. | 2505.12135 | link |
| 2025-05-17 | Towards Sustainability in 6G Network Slicing with Energy-Saving and Optimization Methods | Rodrigo Moreira et.al. | 2505.12132 | null |
| 2025-05-17 | Scalable Time-Tagged Data Acquisition for Entanglement Distribution in Quantum Networks | Abderrahim Amlou et.al. | 2505.12102 | null |
| 2025-05-17 | Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | Tiannuo Yang et.al. | 2505.12065 | link |
| 2025-05-17 | AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research | Renqi Chen et.al. | 2505.12039 | null |
| 2025-05-17 | Incentivize Contribution and Learn Parameters Too: Federated Learning with Strategic Data Owners | Drashthi Doshi et.al. | 2505.12010 | null |
| 2025-05-17 | SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation | Yuncheng Hua et.al. | 2505.12006 | null |
| 2025-05-17 | Interactional Fairness in LLM Multi-Agent Systems: An Evaluation Framework | Ruta Binkyte et.al. | 2505.12001 | null |
| 2025-05-17 | Task Scheduling in Space-Air-Ground Uniformly Integrated Networks with Ripple Effects | Chuan Huang et.al. | 2505.11974 | null |
| 2025-05-17 | MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models | Luca Collini et.al. | 2505.11963 | null |
| 2025-05-17 | CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World | Zoya Volovikova et.al. | 2505.11962 | null |
| 2025-05-17 | LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | Junhao Zheng et.al. | 2505.11942 | link |
| 2025-05-17 | Modèles de Substitution pour les Modèles à base d'Agents : Enjeux, Méthodes et Applications | Paul Saves et.al. | 2505.11912 | link |
| 2025-05-17 | Benchmarking LLMs in an Embodied Environment for Blue Team Threat Hunting | Xiaoqun Liu et.al. | 2505.11901 | null |
| 2025-05-17 | Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents | Weikai Xu et.al. | 2505.11891 | null |
| 2025-05-17 | AR Secretary Agent: Real-time Memory Augmentation via LLM-powered Augmented Reality Glasses | Raphaël A. El Haddad et.al. | 2505.11888 | null |
| 2025-05-20 | Aux-Think: Exploring Reasoning Strategies for Data-Efficient Vision-Language Navigation | Shuo Wang et.al. | 2505.11886 | null |
| 2025-05-17 | Position Paper: Bounded Alignment: What (Not) To Expect From AGI Agents | Ali A. Minai et.al. | 2505.11866 | null |
| 2025-05-17 | Learning Pareto-Optimal Rewards from Noisy Preferences: A Framework for Multi-Objective Inverse Reinforcement Learning | Kalyan Cherukuri et.al. | 2505.11864 | null |
| 2025-05-17 | RVTBench: A Benchmark for Visual Reasoning Tasks | Yiqing Shen et.al. | 2505.11838 | link |
| 2025-05-17 | Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment | Siliang Zeng et.al. | 2505.11821 | null |
| 2025-05-17 | BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering | Taolin Zhang et.al. | 2505.11811 | null |
| 2025-05-17 | Retrospex: Language Agent Meets Offline Reinforcement Learning Critic | Yufei Xiang et.al. | 2505.11807 | link |
| 2025-05-17 | Robustness of Incentive Mechanisms Against System Misspecification in Congestion Games | Chih-Yuan Chiu et.al. | 2505.11791 | null |
| 2025-05-17 | OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration | Shijun Li et.al. | 2505.11765 | link |
| 2025-05-16 | REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning | Pawin Taechoyotin et.al. | 2505.11718 | null |
| 2025-05-16 | EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents | Xilong Wang et.al. | 2505.11717 | null |
| 2025-05-16 | Unveiling the Black Box: A Multi-Layer Framework for Explaining Reinforcement Learning-Based Cyber Agents | Diksha Goel et.al. | 2505.11708 | null |
| 2025-05-16 | Forensics of Error Rates of Quantum Hardware | Rupshali Roy et.al. | 2505.11706 | null |
| 2025-05-16 | Ambiguity Resolution in Text-to-Structured Data Mapping | Zhibo Hu et.al. | 2505.11679 | null |
| 2025-05-16 | Terminators: Terms of Service Parsing and Auditing Agents | Maruf Ahmed Mridul et.al. | 2505.11672 | null |
| 2025-05-16 | Learning from Less: Guiding Deep Reinforcement Learning with Differentiable Symbolic Planning | Zihan Ye et.al. | 2505.11661 | null |
| 2025-05-16 | PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning | Falong Fan et.al. | 2505.11642 | link |
| 2025-05-20 | Talk to Your Slides: Language-Driven Agents for Efficient Slide Editing | Kyudan Jung et.al. | 2505.11604 | link |
| 2025-05-16 | Continuous Optimization for Feature Selection with Permutation-Invariant Embedding and Policy-Guided Search | Rui Liu et.al. | 2505.11601 | null |
| 2025-05-16 | LLM Agents Are Hypersensitive to Nudges | Manuel Cherep et.al. | 2505.11584 | null |
| 2025-05-16 | Toward Adaptive Categories: Dimensional Governance for Agentic AI | Zeynep Engin et.al. | 2505.11579 | null |
| 2025-05-15 | Assessing Collective Reasoning in Multi-Agent LLMs via Hidden Profile Tasks | Yuxuan Li et.al. | 2505.11556 | null |
| 2025-05-14 | TARGET: Benchmarking Table Retrieval for Generative Tasks | Xingyu Ji et.al. | 2505.11545 | null |
| 2025-05-16 | Automatic Reward Shaping from Confounded Offline Data | Mingxuan Li et.al. | 2505.11478 | null |
| 2025-05-16 | Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks | Wesley A Suttle et.al. | 2505.11461 | null |
| 2025-05-16 | Robust Equilibria in Shared Resource Allocation via Strengthening Border's Theorem | David X. Lin et.al. | 2505.11431 | null |
| 2025-05-16 | Can AI automatically analyze public opinion? A LLM agents-based agentic pipeline for timely public opinion analysis | Jing Liu et.al. | 2505.11401 | null |
| 2025-05-16 | Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation | Zihan Wang et.al. | 2505.11383 | link |
| 2025-05-16 | GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents | Lingxiao Diao et.al. | 2505.11368 | null |
| 2025-05-16 | Long-Term Average Impulse Control with Mean Field Interactions | K. L. Helmes et.al. | 2505.11345 | null |
| 2025-05-16 | Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics | Ardian Selmonaj et.al. | 2505.11311 | null |
| 2025-05-16 | Diffusion Learning with Partial Agent Participation and Local Updates | Elsa Rizk et.al. | 2505.11307 | null |
| 2025-05-16 | Meta-World+: An Improved, Standardized, RL Benchmark | Reginald McLean et.al. | 2505.11289 | link |
| 2025-05-16 | TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes | Chao Zhang et.al. | 2505.11270 | null |
| 2025-05-19 | Massive-STEPS: Massive Semantic Trajectories for Understanding POI Check-ins -- Dataset and Benchmarks | Wilson Wongso et.al. | 2505.11239 | link |
| 2025-05-16 | Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation | Donghoon Lee et.al. | 2505.11221 | link |
| 2025-05-16 | From Intent Discovery to Recognition with Topic Modeling and Synthetic Data | Aaron Rodrigues et.al. | 2505.11176 | null |
| 2025-05-19 | Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition | Bo Yue et.al. | 2505.11175 | null |
| 2025-05-16 | MPMA: Preference Manipulation Attack Against Model Context Protocol | Zihan Wang et.al. | 2505.11154 | null |
| 2025-05-16 | Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes | Ashok Arora et.al. | 2505.11153 | null |
| 2025-05-16 | Reinforcement Learning for AMR Charging Decisions: The Impact of Reward and Action Space Design | Janik Bischoff et.al. | 2505.11136 | null |
| 2025-05-16 | Scalability of Reinforcement Learning Methods for Dispatching in Semiconductor Frontend Fabs: A Comparison of Open-Source Models with Real Industry Datasets | Patrick Stöckermann et.al. | 2505.11135 | null |
| 2025-05-16 | Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity | Chan-Jan Hsu et.al. | 2505.11107 | null |
| 2025-05-16 | Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors | Lang Feng et.al. | 2505.11100 | null |
| 2025-05-16 | LLM-Enhanced Symbolic Control for Safety-Critical Applications | Amir |