arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.25917 2026-04-29 cs.AI cs.CL cs.LG

Recursive Multi-Agent Systems

Xiyuan Yang, Jiaru Zou, Rui Pan, Ruizhong Qiu, Pan Lu, Shizhe Diao, Jindong Jiang, Hanghang Tong, Tong Zhang, Markus J. Buehler, Jingrui He, James Zou

Comments 36 Pages. Project Website: https://recursivemas.github.io

详情

英文摘要

Recursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen reasoning. We extend such scaling principle from a single model to multi-agent systems, and ask: Can agent collaboration itself be scaled through recursion? To this end, we introduce RecursiveMAS, a recursive multi-agent framework that casts the entire system as a unified latent-space recursive computation. RecursiveMAS connects heterogeneous agents as a collaboration loop through the lightweight RecursiveLink module, enabling in-distribution latent thoughts generation and cross-agent latent state transfer. To optimize our framework, we develop an inner-outer loop learning algorithm for iterative whole-system co-optimization through shared gradient-based credit assignment across recursion rounds. Theoretical analyses of runtime complexity and learning dynamics establish that RecursiveMAS is more efficient than standard text-based MAS and maintains stable gradients during recursive training. Empirically, we instantiate RecursiveMAS under 4 representative agent collaboration patterns and evaluate across 9 benchmarks spanning mathematics, science, medicine, search, and code generation. In comparison with advanced single/multi-agent and recursive computation baselines, RecursiveMAS consistently delivers an average accuracy improvement of 8.3%, together with 1.2$\times$-2.4$\times$ end-to-end inference speedup, and 34.6%-75.6% token usage reduction. Code and Data are provided in https://recursivemas.github.io.

URL PDF HTML ☆

赞 0 踩 0

2604.25914 2026-04-29 cs.CL

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Jinxiang Meng, Shaoping Huang, Fangyu Lei, Jingyu Guo, Haoxiang Liu, Jiahao Su, Sihan Wang, Yao Wang, Enrui Wang, Ye Yang, Hongze Chai, Jinming Lv, Anbang Yu, Huangjing Zhang, Yitong Zhang, Yiming Huang, Zeyao Ma, Shizhu He, Jun Zhao, Kang Liu

2604.25913 2026-04-29 cs.GT cs.CR

Credit Limits beyond Full Collateralization in Decentralized Micropayments: Incentive Conditions

Chien-Chih Chen, Wojciech Golab

Comments 12 pages, 3 tables

2604.25906 2026-04-29 cs.IR

Make Any Collection Navigable: Methods for Constructing and Evaluating Hypergraph of Text

Dean E. Alvarez, ChengXiang Zhai

2604.25905 2026-04-29 cs.CL

A paradox of AI fluency

Christopher Potts, Moritz Sudhof

2604.25904 2026-04-29 cs.LG math.DS stat.ML

Teacher Forcing as Generalized Bayes: Optimization Geometry Mismatch in Switching Surrogates for Chaotic Dynamics

Andre Herz, Daniel Durstewitz, Georgia Koppe

Comments Presented at the Workshop on Optimization and Post-Bayesian Inference in Machine Learning, AISTATS 2026

2604.25903 2026-04-29 cs.SE cs.LG

Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models

Ajmain Inqiad Alam, Palash Roy, Chanchal K. Roy, Banani Roy, Kevin A. Schneider

详情

DOI: 10.1145/3797075
Journal ref: Proceedings of ACM Software Engineering 3, FSE, Article FSE047, 2026

英文摘要

The accelerating adoption of Large Language Models (LLMs) in software engineering (SE) has brought with it a silent crisis: unsustainable computational cost. While these models demonstrate remarkable capabilities in different SE tasks, they are unmanageably large, slow to deploy, memory-intensive, and carbon-heavy. This reality threatens not only the scalability and accessibility of AI-powered SE, but also its long-term environmental sustainability. The research challenge is clear: we must go beyond accuracy and address efficiency and environmental cost as first-class design constraints. To meet this challenge, we introduce Carbon-Taxed Transformers (CTT), a systematic multi-architectural compression principled pipeline ordering inspired by economic carbon taxation principles. Drawing from the economic concept of carbon pricing, CTT operationalizes a computational carbon tax that penalizes architectural inefficiencies and rewards deployment-ready compression. We evaluate CTT across three core SE tasks: code clone detection, code summarization, and code generation, with models spanning encoder-only, encoder-decoder, and decoder-only architecture. Our results show that CTT delivers on inference: (1) up to 49x memory reduction, (2) time reduction up to 8-10x for clone detection, up to 3x for summarization, and 4-7x for generation, (3) up to 81% reduction in CO2 emissions and (4) CTT retains around 98% accuracy on clone detection, around 89% on summarization, and up to 91% (textual metrics) and 68% (pass@1) for generation. Two ablation studies show that pipeline ordering and individual component contributions are both essential, providing empirical justification for CTT's design and effectiveness. This work establishes a viable path toward responsible AI in SE through aggressive yet performance-preserving compression.

URL PDF HTML ☆

赞 0 踩 0

2604.25902 2026-04-29 cs.CL cs.AI cs.LG

Toward a Functional Geometric Algebra for Natural Language Semantics

James Pustejovsky

Comments 43 pages. Keywords: geometric algebra, Clifford algebra, compositional semantics, natural language semantics, type coercion, multivector representations, graded type system, Generative Lexicon, neural language models, distributional semantics

2604.25898 2026-04-29 cs.LG cs.AI

TSN-Affinity: Similarity-Driven Parameter Reuse for Continual Offline Reinforcement Learning

Dominik Żurek, Kamil Faber, Marcin Pietron, Paweł Gajewski, Roberto Corizzo

2604.25897 2026-04-29 cs.RO cs.LG cs.SY eess.SY

Variational Neural Belief Parameterizations for Robust Dexterous Grasping under Multimodal Uncertainty

Clinton Enwerem, Shreya Kalyanaraman, John S. Baras, Calin Belta

Comments 11 pages, 10 figures

2604.25895 2026-04-29 cs.CY cs.AI cs.CL

Three Models of RLHF Annotation: Extension, Evidence, and Authority

Steve Coyne

Comments 17 pages. Accepted to ACM FAccT '26, June 25-28, Montreal

2604.25891 2026-04-29 cs.LG cs.AI cs.CR

Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggers

Jan Dubiński, Jan Betley, Anna Sztyber-Betley, Daniel Tan, Owain Evans

2604.25889 2026-04-29 cs.CV

Robust Deepfake Detection: Mitigating Spatial Attention Drift via Calibrated Complementary Ensembles

Minh-Khoa Le-Phan, Minh-Hoang Le, Trong-Le Do, Minh-Triet Tran

Comments 4th place (out of 94 teams) in the NTIRE 2026 Robust Deepfake Detection Challenge

2604.25887 2026-04-29 cs.CV cs.AI cs.RO cs.SY eess.SY

No Pedestrian Left Behind: Real-Time Detection and Tracking of Vulnerable Road Users for Adaptive Traffic Signal Control

Anas Gamal Aly, Hala ElAarag

Comments © Anas Gamal Aly and Hala ElAarag, 2026. This is the authors' version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record will be published in Proceedings of the 2026 ACM Southeast Conference (ACMSE 2026)

2604.25885 2026-04-29 hep-ph cs.LG hep-ex

Explainable AI for Jet Tagging: A Comparative Study of GNNExplainer, GNNShap, and GradCAM for Jet Tagging in the Lund Jet Plane

Pahal D. Patel, Sanmay Ganguly

Comments 25 pages, 9 figures. Comments are welcome

2604.25884 2026-04-29 quant-ph cs.CV

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

Shuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrud, Niyaz R. Beysengulov, Daniel C. Cole, Alejandro Gómez Frieiro, Elena O. Glen, Hao Hsu, Gang Huang, Raymond Jow, Greshma Shaji, Tom Lubowe, Ligeng Zhu, Luis Mantilla Calderón, Nicola Pancotti, Joel Pendleton, Brandon Severin, Charles Etienne Staub, Sara Sussman, Antti Vepsäläinen, Neel Rajeshbhai Vora, Yilun Xu, Varinia Bernales, Daniel Bowring, Elica Kyoseva, Ivan Rungger, Giulia Semeghini, Sam Stanwyck, Timothy Costa, Alán Aspuru-Guzik, Krysta Svore

2604.25880 2026-04-29 cs.SE

From Threads to Trajectories: A Multi-LLM Pipeline for Community Knowledge Extraction from GitHub Issue Discussions

Nazia Shehnaz Joynab, Soneya Binta Hossain

2604.25878 2026-04-29 cs.CR

Prime-Field PINI: Machine-Checked Composition Theorems for Post-Quantum NTT Masking

Ray Iskander, Khaled Kirah

Comments 17 pages, 1 Figure

2604.25872 2026-04-29 cs.LG cs.AI stat.ML

When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient

Shuning Shang, Hubert Strauss, Stanley Wei, Sanjeev Arora, Noam Razin

Comments Code available at https://github.com/princeton-pli/imperfect-rewards

2604.25870 2026-04-29 cs.IT math.IT

Twisted and Twisted Linearized Reed--Solomon Codes, LCD and ACD MDS constructions

Sanjit Bhowmick, Kuntal Deka, Edgar Martínez-Moro

2604.25868 2026-04-29 cs.NI cs.IT math.IT

Decoding Delay Guarantees of Space Regulated Multiple Access Random Wireless Networks using Successive Interference Cancellation

Kevin Zagalo, Jean-Marie Gorce, François Baccelli

2604.25866 2026-04-29 cs.CL

From Syntax to Emotion: A Mechanistic Analysis of Emotion Inference in LLMs

Bangzhao Shu, Arinjay Singh, Mai ElSherief

Comments 18 pages including appendix

2604.25862 2026-04-29 cs.SE cs.AI

RESTestBench: A Benchmark for Evaluating the Effectiveness of LLM-Generated REST API Test Cases from NL Requirements

Leon Kogler, Stefan Hangler, Maximilian Ehrhart, Benedikt Dornauer, Roland Wuersching, Peter Schrammel

Comments Accepted for EASE 2026

2604.25857 2026-04-29 cs.NI

Slice Agent: Identifying and Isolating Slices in Shared Open Radio Unit

Felipe Arnholda, Flavio Rocha, Lucio Prade, Cristiano Bonato Both

Comments 40 pages, 13 figures, 4 tables

2604.25852 2026-04-29 math.NA cs.NA

Efficient boundary elements for the Smoluchowski diffusion equation

Ignacio Labarca-Figueroa, Heiko Gimperlein

Comments 23 pages, 17 figures

2604.25849 2026-04-29 cs.AI

ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents

Zhou Hanlin, Chan Huah Yong

2604.25848 2026-04-29 cs.AI

Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions

An Nguyen, Hoang Nguyen, Phuong Le, Hung Pham, Cuong Do, Laurent El Ghaoui

Comments 13 pages, 9 figures. Submitted to Neurocomputing

2604.25847 2026-04-29 math.OC cs.AI cs.LG

From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling

Jianghao Lin, Zi Ling, Chenyu Zhou, Tianyi Xu, Ruoqing Jiang, Zizhuo Wang, Dongdong Ge

Comments Working Paper

2604.25846 2026-04-29 cs.CR cs.AI

Towards Agentic Investigation of Security Alerts

Even Eilertsen, Vasileios Mavroeidis, Gudmund Grov

Comments 10 pages, 3 figures, 4 tables. Accepted at the 2025 IEEE International Conference on Big Data (BigData)

2604.25841 2026-04-29 cs.DS

Tight Bounds for some W[1]-hard Problems Parameterized by Multi-clique-width

Benjamin Bergougnoux, Vera Chekan, Stefan Kratsch

Comments Conference version to appear at International Workshop on Graph-Theoretic Concepts in Computer Science (WG 2026)