arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.06170 2026-04-08 cs.CL

Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

Komal Kumar, Aman Chadha, Salman Khan, Fahad Shahbaz Khan, Hisham Cholakkal

Comments 19 pages, 7 figures, 8 tables, ACL main (Oral)

详情

英文摘要

The rapid growth of scientific literature has made it increasingly difficult for researchers to efficiently discover, evaluate, and synthesize relevant work. Recent advances in multi-agent large language models (LLMs) have demonstrated strong potential for understanding user intent and are being trained to utilize various tools. In this paper, we introduce Paper Circle, a multi-agent research discovery and analysis system designed to reduce the effort required to find, assess, organize, and understand academic literature. The system comprises two complementary pipelines: (1) a Discovery Pipeline that integrates offline and online retrieval from multiple sources, multi-criteria scoring, diversity-aware ranking, and structured outputs; and (2) an Analysis Pipeline that transforms individual papers into structured knowledge graphs with typed nodes such as concepts, methods, experiments, and figures, enabling graph-aware question answering and coverage verification. Both pipelines are implemented within a coder LLM-based multi-agent orchestration framework and produce fully reproducible, synchronized outputs including JSON, CSV, BibTeX, Markdown, and HTML at each agent step. This paper describes the system architecture, agent roles, retrieval and scoring methods, knowledge graph schema, and evaluation interfaces that together form the Paper Circle research workflow. We benchmark Paper Circle on both paper retrieval and paper review generation, reporting hit rate, MRR, and Recall at K. Results show consistent improvements with stronger agent models. We have publicly released the website at https://papercircle.vercel.app/ and the code at https://github.com/MAXNORM8650/papercircle.

URL PDF HTML ☆

赞 0 踩 0

2604.06169 2026-04-08 cs.LG cs.AI cs.CL stat.ML

In-Place Test-Time Training

Guhao Feng, Shengjie Luo, Kai Hua, Ge Zhang, Di He, Wenhao Huang, Tianle Cai

Comments ICLR 2026 Oral Presentation; Code is released at https://github.com/ByteDance-Seed/In-Place-TTT

2604.06167 2026-04-08 cs.LG math.AT

Topological Characterization of Churn Flow and Unsupervised Correction to the Wu Flow-Regime Map in Small-Diameter Vertical Pipes

Brady Koenig, Sushovan Majhi, Atish Mitra, Abigail Stein, Burt Todd

2604.06163 2026-04-08 cs.IR

Data, Not Model: Explaining Bias toward LLM Texts in Neural Retrievers

Wei Huang, Keping Bi, Yinqiong Cai, Wei Chen, Jiafeng Guo, Xueqi Cheng

2604.06160 2026-04-08 cs.CV cs.LG

The Character Error Vector: Decomposable errors for page-level OCR evaluation

Jonathan Bourne, Mwiza Simbeye, Joseph Nockels

Comments 6643 words, 5 figures, 15 tables

2604.06159 2026-04-08 cs.LG

Target Policy Optimization

Jean Kaddour

2604.06158 2026-04-08 math.OC cs.SY eess.SY

Distributionally Robust Regret Optimal LQR with Common Stage-Law Ambiguity

Lukas-Benedikt Fiechtner, Jose Blanchet

2604.06156 2026-04-08 cs.CV cs.AI cs.CL

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

Yuchi Wang, Haiyang Yu, Weikang Bian, Jiefeng Long, Xiao Liang, Chao Feng, Hongsheng Li

2604.06154 2026-04-08 cs.CL

Exclusive Unlearning

Mutsumi Sasaki, Kouta Nakayama, Yusuke Miyao, Yohei Oseki, Masaru Isonuma

2604.06150 2026-04-08 cs.RO

Delta6: A Low-Cost, 6-DOF Force-Sensing Flexible End-Effector

Yue Feng, Weicheng Huang, Chen Qiu, Huixu Dong, I-Ming Chen

Comments This work has been submitted to the IEEE for possible publication

2604.06148 2026-04-08 cs.CR cs.AI cs.MA

Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Systems Operating Across Enterprise and Geopolitical Boundaries

Andrew Kurtz, Klaudia Krawiecka

Comments 75 pages (excl. references), 2 tables. Addresses policy makers, regulators, and practitioners at the intersection of AI governance, cybersecurity, and geopolitical risk

2604.06140 2026-04-08 eess.SY cs.SY

On the Convergence of an Opinion-Action Coevolution Model with Bounded Confidence

Chen Song, Angela Fontan, Rong Su, Julien M. Hendrickx, Vladimir Cvetkovic, Karl H. Johansson

Comments This work has been accepted for presentation at the 24th European Control Conference (ECC 2026)

2604.06138 2026-04-08 cs.SD cs.AI

Generating Synthetic Doctor-Patient Conversations for Long-form Audio Summarization

Yanis Labrak, David Grünert, Séverin Baroudi, Jiyun Chun, Pawel Cyrta, Sergio Burdisso, Ahmed Hassoon, David Liu, Adam Rothschild, Reed Van Deusen, Petr Motlicek, Andrew Perrault, Ricard Marxer, Thomas Schaaf

Comments Submitted for review at Interspeech 2026

2604.06135 2026-04-08 quant-ph cs.AI cs.LG

Shot-Based Quantum Encoding: A Data-Loading Paradigm for Quantum Neural Networks

Basil Kyriacou, Viktoria Patapovich, Maniraman Periyasamy, Alexey Melnikov

Comments 6 pages, 2 figures, 0 tables

2604.06134 2026-04-08 cs.HC

MAESTRO: Adapting GUIs and Guiding Navigation with User Preferences in Conversational Agents with GUIs

Sangwook Lee, Sang Won Lee, Adnan Abbas, Young-Ho Kim, Yan Chen

Comments 10 pages, 5 figures

2604.06133 2026-04-08 cs.RO

Learning-Guided Force-Feedback Model Predictive Control with Obstacle Avoidance for Robotic Deburring

Krzysztof Wojciechowski, Ege Gursoy, Arthur Haffemayer, Sebastien Kleff, Vincent Bonnet, Florent Lamiraux, Nicolas Mansard

Comments Accepted to ICRA 2026

2604.06131 2026-04-08 cs.HC

Understanding Educators' Perceptions of AI-generated Non-consensual Intimate Imagery

Tongxin Li, Katelyn M Reyes, Liezeil Jimenez, Katie S Nam, Donghee Yvette Wohn

2604.06130 2026-04-08 math.NA cs.NA quant-ph

QAFE$^2$: Quantum Accelerated Multiscale Finite Element Analysis

Yiren Wang, Michael Ortiz, Fehmi Cirak

2604.06129 2026-04-08 cs.CV cs.AI

PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer

David Picard, Nicolas Dufour, Lucas Degeorge, Arijit Ghosh, Davide Allegro, Tom Ravaud, Yohann Perron, Corentin Sautier, Zeynep Sonat Baltaci, Fei Meng, Syrine Kalleli, Marta López-Rauhut, Thibaut Loiseau, Ségolène Albouy, Raphael Baena, Elliot Vincent, Loic Landrieu

Comments Accepted to CVPR Findings 2026

2604.06126 2026-04-08 cs.LG cs.AI

Gym-Anything: Turn any Software into an Agent Environment

Pranjal Aggarwal, Graham Neubig, Sean Welleck

详情

英文摘要

Computer-use agents hold the promise of assisting in a wide range of digital economic activities. However, current research has largely focused on short-horizon tasks over a limited set of software with limited economic value, such as basic e-commerce and OS-configuration tasks. A key reason is that creating environments for complex software requires significant time and human effort, and therefore does not scale. To address this, we introduce Gym-Anything, a framework for converting any software into an interactive computer-use environment. We frame environment creation itself as a multi-agent task: a coding agent writes setup scripts, downloads real-world data, and configures the software, while producing evidence of correct setup. An independent audit agent then verifies evidence for the environment setup against a quality checklist. Using a taxonomy of economically valuable occupations grounded in U.S. GDP data, we apply this pipeline to 200 software applications with broad occupational coverage. The result is CUA-World, a collection of over 10K long-horizon tasks spanning domains from medical science and astronomy to engineering and enterprise systems, each configured with realistic data along with train and test splits. CUA-World also includes CUA-World-Long, a challenging long-horizon benchmark with tasks often requiring over 500 steps, far exceeding existing benchmarks. Distilling successful trajectories from the training split into a 2B vision-language model outperforms models 2$\times$ its size. We also apply the same auditing principle at test time: a separate VLM reviews completed trajectories and provides feedback on what remains, improving Gemini-3-Flash on CUA-World-Long from 11.5% to 14.0%. We release all code, infrastructure, and benchmark data to facilitate future research in realistic computer-use agents.

URL PDF HTML ☆

赞 0 踩 0

2604.06125 2026-04-08 cs.IT math.IT

Multilevel Coset Codes on Lattices

Leopold Bertholet, Chloe Makdad, Stephen Mackes, Daniel Chew, Matthew Robinson

2604.06124 2026-04-08 cs.CV cs.AI

Lightweight Multimodal Adaptation of Vision Language Models for Species Recognition and Habitat Context Interpretation in Drone Thermal Imagery

Hao Chen, Fang Qiu, Fangchao Dong, Defei Yang, Eve Bohnett, Li An

2604.06123 2026-04-08 stat.CO cs.LG econ.EM stat.ME

A Large-Scale Empirical Comparison of Meta-Learners and Causal Forests for Heterogeneous Treatment Effect Estimation in Marketing Uplift Modeling

Aman Singh

Comments 6 pages

2604.06117 2026-04-08 math.DS cs.SY eess.SY

On Permanence of Conservative Replicator Dynamics with Four Strategies

Haoyu Yin, Xudong Chen, Bruno Sinopoli

2604.06115 2026-04-08 math.NA cs.NA

A Neural-Enhanced Weak Galerkin Method for Second-Order Elliptic Problems with Low-Regularity Solutions

Chunmei Wang

Comments 12 pages

2604.06113 2026-04-08 cs.CV

SEM-ROVER: Semantic Voxel-Guided Diffusion for Large-Scale Driving Scene Generation

Hiba Dahmani, Nathan Piasco, Moussab Bennehar, Luis Roldão, Dzmitry Tsishkou, Laurent Caraffa, Jean-Philippe Tarel, Roland Brémond

2604.06109 2026-04-08 cs.LG cs.DS

Learning $\mathsf{AC}^0$ Under Graphical Models

Gautam Chandrasekaran, Jason Gaitonde, Ankur Moitra, Arsen Vasilyan

Comments 57 pages

2604.06107 2026-04-08 cs.AI math.HO math.LO

Artificial Intelligence and the Structure of Mathematics

Maissam Barkeshli, Michael R. Douglas, Michael H. Freedman

Comments 45 pages

2604.06102 2026-04-08 cs.HC

UI Placement as a Critical Design Factor for Augmented Reality During Locomotion

Pavel Manakhov, Hans Gellersen

Comments 4 pages, 2 figures

2604.06101 2026-04-08 cs.CR

Towards Securing IIoT: An Innovative Privacy-Preserving Anomaly Detector Based on Federated Learning

Samira Kamali Poorazad, Chafika Benzaïd, Tarik Taleb