arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.01452 2026-04-03 cs.AI

A Multi-Agent Human-LLM Collaborative Framework for Closed-Loop Scientific Literature Summarization

Maxwell J. Jacobson, Daniel Xie, Jackson Shen, Adil Wazeer, Haiyan Wang, Xinghang Zhang, Yexiang Xue

详情

英文摘要

Scientific discovery is slowed by fragmented literature that requires excessive human effort to gather, analyze, and understand. AI tools, including autonomous summarization and question answering, have been developed to aid in understanding scientific literature. However, these tools lack the structured, multi-step approach necessary for extracting deep insights from scientific literature. Large Language Models (LLMs) offer new possibilities for literature analysis, but remain unreliable due to hallucinations and incomplete extraction. We introduce Elhuyar, a multi-agent, human-in-the-loop system that integrates LLMs, structured AI, and human scientists to extract, analyze, and iteratively refine insights from scientific literature. The framework distributes tasks among specialized agents for filtering papers, extracting data, fitting models, and summarizing findings, with human oversight ensuring reliability. The system generates structured reports with extracted data, visualizations, model equations, and text summaries, enabling deeper inquiry through iterative refinement. Deployed in materials science, it analyzed literature on tungsten under helium-ion irradiation, showing experimentally correlated exponential helium bubble growth with irradiation dose and temperature, offering insight for plasma-facing materials (PFMs) in fusion reactors. This demonstrates how AI-assisted literature review can uncover scientific patterns and accelerate discovery.

URL PDF HTML ☆

赞 0 踩 0

2604.01434 2026-04-03 cs.AI

Leveraging the Value of Information in POMDP Planning

Zakariya Laouar, Qi Heng Ho, Zachary Sunberg

2604.01430 2026-04-03 cs.LG

Improving Latent Generalization Using Test-time Compute

Arslan Chaudhry, Sridhar Thiagarajan, Andrew Lampinen

2604.01425 2026-04-03 cs.CL

The power of context: Random Forest classification of near synonyms. A case study in Modern Hindi

Jacek Bąkowski

2604.01421 2026-04-03 cs.CV

EgoFlow: Gradient-Guided Flow Matching for Egocentric 6DoF Object Motion Generation

Abhishek Saroha, Huajian Zeng, Xingxing Zuo, Daniel Cremers, Xi Wang

Comments CVPR 2026: https://abhi-rf.github.io/egoflow/

2604.01418 2026-04-03 cs.CL

Cost-Efficient Estimation of General Abilities Across Benchmarks

Michael Krumdick, Adam Wiemerslage, Seth Ebner, Charles Lovering, Chris Tanner

2604.01414 2026-04-03 cs.RO

Learning When to See and When to Feel: Adaptive Vision-Torque Fusion for Contact-Aware Manipulation

Jiuzhou Lei, Chang Liu, Yu She, Xiao Liang, Minghui Zheng

2604.01411 2026-04-03 cs.LG cs.CL stat.ML

Test-Time Scaling Makes Overtraining Compute-Optimal

Nicholas Roberts, Sungjun Cho, Zhiqi Gao, Tzu-Heng Huang, Albert Wu, Gabriel Orlanski, Avi Trost, Kelly Buchanan, Aws Albarghouthi, Frederic Sala

2604.01398 2026-04-03 cs.LG

Benchmark Problems and Benchmark Datasets for the evaluation of Machine and Deep Learning methods on Photoplethysmography signals: the D4 report from the QUMPHY project

Urs Hackstein, Jordi Alastruey, Philip Aston, Ciaran Bench, Peter H. Charlton, Loic Coquelin, Nando Hegemann, Vaidotas Marozas, Mohammad Moulaeifard, Manasi Nandi, Andrius Petrenas, Oskar Pfeffer, Mantas Rinkevicius, Andrius Solosenko, Nils Strodthoff, Sara Vardanega

Comments 28 pages

2604.01390 2026-04-03 cs.RO

A soft and lightweight fabric-based pneumatic interface for multimodal fingertip tactile feedback

Rui Chen, Daniele Leonardis, Antonio Frisoli

2604.01388 2026-04-03 cs.CV

LESV: Language Embedded Sparse Voxel Fusion for Open-Vocabulary 3D Scene Understanding

Fusang Wang, Nathan Piasco, Moussab Bennehar, Luis Roldão, Dzmitry Tsishkou, Fabien Moutarde

2604.01378 2026-04-03 cs.LG math.OC

Residuals-based Offline Reinforcement Learning

Qing Zhu, Xian Yu

2604.01371 2026-04-03 cs.CV cs.AI cs.RO eess.IV

AffordTissue: Dense Affordance Prediction for Tool-Action Specific Tissue Interaction

Aiza Maksutova, Lalithkumar Seenivasan, Hao Ding, Jiru Xu, Chenhao Yu, Chenyan Jing, Yiqing Shen, Mathias Unberath

2604.01366 2026-04-03 cs.AI

CogBias: Measuring and Mitigating Cognitive Bias in Large Language Models

Fan Huang, Songheng Zhang, Haewoon Kwak, Jisun An

2604.01363 2026-04-03 cs.AI econ.GN q-fin.EC

Crashing Waves vs. Rising Tides: Preliminary Findings on AI Automation from Thousands of Worker Evaluations of Labor Market Tasks

Matthias Mertens, Adam Kuzee, Brittany S. Harris, Harry Lyu, Wensu Li, Jonathan Rosenfeld, Meiri Anto, Martin Fleming, Neil Thompson

2604.01361 2026-04-03 cs.CV

IGLOSS: Image Generation for Lidar Open-vocabulary Semantic Segmentation

Nermin Samet, Gilles Puy, Renaud Marlet

2604.01359 2026-04-03 cs.AI

Semantic Modeling for World-Centered Architectures

Andrei Mantsivoda, Darya Gavrilina

Comments 15 pages, 1 figure, MathAI conference

2604.01354 2026-04-03 cs.CL

Open-Domain Safety Policy Construction

Di Wu, Siyue Liu, Zixiang Ji, Ya-Liang Chang, Zhe-Yu Liu, Andrew Pleffer, Kai-Wei Chang

Comments EACL 2026 (Findings)

2604.01352 2026-04-03 cs.RO

Open-loop POMDP Simplification and Safe Skipping of Replanning with Formal Performance Guarantees

Da Kong, Vadim Indelman

Comments 18 pages, 5 figures. Accepted to WAFR 2026

2604.01350 2026-04-03 cs.CL cs.AI cs.CR

No Attacker Needed: Unintentional Cross-User Contamination in Shared-State LLM Agents

Tiankai Yang, Jiate Li, Yi Nian, Shen Dong, Ruiyao Xu, Ryan Rossi, Kaize Ding, Yue Zhao

2604.01345 2026-04-03 cs.LG

Malliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement Learning

Vikram Krishnamurthy, Luke Snow

2604.01344 2026-04-03 cs.AI

IDEA2: Expert-in-the-loop competency question elicitation for collaborative ontology engineering

Elliott Watkiss-Leek, Reham Alharbi, Harry Rostron, Andrew Ng, Ewan Johnson, Andrew Mitchell, Terry R. Payne, Valentina Tamma, Jacopo de Berardinis

2604.01339 2026-04-03 cs.CV cs.AI cs.LG stat.ME stat.ML

Regularizing Attention Scores with Bootstrapping

Neo Christopher Chung, Maxim Laletin

2604.01337 2026-04-03 cs.LG cs.CV

SECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous Driving

Wenjing Wang, Wenxuan Wang, Songning Lai

Comments 13 pages, 2 figures

2604.01330 2026-04-03 cs.SD cs.AI cs.CR cs.LG cs.NE

Evolutionary Multi-Objective Fusion of Deepfake Speech Detectors

Vojtěch Staněk, Martin Perešíni, Lukáš Sekanina, Anton Firc, Kamil Malinka

Comments Accepted to WCCI CEC 2026

2604.01329 2026-04-03 cs.LG

Model Merging via Data-Free Covariance Estimation

Marawan Gamal Abdel Hameed, Derek Tam, Pascal Jr Tikeng Notsawo, Colin Raffel, Guillaume Rabusseau

2604.01325 2026-04-03 cs.AI stat.ME

The Digital Twin Counterfactual Framework: A Validation Architecture for Simulated Potential Outcomes

Olav Laudy

2604.01322 2026-04-03 cs.CV

Human Pose Estimation in Trampoline Gymnastics: Improving Performance Using a New Synthetic Dataset

Léa Drolet-Roy, Victor Nogues, Sylvain Gaudet, Eve Charbonneau, Mickaël Begon, Lama Séoud

2604.01318 2026-04-03 cs.CV

ViTs for Action Classification in Videos: An Approach to Risky Tackle Detection in American Football Practice Videos

Syed Ahsan Masud Zaidi, William Hsu, Scott Dietrich

Comments 15 pages, 4 figures. Accepted to ICPR 2026 (28th International Conference on Pattern Recognition)

2604.01312 2026-04-03 cs.CL cs.AI

Preference learning in shades of gray: Interpretable and bias-aware reward modeling for human preferences

Simona-Vasilica Oprea, Adela Bâra