arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00060 2026-05-04 cs.AI cs.SY eess.SY

TADI: Tool-Augmented Drilling Intelligence via Agentic LLM Orchestration over Heterogeneous Wellsite Data

Rong Lu

详情

DOI: 10.20944/preprints202604.1820.v1

英文摘要

We present TADI (Tool-Augmented Drilling Intelligence), an agentic AI system that transforms drilling operational data into evidence-based analytical intelligence. Applied to the Equinor Volve Field dataset, TADI integrates 1,759 daily drilling reports, selected WITSML real-time objects, 15,634 production records, formation tops, and perforations into a dual-store architecture: DuckDB for structured queries over 12 tables with 65,447 rows, and ChromaDB for semantic search over 36,709 embedded documents. Twelve domain-specialized tools, orchestrated by a large language model via iterative function calling, support multi-step evidence gathering that cross-references structured drilling measurements with daily report narratives. The system parses all 1,759 DDR XML files with zero errors, handles three incompatible well naming conventions, and is backed by 95 automated tests plus a 130-question stress-question taxonomy spanning six operational categories. We formalize the agent's behavior as a sequential tool-selection problem and propose the Evidence Grounding Score (EGS) as a simple grounding-compliance proxy based on measurements, attributed DDR quotations, and required answer sections. The complete 6,084-line, framework-free implementation is reproducible given the public Volve download and an API key, and the case studies and qualitative ablation analysis suggest that domain-specialized tool design, rather than model scale alone, is the primary driver of analytical quality in technical operations.

URL PDF HTML ☆

赞 0 踩 0

2605.00059 2026-05-04 cs.RO cs.AI

Dynamic-TD3: A Novel Algorithm for UAV Path Planning with Dynamic Obstacle Trajectory Prediction

Wentao Chen, Jingtang Chen, Mingjian Fu, Tiantian Li, Youfeng Su, Wenxi Liu, Yuanlong Yu

Comments 6 pages, 5 figures

2605.00056 2026-05-04 cs.LG cs.AI physics.data-an physics.geo-ph stat.AP stat.ML

Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution

T. Ansah-Narh, G. Y. Afrifa, J. B. Tandoh, K. Asare, M. Addi, K. E. Yorke, D. M. A. Akpoley, K. Aidoo, S. K. Fosuhene

Comments 53 pages, 16 figures, accepted for publication in Earth Systems and Environment (2026)

2605.00052 2026-05-04 cs.CV

Two-View Accumulation as the Primary Training Lever for Hybrid-Capture Gaussian Splatting: A Variance-Decomposition View of When Gradient Surgery Helps

Sungjun Cho

2605.00051 2026-05-04 cs.CV cs.LG

Learning from the Unseen: Generative Data Augmentation for Geometric-Semantic Accident Anticipation

Yanchen Guan, Haicheng Liao, Chengyue Wang, Xingcheng Liu, Jiaxun Zhang, Keqiang Li, Zhenning Li

2605.00050 2026-05-04 cs.LG cs.CV

Learning physically grounded traffic accident reconstruction from public accident reports

Yanchen Guan, Haicheng Liao, Chengyue Wang, Zhenning Li

2605.00022 2026-05-04 cs.CL cs.AI cs.SD

Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment

Woody Haosheng Gan, William Held, Diyi Yang

Comments Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics

2605.00020 2026-05-04 cs.LG cs.AI cs.IT eess.SP math.IT

AirFM-DDA: Air-Interface Foundation Model in the Delay-Doppler-Angle Domain for AI-Native 6G

Kejia Bian, Meixia Tao, Jianhua Mo, Zhiyong Chen, Leyan Chen

Comments 16 pages

2605.00018 2026-05-04 cs.LG eess.SP

What Physics do Data-Driven MoCap-to-Radar Models Learn?

Kevin Chen, Kenneth W. Parker, Anish Arora

2605.00011 2026-05-04 cs.LG cs.AI cs.DC

FedACT: Concurrent Federated Intelligence across Heterogeneous Data Sources

Md Sirajul Islam, Isabelle G Chapman, N I Md Ashafuddula, Xu Yuan, Li Chen, Nian-Feng Tzeng, Klara Nahrstedt

2605.00005 2026-05-04 cs.LG cs.AI cs.DC cs.NI

Cloud Is Closer Than It Appears: Revisiting the Tradeoffs of Distributed Real-Time Inference

Pragya Sharma, Hang Qiu, Mani Srivastava

详情

DOI: 10.1109/ICCCN65249.2025.11133852

英文摘要

The increasing deployment of deep neural networks (DNNs) in cyber-physical systems (CPS) enhances perception fidelity, but imposes substantial computational demands on execution platforms, posing challenges to real-time control deadlines. Traditional distributed CPS architectures typically favor on-device inference to avoid network variability and contention-induced delays on remote platforms. However, this design choice places significant energy and computational demands on the local hardware. In this work, we revisit the assumption that cloud-based inference is intrinsically unsuitable for latency-sensitive control tasks. We demonstrate that, when provisioned with high-throughput compute resources, cloud platforms can effectively amortize network and queueing delays, enabling them to match or surpass on-device performance for real-time decision-making. Specifically, we develop a formal analytical model that characterizes distributed inference latency as a function of the sensing frequency, platform throughput, network delay, and task-specific safety constraints. We instantiate this model in the context of emergency braking for autonomous driving and validate it through extensive simulations using real-time vehicular dynamics. Our empirical results identify concrete conditions under which cloud-based inference adheres to safety margins more reliably than its on-device counterpart. These findings challenge prevailing design strategies and suggest that the cloud is not merely a feasible option, but often the preferred inference location for distributed CPS architectures. In this light, the cloud is not as distant as traditionally perceived; in fact, it is closer than it appears.

URL PDF HTML ☆

赞 0 踩 0

2604.28158 2026-05-04 cs.AI

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Yujun Wu, Dongxu Zhang, Xinchen Li, Jinhang Xu, Yiling Duan, Yumou Liu, Jiabao Pan, Qiyuan Zhu, Xuanhe Zhou, Jingxuan Wei, Siyuan Li, Jintao Chen, Conghui He, Cheng Tan

Comments 25 pages, 5 figures, 8 tables

2604.28075 2026-05-04 cs.CL cs.AI

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Ansar Aynetdinov, Patrick Haller, Alan Akbik

2604.28031 2026-05-04 cs.CL cs.AI

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

Garvin Kruthof

2604.27906 2026-05-04 cs.AI cs.CL

From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction

Alex Petrov, Alexander Gusak, Denis Mukha, Dima Korolev

Comments 33 pages, 7 figures

2604.27807 2026-05-04 cs.AI cs.DC

Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification

Behnaz Ranjbar, Kirankumar Raveendiran, Sudeep Pasricha, Samarjit Chakraborty, Cecilia Carbonelli, Akash Kumar

2604.27792 2026-05-04 cs.RO

MotuBrain: An Advanced World Action Model for Robot Control

MotuBrain Team, Chendong Xiang, Fan Bao, Haitian Liu, Hengkai Tan, Hongzhe Bi, James Li, Jiabao Liu, Jingrui Pang, Kiro Jing, Louis Liu, Mengchen Cai, Rongxu Cui, Ruowen Zhao, Runqing Wang, Shuhe Huang, Yao Feng, Yinze Rong, Zeyuan Wang, Jun Zhu

2604.27454 2026-05-04 cs.CL

Exploring Applications of Transfer-State Large Language Models: Cognitive Profiling and Socratic AI Tutoring

Minori Noguchi

Comments 29 pages, 5 figures, 7 tables, including appendices

2604.27345 2026-05-04 cs.CL

LLMs Capture Emotion Labels, Not Emotion Uncertainty: Distributional Analysis and Calibration of Human-LLM Judgment Gaps

Keito Inoshita, Xiaokang Zhou, Akira Kawai, Katsutoshi Yada

2604.27077 2026-05-04 cs.LG cs.AI stat.ML

Learning Rate Transfer in Normalized Transformers

Boris Shigida, Boris Hanin, Andrey Gromov

2604.26848 2026-05-04 cs.RO

STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation

Yuxuan Tian, Yurun Jin, Bin Yu, Yukun Shi, Hao Wu, Chi Harold Liu, Kai Chen, Cong Huang

Comments 19 pages

2604.26258 2026-05-04 cs.CL cs.LG

FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients

Hongyeon Yu, Young-Bum Kim, Yoon Kim

2604.26181 2026-05-04 cs.LG

SWAN: World-Aware Adaptive Multimodal Networks for Runtime Variations

Jason Wu, Shir-Kang Scott Jin, Yuyang Yuan, Maggie Wigness, Lance M. Kaplan, Hang Qiu, Mani Srivastava

2604.26173 2026-05-04 cs.LG cs.AI cs.CL

Entropy Centroids as Intrinsic Rewards for Test-Time Scaling

Wenshuo Zhao, Qi Zhu, Xingshan Zeng, Fei Mi, Lifeng Shang, Yi R., Fung

Comments Under Review, 39 pages

详情

英文摘要

An effective way to scale up test-time compute of large language models is to sample multiple responses and then select the best one, as in Grok Heavy and Gemini Deep Think. Existing selection methods often rely on external reward models, which requires training a strong reward model and introduces additional computation overhead. As an alternative, previous approaches have explored intrinsic signals, such as confidence and entropy, but these signals are noisy with naive aggregation. In this work, we observe that high-entropy tokens tend to cluster into consecutive groups during inference, providing a more stable notion of model uncertainty than individual tokens. Together, these clusters reveal temporal patterns of model uncertainty throughout the inference process. Motivated by this observation, we propose to use the temporal structure of uncertainty as an intrinsic reward. To this end, we first formalize the basic unit of segment-level uncertainty as the High Entropy Phase (HEP), a variable-length segment that begins at a high-entropy token and ends when consecutive low-entropy tokens appear. We then define the Entropy Centroid, inspired by the concept of the center of mass in physics, as the weighted average position of all HEPs along the trajectory. Intuitively, a lower centroid indicates early exploration followed by confident generation, which we find often corresponds to higher response quality. Based on this insight, we propose the Lowest Centroid method, which selects the response with the lowest entropy centroid among multiple candidates. Experiments on mathematics, code generation, logical reasoning, and agentic tasks, across model scales ranging from 14B to 480B, show that Lowest Centroid consistently outperforms existing baselines and delivers stable gains as model size increases. Code is available at https://github.com/hkust-nlp/entropy-centroid.

URL PDF HTML ☆

赞 0 踩 0

2604.25766 2026-05-04 cs.RO

Sensitivity-Based Tube NMPC for Cooperative Aerial Structures Under Parametric Uncertainty

Giuseppe Silano, Quentin Sablé, Marco Tognon, Luigi Iannelli, Antonio Franchi

Comments Accepted to the 2026 International Conference on Unmanned Aircraft Systems, ICUAS 2026

2604.25120 2026-05-04 cs.CL

SCOPE:Planning for Hybrid Querying over Clinical Trial Data

Suparno Roy Chowdhury, Manan Roy Choudhury, Tejas Anvekar, Muhammad Ali Khan, Kaneez Zahra Rubab Khakwani, Mohamad Bassam Sonbol, Irbaz Bin Riaz, Vivek Gupta

2604.23073 2026-05-04 cs.LG cs.RO

RL Token: Bootstrapping Online RL with Vision-Language-Action Models

Charles Xu, Jost Tobias Springenberg, Michael Equi, Ali Amin, Adnan Esmail, Sergey Levine, Liyiming Ke

2604.22271 2026-05-04 cs.LG

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Dharshan Kumaran, Viorica Patraucean, Simon Osindero, Petar Veličković, Nathaniel Daw

2604.22189 2026-05-04 cs.RO

Energy-Efficient Multi-Robot Coverage Path Planning of Non-Convex Regions of Interests

Sourav Raxit, Jose Fuentes, Paulo Padrao, Abdullah Al Redwan Newaz, Md Tamjidul Hoque, Mark Kulp, Leonardo Bobadilla

Comments Accepted in " Robotics and Automation Letters (RAL)"

2604.22082 2026-05-04 cs.LG cs.AI

Removing Sandbagging in LLMs by Training with Weak Supervision

Emil Ryd, Henning Bartsch, Julian Stastny, Joe Benton, Vivek Hebbar