arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.15169 2026-04-17 cs.LG

Assessing the Potential of Masked Autoencoder Foundation Models in Predicting Downhole Metrics from Surface Drilling Data

Aleksander Berezowski, Hassan Hassanzadeh, Gouri Ginde

详情

英文摘要

Oil and gas drilling operations generate extensive time-series data from surface sensors, yet accurate real-time prediction of critical downhole metrics remains challenging due to the scarcity of labelled downhole measurements. This systematic mapping study reviews thirteen papers published between 2015 and 2025 to assess the potential of Masked Autoencoder Foundation Models (MAEFMs) for predicting downhole metrics from surface drilling data. The review identifies eight commonly collected surface metrics and seven target downhole metrics. Current approaches predominantly employ neural network architectures such as artificial neural networks (ANNs) and long short-term memory (LSTM) networks, yet no studies have explored MAEFMs despite their demonstrated effectiveness in time-series modeling. MAEFMs offer distinct advantages through self-supervised pre-training on abundant unlabeled data, enabling multi-task prediction and improved generalization across wells. This research establishes that MAEFMs represent a technically feasible but unexplored opportunity for drilling analytics, recommending future empirical validation of their performance against existing models and exploration of their broader applicability in oil and gas operations.

URL PDF HTML ☆

赞 0 踩 0

2604.15168 2026-04-17 cs.RO

Dual Pose-Graph Semantic Localization for Vision-Based Autonomous Drone Racing

David Perez-Saura, Miguel Fernandez-Cortizas, Alvaro J. Gaona, Pascual Campoy

2604.15167 2026-04-17 cs.LG

When Flat Minima Fail: Characterizing INT4 Quantization Collapse After FP32 Convergence

Marcus Armstrong

2604.15165 2026-04-17 cs.CL

Fabricator or dynamic translator?

Lisa Vasileva, Karin Sim

Comments Published here: https://chomps2025.github.io/accepted_papers.html

2604.15151 2026-04-17 cs.CL

QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies

Alexey Khoroshilov, Alexey Chernysh, Orkhan Ekhtibarov, Nini Kamkia, Dmitry Zmitrovich

Comments 12 pages, 8 tables

2604.15149 2026-04-17 cs.LG cs.AI

LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking

Lukas Helff, Quentin Delfosse, David Steinmann, Ruben Härle, Hikaru Shindo, Patrick Schramowski, Wolfgang Stammer, Kristian Kersting, Felix Friedrich

2604.15148 2026-04-17 cs.AI cs.CL cs.IR

IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning

Zihan Liang, Yufei Ma, Ben Chen, Zhipeng Qian, Huangyu Dai, Lingtao Mao, Xuxin Zhang, Chenyi Lei, Wenwu Ou

详情

英文摘要

Reinforcement learning has emerged as an effective paradigm for training large language models to perform search-augmented reasoning. However, existing approaches rely on trajectory-level rewards that cannot distinguish precise search queries from vague or redundant ones within a rollout group, and collapse to a near-zero gradient signal whenever every sampled trajectory fails. In this paper, we propose IG-Search, a reinforcement learning framework that introduces a step-level reward based on Information Gain (IG). For each search step, IG measures how much the retrieved documents improve the model's confidence in the gold answer relative to a counterfactual baseline of random documents, thereby reflecting the effectiveness of the underlying search query. This signal is fed back to the corresponding search-query tokens via per-token advantage modulation in GRPO, enabling fine-grained, step-level credit assignment within a rollout. Unlike prior step-level methods that require either externally annotated intermediate supervision or shared environment states across trajectories, IG-Search derives its signals from the policy's own generation probabilities, requiring no intermediate annotations beyond standard question-answer pairs. Experiments on seven single-hop and multi-hop QA benchmarks demonstrate that IG-Search achieves an average EM of 0.430 with Qwen2.5-3B, outperforming the strongest trajectory-level baseline (MR-Search) by 1.6 points and the step-level method GiGPO by 0.9 points on average across benchmarks, with particularly pronounced gains on multi-hop reasoning tasks. Despite introducing a dense step-level signal, IG-Search adds only ~6.4% to per-step training wall-clock time over the trajectory-level baseline and leaves inference latency unchanged, while still providing a meaningful gradient signal even when every sampled trajectory answers incorrectly.

URL PDF HTML ☆

赞 0 踩 0

2604.15145 2026-04-17 cs.AI cs.DL

An Axiomatic Benchmark for Evaluation of Scientific Novelty Metrics

Miri Liu, ChengXiang Zhai

Comments 9 pages, 0 figures

2604.15141 2026-04-17 cs.CV

KVNN: Learnable Multi-Kernel Volterra Neural Networks

Haoyu Yun, Hamid Krim, Yufang Bao

2604.15140 2026-04-17 cs.CL

DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering

Neha Srikanth, Jordan Boyd-Graber, Rachel Rudinger

2604.15134 2026-04-17 cs.CV

How to Correctly Make Mistakes: A Framework for Constructing and Benchmarking Mistake Aware Egocentric Procedural Videos

Olga Loginova, Frank Keller

2604.15124 2026-04-17 cs.CL

Blinded Multi-Rater Comparative Evaluation of a Large Language Model and Clinician-Authored Responses in CGM-Informed Diabetes Counseling

Zhijun Guo, Alvina Lai, Emmanouil Korakas, Aristeidis Vagenas, Irshad Ahamed, Christo Albor, Hengrui Zhang, Justin Healy, Kezhi Li

2604.15121 2026-04-17 cs.AI

SRMU: Relevance-Gated Updates for Streaming Hyperdimensional Memories

Shay Snyder, Andrew Capodieci, David Gorsich, Maryam Parsa

2604.15115 2026-04-17 cs.LG cs.CR

FedIDM: Achieving Fast and Stable Convergence in Byzantine Federated Learning through Iterative Distribution Matching

He Yang, Dongyi Lv, Wei Xi, Song Ma, Hanlin Gu, Jizhong Zhao

2604.15096 2026-04-17 cs.CV cs.LG

Beyond Independent Frames: Latent Attention Masked Autoencoders for Multi-View Echocardiography

Simon Böhi, Irene Cannistraci, Sergio Muñoz Gonzalez, Moritz Vandenhirtz, Sonia Laguna, Samuel Ruiperez-Campillo, Max Krähenmann, Andrea Agostini, Ece Ozkan, Thomas M. Sutter, Julia E. Vogt

Comments Accepted as a workshop paper at the ICLR 2026 Workshop on Foundation Models for Science

2604.15093 2026-04-17 cs.AI cs.CL cs.CV cs.HC

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Kanzhi Cheng, Zehao Li, Zheng Ma, Nuo Chen, Jialin Cao, Qiushi Sun, Zichen Ding, Fangzhi Xu, Hang Yan, Jiajun Chen, Anh Tuan Luu, Jianbing Zhang, Lewei Lu, Dahua Lin

Comments Work in progress

2604.15090 2026-04-17 cs.CV

Beyond Visual Cues: Semantic-Driven Token Filtering and Expert Routing for Anytime Person ReID

Jiaxuan Li, Xin Wen, Zhihang Li

2604.15088 2026-04-17 cs.CV

Building Extraction from Remote Sensing Imagery under Hazy and Low-light Conditions: Benchmark and Baseline

Feifei Sang, Wei Lu, Hongruixuan Chen, Sibao Chen, Bin Luo

Comments 14 pages, 12 figures, 9 tables

2604.15078 2026-04-17 cs.AI

Where are the Humans? A Scoping Review of Fairness in Multi-agent AI Systems

Simeon Allmendinger, Luca Deck, Lucas Mueller

Comments In proceedings of European Conference on Information Systems (ECIS) 2026

2604.15076 2026-04-17 cs.RO cs.AI cs.NE

NEAT-NC: NEAT guided Navigation Cells for Robot Path Planning

Hibatallah Meliani, Khadija Slimani, Samira Khoulji

Comments To appear in short form in Genetic and Evolutionary Computation Conference (GECCO '26), 2026

2604.15074 2026-04-17 cs.RO cs.SY eess.SY

Trajectory Planning for a Multi-UAV Rigid-Payload Cascaded Transportation System Based on Enhanced Tube-RRT*

Jianqiao Yu, Jia Li, Tianhua Gao

Comments 15 pages, 7 figures. Under review at IEEE Transactions on Aerospace and Electronic Systems (TAES). This work has been submitted to the IEEE for possible publication

2604.15069 2026-04-17 cs.LG

Beyond the Laplacian: Doubly Stochastic Matrices for Graph Neural Networks

Zhaobo Hu, Vincent Gauthier, Mehdi Naima

2604.15065 2026-04-17 cs.CV

Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection

Yangchen Zeng, Zhenyu Yu, Dongming Jiang, Wenbo Zhang, Yifan Hong, Zhanhua Hu, Jiao Luo, Kangning Cui

Comments Accepted to ACM ICMR 2026; 14 pages, 6 figures, and 4 tables

2604.15063 2026-04-17 cs.LG cs.AI cs.CR

No More Guessing: a Verifiable Gradient Inversion Attack in Federated Learning

Francesco Diana, Chuan Xu, André Nusser, Giovanni Neglia

2604.15059 2026-04-17 cs.CV

Attention-Gated Convolutional Networks for Scanner-Agnostic Quality Assessment

Chinmay Bakhale, Anil Sao

2604.15052 2026-04-17 cs.RO

CAVERS: Multimodal SLAM Data from a Natural Karstic Cave with Ground Truth Motion Capture

Giacomo Franchini, David Rodríguez-Martínez, Alfonso Martínez-Petersen, C. J. Pérez-del-Pulgar, Marcello Chiaberge

Comments 8 pages, 5 figures, preprint version

2604.15047 2026-04-17 cs.CV

Implicit Neural Representations: A Signal Processing Perspective

Dhananjaya Jayasundara, Vishal M. Patel

2604.15027 2026-04-17 cs.CV

Quality-Aware Calibration for AI-Generated Image Detection in the Wild

Fabrizio Guillaro, Vincenzo De Rosa, Davide Cozzolino, Luisa Verdoliva

Comments Accepted at the APAI Workshop at CVPR 2026

2604.15023 2026-04-17 cs.RO

DockAnywhere: Data-Efficient Visuomotor Policy Learning for Mobile Manipulation via Novel Demonstration Generation

Ziyu Shan, Yuheng Zhou, Gaoyuan Wu, Ziheng Ji, Zhenyu Wu, Ziwei Wang

Comments Accepted to RA-L

2604.15013 2026-04-17 cs.RO

DEX-Mouse: A Low-cost Portable and Universal Interface with Force Feedback for Data Collection of Dexterous Robotic Hands

Joonho Koh, Haechan Jung, Nayoung Kim, Wook Ko, Changjoo Nam