arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.06620 2026-04-09 cs.LG

PD-SOVNet: A Physics-Driven Second-Order Vibration Operator Network for Estimating Wheel Polygonal Roughness from Axle-Box Vibrations

Xiancheng Wang, Lin Wang, Rui Wang, Zhibo Zhang, Minghang Zhao, Xiaoheng Zhang, Zhongyue Tan, Kaitai Mao

详情

英文摘要

Quantitative estimation of wheel polygonal roughness from axle-box vibration signals is a challenging yet practically relevant problem for rail-vehicle condition monitoring. Existing studies have largely focused on detection, identification, or severity classification, while continuous regression of multi-order roughness spectra remains less explored, especially under real operational data and unseen-wheel conditions. To address this problem, this paper presents PD-SOVNet, a physics-guided gray-box framework that combines shared second-order vibration kernels, a $4\times4$ MIMO coupling module, an adaptive physical correction branch, and a Mamba-based temporal branch for estimating the 1st--40th-order wheel roughness spectrum from axle-box vibrations. The proposed design embeds modal-response priors into the model while retaining data-driven flexibility for sample-dependent correction and residual temporal dynamics. Experiments on three real-world datasets, including operational data and real fault data, show that the proposed method provides competitive prediction accuracy and relatively stable cross-wheel performance under the current data protocol, with its most noticeable advantage observed on the more challenging Dataset III. Noise injection experiments further indicate that the Mamba temporal branch helps mitigate performance degradation under perturbed inputs. These results suggest that structured physical priors can be beneficial for stabilizing roughness regression in practical rail-vehicle monitoring scenarios, although further validation under broader operating conditions and stricter comparison protocols is still needed.

URL PDF HTML ☆

赞 0 踩 0

2604.06614 2026-04-09 cs.CV cs.LG

Holistic Optimal Label Selection for Robust Prompt Learning under Partial Labels

Yaqi Zhao, Haoliang Sun, Yating Wang, Yongshun Gong, Yilong Yin

2604.06610 2026-04-09 cs.LG cs.AI

TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning

Nan Zhang, Zishuo Wang, Shuyu Huang, Georgios Diamantopoulos, Nikos Tziritas, Panagiotis Oikonomou, Georgios Theodoropoulos

Comments 6 pages, 6 figures

2604.06603 2026-04-09 cs.CL cs.AI

Scientific Knowledge-driven Decoding Constraints Improving the Reliability of LLMs

Maotian Ma, Zheni Zeng, Zhenghao Liu, Yukun Yan

2604.06598 2026-04-09 cs.RO cs.SY eess.SY

Train-Small Deploy-Large: Leveraging Diffusion-Based Multi-Robot Planning

Siddharth Singh, Soumee Guha, Qing Chang, Scott Acton

2604.06589 2026-04-09 cs.RO

BiDexGrasp: Coordinated Bimanual Dexterous Grasps across Object Geometries and Sizes

Mu Lin, Yi-Lin Wei, Jiaxuan Chen, Yuhao Lin, Shuoyu Chen, Jiangran Lyu, Jiayi Chen, Yansong Tang, He Wang, Wei-Shi Zheng

Comments Project Page: https://frenkielm.github.io/BiDexGrasp.github.io/

2604.06583 2026-04-09 cs.CV

VAMAE: Vessel-Aware Masked Autoencoders for OCT Angiography

Ilerioluwakiiye Abolade, Prince Mireku, Kelechi Chibundu, Peace Ododo, Emmanuel Idoko, Promise Omoigui, Solomon Odelola

Comments 8 pages, 5 figures. Accepted at ICPR 2026

2604.06576 2026-04-09 cs.CV eess.IV

LiftFormer: Lifting and Frame Theory Based Monocular Depth Estimation Using Depth and Edge Oriented Subspace Representation

Shuai Li, Huibin Bai, Yanbo Gao, Chong Lv, Hui Yuan, Chuankun Li, Wei Hua, Tian Xie

Comments Accepted by IEEE Transactions on Multimedia

2604.06573 2026-04-09 cs.CL

Scoring Edit Impact in Grammatical Error Correction via Embedded Association Graphs

Qiyuan Xiao, Xiaoman Wang, Yunshi Lan

2604.06571 2026-04-09 cs.CL cs.AI cs.IR cs.LG

LLM-based Schema-Guided Extraction and Validation of Missing-Person Intelligence from Heterogeneous Data Sources

Joshua Castillo, Ravi Mukkamala

Comments 9 pages, 6 figures. Accepted at International Conference on Intelligent Digitization of Systems and Services (IDSS 2026)

详情

英文摘要

Missing-person and child-safety investigations rely on heterogeneous case documents, including structured forms, bulletin-style posters, and narrative web profiles. Variations in layout, terminology, and data quality impede rapid triage, large-scale analysis, and search-planning workflows. This paper introduces the Guardian Parser Pack, an AI-driven parsing and normalization pipeline that transforms multi-source investigative documents into a unified, schema-compliant representation suitable for operational review and downstream spatial modeling. The proposed system integrates (i) multi-engine PDF text extraction with Optical Character Recognition (OCR) fallback, (ii) rule-based source identification with source-specific parsers, (iii) schema-first harmonization and validation, and (iv) an optional Large Language Model (LLM)-assisted extraction pathway incorporating validator-guided repair and shared geocoding services. We present the system architecture, key implementation decisions, and output design, and evaluate performance using both gold-aligned extraction metrics and corpus-level operational indicators. On a manually aligned subset of 75 cases, the LLM-assisted pathway achieved substantially higher extraction quality than the deterministic comparator (F1 = 0.8664 vs. 0.2578), while across 517 parsed records per pathway it also improved aggregate key-field completeness (96.97\% vs. 93.23\%). The deterministic pathway remained much faster (mean runtime 0.03 s/record vs. 3.95 s/record for the LLM pathway). In the evaluated run, all LLM outputs passed initial schema validation, so validator-guided repair functioned as a built-in safeguard rather than a contributor to the observed gains. These results support controlled use of probabilistic AI within a schema-first, auditable pipeline for high-stakes investigative settings.

URL PDF HTML ☆

赞 0 踩 0

2604.06562 2026-04-09 cs.AI

On Emotion-Sensitive Decision Making of Small Language Model Agents

Jiaju Lin, Xingjian Du, Qingyun Wu, Ellen Wenting Zou, Jindong Wang

2604.06552 2026-04-09 cs.CL

To Lie or Not to Lie? Investigating The Biased Spread of Global Lies by LLMs

Zohaib Khan, Mustafa Dogan, Ifeoma Okoh, Pouya Sadeghi, Siddhartha Shrestha, Sergius Justus Nyah, Mahmoud O. Mokhiamar, Michael J. Ryan, Tarek Naous

Comments Accepted at ACL 2026 Main Conference

2604.06551 2026-04-09 cs.CL

CCD-CBT: Multi-Agent Therapeutic Interaction for CBT Guided by Cognitive Conceptualization Diagram

Chang Liu, Changsheng Ma, Yongfeng Tao, Bin Hu, Minqiang Yang

2604.06543 2026-04-09 cs.CL cs.LG

The Illusion of Stochasticity in LLMs

Xiangming Gu, Soham De, Michalis Titsias, Larisa Markeeva, Petar Veličković, Razvan Pascanu

Comments Under review

2604.06542 2026-04-09 cs.CL

Does a Global Perspective Help Prune Sparse MoEs Elegantly?

Zeliang Zhang, Nikhil Ghosh, Jiani Liu, Bin Yu, Xiaodong Liu

2604.06537 2026-04-09 cs.LG

Time-Series Classification with Multivariate Statistical Dependence Features

Yao Sun, Bo Hu, Jose Principe

2604.06515 2026-04-09 cs.LG cs.AI

Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees

Mohammed Nowaz Rabbani Chowdhury, Kaoutar El Maghraoui, Hsinyu Tsai, Naigang Wang, Geoffrey W. Burr, Liu Liu, Meng Wang

2604.06507 2026-04-09 cs.CL

Fine-tuning Whisper for Pashto ASR: strategies and scale

Hanif Rahman

2604.06505 2026-04-09 cs.CL cs.AI

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

Weiyue Li, Ruizhi Qian, Yi Li, Yongce Li, Yunfan Long, Jiahui Cai, Yan Luo, Mengyu Wang

2604.06502 2026-04-09 cs.LG

VLMShield: Efficient and Robust Defense of Vision-Language Models against Malicious Prompts

Peigui Qi, Kunsheng Tang, Yanpu Yu, Jialin Wu, Yide Song, Wenbo Zhou, Zhicong Huang, Cheng Hong, Weiming Zhang, Nenghai Yu

2604.06501 2026-04-09 cs.LG cs.CL

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

Philipp Hellwig, Willem Zuidema, Claire E. Stevenson, Martha Lewis

2604.06495 2026-04-09 cs.LG cs.AI

Improving Robustness In Sparse Autoencoders via Masked Regularization

Vivek Narayanaswamy, Kowshik Thopalli, Bhavya Kailkhura, Wesam Sakla

Comments 4 pages, 1 figure

2604.06494 2026-04-09 cs.CV cs.GR

DesigNet: Learning to Draw Vector Graphics as Designers Do

Tomas Guija-Valiente, Iago Suárez

2604.06492 2026-04-09 cs.LG cs.CR stat.ML

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

Andrew Lowy

2604.06491 2026-04-09 cs.LG cs.AI cs.CE

Discrete Flow Matching Policy Optimization

Maojiang Su, Po-Chung Hsieh, Weimin Wu, Mingcheng Lu, Jiunhau Chen, Jerry Yao-Chieh Hu, Han Liu

2604.06487 2026-04-09 cs.CL

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Thibault Bañeras-Roux, Sergio Burdisso, Esaú Villatoro-Tello, Dairazalia Sánchez-Cortés, Shiran Liu, Severin Baroudi, Shashi Kumar, Hasindri Watawana, Manjunath K E, Kadri Hacioglu, Petr Motlicek, Andreas Stolcke

Comments Submitted to Interspeech

2604.06483 2026-04-09 cs.LG cs.AI

Distributed Interpretability and Control for Large Language Models

Dev Arpan Desai, Shaoyi Huang, Zining Zhu

2604.06481 2026-04-09 cs.CV cs.AI cs.CR

Hybrid ResNet-1D-BiGRU with Multi-Head Attention for Cyberattack Detection in Industrial IoT Environments

Afrah Gueriani, Hamza Kheddar, Ahmed Cherif Mazari

2604.06475 2026-04-09 cs.LG cs.NA math.NA

AE-ViT: Stable Long-Horizon Parametric Partial Differential Equations Modeling

Iva Mikuš, Boris Muha, Domagoj Vlah

Comments 16 pages, 7 figures

2604.06474 2026-04-09 cs.CL

DataSTORM: Deep Research on Large-Scale Databases using Exploratory Data Analysis and Data Storytelling

Shicheng Liu, Yucheng Jiang, Sajid Farook, Camila Nicollier Sanchez, David Fernando Castro Pena, Monica S. Lam