arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2411.08027 2026-04-27 cs.LG cs.AI cs.CV cs.RO

LLMPhy: Parameter-Identifiable Physical Reasoning Combining Large Language Models and Physics Engines

Anoop Cherian, Radu Corcodel, Siddarth Jain, Diego Romeres

Comments Accepted at AISTATS 2026

详情

英文摘要

Most learning-based approaches to complex physical reasoning sidestep the crucial problem of parameter identification (e.g., mass, friction) that governs scene dynamics, despite its importance in real-world applications such as collision avoidance and robotic manipulation. In this paper, we present LLMPhy, a black-box optimization framework that integrates large language models (LLMs) with physics simulators for physical reasoning. The core insight of LLMPhy is to bridge the textbook physical knowledge embedded in LLMs with the world models implemented in modern physics engines, enabling the construction of digital twins of input scenes via latent parameter estimation. Specifically, LLMPhy decomposes digital twin construction into two subproblems: (i) a continuous problem of estimating physical parameters and (ii) a discrete problem of estimating scene layout. For each subproblem, LLMPhy iteratively prompts the LLM to generate computer programs encoding parameter estimates, executes them in the physics engine to reconstruct the scene, and uses the resulting reconstruction error as feedback to refine the LLM's predictions. As existing physical reasoning benchmarks rarely account for parameter identifiability, we introduce three new datasets designed to evaluate physical reasoning in zero-shot settings. Our results show that LLMPhy achieves state-of-the-art performance on our tasks, recovers physical parameters more accurately, and converges more reliably than prior black-box methods. See the LLMPhy project page for details: https://www.merl.com/research/highlights/LLMPhy

URL PDF HTML ☆

赞 0 踩 0

2411.07378 2026-04-27 cs.AI

Data-Driven Analysis of AI in Medical Device Software in China: Trends of Deep Learning and Traditional AI Based on Regulatory Data

Yu Han, Aaron Ceross, Sarim Ather, Jeroen H. M. Bergmann

2411.04680 2026-04-27 cs.LG cs.CR

Privacy Leakage via Output Label Space and Differentially Private Continual Learning

Marlon Tobaben, Talal Alrawajfeh, Marcus Klasson, Mikko Heikkilä, Arno Solin, Antti Honkela

Comments 52 pages, 16 figures

2411.03715 2026-04-27 cs.SD eess.AS

MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models

Wen-Chin Huang, Erica Cooper, Tomoki Toda

Comments Accepted to Transactions on Audio, Speech and Language Processing

2410.21548 2026-04-27 cs.CL cs.IT cs.LG math.IT

MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression

Noel Elias, Homa Esfahanizadeh, Kaan Kale, Sriram Vishwanath, Muriel Medard

2410.13713 2026-04-27 cs.LG

CrystalX: High-accuracy Crystal Structure Analysis Using Deep Learning

Kaipeng Zheng, Weiran Huang, Wanli Ouyang, Han-Sen Zhong, Yuqiang Li

2408.16286 2026-04-27 cs.LG math.OC

Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form

Toshinori Kitamura, Tadashi Kozuno, Wataru Kumagai, Kenta Hoshino, Yohei Hosoe, Kazumi Kasaura, Masashi Hamaya, Paavo Parmas, Yutaka Matsuo

Comments This manuscript contains a technical error; the main result does not hold (see also arXiv:2604.21177 for a formal invalidation)

2405.10138 2026-04-27 cs.CL

PL-MTEB: Polish Massive Text Embedding Benchmark

Rafał Poświata, Sławomir Dadas, Michał Perełkiewicz

Comments Accepted for ACL 2026 Findings

2405.00577 2026-04-27 cs.LG eess.SP q-bio.NC

Discovering robust biomarkers of psychiatric disorders from resting-state functional MRI via graph neural networks: A systematic review

Yi Hao Chan, Deepank Girish, Sukrit Gupta, Jing Xia, Chockalingam Kasi, Yinan He, Conghao Wang, Jagath C. Rajapakse

详情

DOI: 10.1016/j.neuroimage.2025.121422

英文摘要

Graph neural networks (GNN) have emerged as a popular tool for modelling functional magnetic resonance imaging (fMRI) datasets. Many recent studies have reported significant improvements in disorder classification performance via more sophisticated GNN designs and highlighted salient features that could be potential biomarkers of the disorder. However, existing methods of evaluating their robustness are often limited to cross-referencing with existing literature, which is a subjective and inconsistent process. In this review, we provide an overview of how GNN and model explainability techniques (specifically, feature attributors) have been applied to fMRI datasets for disorder prediction tasks, with an emphasis on evaluating the robustness of potential biomarkers produced for psychiatric disorders. Then, 65 studies using GNNs that reported potential fMRI biomarkers for psychiatric disorders (attention-deficit hyperactivity disorder, autism spectrum disorder, major depressive disorder, schizophrenia) published before 9 October 2024 were identified from 2 online databases (Scopus, PubMed). We found that while most studies have performant models, salient features highlighted in these studies (as determined by feature attribution scores) vary greatly across studies on the same disorder. Reproducibility of biomarkers is only limited to a small subset at the level of regions and few transdiagnostic biomarkers were identified. To address these issues, we suggest establishing new standards that are based on objective evaluation metrics to determine the robustness of these potential biomarkers. We further highlight gaps in the existing literature and put together a prediction-attribution-evaluation framework that could set the foundations for future research on discovering robust biomarkers of psychiatric disorders via GNNs.

URL PDF HTML ☆

赞 0 踩 0

2305.00931 2026-04-27 cs.AI cs.HC cs.LG

Explanation through Reward Model Reconciliation using POMDP Tree Search

Benjamin D. Kraske, Anshu Saksena, Anna L. Buczak, Zachary N. Sunberg

2211.16327 2026-04-27 cs.AI cs.LG

On the Power of Foundation Models

Yang Yuan

Comments ICML'23. This version polished paper with the help of LLM, fixed a few notational issues

2210.05513 2026-04-27 cs.CV

ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive Learning

Nicholas Meegan, Hansi Liu, Bryan Bo Cao, Abrar Alali, Kristin Dana, Marco Gruteser, Shubham Jain, Ashwin Ashok

Comments 8 pages, 6 figures, 6 tables

2112.02604 2026-04-27 cs.CV cs.AI

PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions

Taotao Jing, Tina Chen, Renran Tian, Yaobin Chen, Joshua Domeyer, Heishiro Toyoda, Rini Sherony, Zhengming Ding

Comments Published in NeurIPS 2025 datasets and benchmarks track

2604.22746 2026-04-27 math.OC cs.LG

Relaxation-Informed Training of Neural Network Surrogate Models

Calvin Tsay

Comments 35 pages, 5 figures

2604.22736 2026-04-27 cs.LO cs.AI

An Undecidability Proof for the Plan Existence Problem

Antonis Achilleos

2604.22695 2026-04-27 eess.SP cs.LG

Time-Localized Parametric Decomposition of Respiratory Airflow for Sub-Breath Analysis

Victoria Ribeiro Rodrigues, Paul W. Davenport, Nicholas J. Napoli

Comments Submitted to IEEE Journal of Biomedical and Health Informatics (under review). 18 pages, 7 figures, 5 tables

2604.22679 2026-04-27 cs.CY cs.AI

How Supply Chain Dependencies Complicate Bias Measurement and Accountability Attribution in AI Hiring Applications

Gauri Sharma, Maryam Molamohammadi

详情

英文摘要

The increasing adoption of AI systems in hiring has raised concerns about algorithmic bias and accountability, prompting regulatory responses including the EU AI Act, NYC Local Law 144, and Colorado's AI Act. While existing research examines bias through technical or regulatory lenses, both perspectives overlook a fundamental challenge: modern AI hiring systems operate within complex supply chains where responsibility fragments across data vendors, model developers, platform providers, and deploying organizations. This paper investigates how these dependency chains complicate bias evaluation and accountability attribution. Drawing on literature review and regulatory analysis, we demonstrate that fragmented responsibilities create two critical problems. First, bias emerges from component interactions rather than isolated elements, yet proprietary configurations prevent integrated evaluation. A resume parser may function without bias independently but contribute to discrimination when integrated with specific ranking algorithms and filtering thresholds. Second, information asymmetries mean deploying organizations bear legal responsibility without technical visibility into vendor-supplied algorithms, while vendors control implementations without meaningful disclosure requirements. Each stakeholder may believe they are compliant; nevertheless, the integrated system may produce biased outcomes. Analysis of implementation ambiguities reveals these challenges in practice. We propose multi-layered interventions including system-level audits, vendor guidelines, continuous monitoring mechanisms, and documentation across dependency chains. Our findings reveal that effective governance requires coordinated action across technical, organizational, and regulatory domains to establish meaningful accountability in distributed development environments.

URL PDF HTML ☆

赞 0 踩 0

2604.22661 2026-04-27 cs.IR cs.CL

Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines

Negar Arabzadeh, Andrew Drozdov, Michael Bendersky, Matei Zaharia

2604.18820 2026-04-27 stat.ML cs.LG eess.SP math.OC stat.AP

Sparse Network Inference under Imperfect Detection and its Application to Ecological Networks

Aoran Zhang, Tianyao Wei, Maria J. Guerrero, César A. Uribe

Comments 13 pages, 4 figures

2604.18655 2026-04-27 cs.DC cs.AI cs.CL

Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

Sravanth Kodavanti, Sowmya Vajrala, Srinivas Miriyala, Utsav Tiwari, Uttam Kumar, Utkarsh Kumar Mahawar, Achal Pratap Singh, Arya D, Narendra Mutyala, Vikram Nelvoy Rajendiran, Sharan Kumar Allur, Euntaik Lee, Dohyoung Kim, HyeonSu Lee, Gyusung Cho, JungBae Kim

Comments Accepted at ACL 2026

2603.18941 2026-04-27 stat.ML cs.LG

Unified Taxonomy for Multivariate Time Series Anomaly Detection using Deep Learning

Bruna Alves, Armando J. Pinho, Sónia Gouveia

2601.17060 2026-04-27 cs.CY cs.AI

Initial results of the Digital Consciousness Model

Derek Shiller, Laura Duffy, Arvo Muñoz Morán, Adrià Moret, Chris Percy, Hayley Clatterbuck

Comments v1.1 Revised section 4.2 details and acknowledgments

2512.09003 2026-04-27 q-bio.QM cs.AI

Digital Modeling of Spatial Pathway Activity from Histology Reveals Tumor Microenvironment Heterogeneity

Ling Liao, Changhuei Yang, Maxim Artyomov, Mark Watson, Adam Kepecs, Haowen Zhou, Alexey Sergushichev, Richard Cote

Comments The paper was withdrawn because the original submission was an early draft manuscript and not the final version for publication

2510.19020 2026-04-27 stat.ML cs.LG

Calibrated Principal Component Regression

Yixuan Florence Wu, Yilun Zhu, Lei Cao, Naichen Shi

2506.09520 2026-04-27 q-bio.NC cs.AI cs.RO

How attention simplifies mental representations for planning

Jason da Silva Castanheira, Nicholas Shea, Stephen M. Fleming

2604.22649 2026-04-27 cs.NE cs.CV

Structure-Guided Diffusion Model for EEG-Based Visual Cognition Reconstruction

Yongxiang Lian, Yueyang Cang, Pingge Hu, Yuchen He, Li Shi

详情

英文摘要

Objective: Decoding visual information from electroencephalography (EEG) is an important problem in neuroscience and brain-computer interface (BCI) research. Existing methods are largely restricted to natural images and categorical representations, with limited capacity to capture structural features and to differentiate objective perception from subjective cognition. We propose a Structure-Guided Diffusion Model (SGDM) that incorporates explicit structural information for EEG-based visual reconstruction. Approach: SGDM is evaluated on the Kilogram abstract visual object dataset and the THINGS natural image dataset using a two-stage generative mechanism. The framework combines a structurally supervised variational autoencoder with a spatiotemporal EEG encoder aligned to a visual embedding space via contrastive learning. Structural information is integrated into a diffusion model through ControlNet to guide image generation from EEG features. Results: SGDM outperforms existing methods on both abstract and natural image datasets. Reconstructed images achieve higher fidelity in low-level visual features and semantic representations, indicating improved decoding accuracy and strong generalization across diverse visual domains. Spatiotemporal analysis of EEG signals further reveals hierarchical structural encoding patterns, consistent with the neural dynamics of visual cognition. Significance: These findings validate the effectiveness of SGDM in capturing explicit structural geometry and generating images with high fidelity to individual cognitive representations. By enabling decoding of complex visual content from EEG signals, the framework extends neural decoding beyond low-dimensional or categorical outputs. This supports BCIs with increased degrees of freedom for intention decoding and more flexible brain-to-machine communication.

URL PDF HTML ☆

赞 0 踩 0

2604.22640 2026-04-27 cs.SE cs.LG

Quality-Driven Selective Mutation for Deep Learning

Zaheed Ahmed, Emmanuel Charleson Dapaah, Philip Makedonski, Jens Grabowski

2604.22639 2026-04-27 cs.CR cs.LG

Adversarial Malware Generation in Linux ELF Binaries via Semantic-Preserving Transformations

Lukáš Hrdonka, Martin Jureček

2604.22636 2026-04-27 stat.ML cs.LG stat.AP

CLVAE: A Variational Autoencoder for Long-Term Customer Revenue Forecasting

Jeffrey Näf, Riana Valera Mbelson, Markus Meierer

2604.22633 2026-04-27 stat.ML cs.LG

Mixed Membership sub-Gaussian Models

Huan Qing

Comments 30 pages, 6 figures, 2 tables