arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Siyuan Yan, Xieji Li, Dan Mo, Philipp Tschandl, Yiwen Jiang, Zhonghua Wang, Ming Hu, Lie Ju, Cristina Vico-Alonso, Yizhen Zheng, Jiahe Liu, Juexiao Zhou, Camilla Chello, Jen G. Cheung, Julien Anriot, Luc Thomas, Clare Primiero, Gin Tan, Aik Beng Ng, Simon See, Xiaoying Tang, Albert Ip, Xiaoyang Liao, Adrian Bowling, Martin Haskett, Shuang Zhao, Monika Janda, H. Peter Soyer, Victoria Mar, Harald Kittler, Zongyuan Ge

Comments reports

2602.10622 2026-02-12 cs.CL

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Jiahao Yuan, Yike Xu, Jinyong Wen, Baokun Wang, Yang Chen, Xiaotong Lin, Wuliang Huang, Ziyi Gao, Xing Fu, Yu Cheng, Weiqiang Wang

Comments 13 pages, 4 figures

2602.10614 2026-02-12 cs.LG

Pupillometry and Brain Dynamics for Cognitive Load in Working Memory

Nusaibah Farrukh, Malavika Pradeep, Akshay Sasi, Rahul Venugopal, Elizabeth Sherly

Comments 6 Pages, 3 Figures, 5 Tables, Code Available at: https://github.com/NusaibahFarrukh/PupillometryBrainDynamics

2602.10611 2026-02-12 cs.LG physics.comp-ph stat.ML

On the Role of Consistency Between Physics and Data in Physics-Informed Neural Networks

Nicolás Becerra-Zuniga, Lucas Lacasa, Eusebio Valero, Gonzalo Rubio

Comments 24 pages, 7 Figures, 3 Tables

详情

英文摘要

Physics-informed neural networks (PINNs) have gained significant attention as a surrogate modeling strategy for partial differential equations (PDEs), particularly in regimes where labeled data are scarce and physical constraints can be leveraged to regularize the learning process. In practice, however, PINNs are frequently trained using experimental or numerical data that are not fully consistent with the governing equations due to measurement noise, discretization errors, or modeling assumptions. The implications of such data-to-PDE inconsistencies on the accuracy and convergence of PINNs remain insufficiently understood. In this work, we systematically analyze how data inconsistency fundamentally limits the attainable accuracy of PINNs. We introduce the concept of a consistency barrier, defined as an intrinsic lower bound on the error that arises from mismatches between the fidelity of the data and the exact enforcement of the PDE residual. To isolate and quantify this effect, we consider the 1D viscous Burgers equation with a manufactured analytical solution, which enables full control over data fidelity and residual errors. PINNs are trained using datasets of progressively increasing numerical accuracy, as well as perfectly consistent analytical data. Results show that while the inclusion of the PDE residual allows PINNs to partially mitigate low-fidelity data and recover the dominant physical structure, the training process ultimately saturates at an error level dictated by the data inconsistency. When high-fidelity numerical data are employed, PINN solutions become indistinguishable from those trained on analytical data, indicating that the consistency barrier is effectively removed. These findings clarify the interplay between data quality and physics enforcement in PINNs providing practical guidance for the construction and interpretation of physics-informed surrogate models.

URL PDF HTML ☆

赞 0 踩 0

2602.10610 2026-02-12 cs.RO

Pitch Angle Control of a Magnetically Actuated Capsule Robot with Nonlinear FEA-based MPC and EKF Multisensory Fusion

Chongxun Wang, Zikang Shen, Apoorav Rathore, Akanimoh Udombeh, Harrison Teng, Fangzhou Xia

Comments This version is submitted for review at IEEE/ASME Transactions on Mechatronics

2602.10607 2026-02-12 cs.LG cs.AI

Hierarchical Zero-Order Optimization for Deep Neural Networks

Sansheng Cao, Zhengyu Ma, Yonghong Tian

Comments Corresponding author: Zhengyu Ma (mazhy@pcl.ac.cn)

2602.10602 2026-02-12 cs.LG

Learning Mixture Density via Natural Gradient Expectation Maximization

Yutao Chen, Jasmine Bayrooti, Steven Morad

2602.10598 2026-02-12 cs.AI cs.LG

Neuro-symbolic Action Masking for Deep Reinforcement Learning

Shuai Han, Mehdi Dastani, Shihan Wang

2602.10595 2026-02-12 cs.LG

Roughness-Informed Federated Learning

Mohammad Partohaghighi, Roummel Marcia, Bruce J. West, YangQuan Chen

Comments This manuscript is under review in IEEE TPAMI journal

2602.10593 2026-02-12 cs.CV

Fast Person Detection Using YOLOX With AI Accelerator For Train Station Safety

Mas Nurul Achmadiah, Novendra Setyawan, Achmad Arif Bryantono, Chi-Chia Sun, Wen-Kai Kuo

Comments 6 pages, 8 figures, 2 tables. Presented at 2024 International Electronics Symposium (IES). IEEE DOI: 10.1109/IES63037.2024.10665874

Journal ref 2024 International Electronics Symposium (IES), pp. 504-509, 2024

2602.10588 2026-02-12 cs.LG stat.ML

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Hosein Anjidani, S. Yahya S. R. Tehrani, Mohammad Mahdi Mojahedian, Mohammad Hossein Yassaee

2602.10586 2026-02-12 cs.CV eess.IV

Enhancing Underwater Images via Adaptive Semantic-aware Codebook Learning

Bosen Lin, Feng Gao, Yanwei Yu, Junyu Dong, Qian Du

Comments Accepted for publication in IEEE TGRS 2026

2602.10585 2026-02-12 cs.LG cs.AI

Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity

Guangzhi Xiong, Sanchit Sinha, Aidong Zhang

Comments AISTATS 2026

2602.10584 2026-02-12 cs.LG

When Gradient Clipping Becomes a Control Mechanism for Differential Privacy in Deep Learning

Mohammad Partohaghighi, Roummel Marcia, Bruce J. West, YangQuan Chen

Comments This manuscript is under review in the Engineering Applications of Artificial Intelligence journal

2602.10583 2026-02-12 cs.AI

Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets

Bo Xue, Yunchong Song, Fanghao Shao, Xuekai Zhu, Lin Chen, Luoyi Fu, Xinbing Wang, Zhouhan Lin

Comments Published as a conference paper at ICLR 2026

2602.10576 2026-02-12 cs.LG cs.AI

LLM-Based Scientific Equation Discovery via Physics-Informed Token-Regularized Policy Optimization

Boxiao Wang, Kai Li, Tianyi Liu, Chen Li, Junzhe Wang, Yifan Zhang, Jian Cheng

2602.10575 2026-02-12 cs.CV cs.AI cs.CY

MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning

Chenhao Zhang, Yazhe Niu, Hongsheng Li

Comments 14 pages, 4 figures, 11 tables; Code: https://github.com/MING-ZCH/MetaphorStar, Model & Dataset: https://huggingface.co/collections/MING-ZCH/metaphorstar

2602.10568 2026-02-12 cs.LG

Gauss-Newton Unlearning for the LLM Era

Lev McKinney, Anvith Thudi, Juhan Bae, Tara Rezaei, Nicolas Papernot, Sheila A. McIlraith, Roger Grosse

Comments 18 pages

2602.10565 2026-02-12 cs.LG math.OC

Online Min-Max Optimization: From Individual Regrets to Cumulative Saddle Points

Abhijeet Vyas, Brian Bullins

2602.10561 2026-02-12 cs.RO

Morphogenetic Assembly and Adaptive Control for Heterogeneous Modular Robots

Chongxi Meng, Da Zhao, Yifei Zhao, Minghao Zeng, Yanmin Zhou, Zhipeng Wang, Bin He

Comments Accepted by ICRA 2026

2602.10560 2026-02-12 cs.CL cs.AI

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Leheng Sheng, Yongtao Zhang, Wenchang Ma, Yaorui Shi, Ting Huang, Xiang Wang, An Zhang, Ke Shen, Tat-Seng Chua

Comments 26 pages

2602.10553 2026-02-12 cs.LG cs.AI

Contrastive Learning for Multi Label ECG Classification with Jaccard Score Based Sigmoid Loss

Junichiro Takahashi, Masataka Sato, Satoshi Kodeta, Norihiko Takeda

2602.10549 2026-02-12 cs.CV cs.AI

Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance

Shengyang Sun, Jiashen Hua, Junyi Feng, Xiaojin Gong

Comments Accepted by IEEE Transactions on Multimedia

2602.10547 2026-02-12 cs.RO

ReSPEC: A Framework for Online Multispectral Sensor Reconfiguration in Dynamic Environments

Yanchen Liu, Yuang Fan, Minghui Zhao, Xiaofan Jiang

Comments 8 pages, 4 figures. This work has been submitted to the IEEE for possible publication

2602.10546 2026-02-12 cs.CV cs.AI

RealHD: A High-Quality Dataset for Robust Detection of State-of-the-Art AI-Generated Images

Hanzhe Yu, Yun Ye, Jintao Rong, Qi Xuan, Chen Ma

Comments Published in the Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM 2025)

Journal ref Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM 2025), 2025, pp. 11394--11403

详情

DOI: 10.1145/3746027.3755207

英文摘要

The rapid advancement of generative AI has raised concerns about the authenticity of digital images, as highly realistic fake images can now be generated at low cost, potentially increasing societal risks. In response, several datasets have been established to train detection models aimed at distinguishing AI-generated images from real ones. However, existing datasets suffer from limited generalization, low image quality, overly simple prompts, and insufficient image diversity. To address these limitations, we propose a high-quality, large-scale dataset comprising over 730,000 images across multiple categories, including both real and AI-generated images. The generated images are synthesized via state-of-the-art methods, including text-to-image generation (guided by over 10,000 carefully designed prompts), image inpainting, image refinement, and face swapping. Each generated image is annotated with its generation method and category. Inpainting images further include binary masks to indicate inpainted regions, providing rich metadata for analysis. Compared to existing datasets, detection models trained on our dataset demonstrate superior generalization capabilities. Our dataset not only serves as a strong benchmark for evaluating detection methods but also contributes to advancing the robustness of AI-generated image detection techniques. Building upon this, we propose a lightweight detection method based on image noise entropy, which transforms the original image into an entropy tensor of Non-Local Means (NLM) noise before classification. Extensive experiments demonstrate that models trained on our dataset achieve strong generalization, and our method delivers competitive performance, establishing a solid baseline for future research. The dataset and source code are publicly available at https://real-hd.github.io.

URL PDF HTML ☆

赞 0 踩 0

2602.10545 2026-02-12 cs.LG cs.AI stat.ML

$μ$pscaling small models: Principled warm starts and hyperparameter transfer

Yuxin Ma, Nan Chen, Mateo Díaz, Soufiane Hayou, Dmitriy Kunisky, Soledad Villar

Comments 61 pages, 6 figures

2602.10544 2026-02-12 cs.LG cs.NA math.NA

Bridging the Compression-Precision Paradox: A Hybrid Architecture for Clinical EEG Report Generation with Guaranteed Measurement Accuracy

Wuyang Zhang, Zhen Luo, Chuqiao Gu, Jianming Ma, Yebo Cao, Wangming Yuan, Yinzhi Jin

Comments 7 pages

2602.10539 2026-02-12 cs.LG

What Makes Value Learning Efficient in Residual Reinforcement Learning?

Guozheng Ma, Lu Li, Haoyu Wang, Zixuan Liu, Pierre-Luc Bacon, Dacheng Tao

2602.10528 2026-02-12 cs.LG cs.AI

A Swap-Adversarial Framework for Improving Domain Generalization in Electroencephalography-Based Parkinson's Disease Prediction

Seongwon Jin, Hanseul Choi, Sunggu Yang, Sungho Park, Jibum Kim

详情

英文摘要

Electroencephalography (ECoG) offers a promising alternative to conventional electrocorticography (EEG) for the early prediction of Parkinson's disease (PD), providing higher spatial resolution and a broader frequency range. However, reproducible comparisons has been limited by ethical constraints in human studies and the lack of open benchmark datasets. To address this gap, we introduce a new dataset, the first reproducible benchmark for PD prediction. It is constructed from long-term ECoG recordings of 6-hydroxydopamine (6-OHDA)-induced rat models and annotated with neural responses measured before and after electrical stimulation. In addition, we propose a Swap-Adversarial Framework (SAF) that mitigates high inter-subject variability and the high-dimensional low-sample-size (HDLSS) problem in ECoG data, while achieving robust domain generalization across ECoG and EEG-based Brain-Computer Interface (BCI) datasets. The framework integrates (1) robust preprocessing, (2) Inter-Subject Balanced Channel Swap (ISBCS) for cross-subject augmentation, and (3) domain-adversarial training to suppress subject-specific bias. ISBCS randomly swaps channels between subjects to reduce inter-subject variability, and domain-adversarial training jointly encourages the model to learn task-relevant shared features. We validated the effectiveness of the proposed method through extensive experiments under cross-subject, cross-session, and cross-dataset settings. Our method consistently outperformed all baselines across all settings, showing the most significant improvements in highly variable environments. Furthermore, the proposed method achieved superior cross-dataset performance between public EEG benchmarks, demonstrating strong generalization capability not only within ECoG but to EEG data. The new dataset and source code will be made publicly available upon publication.

URL PDF HTML ☆

赞 0 踩 0

2602.10518 2026-02-12 cs.CV

MapVerse: A Benchmark for Geospatial Question Answering on Diverse Real-World Maps

Sharat Bhat, Harshita Khandelwal, Tushar Kataria, Vivek Gupta

AI 大模型

视觉与机器人

科学与医疗

A Vision-Language Foundation Model for Zero-shot Clinical Collaboration and Automated Concept Discovery in Dermatology

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Pupillometry and Brain Dynamics for Cognitive Load in Working Memory

On the Role of Consistency Between Physics and Data in Physics-Informed Neural Networks

Pitch Angle Control of a Magnetically Actuated Capsule Robot with Nonlinear FEA-based MPC and EKF Multisensory Fusion

Hierarchical Zero-Order Optimization for Deep Neural Networks

Learning Mixture Density via Natural Gradient Expectation Maximization

Neuro-symbolic Action Masking for Deep Reinforcement Learning

Roughness-Informed Federated Learning

Fast Person Detection Using YOLOX With AI Accelerator For Train Station Safety

TRACE: Theoretical Risk Attribution under Covariate-shift Effects

Enhancing Underwater Images via Adaptive Semantic-aware Codebook Learning

Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity

When Gradient Clipping Becomes a Control Mechanism for Differential Privacy in Deep Learning

Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets

LLM-Based Scientific Equation Discovery via Physics-Informed Token-Regularized Policy Optimization

MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning

Gauss-Newton Unlearning for the LLM Era

Online Min-Max Optimization: From Individual Regrets to Cumulative Saddle Points

Morphogenetic Assembly and Adaptive Control for Heterogeneous Modular Robots

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Contrastive Learning for Multi Label ECG Classification with Jaccard Score Based Sigmoid Loss

Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance

ReSPEC: A Framework for Online Multispectral Sensor Reconfiguration in Dynamic Environments

RealHD: A High-Quality Dataset for Robust Detection of State-of-the-Art AI-Generated Images

$μ$pscaling small models: Principled warm starts and hyperparameter transfer

Bridging the Compression-Precision Paradox: A Hybrid Architecture for Clinical EEG Report Generation with Guaranteed Measurement Accuracy

What Makes Value Learning Efficient in Residual Reinforcement Learning?

A Swap-Adversarial Framework for Improving Domain Generalization in Electroencephalography-Based Parkinson's Disease Prediction

MapVerse: A Benchmark for Geospatial Question Answering on Diverse Real-World Maps