arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.14532 2026-04-17 cs.LG cs.AI

CSRA: Controlled Spectral Residual Augmentation for Robust Sepsis Prediction

Honglin Guo, Rihao Chang, He Jiao, Weizhi Nie, Zhongheng Zhang, Yuehao Shen

详情

英文摘要

Accurate prediction of future risk and disease progression in sepsis is clinically important for early warning and timely intervention in intensive care. However, short-window sepsis prediction remains challenging, because shorter observation windows provide limited historical evidence, whereas longer prediction horizons reduce the number of patient trajectories with valid future supervision. To address this problem, we propose CSRA, a Controlled Spectral Residual Augmentation framework for short-window multi-system ICU time series. CSRA first groups variables by clinical systems and extracts system-level and global representations. It then performs input-adaptive residual perturbation in the spectral domain to generate structured and clinically plausible trajectory variations. To improve augmentation stability and controllability, CSRA is trained end-to-end with the downstream predictor under a unified objective, together with anchor consistency loss and controller regularization. Experiments on a MIMIC-IV sepsis cohort across multiple downstream models show that CSRA is consistently competitive and often superior, reducing regression error by 10.2\% in MSE and 3.7\% in MAE over the non-augmentation baseline, while also yielding consistent gains on classification. CSRA further maintains more favorable performance under shorter observation windows, longer prediction horizons, and smaller training data scales, while also remaining effective on an external clinical dataset~(ZiGongICUinfection), indicating stronger robustness and generalizability in clinically constrained settings.

URL PDF HTML ☆

赞 0 踩 0

2604.14531 2026-04-17 cs.AI

TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification

Adam Rida

Comments github.com/adrida/tracer

2604.14528 2026-04-17 cs.AI cs.CL

Dissecting Failure Dynamics in Large Language Model Reasoning

Wei Zhu, Jian Zhang, Lixing Yu, Kun Yue, Zhiwen Tang

Comments Accepted by ACL 2026

2604.14527 2026-04-17 cs.CV cs.SY eess.IV eess.SY

Design and Validation of a Low-Cost Smartphone Based Fluorescence Detection Platform Compared with Conventional Microplate Readers

Zhendong Cao, Katrina G. Salvante, Ash Parameswaran, Pablo A. Nepomnaschy, Hongji Dai

Comments 4 pages

2604.14526 2026-04-17 cs.CV

FreqTrack: Frequency Learning based Vision Transformer for RGB-Event Object Tracking

Jinlin You, Muyu Li, Xudong Zhao

2604.14525 2026-04-17 cs.AI

Quantifying Cross-Query Contradictions in Multi-Query LLM Reasoning

Rohit Kumar Salla, Ramya Manasa Amancherla, Manoj Saravanan

Comments Accepted at the ICLR 2026 Workshop on Logical Reasoning of Large Language Models. 9 pages, 6 tables, code and data at https://huggingface.co/datasets/rohitspider/cross_query_benchmark

2604.14520 2026-04-17 cs.CV

Chain of Modality: From Static Fusion to Dynamic Orchestration in Omni-MLLMs

Ziyang Luo, Nian Liu, Junwei Han

2604.14519 2026-04-17 cs.LG cs.CV

CI-CBM: Class-Incremental Concept Bottleneck Model for Interpretable Continual Learning

Amirhosein Javadi, Tuomas Oikarinen, Tara Javidi, Tsui-Wei Weng

Comments 31 pages, 6 figures. Published in Transactions on Machine Learning Research (TMLR), 04/2026

2604.14513 2026-04-17 cs.CL

PeerPrism: Peer Evaluation Expertise vs Review-writing AI

Soroush Sadeghian, Alireza Daqiq, Radin Cheraghi, Sajad Ebrahimi, Negar Arabzadeh, Ebrahim Bagheri

详情

DOI: 10.1145/3805712.3808602

英文摘要

Large Language Models (LLMs) are increasingly used in scientific peer review, assisting with drafting, rewriting, expansion, and refinement. However, existing peer-review LLM detection methods largely treat authorship as a binary problem-human vs. AI-without accounting for the hybrid nature of modern review workflows. In practice, evaluative ideas and surface realization may originate from different sources, creating a spectrum of human-AI collaboration. In this work, we introduce PeerPrism, a large-scale benchmark of 20,690 peer reviews explicitly designed to disentangle idea provenance from text provenance. We construct controlled generation regimes spanning fully human, fully synthetic, and multiple hybrid transformations. This design enables systematic evaluation of whether detectors identify the origin of the surface text or the origin of the evaluative reasoning. We benchmark state-of-the-art LLM text detection methods on PeerPrism. While several methods achieve high accuracy on the standard binary task (human vs. fully synthetic), their predictions diverge sharply under hybrid regimes. In particular, when ideas originate from humans but the surface text is AI-generated, detectors frequently disagree and produce contradictory classifications. Accompanied by stylometric and semantic analyses, our results show that current detection methods conflate surface realization with intellectual contribution. Overall, we demonstrate that LLM detection in peer review cannot be reduced to a binary attribution problem. Instead, authorship must be modeled as a multidimensional construct spanning semantic reasoning and stylistic realization. PeerPrism is the first benchmark evaluating human-AI collaboration in these settings. We release all code, data, prompts, and evaluation scripts to facilitate reproducible research at https://github.com/Reviewerly-Inc/PeerPrism.

URL PDF HTML ☆

赞 0 踩 0

2604.14507 2026-04-17 cs.CV cs.LG

H2VLR: Heterogeneous Hypergraph Vision-Language Reasoning for Few-Shot Anomaly Detection

Jianghong Huang, Luping Ji, Weiwei Duan, Mao Ye

Comments 9 pages, 5 figures

2604.14506 2026-04-17 cs.CV

Co-distilled attention guided masked image modeling with noisy teacher for self-supervised learning on medical images

Jue Jiang, Aneesh Rangnekar, Harini Veeraraghavan

Comments Accepted at MIDL 2025

2604.14501 2026-04-17 cs.LG cs.AI cs.CC

On the Expressive Power and Limitations of Multi-Layer SSMs

Nikola Zubić, Qian Li, Yuyi Wang, Davide Scaramuzza

Comments 25 pages, 6 theorems

2604.14500 2026-04-17 cs.AI

Geometric Metrics for MoE Specialization: From Fisher Information to Early Failure Detection

Dongxin Guo, Jikun Wu, Siu Ming Yiu

Comments 6 pages, 2 figures, 7 tables

2604.14498 2026-04-17 cs.AI cs.LG stat.ML

Improving Machine Learning Performance with Synthetic Augmentation

Mel Sohm, Charles Dezons, Sami Sellami, Oscar Ninou, Axel Pincon

2604.14487 2026-04-17 cs.LG

Quantization of Spiking Neural Networks Beyond Accuracy

Evan Gibson Smith, Jacob Whitehill, Fatemeh Ganji

2604.14477 2026-04-17 cs.AI

Seeing Through Circuits: Faithful Mechanistic Interpretability for Vision Transformers

Nina Żukowska, Wolfgang Stammer, Bernt Schiele, Jonas Fischer

2604.14475 2026-04-17 cs.AI

Evo-MedAgent: Beyond One-Shot Diagnosis with Agents That Remember, Reflect, and Improve

Weixiang Shen, Bailiang Jian, Jun Li, Che Liu, Johannes Moll, Xiaobin Hu, Daniel Rueckert, Hongwei Bran Li, Jiazhen Pan

2604.14474 2026-04-17 cs.LG

Scouting By Reward: VLM-TO-IRL-Driven Player Selection For Esports

Qing Yan, Wenyu Yang, Yufei Wang, Wenhao Ma, Linchong Hu, Yifei Jin, Anton Dahbura

2604.14473 2026-04-17 cs.AI

Response-Aware User Memory Selection for LLM Personalization

Jillian Fisher, Jennifer Neville, Chan Young Park

Comments Code at: https://github.com/jfisher52/Response_Utility_Optimized_Memory_Selection

2604.14472 2026-04-17 cs.LG cs.AI cs.CE physics.comp-ph

Auxiliary Finite-Difference Residual-Gradient Regularization for PINNs

Stavros Kassinos

Comments 18 pages, 5 figures, 10 tables

2604.14465 2026-04-17 cs.AI

Improving Human Performance with Value-Aware Interventions: A Case Study in Chess

Saumik Narayanan, Raja Panjwani, Siddhartha Sen, Chien-Ju Ho

详情

英文摘要

AI systems are increasingly used to assist humans in sequential decision-making tasks, yet determining when and how an AI assistant should intervene remains a fundamental challenge. A potential baseline is to recommend the optimal action according to a strong model. However, such actions assume optimal follow-up actions, which human decision makers may fail to execute, potentially reducing overall performance. In this work, we propose and study value-aware interventions, motivated by a basic principle in reinforcement learning: under the Bellman equation, the optimal policy selects actions that maximize the immediate reward plus the value function. When a decision maker follows a suboptimal policy, this policy-value consistency no longer holds, creating discrepancies between the actions taken by the policy and those that maximize the immediate reward plus the value of the next state. We show that these policy-value inconsistencies naturally identify opportunities for intervention. We formalize this problem in a Markov decision process where an AI assistant may override human actions under an intervention budget. In the single-intervention regime, we show that the optimal strategy is to recommend the action that maximizes the human value function. For settings with multiple interventions, we propose a tractable approximation that prioritizes interventions based on the magnitude of the policy-value discrepancy. We evaluate these ideas in the domain of chess by learning models of humans from large-scale gameplay data. In simulation, our approach consistently outperforms interventions based on the strongest chess engine (Stockfish) in a wide range of settings. A within-subject human study with 20 players and 600 games further shows that our interventions significantly improve performance for low- and mid-skill players while matching expert-engine interventions for high-skill players.

URL PDF HTML ☆

赞 0 踩 0

2604.14463 2026-04-17 cs.CL

Psychological Steering of Large Language Models

Leonardo Blas, Robin Jia, Emilio Ferrara

Comments 66 pages, 60 images

2604.14459 2026-04-17 cs.CL

Filling in the Mechanisms: How do LMs Learn Filler-Gap Dependencies under Developmental Constraints?

Atrey Desai, Sathvik Nair

Comments To be published in the 64th Annual Meeting of the Association for Computational Linguistics

2604.14455 2026-04-17 cs.AI

AIBuildAI: An AI Agent for Automatically Building AI Models

Ruiyi Zhang, Peijia Qin, Qi Cao, Li Zhang, Pengtao Xie

2604.14454 2026-04-17 cs.RO cs.CV

CooperDrive: Enhancing Driving Decisions Through Cooperative Perception

Deyuan Qu, Qi Chen, Takayuki Shimizu, Onur Altintas

Comments Accepted at ICRA 2026

2604.14450 2026-04-17 cs.LG

Asynchronous Probability Ensembling for Federated Disaster Detection

Emanuel Teixeira Martins, Rodrigo Moreira, Larissa Ferreira Rodrigues Moreira, Rodolfo S. Villaça, Augusto Neto, Flávio de Oliveira Silva

Comments Paper accepted for publication at 31st IEEE Symposium on Computers and Communications (ISCC) 2026

2604.14449 2026-04-17 cs.CV cs.AI

Crowdsourcing of Real-world Image Annotation via Visual Properties

Xiaolei Diao, Fausto Giunchiglia

2604.14448 2026-04-17 cs.CL

MARCA: A Checklist-Based Benchmark for Multilingual Web Search

Thales Sales Almeida, Giovana Kerche Bonás, Ramon Pires, Celio Larcher, Hugo Abonizio, Marcos Piau, Roseval Malaquias Junior, Rodrigo Nogueira, Thiago Laitz

2604.14442 2026-04-17 cs.CL cs.AI

Hierarchical vs. Flat Iteration in Shared-Weight Transformers

Sang-Il Han

2604.14440 2026-04-17 cs.AI

On Tackling Complex Tasks with Reward Machines and Signal Temporal Logics

Ana María Gómez Ruiz, Thao Dang, Alexandre Donzé