arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05214 2026-05-08 eess.SP cs.AI cs.LG

MedMamba: Recasting Mamba for Medical Time Series Classification

ZhengXiao He, Huayu Li, Xiwen Chen, Janet M Roveda, Jinghao Wen, Siyuan Tian, Ao Li

详情

英文摘要

Medical time series, such as electrocardiograms (ECG) and electroencephalograms (EEG), exhibit complex temporal dynamics and structured cross-channel dependencies, posing fundamental challenges for automated analysis. Conventional convolutional and recurrent models struggle to capture long-range dependencies, while Transformer-based approaches incur quadratic complexity and often introduce redundant interactions that are misaligned with the intrinsic structure of physiological signals. To address these limitations, we propose MedMamba, a principle-driven multi-scale bidirectional state space architecture tailored for medical time series classification. Our design is guided by three key inductive biases of physiological signals: spatial centralization, multi-timescale temporal composition, and non-causal contextual dependency. These principles are instantiated through a lightweight channel-mixing module for cross-channel reparameterization, multi-scale convolutional tokenization for temporal decomposition, and bidirectional Mamba blocks for efficient global context modeling with linear complexity. Extensive experiments on six benchmark datasets spanning EEG, ECG, and human activity signals demonstrate that MedMamba consistently outperforms state-of-the-art methods across diverse modalities. Notably, it achieves 85.97% accuracy on PTB and establishes new state-of-the-art performance on the challenging ADFTD dataset (54.72% accuracy and 52.01% F1-score). Strong results on long-sequence benchmarks, such as SleepEDF, further validate its capability in modeling long-range dependencies. Moreover, MedMamba achieves a speedup of 4.6x in inference, highlighting its practicality for real-time clinical deployment. These results suggest that principle-guided state space modeling offers an effective and scalable alternative to Transformer-based approaches for medical time series analysis.

URL PDF HTML ☆

赞 0 踩 0

2605.05212 2026-05-08 eess.SP cs.HC cs.LG

MPNet: A Robust and Efficient Manifold Pooling Network for Multi-Rhythm EEG Signal Decoding

Guoqing Cai, Kai Zeng, Shoulin Huang, Ting Ma

2605.05211 2026-05-08 q-fin.PR cs.AI cs.LG q-fin.ST

A Review of Large Language Models for Stock Price Forecasting from a Hedge-Fund Perspective

Olivia Zhang, Zhilin Zhang

Comments Accepted at the IEEE Conference on Artificial Intelligence, Spain, May 8--10, 2026

2605.04723 2026-05-08 cs.IR cs.LG

Rethinking Convolutional Networks for Attribute-Aware Sequential Recommendation

Shereen Elsayed, Ngoc Son Le, Ahmed Rashed, Lars Schmidt-Thieme

Comments Accepted at IJCAI-ECAI 2026

2605.04400 2026-05-08 cs.IT cs.LG math.IT

Contextual Memory-Enhanced Source Coding for Low-SNR Communications

Ziqiong Wang, Rongpeng Li

2605.03061 2026-05-08 stat.ML cs.LG q-bio.QM stat.ME

Dynamic Vine Copulas: Detecting and Quantifying Time-Varying Higher-Order Interactions

Houman Safaai, Alessandro Marin Vargas

2605.01669 2026-05-08 stat.ML cs.LG stat.ME

PRCD-MAP: Learning How Much to Trust Imperfect Priors in Causal Discovery

Xihang Shan, Da Zhou

详情

英文摘要

External priors of unknown reliability create a brittle trade-off in causal discovery: blind trust amplifies errors, blind rejection wastes signal. Real priors are also heterogeneously reliable -- physical laws are trustworthy, LLM-suggested edges are speculative -- yet existing methods either ignore priors or impose them through globally uniform trust. We propose PRCD-MAP, a soft prior-consumption layer that assigns per-edge trust to an imperfect prior and uses it to modulate a prior-aware $\ell_1$ and prior-weighted $\ell_2$ regularizer in a MAP objective. Trust is calibrated by empirical Bayes on a Laplace-approximated marginal likelihood and propagated along the prior graph by an MLP, so data-confirmed neighborhoods boost trust and contradictions suppress it. PRCD-MAP enjoys a population-level safety guarantee: it is $\varepsilon$-safe in expectation over the prior-generation distribution, with $\varepsilon\leq C\cdot\mathrm{acc}(1{-}\mathrm{acc})\cdot d^2/T$ at the parametric $T^{-1}$ rate and vanishing at the prior-quality endpoints. When the prior is uninformative, learned trust provably collapses to its floor and the method recovers a no-prior baseline. Empirically, on real CausalTime data PRCD-MAP exploits informative LLM priors (LLM-prior gain $+0.067/+0.089$ AUROC on AQI/Medical over a no-prior PRCD-MAP backbone; combined backbone+prior lead $+0.123/+0.043$ over PCMCI+), auto-attenuates on the anonymous-variable Traffic stress test, and retains a lead at $d{=}300$; against BayesDAG, the closest soft-Bayesian baseline, PRCD-MAP wins on every CausalTime dataset under a matched $W_0$-only protocol. A four-way ablation isolates each component: EB calibration and MLP trust propagation jointly carry the plurality of the gain, with positive sign on every dataset. Extensions to nonlinear (NAM) and cross-sectional settings show the calibrated-trust principle is setting-agnostic.

URL PDF HTML ☆

赞 0 踩 0

2605.01297 2026-05-08 cs.CY cs.AI

Are we Doomed to an AI Race? Why Self-Interest Could Drive Countries Towards a Moratorium on Superintelligence

Edward Roussel, Lode Lauwaert, Torben Swoboda, Grant Ramsey, Risto Uuk, Leonard Dung, Anthony Aguirre

Comments 19 pages, 3 figures

2605.00062 2026-05-08 eess.IV cs.LG

RETO: A Rotary-Enhanced Transformer Operator for High-Fidelity Prediction of Automotive Aerodynamics

Bojun Zhang, Huiyu Yang, Yunpeng Wang, Yuntian Chen, Yuanwei Bin, Rikui Zhang, Jianchun Wang

2604.27307 2026-05-08 stat.ML cs.LG

A Novel Computational Framework for Causal Inference: Tree-Based Discretization with ILP-Based Matching

Tianyu Yang, Md. Noor-E-Alam

2604.22158 2026-05-08 math.OC cs.LG

Rate-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator

Spencer Hutchinson, Nanfei Jiang, Mahnoosh Alizadeh

2603.20531 2026-05-08 cs.DC cs.AI cs.CL cs.LG

Epistemic Observability in Language Models

Tony Mason, Vaastav Anand

详情

英文摘要

We find that models report highest confidence precisely when they are fabricating. Across four model families (OLMo-3, Llama-3.1, Qwen3, Mistral), self-reported confidence inversely correlates with accuracy, with AUC ranging from 0.28 to 0.36 where 0.5 is random guessing. We prove, under explicit formal assumptions, that this is not a capability gap but an observational one. Under text-only observation, where a supervisor sees only the model's output text, no monitoring system can reliably distinguish honest model outputs from plausible fabrications. We prove two results: first, that any policy conditioning only on the query cannot satisfy epistemic honesty across ambiguous world states; second, that no learning algorithm optimizing reward from a text-only supervisor can converge to honest behavior when the supervisor's observations are identical for both grounded and fabricated responses. Within our formal model, these impossibilities hold regardless of model scale or training procedure, including RLHF and instruction tuning. We construct a tensor interface that escapes the impossibility by exporting computational byproducts (per-token entropy and log-probability distributions) that are structurally coupled to correctness under standard training. Per-token entropy achieves pooled AUC 0.757, outperforming all text baselines by 2.5--3.9 percentage points at every budget level tested (10\%, 20\%, 30\%). The entropy signal generalizes across architectures (Spearman $ρ= 0.762$). The core contribution is a cost surface where the empirical mapping from verification budget (fraction of queries receiving expensive checks) to detection accuracy for each judge strategy is a practical lookup for system builders deciding how to allocate verification resources. The contribution is the map. The territory is the system you are building.

URL PDF HTML ☆

赞 0 踩 0

2603.13441 2026-05-08 stat.ML cond-mat.mtrl-sci cs.LG

Filtered Spectral Projection for Quantum Principal Component Analysis

Sk Mujaffar Hossain, Satadeep Bhattacharjee

详情

英文摘要

Quantum principal component analysis (qPCA) is commonly formulated as the extraction of eigenvalues and eigenvectors of a covariance-encoded density operator. Yet in many qPCA settings the practical goal is simpler: projection onto the dominant spectral subspace. Here we introduce a projection-first framework, the Filtered Spectral Projection Algorithm (FSPA), which bypasses explicit eigenvalue estimation while preserving the relevant spectral structure. FSPA amplifies any nonzero warm-start overlap with the leading subspace and remains robust in small-gap and near-degenerate regimes, without artificial symmetry breaking in the absence of bias. We show that FSPA achieves an oracle complexity $\mathcal{O}((\log(1/ε)+\log(1/|a_1|^2))/\log(λ_1/λ_2))$,which is tight by a matching lower bound, establishing it as an\emph{optimal} projection primitive. We derive a convergence rate for degenerate spectra, give a circuit resource analysis with $n+\mathcal{O}(1)$ qubit overhead independent of system dimension, and extend the method to threshold spectral projection, Threshold-FSPA, which converges in $\mathcal{O}(\log(1/ε))$ calls when the threshold lies between eigenvalues. In the density matrix exponentiation access model, FSPA gives an exponential copy-complexity advantage over classical methods. For classical datasets, we show that for amplitude-encoded centered data the ensemble density matrix $ρ=\sum_i p_i|ψ_i\rangle\langleψ_i|$ equals the covariance matrix. Numerical tests on chemistry density matrices, noisy circuit outputs, Breast Cancer Wisconsin, handwritten Digits, and 1--4-qubit scalability confirm the theory. A minimal Qiskit implementation validates magnitude invariance, signal amplification, and no spurious symmetry breaking. These results establish FSPA as an optimal and deployable quantum spectral projection primitive.

URL PDF HTML ☆

赞 0 踩 0

2603.04807 2026-05-08 stat.ML cs.LG

Does Sparse Connectivity Improve Generalization? Convolutional Networks Below the Edge of Stability

Tongtong Liang, Esha Singh, Rahul Parhi, Alexander Cloninger, Yu-Xiang Wang

Comments Under Review. Comments welcome!

2602.23405 2026-05-08 cs.NE cs.LG

Isotropic Activation Functions Enable Deindividuated Neurons and Adaptive Topologies

George Bird

Comments 33 pages, 5 figures, UPDATED CHANGES: Improved the main body text (same content), slight modification to title and abstract, and updated formatting for clarity and to comply with submission to NeurIPS review. Updated version reflects those changes made

2602.14481 2026-05-08 cs.IT cs.AI math.IT

On the Rate-Distortion-Complexity Tradeoff for Semantic Communication

Jingxuan Chai, Yong Xiao, Guangming Shi

Comments Accepted at IEEE Internet of Things Journal

2602.06381 2026-05-08 quant-ph cs.LG

HyQuRP: Hybrid quantum-classical neural network with rotational and permutational equivariance

Semin Park, Chae-Yeun Park

Comments 12+41 pages; 1 figure

2602.01390 2026-05-08 cs.HC cs.AI

Toward Scalable Audio Description Quality Control: A Workflow for Evaluating Human and VLM Raters

Lana Do, Gio Jung, Juvenal Francisco Barajas, Andrew Taylor Scott, Shasta Ihorn, Alexander Mario Blum, Vassilis Athitsos, Ilmi Yoon

2601.21831 2026-05-08 stat.ML cs.LG

Generative Modeling of Discrete Data Using Geometric Latent Subspaces

Daniel Gonzalez-Alvarado, Jonas Cassel, Stefania Petra, Christoph Schnörr

2601.21264 2026-05-08 cs.HC cs.SD eess.AS

Evaluating Spatialized Auditory Cues for Rapid Attention Capture in XR

Yoonsang Kim, Swapnil Dey, Arie Kaufman

Comments 8 pages, 4 figures. This is the author's version of the article that appeared at the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (IEEE VRW) 2026

2601.17622 2026-05-08 cs.HC cs.CL cs.IR

Memento: Towards Proactive Visualization of Everyday Memories with Personal Wearable AR Assistant

Yoonsang Kim, Yalong Yang, Arie E. Kaufman

Comments 8 pages, 5 figures. This is the author's version of the article that appeared at the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (IEEE VRW) 2026

2601.10915 2026-05-08 cs.IT cs.LG math.IT

A PAC-Bayesian Analysis of Channel-Induced Degradation in Edge Inference

Yangshuo He, Guanding Yu, Jingge Zhu

2601.09056 2026-05-08 cs.CR cs.CL cs.IR

StegoStylo: Squelching Stylometric Scrutiny through Steganographic Stitching

Robert Dilworth

Comments 16 pages, 6 figures, 1 table

2512.00751 2026-05-08 quant-ph cs.LG

Fragmentation is Efficiently Learnable by Quantum Neural Networks

Mikhail Mints, Eric R. Anschuetz

Comments 26 pages, 1 figure

2511.06454 2026-05-08 math.OC cs.LG

Feature weighting for data analysis via evolutionary simulation

Aris Daniilidis, Alberto Domínguez Corella, Philipp Wissgott

2511.02526 2026-05-08 eess.SY cs.LG cs.RO cs.SY

Many-vs-Many Missile Guidance via Virtual Targets

Marc Schneider, Walter Fichter

Comments Subsequent investigations showed that the proposed method does not generalize beyond the specific scenario considered in this manuscript

2510.18120 2026-05-08 stat.ML cs.LG

Generalization Below the Edge of Stability: The Role of Data Geometry

Tongtong Liang, Alexander Cloninger, Rahul Parhi, Yu-Xiang Wang

Comments Accepted by ICLR 2026

2509.24814 2026-05-08 stat.ME cs.LG stat.ML

A Greedy PDE Router for Blending Neural Operators and Classical Methods

Sahana Rayan, Yash Patel, Ambuj Tewari

2508.14804 2026-05-08 math.OC cs.LG

Learning from user's behaviour of some well-known congested traffic networks

Isolda Cardoso, Lucas Venturato, Jorgelina Walpen

Comments 30 pages, 8 figures, 7 tables

2508.11659 2026-05-08 cs.NE cs.AI cs.LG q-bio.NC

Toward Practical Equilibrium Propagation: Brain-inspired Recurrent Neural Network with Feedback Regulation and Residual Connections

Zhuo Liu, Tao Chen