arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2505.12772 2026-03-31 cs.CV

P$^2$HCT: Plug-and-Play Hierarchical C2F Transformer for Multi-Scale Feature Fusion

Junyi Hu, Tian Bai, Fengyi Wu, Zhenming Peng, Yi Zhang

Comments 12 pages, 6 figures, ICME2026

详情

英文摘要

Feature fusion plays a pivotal role in achieving high performance in vision models, yet existing attention-based fusion techniques often suffer from substantial computational overhead and implementation complexity, particularly in resource-constrained settings. To address these limitations, we introduce the Plug-and-Play Hierarchical C2F Transformer (P$^2$HCT), a lightweight module that combines coarse-to-fine token selection with shared attention parameters to preserve spatial details while reducing inference cost. P$^2$HCT is trainable using coarse attention alone and can be seamlessly activated at inference to enhance accuracy without retraining. Integrated into real-time detectors such as YOLOv11-N/S/M, P$^2$HCT achieves mAP gains of 0.9\%, 0.5\%, and 0.4\% on MS COCO with minimal latency increase. Similarly, embedding P$^2$HCT into ResNet-18/50/101 backbones improves ImageNet top-1 accuracy by 6.5\%, 1.7\%, and 1.0\%, respectively. These results underscore P$^2$HCT's effectiveness as a hardware-friendly and general-purpose enhancement for both detection and classification tasks.

URL PDF HTML ☆

赞 0 踩 0

2505.11349 2026-03-31 cs.LG nlin.CD physics.comp-ph

Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning

Yuanzhao Zhang, William Gilpin

Comments International Conference on Learning Representations (ICLR 2026)

2505.11035 2026-03-31 cs.LG

Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios

Kihun Hong, Sejun Park, Ganguk Hwang

Comments Accepted to ICLR 2026

2505.08137 2026-03-31 cs.LG cs.CL cs.GR cs.MM

Large Language Models for Computer-Aided Design: A Survey

Licheng Zhang, Bach Le, Naveed Akhtar, Siew-Kei Lam, Tuan Ngo

2505.01448 2026-03-31 cs.LG cs.MM

OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

Shengkai Chen, Yifang Yin, Jinming Cao, Shili Xiang, Zhenguang Liu, Roger Zimmermann

Comments Accepted by ICME 2026

2504.14814 2026-03-31 cs.LG

A Diagnostic Evaluation of Neural Networks Trained with the Error Diffusion Learning Algorithm

Kazuhisa Fujita

详情

DOI: 10.1007/s44163-026-01137-y
Journal ref: Discover Artificial Intelligence, 2026

英文摘要

The Error Diffusion Learning Algorithm (EDLA) is a learning scheme that performs synaptically local weight updates driven by a single, globally defined error signal. Although originally proposed as an alternative to backpropagation, its behavior has not been systematically characterized. We provide a modern formulation and implementation of EDLA and evaluate multilayer perceptrons trained with EDLA on parity, regression, and image-classification benchmarks (Digits, MNIST, Fashion-MNIST, and CIFAR-10). Following the original formulation, multi-class classification is implemented by training independent single-output networks (one per class), which makes the computational cost scale linearly with the number of classes. Under comparable architectures and training protocols, EDLA consistently underperforms backpropagation-trained baselines on all benchmarks considered. Through an analysis of internal dynamics, we identify a depth-related failure mode in ReLU-based EDLA: activations can grow explosively, causing unstable training and degraded accuracy. To mitigate this instability, we incorporate root mean square normalization (RMSNorm) into EDLA training. RMSNorm substantially improves numerical stability and expands the depth range in which EDLA can be trained, but it does not close the accuracy gap and retains the overhead of the parallel-network implementation. Overall, we offer a diagnostic evaluation of where and why global error diffusion breaks down in deep networks, providing guidance for future development of local, biologically inspired learning rules.

URL PDF HTML ☆

赞 0 踩 0

2504.00780 2026-03-31 cs.CL cs.AI

Benchmarking NLP-supported Language Sample Analysis for Swiss Children's Speech

Anja Ryser, Yingqiang Gao, Sarah Ebling

Comments updated preprint

2503.21262 2026-03-31 cs.CV

vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition

Yunusa Haruna, Adamu Lawan, Shamsuddeen Hassan Muhammad, Jiaquan Zhang, Chaoning Zhang

2502.07297 2026-03-31 cs.LG q-bio.QM

MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

Qian Shao, Bang Du, Zepeng Li, Qiyuan Chen, Jiahe Chen, Hongxia Xu, Jimeng Sun, Jian Wu, Jintai Chen

Comments Under review

2501.19111 2026-03-31 cs.CV cs.AI

A Benchmark for Incremental Micro-expression Recognition

Zhengqin Lai, Xiaopeng Hong, Yabin Wang, Xiaobai Li

2501.05675 2026-03-31 cs.AI cs.LG

Synergizing Large Language Models and Task-specific Models for Time Series Anomaly Detection

Feiyi Chen, Leilei Zhang, Guansong Pang, Roger Zimmermann, Shuiguang Deng

Comments This work has been submitted to the IEEE for possible publication

2412.14019 2026-03-31 cs.AI

Retrieving Classes of Causal Orders with Inconsistent Knowledge Bases

Federico Baldo, Simon Ferreira, Charles K. Assaad

Comments CLeaR 2026 & UAI 2025 Workshop on Causal Abstractions and Representations

2408.13516 2026-03-31 cs.CV cs.AI

Bidirectional Multimodal Prompt Learning with Scale-Aware Training for Few-Shot Multi-Class Anomaly Detection

Yujin Lee, Sewon Kim, Daeun Moon, Seoyoon Jang, Hyunsoo Yoon

Comments accepted to CVPR 2026

2407.07603 2026-03-31 cs.CV

iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency

Haruna Yunusa, Adamu Lawan, Abdulganiyu Abdu Yusuf

Comments 17 pages, 7 figures. Published in Transactions on Machine Learning Research (TMLR). Available at https://openreview.net/pdf?id=HGSjlgFodQ

2406.10045 2026-03-31 cs.CV

Monitoring Simulated Physical Weakness Using Detailed Behavioral Features and Personalized Modeling

Chen Long-fei, Muhammad Ahmed Raza, Craig Innes, Subramanian Ramamoorthy, Robert B. Fisher

2402.14878 2026-03-31 cs.LG cs.AI cs.AR

Estimation of Energy-dissipation Lower-bounds for Neuromorphic Learning-in-memory

Zihao Chen, Faiek Ahsan, Johannes Leugering, Gert Cauwenberghs, Shantanu Chakrabartty

Comments 16 pages, 6 figures

详情

DOI: 10.1103/497f-nt8h
Journal ref: Phys. Rev. E 113, 035311 (2026)

英文摘要

Neuromorphic or neurally-inspired optimizers rely on local but parallel parameter updates to solve problems that range from quadratic programming to Ising machines. An ideal realization of such an optimizer not only uses a compute-in-memory (CIM) paradigm to address the so-called memory-wall (i.e. energy dissipated due to repeated memory read access), but also uses a learning-in-memory (LIM) paradigm to address the energy bottlenecks due to repeated memory writes at the precision required for optimization (the update-wall), and to address the energy bottleneck due to the repeated transfer of information between short-term and long-term memories (the consolidation-wall). In this paper, we derive theoretical estimates for the energy-to-solution metric that can be achieved by this ideal neuromorphic optimizer which is realized by modulating the energy-barrier of the physical memories such that the dynamics of memory updates and memory consolidation matches the optimization or the annealing dynamics. The analysis presented in this paper captures the out-of-equilibrium thermodynamics of learning and the resulting energy-efficiency estimates are model-agnostic which only depend on the number of model-update operations (OPS), the model-size in terms of number of parameters, the speed of convergence, and the precision of the solution. To show the practical applicability of our results, we apply our analysis for estimating the lower-bound on the energy-to-solution metrics for large-scale AI workloads.

URL PDF HTML ☆

赞 0 踩 0

2402.11877 2026-03-31 cs.LG cs.AI

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ

Han-Dong Lim, HyeAnn Lee, Donghwan Lee

2402.05689 2026-03-31 cs.LG math.OC math.PR

Unichain and Aperiodicity are Sufficient for Asymptotic Optimality of Average-Reward Restless Bandits

Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang

Comments 68 pages, 17 figures

2307.07753 2026-03-31 cs.LG cs.AI stat.ML

Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks

Dominik Schnaus, Jongseok Lee, Daniel Cremers, Rudolph Triebel

Comments Accepted to ICML 2023

2307.01502 2026-03-31 cs.CV cs.AI

Clinical application of HEDI for biomechanical evaluation and visualisation in incisional hernia repair

Philipp D. Lösel, Jacob J. Relle, Samuel Voß, Ramesch Raschidi, Regine Nessel, Johannes Görich, Mark O. Wielpütz, Thorsten Löffler, Vincent Heuveline, Friedrich Kallinowski

Comments 15 pages, 6 figures, this is the author's accepted manuscript of an article published in Communications Medicine (2026). The final version is available online at: https://doi.org/10.1038/s43856-025-01311-w

详情

DOI: 10.1038/s43856-025-01311-w
Journal ref: Communications Medicine 6 (2026) 68

英文摘要

Background: Abdominal wall defects, such as incisional hernias, are a common source of pain and discomfort and often require repeated surgical interventions. Traditional mesh repair techniques typically rely on fixed overlap based on defect size, without considering important biomechanical factors like muscle activity, internal pressure, and tissue elasticity. This study aims to introduce a biomechanical approach to incisional hernia repair that accounts for abdominal wall instability and to evaluate a visualisation tool designed to support surgical planning. Methods: We developed HEDI, a tool that uses computed tomography with Valsalva maneuver to automatically assess hernia size, volume, and abdominal wall instability. This tool was applied in the preoperative evaluation of 31 patients undergoing incisional hernia repair. Surgeries were performed concurrently with the development of the tool, and patient outcomes were monitored over a three-year period. Results: Here we show that all 31 patients remain free of pain and hernia recurrence three years after surgery. The tool provides valuable visual insights into abdominal wall dynamics, supporting surgical decision-making. However, it should be used as an adjunct rather than a standalone guide. Conclusions: This study presents a biomechanical strategy for hernia repair and introduces a visualisation tool that enhances preoperative assessment. While early results are promising, the tool's evolving nature and its role as a visual aid should be considered when interpreting outcomes. Further research is needed to validate its broader clinical utility.

URL PDF HTML ☆

赞 0 踩 0

2208.10384 2026-03-31 cs.CL cs.IT math.IT

The optimality of word lengths. Theoretical foundations and an empirical study

Sonia Petrini, Antoni Casas-i-Muñoz, Jordi Cluet-i-Martinell, Mengxue Wang, Christian Bentz, Ramon Ferrer-i-Cancho

Comments A substantially revised version. Mathematical content has been moved to appendices. In press in Glottometrics

2106.00839 2026-03-31 cs.LG q-fin.RM stat.ML

Algorithmic Insurance

Dimitris Bertsimas, Agni Orfanoudaki

详情

英文摘要

When AI systems make errors in high-stakes domains like medical diagnosis or autonomous vehicles, a single algorithmic flaw across varying operational contexts can generate highly heterogeneous losses that challenge traditional insurance assumptions. Algorithmic insurance constitutes a novel form of financial coverage for AI-induced damages, representing an emerging market that addresses algorithm-driven liability. However, insurers currently struggle to price these risks, while AI developers lack rigorous frameworks connecting system design with financial liability exposure. We analyze the connection between operational choices of binary classification performance to tail risk exposure. Using conditional value-at-risk (CVaR) to capture extreme losses, we prove that established approaches like maximizing accuracy can significantly increase worst-case losses compared to tail risk optimization, with penalties growing quadratically as thresholds deviate from optimal. We then propose a liability insurance contract structure that mandates risk-aware classification thresholds and characterize the conditions under which it creates value for AI providers. Our analysis extends to degrading model performance and human oversight scenarios. We validate our findings through a mammography case study, demonstrating that CVaR-optimal thresholds reduce tail risk up to 13-fold compared to accuracy maximization. This risk reduction enables insurance contracts to create 14-16% gains for well-calibrated firms, while poorly calibrated firms benefit up to 65% through risk transfer, mandatory recalibration, and regulatory capital relief. Unlike traditional insurance that merely transfers risk, algorithmic insurance can function as both a financial instrument and an operational governance mechanism, simultaneously enabling efficient risk transfer while improving AI safety.

URL PDF HTML ☆

赞 0 踩 0

2603.27768 2026-03-31 cs.CL

TailNLG: A Multilingual Benchmark Addressing Verbalization of Long-Tail Entities

Lia Draetta, Michael Oliverio, Virginia Ramón-Ferrer, Pier Felice Balestrucci, Flaviana Corallo, Carlos Badenes-Olmedo, Alessandro Mazzei, Marco Antonio Stranisci, Rossana Damiano

2603.27766 2026-03-31 cs.LG stat.ML

AutoStan: Autonomous Bayesian Model Improvement via Predictive Feedback

Oliver Dürr

2603.27757 2026-03-31 cs.CV cs.RO

E-TIDE: Fast, Structure-Preserving Motion Forecasting from Event Sequences

Biswadeep Sen, Benoit R. Cottereau, Nicolas Cuperlier, Terence Sim

2603.27752 2026-03-31 cs.CL cs.SE

Retromorphic Testing with Hierarchical Verification for Hallucination Detection in RAG

Boxi Yu, Yuzhong Zhang, Liting Lin, Lionel Briand, Emir Muñoz

2603.27751 2026-03-31 cs.AI

SkyNet: Belief-Aware Planning for Partially-Observable Stochastic Games

Adam Haile

2603.27744 2026-03-31 cs.CV

Data Organization Matters in Multimodal Instruction Tuning: A Controlled Study of Capability Trade-offs

Guowei Tang

Comments 12 pages, 2 figures

2603.27742 2026-03-31 cs.CV

TIR-Agent: Training an Explorative and Efficient Agent for Image Restoration

Yisheng Zhang, Guoli Jia, Haote Hu, Shanxu Zhao, Kaikai Zhao, Long Sun, Xinwei Long, Kai Tian, Che Jiang, Zhaoxiang Liu, Kai Wang, Shiguo Lian, Kaiyan Zhang, Bowen Zhou

2603.27738 2026-03-31 cs.AI

TianJi:An autonomous AI meteorologist for discovering physical mechanisms in atmospheric science

Kaikai Zhang, Xiang Wang, Haoluo Zhao, Nan Chen, Mengyang Yu Jing-Jia Luo, Tao Song, Fan Meng