arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.11332 2026-03-13 cs.CC cs.LG

On the Computational Hardness of Transformers

Barna Saha, Yinzhan Xu, Christopher Ye, Hantao Yu

Comments 46 pages, 2 figures. Abstract shortened to meet arXiv requirements

详情

英文摘要

The transformer has revolutionized modern AI across language, vision, and beyond. It consists of $L$ layers, each running $H$ attention heads in parallel and feeding the combined output to the subsequent layer. In attention, the input consists of $N$ tokens, each a vector of dimension $m$. The attention mechanism involves multiplying three $N \times m$ matrices, applying softmax to an intermediate product. Several recent works have advanced our understanding of the complexity of attention. Known algorithms for transformers compute each attention head independently. This raises a fundamental question that has recurred throughout TCS under the guise of ``direct sum'' problems: can multiple instances of the same problem be solved more efficiently than solving each instance separately? Many answers to this question, both positive and negative, have arisen in fields spanning communication complexity and algorithm design. Thus, we ask whether transformers can be computed more efficiently than $LH$ independent evaluations of attention. In this paper, we resolve this question in the negative, and give the first non-trivial computational lower bounds for multi-head multi-layer transformers. In the small embedding regime ($m = N^{o(1)}$), computing $LH$ attention heads separately takes $LHN^{2 + o(1)}$ time. We establish that this is essentially optimal under SETH. In the large embedding regime ($m = N$), one can compute $LH$ attention heads separately using $LHN^{ω+ o(1)}$ arithmetic operations (plus exponents), where $ω$ is the matrix multiplication exponent. We establish that this is optimal, by showing that $LHN^{ω- o(1)}$ arithmetic operations are necessary when $ω> 2$. Our lower bound in the large embedding regime relies on a novel application of the Baur-Strassen theorem, a powerful algorithmic tool underpinning the famous backpropagation algorithm.

URL PDF HTML ☆

赞 0 踩 0

2603.11330 2026-03-13 q-bio.QM cs.LG cs.NA math.DS math.NA

Ill-Conditioning in Dictionary-Based Dynamic-Equation Learning: A Systems Biology Case Study

Yuxiang Feng, Niall M Mangan, Manu Jayadharan

2603.11316 2026-03-13 physics.med-ph cs.CV cs.LG

MRI2Qmap: multi-parametric quantitative mapping with MRI-driven denoising priors

Mohammad Golbabaee, Matteo Cencini, Carolin Pirkl, Marion Menzel, Michela Tosetti, Bjoern Menze

2603.11304 2026-03-13 stat.ML cs.AI cs.LG stat.ME

Worst-case low-rank approximations

Anya Fries, Markus Reichstein, David Blei, Jonas Peters

2603.11274 2026-03-13 cs.HC cs.AI cs.CY

"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

Atieh Taheri, Hamza El Alaoui, Patrick Carrington, Jeffrey P. Bigham

Comments Accepted to CHI 2026 (ACM Conference on Human Factors in Computing Systems), 23 pages, 5 figures

2603.11264 2026-03-13 eess.SY cs.RO cs.SY

Multi-Robot Multitask Gaussian Process Estimation and Coverage

Lai Wei, Andrew McDonald, Vaibhav Srivastava

2603.11244 2026-03-13 q-bio.GN cs.LG

A Standardized Framework For Evaluating Gene Expression Generative Models

Andrea Rubbi, Andrea Giuseppe Di Francesco, Mohammad Lotfollahi, Pietro Liò

2603.11241 2026-03-13 eess.AS cs.LG cs.SD

Cough activity detection for automatic tuberculosis screening

Joshua Jansen van Vüren, Devendra Singh Parihar, Daphne Naidoo, Kimsey Zajac, Willy Ssengooba, Grant Theron, Thomas Niesler

2603.11229 2026-03-13 stat.ML cs.LG

Trustworthy predictive distributions for rare events via diagnostic transport maps

Elizabeth Cucuzzella, Rafael Izbicki, Ann B. Lee

Comments 19 pages, 5 figures, 2 tables

2603.11212 2026-03-13 cs.CR cs.LG

Security-by-Design for LLM-Based Code Generation: Leveraging Internal Representations for Concept-Driven Steering Mechanisms

Maximilian Wendlinger, Daniel Kowatsch, Konstantin Böttinger, Philip Sperl

Comments to be published in the IEEE European Symposium on Security and Privacy (EuroS&P)'26

2603.11205 2026-03-13 eess.AS cs.SD

Can LLMs Help Localize Fake Words in Partially Fake Speech?

Lin Zhang, Thomas Thebaud, Zexin Cai, Sanjeev Khudanpur, Daniel Povey, Leibny Paola García-Perera, Matthew Wiesner, Nicholas Andrews

Comments Submitted to Interspeech 2026; put on arxiv based on requirement from Interspeech: "Interspeech no longer enforces an anonymity period for submissions." and "For authors that prefer to upload their paper online, a note indicating that the paper was submitted for review to Interspeech should be included in the posting."

2603.11200 2026-03-13 cs.CR cs.LG

DNS-GT: A Graph-based Transformer Approach to Learn Embeddings of Domain Names from DNS Queries

Massimiliano Altieri, Ronan Hamon, Roberto Corizzo, Michelangelo Ceci, Ignacio Sanchez

2603.11138 2026-03-13 stat.ML cs.LG math.ST stat.TH

Deep regression learning from dependent observations with minimum error entropy principle

William Kengne, Modou Wade

2603.11134 2026-03-13 math.ST cs.LG stat.TH

Conformal e-prediction in the presence of confounding

Vladimir Vovk, Ruodu Wang

Comments 8 pages, 2 figures

2603.11128 2026-03-13 stat.ML cs.LG cs.NE

Efficient Approximation to Analytic and $L^p$ functions by Height-Augmented ReLU Networks

ZeYu Li, FengLei Fan, TieYong Zeng

2603.11126 2026-03-13 cs.MA cs.CL

Enhancing Value Alignment of LLMs with Multi-agent system and Combinatorial Fusion

Yuanhong Wu, Djallel Bouneffouf, D. Frank Hsu

Comments 5 pages, 3 figures, accepted to 2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

2603.11125 2026-03-13 stat.ML cs.LG

Co-Diffusion: An Affinity-Aware Two-Stage Latent Diffusion Framework for Generalizable Drug-Target Affinity Prediction

Yining Qian, Pengjie Wang, Yixiao Li, An-Yang Lu, Cheng Tan, Shuang Li, Lijun Liu

2603.11095 2026-03-13 cs.MM cs.SD eess.SP

Multimodal Self-Attention Network with Temporal Alignment for Audio-Visual Emotion Recognition

Inyong Koo, yeeun Seong, Minseok Son, Jaehyuk Jang, Changick Kim

Comments 5 pages, 3 figures, accepted to ICASSP 2026

2603.11088 2026-03-13 cs.CR cs.AI

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey

Juhee Kim, Xiaoyuan Liu, Zhun Wang, Shi Qiu, Bo Li, Wenbo Guo, Dawn Song

Comments Accepted to USENIX Security 2026. This manuscript is an extended version of the conference paper, including additional discussion and updated content

2603.11082 2026-03-13 cs.SE cs.AI

Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

Yen-Ku Liu, Yun-Cheng Tsai

2603.11078 2026-03-13 cs.SE cs.AI cs.CL

CR-Bench: Evaluating the Real-World Utility of AI Code Review Agents

Kristen Pereira, Neelabh Sinha, Rajat Ghosh, Debojyoti Dutta

2603.11068 2026-03-13 cond-mat.mtrl-sci cs.AI

From Phase Prediction to Phase Design: A ReAct Agent Framework for High-Entropy Alloy Discovery

Iman Peivaste, Salim Belouettar

2603.11061 2026-03-13 physics.chem-ph cs.AI cs.NE

Hybrid Quantum-Classical Encoding for Accurate Residue-Level pKa Prediction

Van Le, Tan Le

2603.11054 2026-03-13 cs.SI cs.AI cs.CR cs.CY cs.GT

A Survey on Quantitative Modeling of Trust in Online Social Networks

Wenting Song, K. Suzanne Barber

Comments 34 pages, 9 figures, submitted to ACM computing surveys

2603.11051 2026-03-13 cs.IR cs.AI cs.CL cs.LG

OpenSanctions Pairs: Large-Scale Entity Matching with LLMs

Chandler Smith, Magnus Sesodia, Friedrich Lindenberg, Christian Schroeder de Witt

2603.10323 2026-03-13 cs.CR cs.CV

The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance

Jesse Yu, Nicholas Wei

Comments 10 pages, 4 figures

详情

英文摘要

As open-weights generative AI rapidly proliferates, the ability to synthesize hyper-realistic media has introduced profound challenges to digital trust. Automated disinformation and AI-generated imagery have made robust digital provenance a critical cybersecurity imperative. Currently, state-of-the-art invisible watermarks operate within one of two primary mathematical manifolds: the spatial domain (post-generation pixel embedding) or the latent domain (pre-generation frequency embedding). While existing literature frequently evaluates these models against isolated, classical distortions, there is a critical lack of rigorous, comparative benchmarking against modern generative AI editing tools. In this study, we empirically evaluate two leading representative paradigms, RivaGAN (Spatial) and Tree-Ring (Latent), utilizing an automated Attack Simulation Engine across 30 intensity intervals of geometric and generative perturbations. We formalize an "Adversarial Evasion Region" (AER) framework to measure cryptographic degradation against semantic visual retention (OpenCLIP > 75.0). Our statistical analysis ($n=100$ per interval, $MOE = \pm 3.92\%$) reveals that these domains possess mutually exclusive, mathematically orthogonal vulnerabilities. Spatial watermarks experience severe cryptographic degradation under algorithmic pixel-rewriting (exhibiting a 67.47% AER evasion rate under Img2Img translation), whereas latent watermarks exhibit profound fragility against geometric misalignment (yielding a 43.20% AER evasion rate under static cropping). By proving that single-domain watermarking is fundamentally insufficient against modern adversarial toolsets, this research exposes a systemic vulnerability in current digital provenance standards and establishes the foundational exigence for future multi-domain cryptographic architectures.

URL PDF HTML ☆

赞 0 踩 0

2603.10065 2026-03-13 cs.IT cs.AI cs.SY eess.SY math.IT stat.ME

The Epistemic Support-Point Filter: Jaynesian Maximum Entropy Meets Popperian Falsification

Moriba Kemessia Jah

2603.08771 2026-03-13 stat.ML cs.IT cs.LG math.IT

Micro-Diffusion Compression - Binary Tree Tweedie Denoising for Online Probability Estimation

Roberto Tacconelli

Comments 12 pages, 1 figure

2603.08245 2026-03-13 cs.CG cs.CV

Topologically Stable Hough Transform

Stefan Huber, Kristóf Huszár, Michael Kerber, Martin Uray

Comments Extended abstract will be presented at EuroCG'26; 11 pages, 7 figures

2603.07949 2026-03-13 cs.DC cs.RO

RAPID: Redundancy-Aware and Compatibility-Optimal Edge-Cloud Partitioned Inference for Diverse VLA Models

Zihao Zheng, Sicheng Tian, Hangyu Cao, Chenyue Li, Jiayu Chen, Maoliang Li, Xinhao Sun, Hailong Zou, Guojie Luo, Xiang Chen