arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.08720 2026-04-13 cs.SE cs.AI

Demystifying the Silence of Correctness Bugs in PyTorch Compiler

Meiziniu Li, Dongze Li, Jianmeng Liu, Shing-Chi Cheung

详情

英文摘要

Performance optimization of AI infrastructure is key to the fast adoption of large language models (LLMs). The PyTorch compiler (torch.compile), a core optimization tool for deep learning (DL) models (including LLMs), has received due attention. However, torch.compile is prone to correctness bugs, which cause incorrect outputs of compiled DL models without triggering exceptions, crashes, or warnings. These bugs pose a serious threat to the reliability of downstream LLM applications. Data from the PyTorch community shows that 19.2% of high-priority issues are incorrect outputs of compiled DL models induced by torch.compile bugs, the second-most-common bug category (only behind program crashes at 19.57%). However, no systematic study has been conducted to specifically characterize and thereby detect these bugs. In this paper, we present the first empirical study of the correctness bugs in torch.compile, examine their characteristics, and assess the effectiveness of existing fuzzers in detecting them. Based on our findings, we propose a proof-of-concept testing technique named AlignGuard, tailored specifically for detecting correctness bugs in torch.compile. AlignGuard incorporates bug characteristics distilled from our empirical study, applying LLM-based test mutation to existing test cases for correctness bug detection. At the time of writing, AlignGuard has successfully detected 23 new correctness bugs in recent torch.compile. All these bugs have been confirmed or fixed by the PyTorch development team, and over half (14/23) of them are even marked as high-priority bugs, underscoring the usefulness of our technique.

URL PDF HTML ☆

赞 0 踩 0

2604.08703 2026-04-13 cs.MM cs.DB cs.LG

QoS-QoE Translation with Large Language Model

Yingjie Yu, Mingyuan Wu, Ahmadreza Eslaminia, Lingzhi Zhao, Kaizhuo Yan, Klara Nahrstedt

2604.08669 2026-04-13 cond-mat.quant-gas cs.LG quant-ph

An Algorithm for Fast Assembling Large-Scale Defect-Free Atom Arrays

Tao Zhang, Xiaodi Li, Hui Zhai, Linghui Chen

2604.08661 2026-04-13 quant-ph cond-mat.dis-nn cs.LG physics.comp-ph

Geometry-Induced Long-Range Correlations in Recurrent Neural Network Quantum States

Asif Bin Ayub, Amine Mohamed Aboussalah, Mohamed Hibat-Allah

Comments 16 pages, 4 figures, and 1 table

2604.08648 2026-04-13 astro-ph.HE astro-ph.IM cs.LG hep-ph

High-dimensional inference for the $γ$-ray sky with differentiable programming

Siddharth Mishra-Sharma, Tracy R. Slatyer, Yitian Sun, Yuqing Wu

Comments 17 pages, 13 figures. Code available at https://github.com/smsharma/fermi-prob-prog

2604.08628 2026-04-13 cs.CR cs.AI cs.IR

Retrieval Augmented Classification for Confidential Documents

Yeseul E. Chang, Rahul Kailasa, Simon Shim, Byunghoon Oh, Jaewoo Lee

Comments Appears in: KSII The 17th International Conference on Internet (ICONI) 2025, Dec 2025. 7 pages (48-54)

详情

Journal ref: In Proceedings of KSII ICONI 2025, Dec 2025

英文摘要

Unauthorized disclosure of confidential documents demands robust, low-leakage classification. In real work environments, there is a lot of inflow and outflow of documents. To continuously update knowledge, we propose a methodology for classifying confidential documents using Retrieval Augmented Classification (RAC). To confirm this effectiveness, we compare RAC and supervised fine tuning (FT) on the WikiLeaks US Diplomacy corpus under realistic sequence-length constraints. On balanced data, RAC matches FT. On unbalanced data, RAC is more stable while delivering comparable performance--about 96% Accuracy on both the original (unbalanced) and augmented (balanced) sets, and up to 94% F1 with proper prompting--whereas FT attains 90% F1 trained on the augmented, balanced set but drops to 88% F1 trained on the original, unbalanced set. When robust augmentation is infeasible, RAC provides a practical, security-preserving path to strong classification by keeping sensitive content out of model weights and under your control, and it remains robust as real-world conditions change in class balance, data, context length, or governance requirements. Because RAC grounds decisions in an external vector store with similarity matching, it is less sensitive to label skew, reduces parameter-level leakage, and can incorporate new data immediately via reindexing--a difficult step for FT, which typically requires retraining. The contributions of this paper are threefold: first, a RAC-based classification pipeline and evaluation recipe; second, a controlled study that isolates class imbalance and context-length effects for FT versus RAC in confidential-document grading; and third, actionable guidance on RAC design patterns for governed deployments.

URL PDF HTML ☆

赞 0 踩 0

2604.08625 2026-04-13 stat.ML cs.LG math.ST stat.TH

Spectral-Transport Stability and Benign Overfitting in Interpolating Learning

Gustav Olaf Yunus Laitinen-Lundström Fredriksson-Imanov

Comments 50 pages, 7 figures, 4 tables. Research article. Includes full proofs, model-specific corollaries, and synthetic supporting experiments. Submitted to Machine Learning

2604.08606 2026-04-13 cs.GT cs.AI econ.TH

Extrapolating Volition with Recursive Information Markets

Abhimanyu Pallavi Sudhir, Long Tran-Thanh

Comments Accepted to Games, Agents and Incentives Workshop at AAMAS-2026

2604.08602 2026-04-13 cs.DL cs.AI cs.LG

TiAb Review Plugin: A Browser-Based Tool for AI-Assisted Title and Abstract Screening

Yuki Kataoka, Masahiro Banno, Michihito Kyo, Shuri Nakao, Tomoo Sato, Shunsuke Taito, Tomohiro Takayama, Takahiro Tsuge, Yasushi Tsujimoto, Ryuhei So, Toshi A. Furukawa

Comments 25 pages, 2 figures. Abstract submitted to Cochrane Colloquium 2026. Code: https://github.com/youkiti/tiab-review-plugin

详情

英文摘要

Background: Server-based screening tools impose subscription costs, while open-source alternatives require coding skills. Objectives: We developed a browser extension that provides no-code, serverless artificial intelligence (AI)-assisted title and abstract screening and examined its functionality. Methods: TiAb Review Plugin is an open-source Chrome browser extension (available at https://chromewebstore.google.com/detail/tiab-review-plugin/alejlnlfflogpnabpbplmnojgoeeabij). It uses Google Sheets as a shared database, requiring no dedicated server and enabling multi-reviewer collaboration. Users supply their own Gemini API key, stored locally and encrypted. The tool offers three screening modes: manual review, large language model (LLM) batch screening, and machine learning (ML) active learning. For ML evaluation, we re-implemented the default ASReview active learning algorithm (TF-IDF with Naive Bayes) in TypeScript to enable in-browser execution, and verified equivalence against the original Python implementation using 10-fold cross-validation on six datasets. For LLM evaluation, we compared 16 parameter configurations across two model families on a benchmark dataset, then validated the optimal configuration (Gemini 3.0 Flash, low thinking budget, TopP=0.95) with a sensitivity-oriented prompt on five public datasets (1,038 to 5,628 records, 0.5 to 2.0 percent prevalence). Results: The TypeScript classifier produced top-100 rankings 100 percent identical to the original ASReview across all six datasets. For LLM screening, recall was 94 to 100 percent with precision of 2 to 15 percent, and Work Saved over Sampling at 95 percent recall (WSS@95) ranged from 48.7 to 87.3 percent. Conclusions: We developed a functional browser extension that integrates LLM screening and ML active learning into a no-code, serverless environment, ready for practical use in systematic review screening.

URL PDF HTML ☆

赞 0 踩 0

2604.08597 2026-04-13 cs.DB cs.AI

STIndex: A Context-Aware Multi-Dimensional Spatiotemporal Information Extraction System

Wenxiao Zhang, Yu Liu, Qiang sun, Yihao Ding, Sirui Li, Yanbing Liu, Jin B. Hong, Wei Liu

2604.08594 2026-04-13 q-bio.NC cs.AI cs.HC

Mapping generative AI use in the human brain: divergent neural, academic, and mental health profiles of functional versus socio emotional AI use

Junjie Wang, Xianyang Gan, Dan Liu, Jingxian He, Stefania Ferraro, Keith M. Kendrick, Weihua Zhao, Shuxia Yao, Christian Montag, Benjamin Becker

Comments 45 pages, 20 figures, 5 tables

2604.08585 2026-04-13 cs.DB cs.AI

QCFuse: Query-Centric Cache Fusion for Efficient RAG Inference

Jianxin Yan, Zeheng Qian, Wangze Ni, Zhitao Shen, Zhiping Wang, Haoyang Li, Jia Zhu, Lei Chen, Kui Ren

2604.08580 2026-04-13 math.OC cs.LG

Adjoint Matching through the Lens of the Stochastic Maximum Principle in Optimal Control

Carles Domingo-Enrich, Jiequn Han

2604.08576 2026-04-13 cs.NI cs.AI cs.LG

GAN-Enhanced Deep Reinforcement Learning for Semantic-Aware Resource Allocation in 6G Network Slicing

Daniel Benniah John

Comments 15 pages, 8 figures. Under review. Simulation-based evaluation for 6G network slicing

2604.08552 2026-04-13 cs.DB cs.AI

Automated Standardization of Legacy Biomedical Metadata Using an Ontology-Constrained LLM Agent

Josef Hardi, Martin J. O'Connor, Marcos Martinez-Romero, Jean G. Rosario, Stephen A. Fisher, Mark A. Musen

2604.08551 2026-04-13 cs.CR cs.CY cs.LG

Self-Sovereign Agent

Wenjie Qu, Xuandong Zhao, Jiaheng Zhang, Dawn Song

2604.08550 2026-04-13 cs.IR cs.AI

Unbiased Rectification for Sequential Recommender Systems Under Fake Orders

Qiyu Qin, Yichen Li, Haozhao Wang, Cheng Wang, Rui Zhang, Ruixuan Li

详情

英文摘要

Fake orders pose increasing threats to sequential recommender systems by misleading recommendation results through artificially manipulated interactions, including click farming, context-irrelevant substitutions, and sequential perturbations. Unlike injecting carefully designed fake users to influence recommendation performance, fake orders embedded within genuine user sequences aim to disrupt user preferences and mislead recommendation results, thereby manipulating exposure rates of specific items to gain competitive advantages. To protect users' authentic interest preferences and eliminate misleading information, this paper aims to perform precise and efficient rectification on compromised sequential recommender systems while avoiding the enormous computational and time costs of retraining existing models. Specifically, we identify that fake orders are not absolutely harmful - in certain cases, partial fake orders can even have a data augmentation effect. Based on this insight, we propose Dual-view Identification and Targeted Rectification (DITaR), which primarily identifies harmful samples to achieve unbiased rectification of the system. The core idea of this method is to obtain differentiated representations from collaborative and semantic views for precise detection, and then filters detected suspicious fake orders to select truly harmful ones for targeted rectification with gradient ascent. This ensures that useful information in fake orders is not removed while preventing bias residue. Moreover, it maintains the original data volume and sequence structure, thus protecting system performance and trustworthiness to achieve optimal unbiased rectification. Extensive experiments on three datasets demonstrate that DITaR achieves superior performance compared to state-of-the-art methods in terms of recommendation quality, computational efficiency, and system robustness.

URL PDF HTML ☆

赞 0 踩 0

2604.08549 2026-04-13 cs.IR cs.AI cs.CL

VerifAI: A Verifiable Open-Source Search Engine for Biomedical Question Answering

Miloš Košprdić, Adela Ljajić, Bojana Bašaragin, Darija Medvecki, Lorenzo Cassano, Nikola Milošević

2604.08277 2026-04-13 quant-ph cs.AI cs.LG

QARIMA: A Quantum Approach To Classical Time Series Analysis

Nishikanta Mohanty, Bikash K. Behera, Badshah Mukherjee, Pravat Dash

Comments 17 Algorithms, 19 Figures , 26 Tables

2604.06816 2026-04-13 physics.optics cs.CV

Enhanced Self-Supervised Multi-Image Super-Resolution for Camera Array Images

Yating Chen, Feng Huang, Xianyu Wu, Jing Wu, Ying Shen

2604.03936 2026-04-13 stat.ML cs.LG stat.ME

Biconvex Biclustering

Sam Rosen, Eric C. Chi, Jason Xu

Comments 34 pages, 5 figures

2603.28965 2026-04-13 eess.SY cs.RO cs.SY math.DS

Koopman Operator Framework for Modeling and Control of Off-Road Vehicle on Deformable Terrain

Kartik Loya, Phanindra Tallapragada

Comments 11 pages, 14 figures, 4 tables. Submitted to ASME Journal of Autonomous Vehicles (JAVS-26-1012)

2603.28013 2026-04-13 cs.CR cs.AI cs.LG

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

Haochuan Kevin Wang, Zechen Zhang

Comments 10 pages, 8 figures. Benchmark code and run logs released

2602.17667 2026-04-13 cs.IR cs.CV cs.LG

When & How to Write for Personalized Demand-aware Query Rewriting in Video Search

Cheng cheng, Chenxing Wang, Aolin Li, Haijun Wu, Huiyun Hu, Juyuan Wang

2602.07142 2026-04-13 cs.HC cs.AI

Exploring Teachers' Perspectives on Using Conversational AI Agents for Group Collaboration

Prerna Ravi, Carúmey Stevens, Beatriz Flamia Azevedo, Jasmine David, Brandon Hanks, Hal Abelson, Grace Lin, Emma Anderson

Comments Accepted to 27th International Conference on AI in Education (AIED) 2026

2602.04674 2026-04-13 cs.SI cs.AI cs.CL

Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility

Eun Cheol Choi, Lindsay E. Young, Emilio Ferrara

Comments Accepted to ICWSM 2026

2602.04418 2026-04-13 cs.MA cs.AI cs.DC cs.ET cs.SE

SPEAR: An Engineering Case Study of Multi-Agent Coordination for Smart Contract Auditing

Indraveni Chebolu, Arnab Mallick, Harmesh Rana

Comments Accepted at 14th International Workshop on Engineering Multi-Agent Systems(EMAS @ AAMAS)

2601.08588 2026-04-13 quant-ph cs.IT cs.LG math.IT math.ST stat.TH

Sample Complexity of Composite Quantum Hypothesis Testing

Jacob Paul Simpson, Efstratios Palias, Sharu Theresa Jose

Comments Accepted to ISIT 2026

2512.01708 2026-04-13 stat.ML cs.LG

Differentially Private and Federated Structure Learning in Bayesian Networks

Ghita Fassy El Fehri, Aurélien Bellet, Philippe Bastien

2511.03913 2026-04-13 cs.NE cs.AI

Evolutionary Optimization Trumps Adam Optimization on Embedding Space Exploration

Domício Pereira Neto, João Correia, Penousal Machado

Comments 34 pages, 6 figures, 3 tables, 18 appendix figures, 1 appendix table