arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.22808 2026-03-31 cs.CR cs.LG

Combinatorial Privacy: Private Multi-Party Bitstream Grand Sum by Hiding in Birkhoff Polytopes

Praneeth Vepakomma

详情

英文摘要

We introduce PolyVeil, a protocol for private Boolean summation across $k$ clients that encodes private bits as permutation matrices in the Birkhoff polytope. A two-layer architecture gives the server perfect simulation-based security (statistical distance zero) while a separate aggregator faces \#P-hard likelihood inference via the permanent and mixed discriminant. Two variants (full and compressed) differ in what the aggregator observes. We develop a finite-sample $(\varepsilon,δ)$-DP analysis with explicit constants. In the full variant, where the aggregator sees a doubly stochastic matrix per client, the log-Lipschitz constant grows as $n^4 K_t$ and a signal-to-noise analysis shows the DP guarantee is non-vacuous only when the private signal is undetectable. In the compressed variant, where the aggregator sees a single scalar, the univariate density ratio yields non-vacuous $\varepsilon$ at moderate SNR, with the optimal decoy count balancing CLT accuracy against noise concentration. This exposes a fundamental tension. \#P-hardness requires the full matrix view (Birkhoff structure visible), while non-vacuous DP requires the scalar view (low dimensionality). Whether both hold simultaneously in one variant remains open. The protocol needs no PKI, has $O(k)$ communication, and outputs exact aggregates.

URL PDF HTML ☆

赞 0 踩 0

2603.21439 2026-03-31 cs.SE cs.AI

LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study

Shuai Wang, Yinan Yu, Earl Barr, Dhasarathy Parthasarathy

Comments Accepted to FSE 2026 Industrial Track

2603.20062 2026-03-31 cs.IR cs.AI

The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

Peiying Zhu, Sidi Chang

Comments 13 pages, 10 tables, Accepted to the 10th Hospitality Finance & Economics Conference (HFE 2026), Tokyo, Japan

2603.19347 2026-03-31 cs.AR cs.LG

Exploring the Agentic Frontier of Verilog Code Generation

Patrick Yubeaton, Siddharth Garg, Chinmay Hegde

2603.12702 2026-03-31 cs.IR cs.CL cs.LG

FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning

Chaojie Sun, Bin Cao, Tiantian Li, Chenyu Hou, Ruizhe Li, Jing Fan

Comments work in process;10pages, 5 figures, 4 tables

2603.12681 2026-03-31 cs.CR cs.LG

Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment

Sihao Ding

Comments Updated manuscript to better reflect the core contribution

2603.11560 2026-03-31 cs.MA cs.AI econ.TH math.DS

Feedback-Coupled Memory Systems: A Dynamical Model for Adaptive Coordination

Stefano Grassi

2603.09964 2026-03-31 cs.HC cs.AI cs.ET

Understanding the Use of a Large Language Model-Powered Guide to Make Virtual Reality Accessible for Blind and Low Vision People

Jazmin Collins, Sharon Y Lin, Tianqi Liu, Andrea Stevenson Won, Shiri Azenkot

Comments 16 pages, 5 figures, 3 tables, Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26), April 13-17, 2026, Barcelona, Spain. ACM

2603.09455 2026-03-31 cs.SE cs.AI cs.LO

Declarative Scenario-based Testing with RoadLogic

Ezio Bartocci, Alessio Gambi, Felix Gigler, Cristinel Mateis, Dejan Ničković

Comments Accepted at the 29th ACM International Conference on Hybrid Systems: Computation and Control (HSCC 2026). The final version will appear in the ACM Digital Library

2603.01499 2026-03-31 cs.CR cs.AI

Towards Privacy-Preserving LLM Inference via Covariant Obfuscation (Technical Report)

Yu Lin, Qizhi Zhang, Wenqiang Ruan, Daode Zhang, Jue Hong, Ye Wu, Hanning Xia, Yunlong Mao, Sheng Zhong

详情

英文摘要

The rapid development of large language models (LLMs) has driven the widespread adoption of cloud-based LLM inference services, while also bringing prominent privacy risks associated with the transmission and processing of private data in remote inference. For privacy-preserving LLM inference technologies to be practically applied in industrial scenarios, three core requirements must be satisfied simultaneously: (1) Accuracy and efficiency losses should be minimized to mitigate degradation in service experience. (2) The inference process can be run on large-scale clusters consist of heterogeneous legacy xPUs. (3) Compatibility with existing LLM infrastructures should be ensured to reuse their engineering optimizations. To the best of our knowledge, none of the existing privacy-preserving LLM inference methods satisfy all the above constraints while delivering meaningful privacy guarantees. In this paper, we propose AloePri, the first privacy-preserving LLM inference method for industrial applications. AloePri protects both the input and output data by covariant obfuscation, which jointly transforms data and model parameters to achieve better accuracy and privacy. We carefully design the transformation for each model component to ensure inference accuracy and data privacy while keeping full compatibility with existing infrastructures of Language Model as a Service. AloePri has been integrated into an industrial system for the evaluation of mainstream LLMs. The evaluation on Deepseek-V3.1-Terminus model (671B parameters) demonstrates that AloePri causes accuracy loss of 0.0%~3.5% and exhibits efficiency equivalent to that of plaintext inference. Meanwhile, AloePri successfully resists state-of-the-art attacks, with less than 5\% of tokens recovered. To the best of our knowledge, AloePri is the first method to exhibit practical applicability to large-scale models in real-world systems.

URL PDF HTML ☆

赞 0 踩 0

2602.20168 2026-03-31 cs.CY cs.AI cs.LG

Benchmarking Early Deterioration Prediction Across Hospital-Rich and MCI-Like Emergency Triage Under Constrained Sensing

KMA Solaiman, Joshua Sebastian, Karma Tobden

Comments Accepted at the 14th IEEE International Conference on Healthcare Informatics (ICHI) 2026. 10 pages, 4 figures, 6 tables

2602.18482 2026-03-31 physics.comp-ph cond-mat.stat-mech cs.LG stat.ML

Boltzmann Generators for Condensed Matter via Riemannian Flow Matching

Emil Hoffmann, Maximilian Schebek, Leon Klein, Frank Noé, Jutta Rogal

Comments Published as a workshop paper at AI4MAT, ICLR 2026

2601.20404 2026-03-31 cs.SE cs.AI cs.ET cs.HC

On the Impact of AGENTS.md Files on the Efficiency of AI Coding Agents

Jai Lal Lulla, Seyedmoein Mohsenimofidi, Matthias Galster, Jie M. Zhang, Sebastian Baltes, Christoph Treude

Comments 5 pages, 1 figure, 1 table

2601.18857 2026-03-31 stat.ML cs.LG

Statistical Inference for Explainable Boosting Machines

Haimo Fang, Kevin Tan, Jonathan Pipping-Gamon, Giles Hooker

Comments Accepted to AISTATS 2026 (poster)

2601.15109 2026-03-31 cs.SI cs.AI cs.CY cs.HC cs.MA

An Agentic Operationalization of DISARM for FIMI Investigation on Social Media

Kevin Tseng, Juan Carlos Toledano, Bart De Clerck, Yuliia Dukach, Phil Tinn

Comments This paper was originally presented at the International Conference on Military Communication and Information Systems (ICMCIS), organized by the Information Systems Technology (IST) Scientific and Technical Committee, IST-224-RSY---the ICMCIS, held in Bath, United Kingdom, 12-13 May 2026

详情

英文摘要

Interoperable data and intelligence flows among allied partners and operational end-users remain essential to NATO's collective defense across both conventional and hybrid threat environments. Foreign Information Manipulation and Interference (FIMI) increasingly spans multiple societal domains and information ecosystems, complicating threat characterization, persistent situational awareness, and coordinated response. Concurrent advances in AI have further lowered the barrier to conducting large-scale, AI-augmented FIMI activities -- including automated generation, personalization, and amplification of manipulative content. While frameworks such as DISARM offer a standardized analytical and metadata schema for characterizing FIMI incidents, their practical application for automating large-scale detection remains challenging. We present a framework-agnostic, agent-based operationalization of DISARM piloted to support FIMI investigation on social platforms. Our agent coordination pipeline integrates general agentic AI components that (1) identify candidate manipulative behaviors in social-media data and (2) map these behaviors to DISARM taxonomies through transparent, auditable reasoning steps. Evaluation on two practitioner-annotated, real-world datasets demonstrates that our approach can effectively scale analytic workflows that are currently manual, time-intensive, and interpretation-heavy. Notably, the experiment surfaced more than 30 previously undetected Russian bot accounts -- deployed for the 2025 election in Moldova -- during the prior non-agentic investigation. By enhancing analytic throughput, interoperability, and explainability, the proposed approach provides a direct contribution to defense policy and planning needs for improved situational awareness, cross-partner data integration, and rapid assessment of information-environment threats.

URL PDF HTML ☆

赞 0 踩 0

2601.07370 2026-03-31 cond-mat.soft cs.RO physics.app-ph physics.flu-dyn physics.med-ph

Magnetically Driven Elastic Microswimmers: Exploiting Hysteretic Collapse for Autonomous Propulsion and Independent Control

Theo Lequy, Andreas M. Menzel

Comments 12 pages, 7 figures, submitted to ACS Nanoscience Au

2512.19846 2026-03-31 eess.SY cs.RO cs.SY

A Class of Axis-Angle Attitude Control Laws for Rotational Systems

Francisco M. F. R. Gonçalves, Ryan M. Bena, Néstor O. Pérez-Arancibia

Comments 6 pages, 4 figures. Published in IEEE Control Systems Letters

2511.15090 2026-03-31 cs.DB cs.AI cs.CV

SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning

Wenhan Yu, Zhaoxi Zhang, Wang Chen, Guanqiang Qi, Weikang Li, Lei Sha, Deguo Xia, Jizhou Huang

Comments 8 pages, 4 figures, 3 tables

2511.02069 2026-03-31 physics.soc-ph cs.CL

Complete asymptotic type-token relationship for growing complex systems with inverse power-law count rankings

Pablo Rosillo-Rodes, Laurent Hébert-Dufresne, Peter Sheridan Dodds

Comments 5 pages, 2 figures

2510.25974 2026-03-31 cs.HC cs.LG

Who Leads? Comparing Human-Centric and Model-Centric Strategies for Defining ML Target Variables

Mengtian Guo, David Gotz, Yue Wang

Comments 23 pages, 6 figures

2510.15058 2026-03-31 stat.ML cs.LG math.ST stat.TH

The Minimax Lower Bound of Kernel Stein Discrepancy Estimation

Jose Cribeiro-Ramallo, Agnideep Aich, Florian Kalinke, Ashit Baran Aich, Zoltán Szabó

Comments Accepted for publication at AISTATS 2026

2510.10324 2026-03-31 stat.ML cs.LG

On some practical challenges of conformal prediction

Liang Hong, Noura Raydan Nasreddine

2510.09328 2026-03-31 cs.CG cs.AI

Randomized HyperSteiner: A Stochastic Delaunay Triangulation Heuristic for the Hyperbolic Steiner Minimal Tree

Aniss Aiman Medbouhi, Alejandro García-Castellanos, Giovanni Luca Marchetti, Daniel Pelt, Erik J Bekkers, Danica Kragic

2509.19315 2026-03-31 eess.SP cs.AI cs.LG

Advancing Few-Shot Pediatric Arrhythmia Classification with a Novel Contrastive Loss and Multimodal Learning

Yiqiao Chen, Zijian Huang, Zhenghui Feng

Comments 12pages, 9 figures

2508.08517 2026-03-31 stat.ML cs.CE cs.LG

Projection-based multifidelity linear regression for data-scarce applications

Vignesh Sella, Julie Pham, Karen Willcox, Anirban Chaudhuri

Comments 23 page, 7 figures, submitted to Machine Learning for Computational Science and Engineering special issue Accelerating Numerical Methods With Scientific Machine Learning

2507.05147 2026-03-31 cond-mat.stat-mech cond-mat.dis-nn cs.LG

Pseudo-likelihood produces associative memories able to generalize, even for asymmetric couplings

Francesco D'Amico, Dario Bocchi, Luca Maria Del Bono, Saverio Rossi, Matteo Negri

2506.17337 2026-03-31 eess.IV cs.AI cs.CV

Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

Yuan Zhong, Ruinan Jin, Qi Dou, Xiaoxiao Li

Comments version 3

2506.04450 2026-03-31 cs.CR cs.AI cs.CL cs.LG

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification

Payel Bhattacharjee, Fengwei Tian, Geoffrey D. Rubin, Joseph Y. Lo, Nirav Merchant, Heidi Hanson, John Gounley, Ravi Tandon

Comments Accepted in IEEE ACCESS, 2026

详情

英文摘要

Large Language Models (LLMs) are increasingly adopted across domains such as education, healthcare, and finance. In healthcare, LLMs support tasks including disease diagnosis, abnormality classification, and clinical decision-making. Among these, multi-abnormality classification of radiology reports is critical for clinical workflow automation and biomedical research. Leveraging strong natural language processing capabilities, LLMs enable efficient processing of unstructured medical text and reduce the administrative burden of manual report analysis. To improve performance, LLMs are often fine-tuned on private, institution-specific datasets such as radiology reports. However, this raises significant privacy concerns: LLMs may memorize training data and become vulnerable to data extraction attacks, while sharing fine-tuned models risks exposing sensitive patient information. Despite growing interest in LLMs for medical text classification, privacy-preserving fine-tuning for multi-abnormality classification remains underexplored. To address this gap, we propose a differentially private (DP) fine-tuning framework for multi-abnormality classification from free-text radiology reports. Our approach integrates differential privacy with Low-Rank Adaptation (LoRA) to efficiently fine-tune LLMs on sensitive clinical data while mitigating leakage risks. We further employ labels generated by a larger LLM to train smaller models, enabling efficient inference under strong privacy guarantees. Experiments on MIMIC-CXR and CT-RATE demonstrate the effectiveness of our DP-LoRA framework across varying privacy regimes. On MIMIC-CXR, our method achieves weighted F1-scores up to 0.89 under moderate privacy budgets, approaching non-private LoRA (0.90) and full fine-tuning (0.96), confirming that strong privacy can be achieved with only modest performance trade-offs.

URL PDF HTML ☆

赞 0 踩 0

2506.01399 2026-03-31 eess.SY cs.RO cs.SY

Captivity-Escape Games as a Means for Safety in Online Motion Generation

Christopher Bohn, Manuel Hess, Sören Hohmann

2505.24852 2026-03-31 cs.AR cs.LG

Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data

Douwe den Blanken, Charlotte Frenkel

Comments 14 pages, 7 figures; added FSL power consumption measurements at 100 kHz clock speed, fixed typos