arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05882 2026-05-08 stat.ML cs.AI cs.CY cs.LG

Tuning Derivatives for Causal Fairness in Machine Learning

Filip Edström, Guilherme W. F. Barros, Tetiana Gorbach, Xavier de Luna

详情

英文摘要

Artificial-intelligence systems are becoming ubiquitous in society, yet their predictions typically inherit biases with respect to protected attributes such as race, gender, or age. Classical fairness notions, most notably Statistical Parity (SP), demand that predictions be independent of the protected attributes, but are overly restrictive when these attributes influence mediating variables that are considered business necessities. Recent causal formulations relax SP by distinguishing allowed from not-allowed causal paths and by complementing SP with Predictive Parity (PP), requiring the predictor to replicate the legitimate influence of business-necessities. Existing path-based definitions are mainly practical when applied to categorical attributes. This paper introduces a new framework for fairness in structural causal models that is tailored to continuous protected attributes. We formalize SP and PP through path-specific partial derivatives, establish conditions under which these criteria coincide with prior causal definitions, and characterize when a fair predictor, one that satisfies SP along not-allowed paths while achieving PP along allowed paths, exists. Building on this theory, we propose a fair tuning algorithm that either constructs such a predictor or, when not possible, allows for a trade-off between SP and PP. We present experiments on simulated and real data to evaluate our proposal, compare it with previously proposed methods, and show that it performs better when PP is considered.

URL PDF HTML ☆

赞 0 踩 0

2605.05873 2026-05-08 stat.ML cs.AI cs.LG math.ST stat.ME stat.TH

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency

Hirofumi Ota, Naoto Iwase, Yuki Ichihara, Junpei Komiyama, Masaaki Imaizumi

2605.05846 2026-05-08 cs.CR cs.AI

LoopTrap: Termination Poisoning Attacks on LLM Agents

Huiyu Xu, Zhibo Wang, Wenhui Zhang, Ziqi Zhu, Yaopeng Wang, Kui Ren, Chun Chen

2605.05818 2026-05-08 cs.CR cs.AI cs.CL

LeakDojo: Decoding the Leakage Threats of RAG Systems

Maosen Zhang, Jianshuo Dong, Boting Lu, Wenyue Li, Xiaoping Zhang, Tianwei Zhang, Han Qiu

Comments Findings of ACL 2026

2605.05808 2026-05-08 stat.ML cs.LG math.ST stat.TH

Ratio-based Loss Functions

Lena Helgerth, Andreas Christmann

2605.05807 2026-05-08 cs.CR cs.AI

LCC-LLM: Leveraging Code-Centric Large Language Models for Malware Attribution

Christopher G. Pedraza Pohlenz, Hassan Jalil Hadi, Ali Hassan, Ali Shoker

详情

英文摘要

LLMs are increasingly explored for malware analysis; however, current LLM-based malware attribution remains limited by unsupported indicators and insufficient code-level grounding for identifying malicious and vulnerable code segments. To address these limitations, this research introduces LCC-LLM, a code-centric benchmark dataset and evidence-grounded framework for malware attribution and multi-task static malware analysis. The proposed LCCD dataset contains approximately 34K PE samples processed through a large-scale reverse-engineering pipeline and represented using decompiled C code, assembly code, CFG/FCG artifacts, hexadecimal data, PE metadata, suspicious API evidence, and structural features. Beyond dataset construction, LCC-LLM integrates LangGraph-orchestrated static analysis with multi-source cybersecurity knowledge to support evidence-grounded malware reasoning. The framework employs a seven-layer retrieval-augmented generation pipeline, CoVe for IoC validation, and a multi-dimensional quality gate to improve factual reliability and analyst-oriented decision support. Curriculum-ordered instruction data is used to fine-tune DeepSeek-R1-Distill-Qwen-14B and Qwen3-Coder-30B-A3B using QLoRA. Evaluation across 43 malware-analysis task types achieves an average semantic similarity of 0.634, with the highest task-level performance in structured report generation, IoC extraction, vulnerability assessment, malware configuration extraction, and malware class detection. In a real-world case study using MalwareBazaar samples, the grounded pipeline achieves a 10/10 structured analysis pass rate, producing CFG/FCG evidence, MITRE ATT&CK mappings, detection guidance, and analyst-ready reports. These results show that code-centric representations, retrieval grounding, and verification-guided reasoning improve the reliability and operational usefulness of LLM-assisted malware attribution.

URL PDF HTML ☆

赞 0 踩 0

2605.05789 2026-05-08 cs.CR cs.CV

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

Zhen Sun, Zongmin Zhang, Leyi Sheng, Yule Liu, Yifan Liao, Ke Li, Xinhu Zheng, Jiaheng Wei, Wenyuan Yang, Xinlei He

Comments 23 pages

2605.05768 2026-05-08 math.ST cs.LG stat.ML stat.TH

Optimal Confidence Band for Kernel Gradient Flow Estimator

Yuqian Cheng, Zhuo Chen, Qian Lin

2605.05767 2026-05-08 cs.HC cs.CL

Priming, Path-dependence, and Plasticity: Understanding the molding of user-LLM interaction and its implications from (many) chat logs in the wild

Shengqi Zhu, Jeffrey M. Rzeszotarski, David Mimno

2605.05755 2026-05-08 stat.ML cs.AI cs.LG

Transformers Provably Implement In-Context Reinforcement Learning with Policy Improvement

Haodong Liang, Lifeng Lai

Comments 25 pages, 4 figures

2605.05746 2026-05-08 cond-mat.mtrl-sci cs.LG physics.chem-ph physics.comp-ph

Polarizable atomic multipoles for learning long-range electrostatics

Dongjin Kim, Daniel S. King, Yoonjae Park, Roya Savoj, Sebastien Hamel, Xiaoyu Wang, Bingqing Cheng

2605.05724 2026-05-08 cs.MA cs.AI

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Jingjie Ning, Xiaochuan Li, Ji Zeng, Hao Kang, Chenyan Xiong

2605.05705 2026-05-08 math.NA cs.LG cs.NA math.PR stat.ML

Convex-Geometric Error Bounds for Positive-Weight Kernel Quadrature

Satoshi Hayakawa

Comments 22 pages

2605.05700 2026-05-08 cs.SE cs.AI

An Empirical Study of Proactive Coding Assistants in Real-World Software Development

Lehui Li, Ruixuan Jia, Guo-Ye Yang, Jia Li

2605.05699 2026-05-08 cs.PF cs.AI

When Quantization Is Free: An int4 KV Cache That Outruns fp16 on Apple Silicon

Mohamed Amine Bergach

2605.05696 2026-05-08 cs.DC cs.AI cs.LG

Irminsul: MLA-Native Position-Independent Caching for Agentic LLM Serving

Bole Ma, Jan Eitzinger, Harald Köstler

2605.05683 2026-05-08 stat.ML cs.LG

Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization

Andy Zeyi Liu, Elliot Paquette, John Sous

2605.05648 2026-05-08 cs.CY cs.AI cs.HC

The Missing Evaluation Axis: What 10,000 Student Submissions Reveal About AI Tutor Effectiveness

Rose Niousha, Samantha Boatright Smith, Bita Akram, Peter Brusilovsky, Arto Hellas, Juho Leinonen, John DeNero, Narges Norouzi

Comments Accepted to the 27th International Conference on Artificial Intelligence in Education (AIED 2026), Main Conference Track

2605.05632 2026-05-08 cs.CR cs.CL cs.LG

Architecture Matters: Comparing RAG Systems under Knowledge Base Poisoning

Samuel Korn

详情

英文摘要

Retrieval-Augmented Generation (RAG) systems are vulnerable to knowledge base poisoning, yet existing attacks have been evaluated almost exclusively against vanilla retrieve-then-generate pipelines. Architectures designed to handle conflicting retrieved information - multi-agent debate, agentic retrieval, recursive language models - remain untested against adversarially optimized contradictions. We evaluate four RAG architectures (vanilla RAG, agentic RAG, MADAM-RAG, and Recursive Language Models) under controlled single-document (N=1) poisoning on 921 Natural Questions QA pairs, comparing a clean baseline, naive injection, and CorruptRAG-AK - an adversarial attack whose meta-epistemic framing targets credibility assessment. Architecture is a high-impact variable in adversarial robustness: under CorruptRAG-AK, attack success rates range from 81.9% (vanilla) to 24.4% (RLM) - a spread of nearly 58 percentage points across architectures with comparable clean accuracy (~92%). Decomposing this gap, once the poisoned document is retrieved, adversarial framing - not retrieval optimization - drives the majority of CorruptRAG-AK's advantage for three of four architectures, localizing the cross-architecture vulnerability at the content-reasoning stage. Our MADAM-RAG reimplementation shows the highest apparent contradiction detection rate, though our LLM judge over-identifies this behavior (~48.5% precision), so reported rates are upper bounds. Regardless of detection, MADAM-RAG cannot resolve contradictions reliably, producing a 41.4% non-answer rate even on clean inputs - though implementation divergences from the original may contribute. We introduce a seven-category behavioral taxonomy capturing contradiction detection, hedging, and failure modes beyond binary accuracy. Code, data, and analysis notebooks are publicly available.

URL PDF HTML ☆

赞 0 踩 0

2605.05625 2026-05-08 quant-ph cs.LG

Quantum Kernels for Parity-Structured Classification: A Hybrid Pipeline

Tushar Pandey

2605.05606 2026-05-08 stat.ML cs.LG math.PR

Variational Smoothing and Inference for SDEs from Sparse Data with Dynamic Neural Flows

Yu Wang, Arnab Ganguly

Comments Yu Wang and Arnab Ganguly contributed equally to this work. Corresponding to Arnab Ganguly

2605.05602 2026-05-08 cs.DS cs.AI

Nearly Optimal Attention Coresets

Edo Liberty, Alexandr Andoni, Eldar Kleiner

2605.05591 2026-05-08 stat.ML cs.LG stat.CO

In-Context Positive-Unlabeled Learning

Siyan Liu, Yi Chang, Manli Cheng, Qinglong Tian, Pengfei Li

Comments 12 pages, 1 figure, 3 tables

2605.05581 2026-05-08 cs.DC cs.LG

A Scalable Digital Twin Framework for Energy Optimization in Data Centers

Raphael Hendrigo de Souza Gonçalves, Wendel Marcos dos Santos

Comments 11 pages, 2 figures

2605.05575 2026-05-08 eess.SY cs.RO cs.SY math.OC

Maximal Controlled Invariant-MPC: Enhancing Feasibility and Reducing Conservatism through Terminal CBF Constraint in Safety-Critical Control

Tanmay Dokania, Yashwanth Kumar Nakka

Comments Under review

2605.05573 2026-05-08 astro-ph.IM cs.AI

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

Claire Chen, Jiabao Sean Xiao, Shuze Daniel Liu, Facundo Perez Paolino, Luke Handley, Theophile Jegou du Laz, Ricky Nilsson, Alice Zou, Matthew Graham, Ashish Mahabal

2605.05568 2026-05-08 stat.ML cs.LG

Relaxed Sparsest-Permutation Formulation for Causal Discovery at Scale

Sunmin Oh, Sang-Yun Oh, Gunwoong Park

2605.05554 2026-05-08 eess.AS cs.SD

Optimal Transport Audio Distance with Learned Riemannian Ground Metrics

Wonwoo Jeong

Comments 21 pages, 4 figures, 10 tables. The otadtk toolkit is available at https://github.com/wonwoo-jeong/otadtk

2605.05529 2026-05-08 cs.CE cs.GR cs.LG

Discrete Elastic Ribbons: A Unified Discrete Differential Geometry Framework for One-Dimensional Energy Models

Shivam Kumar Panda, M Khalid Jawed

Comments 59 pages, 9 figures, 5 tables. Source code available on https://github.com/StructuresComp/discrete-elastic-ribbon and https://github.com/StructuresComp/discrete-elastic-ribbon-jax

2605.05525 2026-05-08 cs.DB cs.CL

Anatomy of a Query: W5H Dimensions and FAR Patterns for Text-to-SQL Evaluation

Vicki Stover Hertzberg, Eduardo Valverde, Joyce C. Ho

Comments 13 pages