arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2506.03407 2026-02-17 cs.GR cs.AI cs.CV cs.LG

Multi-Spectral Gaussian Splatting with Neural Color Representation

Lukas Meyer, Josef Grün, Maximilian Weiherer, Bernhard Egger, Marc Stamminger, Linus Franke

Comments for project page, see https://meyerls.github.io/ms_splatting

2505.12185 2026-02-17 cs.SE cs.CL cs.LG

EVALOOOP: A Self-Consistency-Centered Framework for Assessing Large Language Model Robustness in Programming

Sen Fang, Weiyuan Ding, Mengshi Zhang, Zihao Chen, Bowen Xu

Comments 27 pages, 7 figures

详情

英文摘要

Evaluating the programming robustness of large language models (LLMs) is paramount for ensuring their reliability in AI-based software development. However, adversarial attacks exhibit fundamental limitations that compromise fair robustness assessment: they demonstrate contradictory evaluation outcomes where different attack strategies tend to favor different models, and more critically, they operate solely through external perturbations, failing to capture the intrinsic stability essential for autonomous coding agents where subsequent inputs are endogenously generated by the model itself. We introduce EVALOOOP, a novel assessment framework that evaluates robustness from a self-consistency perspective, leveraging the natural duality inherent in software engineering tasks (e.g., code generation and code summarization). EVALOOOP establishes a self-contained feedback loop where an LLM iteratively transforms between code and natural language until functional failure occurs, with robustness quantified by a novel Average Sustainable Loops (ASL) metric-the mean number of iterations maintaining functional correctness across benchmark tasks. This cyclical strategy intrinsically evaluates robustness without relying on external attack configurations, providing a unified metric that reveals how effectively LLMs preserve semantic integrity through sustained self-referential transformations. We evaluate 96 popular LLMs, ranging from 0.5B to 685B parameters, on EVALOOOP equipped with the MBPP Plus benchmark, and found that EVALOOOP typically induces a 2.65%-47.62% absolute drop in pass@1 accuracy within ten loops. Intriguingly, robustness does not always align with initial performance (i.e., one-time query); for instance, Qwen3-235B-A22B-Instruct-2507, despite inferior initial code generation compared to OpenAI's o-series models and DeepSeek-V3, demonstrated the superior robustness (ASL score).

URL PDF HTML ☆

赞 0 踩 0

2504.20903 2026-02-17 cs.MA cs.AI cs.HC

Modeling AI-Human Collaboration as a Multi-Agent Adaptation

Prothit Sen, Sai Mihir Jakkaraju

2504.20630 2026-02-17 eess.AS cs.MM cs.SD

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Yu Zhang, Wenxiang Guo, Changhao Pan, Zhiyuan Zhu, Tao Jin, Zhou Zhao

Comments Accepted by ACM Multimedia 2025

Journal ref MM '2025: Proceedings of the 33rd ACM International Conference on Multimedia Pages 9618 - 9627

2504.20532 2026-02-17 cs.MM cs.CR cs.SD eess.AS

TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Traceability

Yue Li, Weizhi Liu, Kaiqing Lin, Dongdong Lin, Kassem Kallas

2504.19062 2026-02-17 eess.AS cs.CL cs.SD

Versatile Framework for Song Generation with Prompt-based Control

Yu Zhang, Wenxiang Guo, Changhao Pan, Zhiyuan Zhu, Ruiqi Li, Jingyu Lu, Rongjie Huang, Ruiyuan Zhang, Zhiqing Hong, Ziyue Jiang, Zhou Zhao

Comments Accepted by Findings of EMNLP 2025

Journal ref Findings of the Association for Computational Linguistics: EMNLP 2025

2504.18367 2026-02-17 physics.comp-ph cs.LG physics.chem-ph q-bio.BM

A Novel 4-D Dataset Paradigm for Studying Complete Ligand-Protein Dissociation Dynamics

Maodong Li, Jiying Zhang, Zhe Wang, Bin Feng, Wenqi Zeng, Dechin Chen, Zhijun Pan, Yu Li, Zijing Liu, Yi Isaac Yang

Comments The dissociation dynamics dataset DD-13M is publicly available at https://huggingface.co/datasets/SZBL-IDEA/MD (For facilitated browsing and categorical download, a dedicated web interface is maintained at: https://aimm.szbl.ac.cn/database/ddd/#/home)

2502.16730 2026-02-17 cs.CR cs.AI

RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents

Sho Nakatani

2502.01713 2026-02-17 cs.CY cs.LG

Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool

Floris Holstege, Mackenzie Jorgensen, Kirtan Padh, Jurriaan Parie, Krsto Prorokovic, Joel Persson, Lukas Snoek

2410.17587 2026-02-17 cs.CE cs.LG econ.GN physics.soc-ph q-fin.EC

Predicting Company Growth using Scaling Theory informed Machine Learning

Ruyi Tao, Veronica R. Cappelli, Kaiwei Liu, Marcus J. Hamilton, Christopher P. Kempes, Geoffrey B. Wes, Jiang Zhang

Comments 28 pages, 13 figures, 3 tables

2409.13832 2026-02-17 eess.AS cs.CL cs.SD

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Yu Zhang, Changhao Pan, Wenxiang Guo, Ruiqi Li, Zhiyuan Zhu, Jialei Wang, Wenhao Xu, Jingyu Lu, Zhiqing Hong, Chuxin Wang, LiChao Zhang, Jinzheng He, Ziyue Jiang, Yuxin Chen, Chen Yang, Jiecheng Zhou, Xinyu Cheng, Zhou Zhao

Comments Accepted by NeurIPS 2024 (Spotlight)

Journal ref Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

2409.04994 2026-02-17 math.OC cs.LG cs.NA math.NA

Learning nonnegative matrix factorizations from compressed data

Abraar Chaudhry, Elizaveta Rebrova

2402.03931 2026-02-17 cond-mat.mes-hall cs.LG quant-ph

Fully autonomous tuning of a spin qubit

Jonas Schuff, Miguel J. Carballido, Madeleine Kotzagiannidis, Juan Carlos Calvo, Marco Caselli, Jacob Rawling, David L. Craig, Barnaby van Straaten, Brandon Severin, Federico Fedele, Simon Svab, Pierre Chevalier Kwon, Rafael S. Eggli, Taras Patlatiuk, Nathan Korda, Dominik Zumbühl, Natalia Ares

Journal ref Nature Electronics (2026)

2306.14297 2026-02-17 stat.ME cs.LG

Inference for relative sparsity

Samuel J. Weisenthal, Sally W. Thurston, Ashkan Ertefaie

Comments 66 pages, 3 figures

2305.01186 2026-02-17 cs.CY cs.AI

Deconstructing Student Perceptions of Generative AI (GenAI) through an Expectancy Value Theory (EVT)-based Instrument

Cecilia Ka Yuk Chan, Wenxin Zhou

Journal ref Smart Learn. Environ. 10, 64 (2023)

2103.14203 2026-02-17 stat.ML cs.LG

Deep Two-Way Matrix Reordering for Relational Data Analysis

Chihiro Watanabe, Taiji Suzuki

2103.02872 2026-02-17 cs.CR cs.LG

An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems

Ipsita Koley, Sunandan Adhikary, Soumyajit Dey

2602.13675 2026-02-17 cs.HC cs.AI

Transferable XAI: Relating Understanding Across Domains with Explanation Transfer

Fei Wang, Yifan Zhang, Brian Y. Lim

Comments 40 pages, accepted by IUI2026

详情

DOI: 10.1145/3742413.3789124

英文摘要

Current Explainable AI (XAI) focuses on explaining a single application, but when encountering related applications, users may rely on their prior understanding from previous explanations. This leads to either overgeneralization and AI overreliance, or burdensome independent memorization. Indeed, related decision tasks can share explanatory factors, but with some notable differences; e.g., body mass index (BMI) affects the risks for heart disease and diabetes at the same rate, but chest pain is more indicative of heart disease. Similarly, models using different attributes for the same task still share signals; e.g., temperature and pressure affect air pollution but in opposite directions due to the ideal gas law. Leveraging transfer of learning, we propose Transferable XAI to enable users to transfer understanding across related domains by explaining the relationship between domain explanations using a general affine transformation framework applied to linear factor explanations. The framework supports explanation transfer across various domain types: translation for data subspace (subsuming prior work on Incremental XAI), scaling for decision task, and mapping for attributes. Focusing on task and attributes domain types, in formative and summative user studies, we investigated how well participants could understand AI decisions from one domain to another. Compared to single-domain and domain-independent explanations, Transferable XAI was the most helpful for understanding the second domain, leading to the best decision faithfulness, factor recall, and ability to relate explanations between domains. This framework contributes to improving the reusability of explanations across related AI applications by explaining factor relationships between subspaces, tasks, and attributes.

URL PDF HTML ☆

赞 0 踩 0

2602.13672 2026-02-17 cs.NI cs.LG

LEAD-Drift: Real-time and Explainable Intent Drift Detection by Learning a Data-Driven Risk Score

Md. Kamrul Hossain, Walid Aljoby

2602.13671 2026-02-17 cs.MA cs.AI

MAS-on-the-Fly: Dynamic Adaptation of LLM-based Multi-Agent Systems at Test Time

Guangyi Liu, Haojun Lin, Huan Zeng, Heng Wang, Quanming Yao

2602.13625 2026-02-17 cs.HC cs.AI

Anthropomorphism on Risk Perception: The Role of Trust and Domain Knowledge in Decision-Support AI

Manuele Reani, Xiangyang He, Zuolan Bao

2602.13619 2026-02-17 stat.ML cs.IT cs.LG math.IT stat.ME

Locally Private Parametric Methods for Change-Point Detection

Anuj Kumar Yadav, Cemre Cadir, Yanina Shkel, Michael Gastpar

Comments 43 pages, 20 figures

2602.13611 2026-02-17 cs.SE cs.AI

From What to How: Bridging User Requirements with Software Development Using Large Language Models

Xiao He, Ru Chen, Jialun Cao

2602.13606 2026-02-17 cs.NI cs.AI cs.ET cs.LG

Multi-Modal Sensing and Fusion in mmWave Beamforming for Connected Vehicles: A Transformer Based Framework

Muhammad Baqer Mollah, Honggang Wang, Mohammad Ataul Karim, Hua Fang

Comments 13 Pages. arXiv admin note: text overlap with arXiv:2509.11112

Journal ref IEEE Transactions on Vehicular Technology, 2026

2602.13576 2026-02-17 cs.CR cs.AI cs.CL

Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

Ruomeng Ding, Yifei Pang, He Sun, Yizhong Wang, Zhiwei Steven Wu, Zhun Deng

2602.13562 2026-02-17 cs.CR cs.AI cs.CL

Mitigating the Safety-utility Trade-off in LLM Alignment via Adaptive Safe Context Learning

Yanbo Wang, Minzheng Wang, Jian Liang, Lu Wang, Yongcan Yu, Ran He

Comments Preprint. 18 pages, 6 figures

2602.13556 2026-02-17 cs.IT cs.AI eess.SP math.IT

Discrete-Space Generative AI Pipeline for Semantic Transmission of Signals

Silvija Kokalj-Filipovic, Yagna Kaasaragadda

2602.13554 2026-02-17 cs.ET cs.IT cs.RO math.IT

From Snapshot Sensing to Persistent EM World Modeling: A Generative-Space Perspective for ISAC

Pin-Han Ho, Haoran Mei, Limei Peng, Yiming Miao, Kairan Liang, Yan Jiao

Comments 7 pages, 6 figures/tables

2602.13547 2026-02-17 cs.CR cs.AI

AISA: Awakening Intrinsic Safety Awareness in Large Language Models against Jailbreak Attacks

Weiming Song, Xuan Xie, Ruiping Yin

2602.13543 2026-02-17 cs.IR cs.CL cs.LG

LiveNewsBench: Evaluating LLM Web Search Capabilities with Freshly Curated News

Yunfan Zhang, Kathleen McKeown, Smaranda Muresan

Comments An earlier version of this work was publicly available on OpenReview as an ICLR 2026 submission in September 2025

AI 大模型

视觉与机器人

科学与医疗