arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.00771 2026-02-06 eess.AS cs.AI cs.SD eess.SP

UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching

Woongjib Choi, Sangmin Lee, Hyungseob Lim, Hong-Goo Kang

Comments Accepted to ICASSP 2026

2508.06616 2026-02-06 cs.NI cs.AI

Generative AI for Intent-Driven Network Management in 6G RAN: A Case Study on the Mamba Model

Md Arafat Habib, Medhat Elsayed, Yigit Ozcan, Pedro Enrique Iturria-Rivera, Majid Bavand, Melike Erol-Kantarci

Comments Paper submitted to IEEE for possible publication. The contents of this paper may change at any time

2507.16838 2026-02-06 eess.AS cs.AI cs.CL

Segmentation-free Goodness of Pronunciation

Xinwei Cao, Zijian Fan, Torbjørn Svendsen, Giampiero Salvi

Comments The article has been accepted for publication by IEEE TASLPRO

2506.22949 2026-02-06 cs.CR cs.AI cs.LG

A Study on Semi-Supervised Detection of DDoS Attacks under Class Imbalance

Ehsan Hallaji, Vaishnavi Shanmugam, Roozbeh Razavi-Far, Mehrdad Saif

Comments Accepted for publication in IEEE CCECE 2025

Journal ref IEEE Canadian Conference on Electrical and Computer Engineering (CCECE), Vancouver, BC, Canada, 2025, pp. 507-511

2506.19806 2026-02-06 cs.CY cs.CL cs.MA

LLM-Based Social Simulations Require a Boundary

Zengqing Wu, Run Peng, Takayuki Ito, Makoto Onizuka, Chuan Xiao

2506.17208 2026-02-06 cs.SE cs.AI cs.CL

Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems

Matias Martinez, Xavier Franch

Comments Part of this work (RQ1) has been published at the 2026 IEEE/ACM 48th International Conference on Software Engineering (ICSE-SEIP 2026), DOI: 10.1145/3786583.3786904. The published version is also available on arXiv at arXiv:2602.04449

2506.08520 2026-02-06 eess.IV cs.CV

Plug-and-play linear attention with provable guarantees for training-free image restoration

Srinivasan Kidambi, Karthik Palaniappan, Pravin Nair

2505.19447 2026-02-06 eess.IV cs.CV

A Contrastive Learning Foundation Model Based on Perfectly Aligned Sample Pairs for Remote Sensing Images

Hengtong Shen, Haiyan Gu, Haitao Li, Yi Yang, Agen Qiu

Comments This article has been accepted for publication in Geo-spatial Information Science, published by Taylor & Francis

2505.14410 2026-02-06 eess.AS cs.CL cs.SD

Pairwise Evaluation of Accent Similarity in Speech Synthesis

Jinzuomu Zhong, Suyuan Liu, Dan Wells, Korin Richmond

Comments Accepted by INTERSPEECH 2025

2505.06085 2026-02-06 cs.PF cs.AI cs.AR

Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities

Hiari Pizzini Cavagna, Daniele Cesarini, Andrea Bartolini

Comments Accepted to the Computational Aspects of Deep Learning Workshop at ISC High Performance 2025. To appear in the ISC High Performance 2025 Workshop Proceedings

Journal ref Proc. High Performance Computing: ISC High Performance 2025 International Workshops, Hamburg, Germany

2505.01866 2026-02-06 cs.CR cs.LG

PQS-BFL: A Post-Quantum Secure Blockchain-based Federated Learning Framework

Daniel Commey, Garth V. Crosby

Journal ref Expert Systems with Applications, 131449 (2026)

2504.20741 2026-02-06 cs.HC cs.AI cs.CY cs.LG

In defence of post-hoc explanations in medical AI

Joshua Hatherley, Lauritz Munch, Jens Christian Bjerring

Journal ref 2026. Hastings Center Report 56(1): 40-46

2502.17399 2026-02-06 cs.HC cs.RO

Enriching physical-virtual interaction in AR gaming by tracking identical objects via an egocentric partial observation frame

Liuchuan Yu, Ching-I Huang, Hsueh-Cheng Wang, Lap-Fai Yu

Journal ref Virtual Reality 30 (2026) 51

详情

DOI: 10.1007/s10055-025-01305-y

英文摘要

Augmented reality (AR) games, particularly those designed for head-mounted displays, have grown increasingly prevalent. However, most existing systems depend on pre-scanned, static environments and rely heavily on continuous tracking or marker-based solutions, which limit adaptability in dynamic physical spaces. This is particularly problematic for AR headsets and glasses, which typically follow the user's head movement and cannot maintain a fixed, stationary view of the scene. Moreover, continuous scene observation is neither power-efficient nor practical for wearable devices, given their limited battery and processing capabilities. A persistent challenge arises when multiple identical objects are present in the environment-standard object tracking pipelines often fail to maintain consistent identities without uninterrupted observation or external sensors. These limitations hinder fluid physical-virtual interactions, especially in dynamic or occluded scenes where continuous tracking is infeasible. To address this, we introduce a novel optimization-based framework for re-identifying identical objects in AR scenes using only one partial egocentric observation frame captured by a headset. We formulate the problem as a label assignment task solved via integer programming, augmented with a Voronoi diagram-based pruning strategy to improve computational efficiency. This method reduces computation time by 50% while preserving 91% accuracy in simulated experiments. Moreover, we evaluated our approach in quantitative synthetic and quantitative real-world experiments. We also conducted three qualitative real-world experiments to demonstrate the practical utility and generalizability for enabling dynamic, markerless object interaction in AR environments. Our video demo is available at https://youtu.be/RwptEfLtW1U.

URL PDF HTML ☆

赞 0 踩 0

2501.12911 2026-02-06 cs.CR cs.DC cs.LG

A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning

Abdulkadir Korkmaz, Praveen Rao

Comments 18 pages, 18 figures

Journal ref Proceedings of the IEEE Consumer Communications & Networking Conference (CCNC), 2026

详情

DOI: 10.1109/CCNC65079.2026.11366371

英文摘要

Federated learning (FL) has come forward as a critical approach for privacy-preserving machine learning in healthcare, allowing collaborative model training across decentralized medical datasets without exchanging clients' data. However, current security implementations for these systems face a fundamental trade-off: rigorous cryptographic protections like fully homomorphic encryption (FHE) impose prohibitive computational overhead, while lightweight alternatives risk vulnerable data leakage through model updates. To address this issue, we present FAS (Fast and Secure Federated Learning), a novel approach that strategically combines selective homomorphic encryption, differential privacy, and bitwise scrambling to achieve robust security without compromising practical usability. Our approach eliminates the need for model pretraining phases while dynamically protecting high-risk model parameters through layered encryption and obfuscation. We implemented FAS using the Flower framework and evaluated it on a cluster of eleven physical machines. Our approach was up to 90\% faster than applying FHE on the model weights. In addition, we eliminated the computational overhead that is required by competitors such as FedML-HE and MaskCrypt. Our approach was up to 1.5$\times$ faster than the competitors while achieving comparable security results. Experimental evaluations on medical imaging datasets confirm that FAS maintains similar security results to conventional FHE against gradient inversion attacks while preserving diagnostic model accuracy. These results position FAS as a practical solution for latency-sensitive healthcare applications where both privacy preservation and computational efficiency are requirements.

URL PDF HTML ☆

赞 0 踩 0

2501.01741 2026-02-06 cs.SE cs.AI cs.CL

How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models

Simone Corbo, Luca Bancale, Valeria De Gennaro, Livia Lestingi, Vincenzo Scotti, Matteo Camilli

Journal ref IEEE Transactions on Software Engineering, Nov. 2025, pp. 3056-3071, vol. 51

详情

DOI: 10.1109/TSE.2025.3607625

英文摘要

Language is a deep-rooted means of perpetration of stereotypes and discrimination. Large Language Models (LLMs), now a pervasive technology in our everyday lives, can cause extensive harm when prone to generating toxic responses. The standard way to address this issue is to align the LLM , which, however, dampens the issue without constituting a definitive solution. Therefore, testing LLM even after alignment efforts remains crucial for detecting any residual deviations with respect to ethical standards. We present EvoTox, an automated testing framework for LLMs' inclination to toxicity, providing a way to quantitatively assess how much LLMs can be pushed towards toxic responses even in the presence of alignment. The framework adopts an iterative evolution strategy that exploits the interplay between two LLMs, the System Under Test (SUT) and the Prompt Generator steering SUT responses toward higher toxicity. The toxicity level is assessed by an automated oracle based on an existing toxicity classifier. We conduct a quantitative and qualitative empirical evaluation using five state-of-the-art LLMs as evaluation subjects having increasing complexity (7-671B parameters). Our quantitative evaluation assesses the cost-effectiveness of four alternative versions of EvoTox against existing baseline methods, based on random search, curated datasets of toxic prompts, and adversarial attacks. Our qualitative assessment engages human evaluators to rate the fluency of the generated prompts and the perceived toxicity of the responses collected during the testing sessions. Results indicate that the effectiveness, in terms of detected toxicity level, is significantly higher than the selected baseline methods (effect size up to 1.0 against random search and up to 0.99 against adversarial attacks). Furthermore, EvoTox yields a limited cost overhead (from 22% to 35% on average).

URL PDF HTML ☆

赞 0 踩 0

2410.17790 2026-02-06 eess.AS cs.SD

Regularized autoregressive modeling and its application to audio signal reconstruction

Ondřej Mokrý, Pavel Rajmic

Comments submitted to IEEE Transactions on Audio, Speech, and Language Processing

2409.09143 2026-02-06 cs.CR cs.CL

DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification

Abdelkader El Mahdaouy, Salima Lamsiyah, Meryem Janati Idrissi, Hamza Alami, Zakaria Yartaoui, Ismail Berrada

Journal ref Journal of Network and Systems Management, 34, 36, 2026

2409.01978 2026-02-06 quant-ph cs.LG stat.ML

Application of Langevin Dynamics to Advance the Quantum Natural Gradient Optimization Algorithm

Oleksandr Borysenko, Mykhailo Bratchenko, Ilya Lukin, Mykola Luhanko, Ihor Omelchenko, Andrii Sotnikov, Alessandro Lomi

Comments 11 pages, 3 figures

Journal ref Physica A 682 (2026) 131158

2404.05748 2026-02-06 q-bio.NC cs.LG

Analyzing heterogeneity in Alzheimer Disease using multimodal normative modeling on imaging-based ATN biomarkers

Sayantan Kumar, Tom Earnest, Braden Yang, Deydeep Kothapalli, Andrew J. Aschenbrenner, Jason Hassenstab, Chengie Xiong, Beau Ances, John Morris, Tammie L. S. Benzinger, Brian A. Gordon, Philip Payne, Aristeidis Sotiras

Comments Under review in Alzheimer's & Dementia

Journal ref Alzheimer's Dement. 2025; 21:e70143

2403.11249 2026-02-06 eess.IV cs.CV

YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images

Chun-Tse Chien, Rui-Yang Ju, Kuang-Yi Chou, Jen-Shiun Chiang

Comments Accepted by Electronics Letters

Journal ref Electron. Lett. 60 (2024) e13248

2403.01673 2026-02-06 stat.ML cs.AI cs.LG

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

Jiecheng Lu, Xu Han, Yan Sun, Shihao Yang

Comments Camera-ready version. Accepted at ICML 2024

Journal ref Proceedings of the Forty-first International Conference on Machine Learning (ICML 2024)

2403.01600 2026-02-06 cs.MA cs.AI

Can Poverty Be Reduced by Acting on Discrimination? An Agent-based Model for Policy Making

Alba Aguilera, Nieves Montes, Georgina Curto, Carles Sierra, Nardine Osman

2310.09488 2026-02-06 stat.ML cs.LG

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Jiecheng Lu, Xu Han, Shihao Yang

Comments Camera-ready version. Accepted at ICLR 2024

Journal ref Proceedings of the Twelfth International Conference on Learning Representations (ICLR 2024)

2309.16858 2026-02-06 stat.ML cs.LG

Improved Generalization Bounds for Transductive Learning by Transductive Local Complexity and Its Applications

Yingzhen Yang

Comments The ICML 2025 conference version (https://openreview.net/pdf?id=NRVdvg7VMn) is a special case of this paper where the chain length is fixed at 2 (i.e.,$Q=2$, see Def. 5.1), and its main results follow directly from the results here. This paper further provides a nearly optimal excess risk bound for realizable transductive learning and a stronger bound for transductive kernel learning

2110.04903 2026-02-06 eess.IV cs.LG

Normative Modeling using Multimodal Variational Autoencoders to Identify Abnormal Brain Structural Patterns in Alzheimer Disease

Sayantan Kumar, Philip Payne, Aristeidis Sotiras

Comments Medical Imaging Meets NeurIPS workshop in NeurIPS 2022

Journal ref Proc. SPIE 12465, Medical Imaging 2023: Computer-Aided Diagnosis, 1246503 (7 April 2023)

详情

DOI: 10.1117/12.2654369

英文摘要

Normative modelling is an emerging method for understanding the underlying heterogeneity within brain disorders like Alzheimer Disease (AD) by quantifying how each patient deviates from the expected normative pattern that has been learned from a healthy control distribution. Since AD is a multifactorial disease with more than one biological pathways, multimodal magnetic resonance imaging (MRI) neuroimaging data can provide complementary information about the disease heterogeneity. However, existing deep learning based normative models on multimodal MRI data use unimodal autoencoders with a single encoder and decoder that may fail to capture the relationship between brain measurements extracted from different MRI modalities. In this work, we propose multi-modal variational autoencoder (mmVAE) based normative modelling framework that can capture the joint distribution between different modalities to identify abnormal brain structural patterns in AD. Our multi-modal framework takes as input Freesurfer processed brain region volumes from T1-weighted (cortical and subcortical) and T2-weighed (hippocampal) scans of cognitively normal participants to learn the morphological characteristics of the healthy brain. The estimated normative model is then applied on Alzheimer Disease (AD) patients to quantify the deviation in brain volumes and identify the abnormal brain structural patterns due to the effect of the different AD stages. Our experimental results show that modeling joint distribution between the multiple MRI modalities generates deviation maps that are more sensitive to disease staging within AD, have a better correlation with patient cognition and result in higher number of brain regions with statistically significant deviations compared to a unimodal baseline model with all modalities concatenated as a single input.

URL PDF HTML ☆

赞 0 踩 0

2602.05207 2026-02-06 eess.AS cs.AI

ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference

Chunyat Wu, Jiajun Deng, Zhengxi Liu, Zheqi Dai, Haolin He, Qiuqiang Kong

Comments Accepted by ICASSP 2026

2602.05184 2026-02-06 hep-th cond-mat.dis-nn cs.AI cs.LG

Towards Worst-Case Guarantees with Scale-Aware Interpretability

Lauren Greenspan, David Berman, Aryeh Brill, Ro Jefferson, Artemy Kolchinsky, Jennifer Lin, Andrew Mack, Anindita Maiti, Fernando E. Rosas, Alexander Stapleton, Lucas Teixeira, Dmitry Vaintrob

2602.05174 2026-02-06 stat.ML cs.AI cs.LG math.ST stat.TH

Total Variation Rates for Riemannian Flow Matching

Yunrui Guan, Krishnakumar Balasubramanian, Shiqian Ma

2602.05167 2026-02-06 physics.comp-ph cond-mat.soft cond-mat.stat-mech cs.LG physics.chem-ph

Path Sampling for Rare Events Boosted by Machine Learning

Porhouy Minh, Sapna Sarupria

Comments 7 pages, 1 figure

Journal ref P. Minh & S. Sarupria, KIM REVIEW, Volume 4, Article 01, 2026

2602.05120 2026-02-06 cs.CC cs.LG

Certifiable Boolean Reasoning Is Universal

Wenhao Li, Anastasis Kratsios, Hrad Ghoukasian, Dennis Zvigelsky

Comments Submitted to COLT 2026

AI 大模型

视觉与机器人

科学与医疗

UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching

Generative AI for Intent-Driven Network Management in 6G RAN: A Case Study on the Mamba Model

Segmentation-free Goodness of Pronunciation

A Study on Semi-Supervised Detection of DDoS Attacks under Class Imbalance

LLM-Based Social Simulations Require a Boundary

Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems

Plug-and-play linear attention with provable guarantees for training-free image restoration

A Contrastive Learning Foundation Model Based on Perfectly Aligned Sample Pairs for Remote Sensing Images

Pairwise Evaluation of Accent Similarity in Speech Synthesis

Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities

PQS-BFL: A Post-Quantum Secure Blockchain-based Federated Learning Framework

In defence of post-hoc explanations in medical AI

Enriching physical-virtual interaction in AR gaming by tracking identical objects via an egocentric partial observation frame

A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning

How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models

Regularized autoregressive modeling and its application to audio signal reconstruction

DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification

Application of Langevin Dynamics to Advance the Quantum Natural Gradient Optimization Algorithm

Analyzing heterogeneity in Alzheimer Disease using multimodal normative modeling on imaging-based ATN biomarkers

YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

Can Poverty Be Reduced by Acting on Discrimination? An Agent-based Model for Policy Making

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Improved Generalization Bounds for Transductive Learning by Transductive Local Complexity and Its Applications

Normative Modeling using Multimodal Variational Autoencoders to Identify Abnormal Brain Structural Patterns in Alzheimer Disease

ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference

Towards Worst-Case Guarantees with Scale-Aware Interpretability

Total Variation Rates for Riemannian Flow Matching

Path Sampling for Rare Events Boosted by Machine Learning

Certifiable Boolean Reasoning Is Universal