arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2507.12108 2026-02-16 cs.SI cs.AI cs.CY cs.HC cs.LG

Multimodal Coordinated Online Behavior: Trade-offs and Strategies

Lorenzo Mannocci, Stefano Cresci, Matteo Magnani, Anna Monreale, Maurizio Tesconi

Comments Postprint of the article published in the Information Sciences journal. Please, cite accordingly

2505.04586 2026-02-16 eess.IV cs.CV cs.LG

Active Sampling for MRI-based Sequential Decision Making

Yuning Du, Jingshuai Liu, Rohan Dharmakumar, Sotirios A. Tsaftaris

Comments Under Review

2505.01239 2026-02-16 eess.IV cs.CV

Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging

Elena Mulero Ayllón, Massimiliano Mantegna, Linlin Shen, Paolo Soda, Valerio Guarrasi, Matteo Tortora

2503.15555 2026-02-16 eess.IV cs.AI

Whole-Body Image-to-Image Translation for a Virtual Scanner in a Healthcare Digital Twin

Valerio Guarrasi, Francesco Di Feola, Rebecca Restivo, Lorenzo Tronchin, Paolo Soda

2503.15058 2026-02-16 eess.IV cs.AI cs.CV

Texture-Aware StarGAN for CT data harmonisation

Francesco Di Feola, Ludovica Pompilio, Cecilia Assolito, Valerio Guarrasi, Paolo Soda

2502.07971 2026-02-16 cs.IR cs.AI cs.LG

Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency

Shubham Gupta, Zichao Li, Tianyi Chen, Cem Subakan, Siva Reddy, Perouz Taslakian, Valentina Zantedeschi

2501.19176 2026-02-16 eess.IV cs.AI cs.CV

Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence

Aurora Rofena, Claudia Lucia Piccolo, Bruno Beomonte Zobel, Paolo Soda, Valerio Guarrasi

详情

DOI: 10.1016/j.jbi.2025.104971

英文摘要

Full-Field Digital Mammography (FFDM) is the primary imaging modality for routine breast cancer screening; however, its effectiveness is limited in patients with dense breast tissue or fibrocystic conditions. Contrast-Enhanced Spectral Mammography (CESM), a second-level imaging technique, offers enhanced accuracy in tumor detection. Nonetheless, its application is restricted due to higher radiation exposure, the use of contrast agents, and limited accessibility. As a result, CESM is typically reserved for select cases, leaving many patients to rely solely on FFDM despite the superior diagnostic performance of CESM. While biopsy remains the gold standard for definitive diagnosis, it is an invasive procedure that can cause discomfort for patients. We introduce a multimodal, multi-view deep learning approach for virtual biopsy, integrating FFDM and CESM modalities in craniocaudal and mediolateral oblique views to classify lesions as malignant or benign. To address the challenge of missing CESM data, we leverage generative artificial intelligence to impute CESM images from FFDM scans. Experimental results demonstrate that incorporating the CESM modality is crucial to enhance the performance of virtual biopsy. When real CESM data is missing, synthetic CESM images proved effective, outperforming the use of FFDM alone, particularly in multimodal configurations that combine FFDM and CESM modalities. The proposed approach has the potential to improve diagnostic workflows, providing clinicians with augmented intelligence tools to improve diagnostic accuracy and patient care. Additionally, as a contribution to the research community, we publicly release the dataset used in our experiments, facilitating further advancements in this field.

URL PDF HTML ☆

赞 0 踩 0

2501.12244 2026-02-16 eess.IV cs.CV

Zero-shot Bias Correction: Efficient MR Image Inhomogeneity Reduction Without Any Data

Hongxu Yang, Edina Timko, Brice Fernandez

Comments Accepted by ISBI 2025. Supported by IHI PREDICTOM Project

2411.04551 2026-02-16 math.OC cs.LG stat.ML

Measure-to-measure interpolation using Transformers

Borjan Geshkovski, Philippe Rigollet, Domènec Ruiz-Balet

Comments To appear in Foundations of Computational Mathematics

2409.16407 2026-02-16 stat.ML cs.LG stat.ME

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

Oscar Clivio, Avi Feller, Chris Holmes

Comments Reference to erroneous result from Clivio et al. (2022) in Section 3.4 fixed

2407.03888 2026-02-16 math.OC cs.LG

Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy

Lijun Bo, Yijie Huang, Xiang Yu, Tingting Zhang

2405.13771 2026-02-16 eess.IV cs.CV cs.LG

Multi-Dataset Multi-Task Learning for COVID-19 Prognosis

Filippo Ruffini, Lorenzo Tronchin, Zhuoru Wu, Wenting Chen, Paolo Soda, Linlin Shen, Valerio Guarrasi

详情

DOI: 10.1007/978-3-031-72390-2_24

英文摘要

In the fight against the COVID-19 pandemic, leveraging artificial intelligence to predict disease outcomes from chest radiographic images represents a significant scientific aim. The challenge, however, lies in the scarcity of large, labeled datasets with compatible tasks for training deep learning models without leading to overfitting. Addressing this issue, we introduce a novel multi-dataset multi-task training framework that predicts COVID-19 prognostic outcomes from chest X-rays (CXR) by integrating correlated datasets from disparate sources, distant from conventional multi-task learning approaches, which rely on datasets with multiple and correlated labeling schemes. Our framework hypothesizes that assessing severity scores enhances the model's ability to classify prognostic severity groups, thereby improving its robustness and predictive power. The proposed architecture comprises a deep convolutional network that receives inputs from two publicly available CXR datasets, AIforCOVID for severity prognostic prediction and BRIXIA for severity score assessment, and branches into task-specific fully connected output networks. Moreover, we propose a multi-task loss function, incorporating an indicator function, to exploit multi-dataset integration. The effectiveness and robustness of the proposed approach are demonstrated through significant performance improvements in prognosis classification tasks across 18 different convolutional neural network backbones in different evaluation strategies. This improvement is evident over single-task baselines and standard transfer learning strategies, supported by extensive statistical analysis, showing great application potential.

URL PDF HTML ☆

赞 0 踩 0

2403.16640 2026-02-16 eess.IV cs.CV cs.LG

Multi-Scale Texture Loss for CT denoising with GANs

Francesco Di Feola, Lorenzo Tronchin, Valerio Guarrasi, Paolo Soda

2402.02196 2026-02-16 stat.ME cs.LG

Sample-Efficient "Clustering and Conquer" Procedures for Parallel Large-Scale Ranking and Selection

Zishi Zhang, Yijie Peng

2309.14841 2026-02-16 q-bio.NC cs.RO

Towards a Neuronally Consistent Ontology for Robotic Agents

Florian Ahrens, Mihai Pomarlan, Daniel Beßler, Thorsten Fehr, Michael Beetz, Manfred Herrmann

Comments Preprint of paper accepted for the European Conference on Artificial Intelligence (ECAI) 2023 (minor typo corrections)

Journal ref ECAI.2023. 372 (2023) 36-43

2602.12681 2026-02-16 cs.CR cs.LG

Fool Me If You Can: On the Robustness of Binary Code Similarity Detection Models against Semantics-preserving Transformations

Jiyong Uhm, Minseok Kim, Michalis Polychronakis, Hyungjoon Koo

Comments 23 pages, 9 figures, 5 tables. The paper has been accepted by The ACM International Conference on the Foundations of Software Engineering (FSE 2026)

详情

英文摘要

Binary code analysis plays an essential role in cybersecurity, facilitating reverse engineering to reveal the inner workings of programs in the absence of source code. Traditional approaches, such as static and dynamic analysis, extract valuable insights from stripped binaries, but often demand substantial expertise and manual effort. Recent advances in deep learning have opened promising opportunities to enhance binary analysis by capturing latent features and disclosing underlying code semantics. Despite the growing number of binary analysis models based on machine learning, their robustness to adversarial code transformations at the binary level remains underexplored. We evaluate the robustness of deep learning models for the task of binary code similarity detection (BCSD) under semantics-preserving transformations. The unique nature of machine instructions presents distinct challenges compared to the typical input perturbations found in other domains. We introduce asmFooler, a system that evaluates the resilience of BCSD models using a diverse set of adversarial code transformations that preserve functional semantics. We construct a dataset of 9,565 binary variants from 620 baseline samples by applying eight semantics-preserving transformations across six representative BCSD models. Our major findings highlight several key insights: i) model robustness relies on the processing pipeline, including code pre-processing, architecture, and feature selection; ii) adversarial transformation effectiveness is bounded by a budget shaped by model-specific constraints like input size and instruction expressive capacity; iii) well-crafted transformations can be highly effective with minimal perturbations; and iv) such transformations efficiently disrupt model decisions (e.g., misleading to false positives or false negatives) by focusing on semantically significant instructions.

URL PDF HTML ☆

赞 0 踩 0

2602.12680 2026-02-16 stat.ML cs.LG

A Regularization-Sharpness Tradeoff for Linear Interpolators

Qingyi Hu, Liam Hodgkinson

Comments 29 pages, 4 figures

2602.12641 2026-02-16 cs.NI cs.AI cs.HC cs.MM

Artic: AI-oriented Real-time Communication for MLLM Video Assistant

Jiangkai Wu, Zhiyuan Ren, Junquan Zhong, Liming Liu, Xinggong Zhang

2602.12630 2026-02-16 cs.CR cs.AI

TensorCommitments: A Lightweight Verifiable Inference for Language Models

Oguzhan Baser, Elahe Sadeghi, Eric Wang, David Ribeiro Alves, Sam Kazemian, Hong Kang, Sandeep P. Chinchali, Sriram Vishwanath

Comments 23 pages, 8 figures, under review

2602.12616 2026-02-16 eess.SY cs.RO cs.SY

When Environments Shift: Safe Planning with Generative Priors and Robust Conformal Prediction

Kaizer Rahaman, Jyotirmoy V. Deshmukh, Ashish R. Hota, Lars Lindemann

详情

英文摘要

Autonomous systems operate in environments that may change over time. An example is the control of a self-driving vehicle among pedestrians and human-controlled vehicles whose behavior may change based on factors such as traffic density, road visibility, and social norms. Therefore, the environment encountered during deployment rarely mirrors the environment and data encountered during training -- a phenomenon known as distribution shift -- which can undermine the safety of autonomous systems. Conformal prediction (CP) has recently been used along with data from the training environment to provide prediction regions that capture the behavior of the environment with a desired probability. When embedded within a model predictive controller (MPC), one can provide probabilistic safety guarantees, but only when the deployment and training environments coincide. Once a distribution shift occurs, these guarantees collapse. We propose a planning framework that is robust under distribution shifts by: (i) assuming that the underlying data distribution of the environment is parameterized by a nuisance parameter, i.e., an observable, interpretable quantity such as traffic density, (ii) training a conditional diffusion model that captures distribution shifts as a function of the nuisance parameter, (iii) observing the nuisance parameter online and generating cheap, synthetic data from the diffusion model for the observed nuisance parameter, and (iv) designing an MPC that embeds CP regions constructed from such synthetic data. Importantly, we account for discrepancies between the underlying data distribution and the diffusion model by using robust CP. Thus, the plans computed using robust CP enjoy probabilistic safety guarantees, in contrast with plans obtained from a single, static set of training data. We empirically demonstrate safety under diverse distribution shifts in the ORCA simulator.

URL PDF HTML ☆

赞 0 踩 0

2602.12612 2026-02-16 cs.IR cs.AI

Self-EvolveRec: Self-Evolving Recommender Systems with LLM-based Directional Feedback

Sein Kim, Sangwu Park, Hongseok Kang, Wonjoong Kim, Jimin Seo, Yeonjun In, Kanghoon Yoon, Chanyoung Park

2602.12593 2026-02-16 cs.IR cs.AI

RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

Ziye Tong, Jiahao Liu, Weimin Zhang, Hongji Ruan, Derick Tang, Zhanpeng Zeng, Qinsong Zeng, Peng Zhang, Tun Lu, Ning Gu

Comments Under review

2602.12574 2026-02-16 cs.DB cs.AI

Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

Xubang Xiong, Raymond Chi-Wing Wong, Yuanfeng Song

2602.12547 2026-02-16 q-bio.NC cs.AI cs.ET

A consequence of failed sequential learning: A computational account of developmental amnesia

Qi Zhang

Comments 30 pages, 5 figures and 2 tables

Journal ref Cognitive Computation 2009-09

2602.12546 2026-02-16 eess.AS cs.AI cs.CL cs.SD

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

Jaeyoung Lee, Masato Mimura

Comments Accepted to ICASSP 2026

2602.12528 2026-02-16 cs.IR cs.CL

DiffuRank: Effective Document Reranking with Diffusion Language Models

Qi Liu, Kun Ai, Jiaxin Mao, Yanzhao Zhang, Mingxin Li, Dingkun Long, Pengjun Xie, Fengbin Zhu, Ji-Rong Wen

Comments The code is available at https://github.com/liuqi6777/DiffusionRank

2602.12510 2026-02-16 cs.IR cs.CV cs.LG

Visual RAG Toolkit: Scaling Multi-Vector Visual Retrieval with Training-Free Pooling and Multi-Stage Search

Ara Yeroyan

Comments 4 pages, 3 figures. Submitted to SIGIR 2026 Demonstrations Track. Project website: https://github.com/Ara-Yeroyan/visual-rag-toolkit

2602.12500 2026-02-16 cs.SE cs.AI cs.CR

Favia: Forensic Agent for Vulnerability-fix Identification and Analysis

André Storhaug, Jiamou Sun, Jingyue Li

Comments 44 pages, 12 figures, 5 tables, 3 listings

2602.12478 2026-02-16 eess.SP cs.LG

Task- and Metric-Specific Signal Quality Indices for Medical Time Series

Jad Haidamous, Christoph Hoog Antink

Comments 5 pages, 3 figures, submitted to EUSIPCO 2026

2602.12476 2026-02-16 cs.CY cs.AI cs.HC

Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions

Raffaele Ciriello, Uri Gal, Ofir Turel

AI 大模型

视觉与机器人

科学与医疗

Multimodal Coordinated Online Behavior: Trade-offs and Strategies

Active Sampling for MRI-based Sequential Decision Making

Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging

Whole-Body Image-to-Image Translation for a Virtual Scanner in a Healthcare Digital Twin

Texture-Aware StarGAN for CT data harmonisation

Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency

Augmented Intelligence for Multimodal Virtual Biopsy in Breast Cancer Using Generative Artificial Intelligence

Zero-shot Bias Correction: Efficient MR Image Inhomogeneity Reduction Without Any Data

Measure-to-measure interpolation using Transformers

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy

Multi-Dataset Multi-Task Learning for COVID-19 Prognosis

Multi-Scale Texture Loss for CT denoising with GANs

Sample-Efficient "Clustering and Conquer" Procedures for Parallel Large-Scale Ranking and Selection

Towards a Neuronally Consistent Ontology for Robotic Agents

Fool Me If You Can: On the Robustness of Binary Code Similarity Detection Models against Semantics-preserving Transformations

A Regularization-Sharpness Tradeoff for Linear Interpolators

Artic: AI-oriented Real-time Communication for MLLM Video Assistant

TensorCommitments: A Lightweight Verifiable Inference for Language Models

When Environments Shift: Safe Planning with Generative Priors and Robust Conformal Prediction

Self-EvolveRec: Self-Evolving Recommender Systems with LLM-based Directional Feedback

RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

A consequence of failed sequential learning: A computational account of developmental amnesia

Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR

DiffuRank: Effective Document Reranking with Diffusion Language Models

Visual RAG Toolkit: Scaling Multi-Vector Visual Retrieval with Training-Free Pooling and Multi-Stage Search

Favia: Forensic Agent for Vulnerability-fix Identification and Analysis

Task- and Metric-Specific Signal Quality Indices for Medical Time Series

Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions