arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.01009 2026-02-05 cs.LG cs.AI

LASS-ODE: Scaling ODE Computations to Connect Foundation Models with Dynamical Physical Systems

Haoran Li, Chenhan Xiao, Lihao Mai, Yang Weng, Erik Blasch

2602.00636 2026-02-05 cs.LG cs.SY eess.SY

On the Equilibrium between Feasible Zone and Uncertain Model in Safe Exploration

Yujie Yang, Zhilong Zheng, Shengbo Eben Li

2602.00158 2026-02-05 cs.LG cs.AI

RAPTOR: Ridge-Adaptive Logistic Probes

Ziqi Gao, Yaotian Zhu, Qingcheng Zeng, Xu Zhao, Ziqing Wang, Feng Ruan, Kaize Ding

Comments Preprint

2601.23228 2026-02-05 cs.AI cs.CL cs.ET cs.MA

Scaling Multiagent Systems with Process Rewards

Ed Li, Junyu Ren, Cat Yan

2601.22778 2026-02-05 cs.CV cs.CR

Color Matters: Demosaicing-Guided Color Correlation Training for Generalizable AI-Generated Image Detection

Nan Zhong, Yiran Xu, Mian Zou

2601.21610 2026-02-05 cs.CV

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

Zijin Yang, Yu Sun, Kejiang Chen, Jiawei Zhao, Jun Jiang, Weiming Zhang, Nenghai Yu

2601.21436 2026-02-05 cs.LG cs.AI cs.CL cs.CV

From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning

Hang Ni, Weijia Zhang, Fei Wang, Zezhi Shao, Hao Liu

2601.20347 2026-02-05 cs.CV

MMSF: Multitask and Multimodal Supervised Framework for WSI Classification and Survival Analysis

Chengying She, Chengwei Chen, Xinran Zhang, Ben Wang, Lizhuang Liu, Chengwei Shao, Yun Bian

Comments Submitted to "Biomedical Signal Processing and Control"

2601.16419 2026-02-05 cs.CL cs.CV

Learning Domain Knowledge in Multimodal Large Language Models through Reinforcement Fine-Tuning

Qinglong Cao, Yuntian Chen, Chao Ma, Xiaokang Yang

2601.15605 2026-02-05 cs.CL cs.SI

ToxiTwitch: Toward Emote-Aware Hybrid Moderation for Live Streaming Platforms

Baktash Ansari, Elias Martin, Afra Mashhadi

Comments Exploratory study; prior versions submitted to peer review

2601.14445 2026-02-05 cs.RO

Learning-based Force Sensing and Impedance Matching for Safe Haptic Feedback in Robot-assisted Laparoscopic Surgery

Aiden, Mazidi, Majid Roshanfar, Amir Sayadi, Javad Dargahi, Jake Barralet, Liane S. Feldman, Amir Hooshiar

2601.14157 2026-02-05 cs.SD cs.AI cs.LG

ConceptCaps: a Distilled Concept Dataset for Interpretability in Music Models

Bruno Sienkiewicz, Łukasz Neumann, Mateusz Modrzejewski

2601.13632 2026-02-05 cs.AI

Resilient Routing: Risk-Aware Dynamic Routing in Smart Logistics via Spatiotemporal Graph Learning

Zhiming Xue, Sichen Zhao, Yalun Qi, Xianling Zeng, Zihan Yu

2601.12090 2026-02-05 cs.CV

Detecting 3D Line Segments for 6DoF Pose Estimation with Limited Data

Matej Mok, Lukáš Gajdošech, Michal Mesároš, Martin Madaras, Viktor Kocur

Comments 8 pages, Accepted to VISAPP 2026 as Position Paper

2601.11073 2026-02-05 cs.LG cs.AI

Bridging Cognitive Neuroscience and Graph Intelligence: Hippocampus-Inspired Multi-View Hypergraph Learning for Web Finance Fraud

Rongkun Cui, Nana Zhang, Kun Zhu, Qi Zhang

2601.05564 2026-02-05 cs.SD cs.CL cs.HC eess.AS

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era

Zhixian Zhao, Shuiyuan Wang, Guojian Li, Hongfei Xue, Chengyou Wang, Shuai Wang, Longshuai Xiao, Zihan Zhang, Hui Bu, Xin Xu, Xinsheng Wang, Hexin Liu, Eng Siong Chng, Hung-yi Lee, Lei Xie

Comments Official summary paper for the ICASSP 2026 HumDial Challenge

2601.01778 2026-02-05 cs.CL

BanglaIPA: Towards Robust Text-to-IPA Transcription with Contextual Rewriting in Bengali

Jakir Hasan, Shrestha Datta, Md Saiful Islam, Shubhashis Roy Dipta, Ameya Debnath

Comments Accepted at LoResLM workshop, EACL 2026

2601.01171 2026-02-05 cs.CL

Almost Clinical: Linguistic properties of synthetic electronic health records

Serge Sharoff, John Baker, David Francis Hunt, Alan Simpson

2601.00645 2026-02-05 cs.CV

Quality Detection of Stored Potatoes via Transfer Learning: A CNN and Vision Transformer Approach

Shrikant Kapse, Priyankkumar Dhrangdhariya, Priya Kedia, Manasi Patwardhan, Shankar Kausley, Soumyadipta Maiti, Beena Rai, Shirish Karande

2512.23592 2026-02-05 cs.CV

Same or Not? Enhancing Visual Perception in Vision-Language Models

Damiano Marsili, Aditya Mehta, Ryan Y. Lin, Georgia Gkioxari

Comments Project webpage: https://glab-caltech.github.io/twin/

2512.10326 2026-02-05 cs.CV

StainNet: Scaling Self-Supervised Foundation Models on Immunohistochemistry and Special Stains for Computational Pathology

Jiawen Li, Jiali Hu, Xitong Ling, Yongqiang Lv, Yuxuan Chen, Yizhi Wang, Tian Guan, Yifei Liu, Yonghong He

Comments 26 pages, 7 figures, 10 tables

2512.00771 2026-02-05 cs.CV cs.AI

EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes

Xiaoshan Wu, Yifei Yu, Xiaoyang Lyu, Yihua Huang, Bo Wang, Baoheng Zhang, Zhongrui Wang, Xiaojuan Qi

Comments Accepted at NeurIPS 2025 (spotlight)

2511.22996 2026-02-05 cs.RO math.OC

Analytical Inverse Kinematic Solution for "Moz1" NonSRS 7-DOF Robot arm with novel arm angle

Ke Chen

2511.21752 2026-02-05 cs.CL cs.AI

Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification

Yanxi Li, Ruocheng Shan

详情

英文摘要

Large language models are increasingly used for text classification tasks such as sentiment analysis, yet their reliance on natural language prompts exposes them to prompt injection attacks. In particular, class-directive injections exploit knowledge of the model's label set (e.g., positive vs. negative) to override its intended behavior through adversarial instructions. Existing defenses, such as detection-based filters, instruction hierarchies, and signed prompts, either require model retraining or remain vulnerable to obfuscation. This paper introduces Label Disguise Defense (LDD), a lightweight and model-agnostic strategy that conceals true labels by replacing them with semantically transformed or unrelated alias labels(e.g., blue vs. yellow). The model learns these new label mappings implicitly through few-shot demonstrations, preventing direct correspondence between injected directives and decision outputs. We evaluate LDD across nine state-of-the-art models, including GPT-5, GPT-4o, LLaMA3.2, Gemma3, and Mistral variants, under varying few-shot and an adversarial setting. Our results show that the ability of LDD to recover performance lost to the adversarial attack varies across models and alias choices. For every model evaluated, LDD is able to restore a portion of the accuracy degradation caused by the attack. Moreover, for the vast majority of models, we can identify more than one alias pair that achieves higher accuracy than the under-attack baseline, in which the model relies solely on few-shot learning without any defensive mechanism. A linguistic analysis further reveals that semantically aligned alias labels(e.g., good vs. bad) yield stronger robustness than unaligned symbols(e.g., blue vs. yellow). Overall, this study demonstrates that label semantics can serve as an effective defense layer, transforming meaning itself into a shield against prompt injection.

URL PDF HTML ☆

赞 0 踩 0

2511.14265 2026-02-05 cs.LG

Unified Multimodal Vessel Trajectory Prediction with Explainable Navigation Intention

Rui Zhang, Chao Li, Kezhong Liu, Chen Wang, Bolong Zheng, Hongbo Jiang

Journal ref IEEE Transactions on Intelligent Transportation Systems, vol. 27, no. 1, pp. 258-269, Jan. 2026

2511.12169 2026-02-05 cs.AI

Incremental Maintenance of DatalogMTL Materialisations

Kaiyue Zhao, Dingqi Chen, Shaoyu Wang, Pan Hu

Comments Accepted as oral paper at the main track of AAAI 2026

2511.11736 2026-02-05 cs.LG

KAN/H: Kolmogorov-Arnold Network using Haar-like bases

Susumu Katayama

2511.09554 2026-02-05 cs.CV

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Isaac Robinson, Peter Robicheaux, Matvei Popov, Deva Ramanan, Neehar Peri

Comments This work has been accepted to the International Conference on Learning Representations (ICLR) 2026. Project Page: https://rfdetr.roboflow.com/

2511.05610 2026-02-05 cs.LG cs.AI math.OC

Conformal Prediction-Driven Adaptive Sampling for Digital Water Twins

Mohammadhossein Homaei, Mehran Tarif, Pablo Garcia Rodriguez, Andres Caro, Mar Avila

Comments 6 Pages, 7 tables, 1 Figure

2511.05059 2026-02-05 cs.CV

SurgiATM: A Physics-Guided Plug-and-Play Model for Deep Learning-Based Smoke Removal in Laparoscopic Surgery

Mingyu Sheng, Jianan Fan, Dongnan Liu, Guoyan Zheng, Ron Kikinis, Weidong Cai

Comments 21 pages, 9 figures, 10 tables. Code available at https://github.com/MingyuShengSMY/SurgiATM

详情

英文摘要

During laparoscopic surgery, smoke generated by tissue cauterization degrade endoscopic frames quality, increasing surgical risk and hindering both clinical decision-making and computer-assisted visual analysis. Therefore, removing surgical smoke is essential for patient safety and operative efficiency. In this study, we propose the Surgical Atmospheric Model (SurgiATM) for surgical smoke removal. SurgiATM statistically bridges a physics-based atmospheric model and data-driven deep learning models, combining the superior generalizability of the former with the high accuracy of the latter. SurgiATM is designed as a lightweight, plug-and-play module that can be seamlessly integrated into diverse surgical desmoking architectures to enhance their accuracy and stability. The proposed method is derived via statistically optimizing MoE model at the output end of arbitrary deep learning methods, with a Laplacian-like error distribution specifically leveraged to model surgical smoke. The output-stage MoE ensures minimal modification to the architecture of the original methods, while the Laplacian-like distribution characteristic of surgical smoke enables a lightweight reconstruction formulation with minimal parameters. Therefore, SurgiATM introduces only two hyperparameters and no extra trainable weights, preserving the original network architecture with minimal overhead. We conduct extensive experiments on three public surgical datasets, involving multiple network architectures and covering diverse procedures, including cholecystectomy, partial nephrectomy, and diaphragm dissection. The results demonstrate that incorporating SurgiATM commonly reduces the restoration errors of existing models and relatively enhances their generalizability, without adding any trainable layers or weights. This highlights the convenience, low cost, effectiveness, and generalizability of the proposed method.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

LASS-ODE: Scaling ODE Computations to Connect Foundation Models with Dynamical Physical Systems

On the Equilibrium between Feasible Zone and Uncertain Model in Safe Exploration

RAPTOR: Ridge-Adaptive Logistic Probes

Scaling Multiagent Systems with Process Rewards

Color Matters: Demosaicing-Guided Color Correlation Training for Generalizable AI-Generated Image Detection

WMVLM: Evaluating Diffusion Model Image Watermarking via Vision-Language Models

From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning

MMSF: Multitask and Multimodal Supervised Framework for WSI Classification and Survival Analysis

Learning Domain Knowledge in Multimodal Large Language Models through Reinforcement Fine-Tuning

ToxiTwitch: Toward Emote-Aware Hybrid Moderation for Live Streaming Platforms

Learning-based Force Sensing and Impedance Matching for Safe Haptic Feedback in Robot-assisted Laparoscopic Surgery

ConceptCaps: a Distilled Concept Dataset for Interpretability in Music Models

Resilient Routing: Risk-Aware Dynamic Routing in Smart Logistics via Spatiotemporal Graph Learning

Detecting 3D Line Segments for 6DoF Pose Estimation with Limited Data

Bridging Cognitive Neuroscience and Graph Intelligence: Hippocampus-Inspired Multi-View Hypergraph Learning for Web Finance Fraud

The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era

BanglaIPA: Towards Robust Text-to-IPA Transcription with Contextual Rewriting in Bengali

Almost Clinical: Linguistic properties of synthetic electronic health records

Quality Detection of Stored Potatoes via Transfer Learning: A CNN and Vision Transformer Approach

Same or Not? Enhancing Visual Perception in Vision-Language Models

StainNet: Scaling Self-Supervised Foundation Models on Immunohistochemistry and Special Stains for Computational Pathology

EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes

Analytical Inverse Kinematic Solution for "Moz1" NonSRS 7-DOF Robot arm with novel arm angle

Semantics as a Shield: Label Disguise Defense (LDD) against Prompt Injection in LLM Sentiment Classification

Unified Multimodal Vessel Trajectory Prediction with Explainable Navigation Intention

Incremental Maintenance of DatalogMTL Materialisations

KAN/H: Kolmogorov-Arnold Network using Haar-like bases

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Conformal Prediction-Driven Adaptive Sampling for Digital Water Twins

SurgiATM: A Physics-Guided Plug-and-Play Model for Deep Learning-Based Smoke Removal in Laparoscopic Surgery