arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.10871 2026-02-12 cs.HC cs.CV

Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling

Yu Zhang, Xinyi Zhao, Chongke Bi, Siming Chen

Comments Accepted to IEEE TVCG

2602.10867 2026-02-12 stat.ML cs.LG

Deep Learning of Compositional Targets with Hierarchical Spectral Methods

Hugo Tabanelli, Yatin Dandi, Luca Pesce, Florent Krzakala

2602.10829 2026-02-12 eess.AS cs.LG cs.SD

Self-Supervised Learning for Speaker Recognition: A study and review

Theo Lepage, Reda Dehak

Comments accepted for publication in Speech Communication

Journal ref Speech Communication, vol. 176, p. 103333, 2026

详情

DOI: 10.1016/j.specom.2025.103333

英文摘要

Deep learning models trained in a supervised setting have revolutionized audio and speech processing. However, their performance inherently depends on the quantity of human-annotated data, making them costly to scale and prone to poor generalization under unseen conditions. To address these challenges, Self-Supervised Learning (SSL) has emerged as a promising paradigm, leveraging vast amounts of unlabeled data to learn relevant representations. The application of SSL for Automatic Speech Recognition (ASR) has been extensively studied, but research on other downstream tasks, notably Speaker Recognition (SR), remains in its early stages. This work describes major SSL instance-invariance frameworks (e.g., SimCLR, MoCo, and DINO), initially developed for computer vision, along with their adaptation to SR. Various SSL methods for SR, proposed in the literature and built upon these frameworks, are also presented. An extensive review of these approaches is then conducted: (1) the effect of the main hyperparameters of SSL frameworks is investigated; (2) the role of SSL components is studied (e.g., data-augmentation, projector, positive sampling); and (3) SSL frameworks are evaluated on SR with in-domain and out-of-domain data, using a consistent experimental setup, and a comprehensive comparison of SSL methods from the literature is provided. Specifically, DINO achieves the best downstream performance and effectively models intra-speaker variability, although it is highly sensitive to hyperparameters and training conditions, while SimCLR and MoCo provide robust alternatives that effectively capture inter-speaker variability and are less prone to collapse. This work aims to highlight recent trends and advancements, identifying current challenges in the field.

URL PDF HTML ☆

赞 0 踩 0

2602.10808 2026-02-12 cs.SE cs.AI

PELLI: Framework to effectively integrate LLMs for quality software generation

Rasmus Krebs, Somnath Mazumdar

Comments 15 pages

2602.09429 2026-02-12 eess.SY cs.RO cs.SY

First-order friction models with bristle dynamics: lumped and distributed formulations

Luigi Romano, Ole Morten Aamo, Jan Åslund, Erik Frisk

Comments 15 pages, 9 figures. Under review at IEEE Transactions on Control Systems Technology

2602.09427 2026-02-12 eess.SY cs.RO cs.SY

Lateral tracking control of all-wheel steering vehicles with intelligent tires

Luigi Romano, Ole Morten Aamo, Jan Åslund, Erik Frisk

Comments 16 pages, 12 figures. Under review at IEEE Transactions on Intelligent Vehicles

2602.09015 2026-02-12 cs.CR cs.AI

CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection

Fatemeh Nejati, Mahdi Rabbani, Morteza Eskandarian, Mansur Mirani, Gunjan Piya, Igor Opushnyev, Ali A. Ghorbani, Sajjad Dadkhah

详情

英文摘要

Phishing attacks represents one of the primary attack methods which is used by cyber attackers. In many cases, attackers use deceptive emails along with malicious attachments to trick users into giving away sensitive information or installing malware while compromising entire systems. The flexibility of malicious email attachments makes them stand out as a preferred vector for attackers as they can embed harmful content such as malware or malicious URLs inside standard document formats. Although phishing email defenses have improved a lot, attackers continue to abuse attachments, enabling malicious content to bypass security measures. Moreover, another challenge that researches face in training advance models, is lack of an unified and comprehensive dataset that covers the most prevalent data types. To address this gap, we generated CIC-Trap4Phish, a multi-format dataset containing both malicious and benign samples across five categories commonly used in phishing campaigns: Microsoft Word documents, Excel spreadsheets, PDF files, HTML pages, and QR code images. For the first four file types, a set of execution-free static feature pipeline was proposed, designed to capture structural, lexical, and metadata-based indicators without the need to open or execute files. Feature selection was performed using a combination of SHAP analysis and feature importance, yielding compact, discriminative feature subsets for each file type. The selected features were evaluated by using lightweight machine learning models, including Random Forest, XGBoost, and Decision Tree. All models demonstrate high detection accuracy across formats. For QR code-based phishing (quishing), two complementary methods were implemented: image-based detection by employing Convolutional Neural Networks (CNNs) and lexical analysis of decoded URLs using recent lightweight language models.

URL PDF HTML ☆

赞 0 踩 0

2602.08965 2026-02-12 cs.MA cs.LG

Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning

John Gardiner, Orlando Romero, Brendan Tivnan, Nicolò Dal Fabbro, George J. Pappas

2602.04900 2026-02-12 cs.ET cs.AI cs.DC

Evaluating Kubernetes Performance for GenAI Inference: From Automatic Speech Recognition to LLM Summarization

Sai Sindhur Malleni, Raúl Sevilla, Aleksei Vasilevskii, José Castillo Lema, André Bauer

Comments A accepted at the 17th International Conference on Performance Engineering

2602.03866 2026-02-12 cs.DL cs.AI

PaperX: A Unified Framework for Multimodal Academic Presentation Generation with Scholar DAG

Tao Yu, Minghui Zhang, Zhiqing Cui, Hao Wang, Zhongtian Luo, Shenghua Chai, Junhao Gong, Yuzhao Peng, Yuxuan Zhou, Yujia Yang, Zhenghao Zhang, Haopeng Jin, Xinming Wang, Yufei Xiong, Jiabing Yang, Jiahao Yuan, Hanqing Wang, Hongzhu Yi, Yan Huang, Liang Wang

Comments 29 pages, 9 figures, Project website: https://github.com/yutao1024/PaperX

2602.02310 2026-02-12 physics.chem-ph cs.AI

FragmentFlow: Scalable Transition State Generation for Large Molecules

Ron Shprints, Peter Holderrieth, Juno Nam, Rafael Gómez-Bombarelli, Tommi Jaakkola

2602.00036 2026-02-12 nlin.CG cs.AI cs.FL cs.NE

LOGOS-CA: A Cellular Automaton Using Natural Language as State and Rule

Keishu Utimula

2601.21963 2026-02-12 cs.CY cs.AI cs.CL cs.SI

Industrialized Deception: The Collateral Effects of LLM-Generated Misinformation on Digital Ecosystems

Alexander Loth, Martin Kappes, Marc-Oliver Pahl

Comments Accepted at ACM TheWebConf '26 Companion

2601.06081 2026-02-12 physics.space-ph astro-ph.IM cs.RO eess.SP

First Multi-Constellation Observations of Navigation Satellite Signals in the Lunar Domain by Post-Processing L1/L5 IQ Snapshots

Lorenzo Sciacca, Alex Minetto, Andrea Nardin, Fabio Dovis, Luca Canzian, Mario Musmeci, Claudia Facchinetti, Giancarlo Varacalli

Comments 13 pages, 9 figures, IEEE Transactions on Aerospace and Electronic Systems

2511.18141 2026-02-12 stat.ML cs.LG

Conformal Prediction for Compositional Data

Lucas P. Amaral, Luben M. C. Cabezas, Thiago R. Ramos, Gustavo H. G. A. Pereira

Comments 32 pages, 11 figures

2510.15198 2026-02-12 astro-ph.IM cs.LG eess.IV

HyperAIRI: a plug-and-play algorithm for precise hyperspectral image reconstruction in radio interferometry

Chao Tang, Arwa Dabbech, Adrian Jackson, Yves Wiaux

Comments 24 pages, 10 figures, accepted by ApJS

2509.17143 2026-02-12 eess.AS cs.AI

MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances

Junhyeok Lee, Helin Wang, Yaohan Guan, Thomas Thebaud, Laureano Moro-Velazquez, Jesús Villalba, Najim Dehak

Comments ICASSP 2026 Accepted

2508.05653 2026-02-12 cs.HC cs.AI

Modeling Interactive Narrative Systems: A Formal Approach

Jules Clerc, Domitile Lourdeaux, Mohamed Sallak, Johann Barbier, Marc Ravaine

Journal ref 8th International Workshop on Computational Models of Narrative, 2025

2507.06109 2026-02-12 cs.GR cs.AI cs.CV

LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures

Seungoh Han, Jaehoon Jang, Hyunsu Kim, Jaeheung Surh, Junhyung Kwak, Hyowon Ha, Kyungdon Joo

Comments WACV 2026

2506.08043 2026-02-12 cs.GR cs.CV cs.LG cs.RO

Neural-Augmented Kelvinlet for Real-Time Soft Tissue Deformation Modeling

Ashkan Shahbazi, Kyvia Pereira, Jon S. Heiselman, Elaheh Akbari, Annie C. Benson, Sepehr Seifi, Xinyuan Liu, Garrison L. Johnston, Jie Ying Wu, Nabil Simaan, Michael I. Miga, Soheil Kolouri

2506.03083 2026-02-12 cs.DS cs.AI cs.CL

Algorithmically Establishing Trust in Evaluators

Adrian de Wynter

2504.18273 2026-02-12 cs.SI cs.LG

Efficient Learning on Large Graphs using a Densifying Regularity Lemma

Jonathan Kouchly, Ben Finkelshtein, Michael Bronstein, Ron Levie

2503.13553 2026-02-12 cs.MA cs.AI cs.CL

LLM-Mediated Guidance of MARL Systems

Philipp D. Siedler, Ian Gemp

2412.03766 2026-02-12 cs.CR cs.LG

End to End Collaborative Synthetic Data Generation

Sikha Pentyala, Geetha Sitaraman, Trae Claar, Martine De Cock

Comments Accepted at PPAI Workshop, AAAI 2025

2407.07295 2026-02-12 eess.IV cs.CE cs.CV

Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis

Jian-Qing Zheng, Yuanhan Mo, Yang Sun, Jiahua Li, Fuping Wu, Ziyang Wang, Tonia Vincent, Bartłomiej W. Papież

Comments accepted by Medical Image Analysis

2407.06211 2026-02-12 q-bio.OT cs.CY cs.LG

Synthetic data: How could it be used for infectious disease research?

Styliani-Christina Fragkouli, Dhwani Solanki, Leyla J Castro, Fotis E Psomopoulos, Núria Queralt-Rosinach, Davide Cirillo, Lisa C Crossman

2406.12326 2026-02-12 cs.SE cs.AI

Towards Better Code Understanding in Decoder-Only Models with Contrastive Learning

Jiayi Lin, Yanlin Wang, Yibiao Yang, Lei Zhang, Yutao Xie

Comments AAAI 2026

2405.20880 2026-02-12 cs.GT cs.AI cs.MA econ.TH

Games with Payments between Learning Agents

Yoav Kolumbus, Joe Halpern, Éva Tardos

2203.00554 2026-02-12 stat.ML cs.LG

Neural Score Matching for High-Dimensional Causal Inference

Oscar Clivio, Fabian Falck, Brieuc Lehmann, George Deligiannidis, Chris Holmes

Comments Fixed erroneous Propositions 5-6-7 and Appendix B from the previous version

2202.11496 2026-02-12 eess.IV cs.CV

MITI: SLAM Benchmark for Laparoscopic Surgery

Regine Hartwig, Daniel Ostler, Jean-Claude Rosenthal, Hubertus Feußner, Dirk Wilhelm, Dirk Wollherr

Comments This submission is withdrawn because it is a duplicate of "Constrained Visual-Inertial Localization With Application And Benchmark in Laparoscopic Surgery" (arXiv:2202.11075). The withdrawn version contains less complete information. Readers are directed to the full version

AI 大模型

视觉与机器人

科学与医疗