arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.11145 2026-02-12 cs.SD cs.LG eess.AS

SCRAPL: Scattering Transform with Random Paths for Machine Learning

Christopher Mitcheltree, Vincent Lostanlen, Emmanouil Benetos, Mathieu Lagrange

Comments Accepted to ICLR 2026. Code, audio samples, and Python package provided at https://christhetree.github.io/scrapl/

2602.11116 2026-02-12 eess.SY cs.RO cs.SY math.OC

Multi-UAV Trajectory Optimization for Bearing-Only Localization in GPS Denied Environments

Alfonso Sciacchitano, Liraz Mudrik, Sean Kragelund, Isaac Kaminer

Comments 38 pages, 7 figure, and 6 tables

2602.11082 2026-02-12 cs.RO eess.SP

Digging for Data: Experiments in Rock Pile Characterization Using Only Proprioceptive Sensing in Excavation

Unal Artan, Martin Magnusson, Joshua A. Marshall

Comments Accepted for publication in the IEEE Transactions on Field Robotics

2602.11076 2026-02-12 eess.SY cs.AI cs.SY eess.SP

Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing

Kavan Fatehi, Mostafa Rahmani Ghourtani, Amir Sonee, Poonam Yadav, Alessandra M Russo, Hamed Ahmadi, Radu Calinescu

Comments This work has been accepted to appear in the IEEE International Conference on Communications (ICC)

2602.11072 2026-02-12 cs.CL cs.SD eess.AS

Simultaneous Speech-to-Speech Translation Without Aligned Data

Tom Labiausse, Romain Fabre, Yannick Estève, Alexandre Défossez, Neil Zeghidour

Comments See inference code at: https://github.com/kyutai-labs/hibiki-zero

2602.11004 2026-02-12 cs.CV cs.AI cs.RO cs.SY eess.SY

Enhancing Predictability of Multi-Tenant DNN Inference for Autonomous Vehicles' Perception

Liangkai Liu, Kang G. Shin, Jinkyu Lee, Chengmo Yang, Weisong Shi

Comments 13 pages, 12 figures

详情

英文摘要

Autonomous vehicles (AVs) rely on sensors and deep neural networks (DNNs) to perceive their surrounding environment and make maneuver decisions in real time. However, achieving real-time DNN inference in the AV's perception pipeline is challenging due to the large gap between the computation requirement and the AV's limited resources. Most, if not all, of existing studies focus on optimizing the DNN inference time to achieve faster perception by compressing the DNN model with pruning and quantization. In contrast, we present a Predictable Perception system with DNNs (PP-DNN) that reduce the amount of image data to be processed while maintaining the same level of accuracy for multi-tenant DNNs by dynamically selecting critical frames and regions of interest (ROIs). PP-DNN is based on our key insight that critical frames and ROIs for AVs vary with the AV's surrounding environment. However, it is challenging to identify and use critical frames and ROIs in multi-tenant DNNs for predictable inference. Given image-frame streams, PP-DNN leverages an ROI generator to identify critical frames and ROIs based on the similarities of consecutive frames and traffic scenarios. PP-DNN then leverages a FLOPs predictor to predict multiply-accumulate operations (MACs) from the dynamic critical frames and ROIs. The ROI scheduler coordinates the processing of critical frames and ROIs with multiple DNN models. Finally, we design a detection predictor for the perception of non-critical frames. We have implemented PP-DNN in an ROS-based AV pipeline and evaluated it with the BDD100K and the nuScenes dataset. PP-DNN is observed to significantly enhance perception predictability, increasing the number of fusion frames by up to 7.3x, reducing the fusion delay by >2.6x and fusion-delay variations by >2.3x, improving detection completeness by 75.4% and the cost-effectiveness by up to 98% over the baseline.

URL PDF HTML ☆

赞 0 踩 0

2602.10976 2026-02-12 eess.SP cs.IT math.IT

Physically Consistent Evaluation of Commonly Used Near-Field Models

Georg Schwan, Alexander Stutz-Tirri, Christoph Studer

Comments Submitted to the 34th edition of EUSIPCO

2602.10963 2026-02-12 eess.SY cs.NA cs.RO cs.SY math.NA

Lie Group Variational Integrator for the Geometrically Exact Rod with Circular Cross-Section Incorporating Cross-Sectional Deformation

Srishti Siddharth, Vivek Natarajan, Ravi N. Banavar

Comments Submitted to: Computers and Mathematics with Applications

2602.10958 2026-02-12 eess.SP

Fluid-Antenna-Enabled Integrated Bistatic Sensing and Backscatter Communication Systems

A. Abdelaziz Salem, Saeed Abdallah, Khawla Alnajjar, Mahmoud A. Albreem, Mohamed Saad, Hayssam Dahrouj, Hesham Elsawy

2602.10936 2026-02-12 eess.SY cs.SY math.OC

Trajectory-based data-driven predictive control and the state-space predictor

Levi D. Reyes Premer, Arash J. Khabbazi, Kevin J. Kircher

2602.10911 2026-02-12 cs.LG cs.SY eess.SY math.OC

Tuning the burn-in phase in training recurrent neural networks improves their performance

Julian D. Schiller, Malte Heinrich, Victor G. Lopez, Matthias A. Müller

Comments Published as a conference paper at ICLR 2026, https://openreview.net/forum?id=jwkdKpioHJ

2602.10906 2026-02-12 eess.IV

Training-Free Stimulus Encoding for Retinal Implants via Sparse Projected Gradient Descent

Henning Konermann, Yuli Wu, Emil Mededovic, Volkmar Schulz, Peter Walter, Johannes Stegmaier

Comments This work has been submitted to the IEEE for possible publication

2602.10888 2026-02-12 eess.SY cs.LG cs.SY

Anomaly Detection with Machine Learning Algorithms in Large-Scale Power Grids

Marc Gillioz, Guillaume Dubuis, Étienne Voutaz, Philippe Jacquod

Comments 12 pages, 9 figures

2602.10837 2026-02-12 eess.IV

FPGA Implementation of Sketched LiDAR for a 192 x 128 SPAD Image Sensor

Zhenya Zang, Mike Davies, Istvan Gyongy

2602.10835 2026-02-12 eess.SY cs.SY

Reference Output Tracking in Boolean Control Networks

Giorgia Disarò, Maria Elena Valcher

2602.10829 2026-02-12 eess.AS cs.LG cs.SD

Self-Supervised Learning for Speaker Recognition: A study and review

Theo Lepage, Reda Dehak

Comments accepted for publication in Speech Communication

Journal ref Speech Communication, vol. 176, p. 103333, 2026

详情

DOI: 10.1016/j.specom.2025.103333

英文摘要

Deep learning models trained in a supervised setting have revolutionized audio and speech processing. However, their performance inherently depends on the quantity of human-annotated data, making them costly to scale and prone to poor generalization under unseen conditions. To address these challenges, Self-Supervised Learning (SSL) has emerged as a promising paradigm, leveraging vast amounts of unlabeled data to learn relevant representations. The application of SSL for Automatic Speech Recognition (ASR) has been extensively studied, but research on other downstream tasks, notably Speaker Recognition (SR), remains in its early stages. This work describes major SSL instance-invariance frameworks (e.g., SimCLR, MoCo, and DINO), initially developed for computer vision, along with their adaptation to SR. Various SSL methods for SR, proposed in the literature and built upon these frameworks, are also presented. An extensive review of these approaches is then conducted: (1) the effect of the main hyperparameters of SSL frameworks is investigated; (2) the role of SSL components is studied (e.g., data-augmentation, projector, positive sampling); and (3) SSL frameworks are evaluated on SR with in-domain and out-of-domain data, using a consistent experimental setup, and a comprehensive comparison of SSL methods from the literature is provided. Specifically, DINO achieves the best downstream performance and effectively models intra-speaker variability, although it is highly sensitive to hyperparameters and training conditions, while SimCLR and MoCo provide robust alternatives that effectively capture inter-speaker variability and are less prone to collapse. This work aims to highlight recent trends and advancements, identifying current challenges in the field.

URL PDF HTML ☆

赞 0 踩 0

2602.10813 2026-02-12 cs.IT cs.SY eess.SY math.IT

Dynamic Interference Management for TN-NTN Coexistence in the Upper Mid-Band

Pradyumna Kumar Bishoyi, Chia Chia Lee, Navid Keshtiarast, Marina Petrova

Comments This work has been accepted for publication in the IEEE ICC 2026 Conference

2602.09429 2026-02-12 eess.SY cs.RO cs.SY

First-order friction models with bristle dynamics: lumped and distributed formulations

Luigi Romano, Ole Morten Aamo, Jan Åslund, Erik Frisk

Comments 15 pages, 9 figures. Under review at IEEE Transactions on Control Systems Technology

2602.09427 2026-02-12 eess.SY cs.RO cs.SY

Lateral tracking control of all-wheel steering vehicles with intelligent tires

Luigi Romano, Ole Morten Aamo, Jan Åslund, Erik Frisk

Comments 16 pages, 12 figures. Under review at IEEE Transactions on Intelligent Vehicles

2602.08484 2026-02-12 eess.AS

Physics-Guided Variational Model for Unsupervised Sound Source Tracking

Luan Vinícius Fiorio, Ivana Nikoloska, Bruno Defraene, Alex Young, Johan David, Ronald M. Aarts

Comments This work has been submitted to the IEEE for possible publication

2602.05363 2026-02-12 eess.SY cs.SY

Policy-Driven Orchestration Framework for Multi-Operator Non-Terrestrial Networks

Yuma Abe, Mariko Sekiguchi, Go Otsuru, Amane Miura

Comments Accepted for publication in IEEE Transactions on Communications

Journal ref IEEE Transactions on Communications, 2026

详情

DOI: 10.1109/TCOMM.2026.3663315

英文摘要

Non-terrestrial networks (NTNs) have gained significant attention for their scalability and wide coverage in next-generation communication systems. A large number of NTN nodes, such as satellites, are required to establish a global NTN, but not all operators have the capability to deploy such a system. Therefore, cooperation among multiple operators, facilitated by an orchestrator, enables the construction of virtually large-scale constellations. In this paper, we propose a weak-control-based orchestration framework that coordinates multiple NTN operators while ensuring that operations align with the policies of both the orchestrator and the individual operators. Unlike centralized orchestration frameworks, where the orchestrator determines the entire route from source to destination, the proposed framework allows each operator to select preferred routes from multiple candidates provided by the orchestrator. To evaluate the effectiveness of our proposed framework, we conducted numerical simulations under various scenarios and network configurations including dynamic NTN environments with time-varying topologies, showing that inter-operator cooperation improves the availability of feasible end-to-end routes. Furthermore, we analyzed the iterative negotiation process to address policy conflicts and quantitatively demonstrated the "price of autonomy," where strict individual policies degrade global feasibility and performance. The results also demonstrate that outcomes of the proposed framework depend on the operators' policies and that hop count and latency increase as the number of operators grows. These findings validate the proposed framework's ability to deliver practical benefits of orchestrated multi-operator collaboration in future NTN environments.

URL PDF HTML ☆

赞 0 踩 0

2601.11827 2026-02-12 cs.LG cs.CV eess.IV

Shortest-Path Flow Matching with Mixture-Conditioned Bases for OOD Generalization to Unseen Conditions

Andrea Rubbi, Amir Akbarnejad, Mohammad Vali Sanian, Aryan Yazdan Parast, Hesam Asadollahzadeh, Arian Amani, Naveed Akhtar, Sarah Cooper, Andrew Bassett, Pietro Liò, Lassi Paavolainen, Sattar Vakili, Mo Lotfollahi

2601.06081 2026-02-12 physics.space-ph astro-ph.IM cs.RO eess.SP

First Multi-Constellation Observations of Navigation Satellite Signals in the Lunar Domain by Post-Processing L1/L5 IQ Snapshots

Lorenzo Sciacca, Alex Minetto, Andrea Nardin, Fabio Dovis, Luca Canzian, Mario Musmeci, Claudia Facchinetti, Giancarlo Varacalli

Comments 13 pages, 9 figures, IEEE Transactions on Aerospace and Electronic Systems

2510.18082 2026-02-12 cs.LG cs.RO cs.SY eess.SY

Provably Optimal Reinforcement Learning under Safety Filtering

Donggeon David Oh, Duy P. Nguyen, Haimin Hu, Jaime F. Fisac

Comments Accepted for publication in the proceedings of The International Association for Safe & Ethical AI (IASEAI) 2026; 17 pages, 3 figures

2510.15198 2026-02-12 astro-ph.IM cs.LG eess.IV

HyperAIRI: a plug-and-play algorithm for precise hyperspectral image reconstruction in radio interferometry

Chao Tang, Arwa Dabbech, Adrian Jackson, Yves Wiaux

Comments 24 pages, 10 figures, accepted by ApJS

2509.17143 2026-02-12 eess.AS cs.AI

MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances

Junhyeok Lee, Helin Wang, Yaohan Guan, Thomas Thebaud, Laureano Moro-Velazquez, Jesús Villalba, Najim Dehak

Comments ICASSP 2026 Accepted

2508.14600 2026-02-12 cs.LG eess.SP

Energy Injection Identification enabled Disaggregation with Deep Multi-Task Learning

Xudong Wang, Guoming Tang, Junyu Xue, Srinivasan Keshav, Tongxin Li, Chris Ding

Comments Accepted to The 17th ACM International Conference on Future and Sustainable Energy Systems (ACM e-Energy 2026)

2507.10775 2026-02-12 cs.CV cs.AI eess.IV

A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Computers

Jeffrey Joan Sam, Janhavi Sathe, Nikhil Chigali, Naman Gupta, Radhey Ruparel, Yicheng Jiang, Janmajay Singh, James W. Berck, Arko Barman

详情

英文摘要

Spacecraft deployed in outer space are routinely subjected to various forms of damage due to exposure to hazardous environments. In addition, there are significant risks to the subsequent process of in-space repairs through human extravehicular activity or robotic manipulation, incurring substantial operational costs. Recent developments in image segmentation could enable the development of reliable and cost-effective autonomous inspection systems. While these models often require large amounts of training data to achieve satisfactory results, publicly available annotated spacecraft segmentation data are very scarce. Here, we present a new dataset of nearly 64k annotated spacecraft images that was created using real spacecraft models, superimposed on a mixture of real and synthetic backgrounds generated using NASA's TTALOS pipeline. To mimic camera distortions and noise in real-world image acquisition, we also added different types of noise and distortion to the images. Our dataset includes images with several real-world challenges, including noise, camera distortions, glare, varying lighting conditions, varying field of view, partial spacecraft visibility, brightly-lit city backgrounds, densely patterned and confounding backgrounds, aurora borealis, and a wide variety of spacecraft geometries. Finally, we finetuned YOLOv8 and YOLOv11 models for spacecraft segmentation to generate performance benchmarks for the dataset under well-defined hardware and inference time constraints to mimic real-world image segmentation challenges for real-time onboard applications in space on NASA's inspector spacecraft. The resulting models, when tested under these constraints, achieved a Dice score of 0.92, Hausdorff distance of 0.69, and an inference time of about 0.5 second. The dataset and models for performance benchmark are available at https://github.com/RiceD2KLab/SWiM.

URL PDF HTML ☆

赞 0 踩 0

2504.11717 2026-02-12 cs.RO cs.SY eess.SY

Safety with Agency: Human-Centered Safety Filter with Application to AI-Assisted Motorsports

Donggeon David Oh, Justin Lidard, Haimin Hu, Himani Sinhmar, Elle Lazarski, Deepak Gopinath, Emily S. Sumner, Jonathan A. DeCastro, Guy Rosman, Naomi Ehrich Leonard, Jaime Fernández Fisac

Comments Accepted to Robotics: Science and Systems (R:SS) 2025, 22 pages, 16 figures, 7 tables Updates for v4: typos in Appendix Subsection A revised

Journal ref Proceedings of Robotics: Science and Systems (RSS), 2025

2503.20184 2026-02-12 cs.CV eess.IV

Spectrum from Defocus: Fast Spectral Imaging with Chromatic Focal Stack

M. Kerem Aydin, Yi-Chun Hung, Jaclyn Pytlarz, Qi Guo, Emma Alexander