arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2601.23201 2026-02-02 eess.IV cs.CV cs.LG

Scale-Cascaded Diffusion Models for Super-Resolution in Medical Imaging

Darshan Thaker, Mahmoud Mostapha, Radu Miron, Shihan Qiu, Mariappan Nadar

Comments Accepted at IEEE International Symposium for Biomedical Imaging (ISBI) 2026

2601.23196 2026-02-02 eess.AS

Beyond Omnidirectional: Neural Ambisonics Encoding for Arbitrary Microphone Directivity Patterns using Cross-Attention

Mikko Heikkinen, Archontis Politis, Konstantinos Drossos, Tuomas Virtanen

Comments Accepted to ICASSP 2026

2601.23160 2026-02-02 eess.SY cs.SY math.OC

Robust Control of Constrained Linear Systems using Online Convex Optimization and a Reference Governor

Marko Nonhoff, Mohammad Taher Al Torshan, Matthias A. Müller

Comments Presented at 2024 IEEE 63rd Conference on Decision and Control (CDC)

Journal ref 2024 IEEE 63rd Conference on Decision and Control (CDC), 2024, pp. 6553-6559

2601.23148 2026-02-02 eess.IV cs.LG

Compressed BC-LISTA via Low-Rank Convolutional Decomposition

Han Wang, Yhonatan Kvich, Eduardo Pérez, Florian Römer, Yonina C. Eldar

Comments Inverse Problems, Model Compression, Compressed Sensing, Deep Unrolling, Computational Imaging

2601.23119 2026-02-02 eess.SP cs.SY eess.SY

Interpolation Techniques for Fast Channel Estimation in Ray Tracing

Ruibin Chen, Jayadev Joy, Yaqi Hu, Mingsheng Yin, Marco Mezzavilla, Sundeep Rangan

Comments This is the authors accepted version of a paper published in the Proceedings of the 2024 58th Asilomar Conference on Signals, Systems, and Computers

Journal ref Proc. IEEE 58th Asilomar Conference on Signals, Systems, and Computers, 2024, pp. 1383-1388

2601.23108 2026-02-02 eess.SY cs.SY

Energy Management Strategies for Electric Aircraft Charging Leveraging Active Landside Vehicle-to-Grid

Finn Vehlhaber, Mauro Salazar

2601.23103 2026-02-02 eess.IV cs.CV

Vision-Language Controlled Deep Unfolding for Joint Medical Image Restoration and Segmentation

Ping Chen, Zicheng Huang, Xiangming Wang, Yungeng Liu, Bingyu Liang, Haijin Zeng, Yongyong Chen

Comments 18 pages, medical image

2601.23076 2026-02-02 eess.SP cs.SY eess.SY

Learning-Based Signal Recovery in Nonlinear Systems with Spectrally Separated Interference

Jayadev Joy, Sundeep Rangan

2601.23004 2026-02-02 eess.AS

Layer-Aware Early Fusion of Acoustic and Linguistic Embeddings for Cognitive Status Classification

Krystof Novotny, Laureano Moro-Velázquez, Jiri Mekyska

Comments 5 pages, 3 figures, paper accepted for ICASSP 2026 conference

2601.22989 2026-02-02 eess.SP

Fluid Antenna Systems under Channel Uncertainty and Hardware Impairments: Trends, Challenges, and Future Research Directions

Saeid Pakravan, Mohsen Ahmadzadeh, Ming Zeng, Wessam Ajib, Ji Wang, Xingwang Li

Comments 12 pages

2601.22938 2026-02-02 cs.CR cs.AI eess.IV eess.SP

A Real-Time Privacy-Preserving Behavior Recognition System via Edge-Cloud Collaboration

Huan Song, Shuyu Tian, Junyi Hao, Cheng Yuan, Zhenyu Jia, Jiawei Shao, Xuelong Li

2601.22915 2026-02-02 eess.SP

Intrinsic MIMO Particle Communication Channel with Random Advection

Fatih Merdan, Ozgur B. Akan

Comments 6 pages, 5 figures

2601.22878 2026-02-02 eess.IV cs.CV

Development of Domain-Invariant Visual Enhancement and Restoration (DIVER) Approach for Underwater Images

Rajini Makam, Sharanya Patil, Dhatri Shankari T M, Suresh Sundaram, Narasimhan Sundararajan

Comments Submitted to IEEE Journal of Oceanic Engineering

详情

英文摘要

Underwater images suffer severe degradation due to wavelength-dependent attenuation, scattering, and illumination non-uniformity that vary across water types and depths. We propose an unsupervised Domain-Invariant Visual Enhancement and Restoration (DIVER) framework that integrates empirical correction with physics-guided modeling for robust underwater image enhancement. DIVER first applies either IlluminateNet for adaptive luminance enhancement or a Spectral Equalization Filter for spectral normalization. An Adaptive Optical Correction Module then refines hue and contrast using channel-adaptive filtering, while Hydro-OpticNet employs physics-constrained learning to compensate for backscatter and wavelength-dependent attenuation. The parameters of IlluminateNet and Hydro-OpticNet are optimized via unsupervised learning using a composite loss function. DIVER is evaluated on eight diverse datasets covering shallow, deep, and highly turbid environments, including both naturally low-light and artificially illuminated scenes, using reference and non-reference metrics. While state-of-the-art methods such as WaterNet, UDNet, and Phaseformer perform reasonably in shallow water, their performance degrades in deep, unevenly illuminated, or artificially lit conditions. In contrast, DIVER consistently achieves best or near-best performance across all datasets, demonstrating strong domain-invariant capability. DIVER yields at least a 9% improvement over SOTA methods in UCIQE. On the low-light SeaThru dataset, where color-palette references enable direct evaluation of color restoration, DIVER achieves at least a 4.9% reduction in GPMAE compared to existing methods. Beyond visual quality, DIVER also improves robotic perception by enhancing ORB-based keypoint repeatability and matching performance, confirming its robustness across diverse underwater environments.

URL PDF HTML ☆

赞 0 踩 0

2601.22873 2026-02-02 eess.AS cs.AI cs.CL cs.SD

EmoShift: Lightweight Activation Steering for Enhanced Emotion-Aware Speech Synthesis

Li Zhou, Hao Jiang, Junjie Li, Tianrui Wang, Haizhou Li

Comments Activation Steering; Emotion-Aware TTS; Speech Synthesis; Accepted by ICASSP 2026

2601.22779 2026-02-02 eess.AS cs.SD

Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization

Genshun Wan, Wenhui Zhang, Jing-Xuan Zhang, Shifu Xiong, Jianqing Gao, Zhongfu Ye

Comments accepted to ICASSP 2026

2601.22765 2026-02-02 eess.SP cs.LG

Bayesian Matrix Completion Under Geometric Constraints

Rohit Varma Chiluvuri, Santosh Nannuru

Comments 4 pages, 3 figures, Accepted to ICASSP 2026

2601.22732 2026-02-02 eess.IV cs.CV

Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture

Hung-Chih Tu, Bo-Syun Chen, Yun-Chien Cheng

2601.21706 2026-02-02 cs.LG cs.SY eess.SY

SmartMeterFM: Unifying Smart Meter Data Generative Tasks Using Flow Matching Models

Nan Lin, Yanbo Wang, Jacco Heres, Peter Palensky, Pedro P. Vergara

Comments 10 pages, 6 figures, 6 tables

2601.12526 2026-02-02 eess.IV cs.CV

Deep Lightweight Unrolled Network for High Dynamic Range Modulo Imaging

Brayan Monroy, Jorge Bacca

2601.00734 2026-02-02 eess.SP

Conformal Reconfigurable Intelligent Surfaces: A Cylindrical Geometry Perspective

Filippo Pepe, Ivan Iudice, Giuseppe Castaldi, Marco Di Renzo, Vincenzo Galdi

Comments 20 pages, 9 figures

2601.00159 2026-02-02 eess.SP

AI-Driven Channel State Information (CSI) Extrapolation for 6G: Current Situations, Challenges and Future Research

Yuan Gao, Zichen Lu, Xinyi Wu, Wenjun Yu, Shengli Liu, Jianbo Du, Yanliang Jin, Shunqing Zhang, Xiaoli Chu, Shugong Xu

Comments This manuscript has been accepted by IEEE Communications Surveys and Tutorials

2512.21572 2026-02-02 cs.LG eess.SP

RefineBridge: Generative Bridge Models Improve Financial Forecasting by Foundation Models

Anthony Bolton, Wuyang Zhou, Zehua Chen, Giorgos Iacovides, Danilo Mandic

2512.17937 2026-02-02 eess.AS cs.SD

LIWhiz: A Non-Intrusive Lyric Intelligibility Prediction System for the Cadenza Challenge

Ram C. M. C. Shekar, Iván López-Espejo

Comments Accepted to ICASSP 2026

2511.05771 2026-02-02 eess.SP

Environment-Aware MIMO Channel Estimation in Pilot-Constrained Upper Mid-Band Systems

Seyed Alireza Javid, Nuria González-Prelcic

Comments Accepted from ICASSP 2026

2511.01431 2026-02-02 eess.SP

Robust Radar Mounting Angle Estimation in Operational Driving Conditions

Simin Zhu, Satish Ravindran, Lihui Chen, Alexander Yarovoy, Francesco Fioranelli

Comments 11 pages, 6 figures, under review at IEEE Transactions on Radar Systems

2509.17797 2026-02-02 eess.SP

SSNet: Flexible and robust channel extrapolation for fluid antenna systems enabled by an self-supervised learning framework

Yuan Gao, Yiming Liu, Runze Yu, Shengli Liu, Yanliang Jin, Shunqing Zhang, Shugong Xu, Xiaoli Chu

详情

DOI: 10.1109/JSAC.2025.3619472

英文摘要

Fluid antenna systems (FAS) signify a pivotal advancement in 6G communication by enhancing spectral efficiency and robustness. However, obtaining accurate channel state information (CSI) in FAS poses challenges due to its complex physical structure. Traditional methods, such as pilot-based interpolation and compressive sensing, are not only computationally intensive but also lack adaptability. Current extrapolation techniques relying on rigid parametric models do not accommodate the dynamic environment of FAS, while data-driven deep learning approaches demand extensive training and are vulnerable to noise and hardware imperfections. To address these challenges, this paper introduces a novel self-supervised learning network (SSNet) designed for efficient and adaptive channel extrapolation in FAS. We formulate the problem of channel extrapolation in FAS as an image reconstruction task. Here, a limited number of unmasked pixels (representing the known CSI of the selected ports) are used to extrapolate the masked pixels (the CSI of unselected ports). SSNet capitalizes on the intrinsic structure of FAS channels, learning generalized representations from raw CSI data, thus reducing dependency on large labelled datasets. For enhanced feature extraction and noise resilience, we propose a mix-of-expert (MoE) module. In this setup, multiple feedforward neural networks (FFNs) operate in parallel. The outputs of the MoE module are combined using a weighted sum, determined by a gating function that computes the weights of each FFN using a softmax function. Extensive simulations validate the superiority of the proposed model. Results indicate that SSNet significantly outperforms benchmark models, such as AGMAE and long short-term memory (LSTM) networks by using a much smaller labelled dataset.

URL PDF HTML ☆

赞 0 踩 0

2509.15804 2026-02-02 cs.SD eess.AS

CompSpoof: A Dataset and Joint Learning Framework for Component-Level Audio Anti-spoofing Countermeasures

Xueping Zhang, Yechen Wang, Linxi Li, Liwei Jin, Ming Li

Comments accepted at ICASSP 2026

2509.01125 2026-02-02 eess.SP

Enabling 6G Through Multi-Domain Channel Extrapolation: Opportunities and Challenges of Generative Artificial Intelligence

Yuan Gao, Zichen Lu, Yifan Wu, Yanliang Jin, Shunqing Zhang, Xiaoli Chu, Shugong Xu, Cheng-Xiang Wang

详情

DOI: 10.1109/MCOM.001.2500246

英文摘要

Channel extrapolation has attracted wide attention due to its potential to acquire channel state information (CSI) with high accuracy and minimal overhead. This is becoming increasingly crucial as the sixth-generation (6G) mobile networks aim to support complex scenarios, for example, high-mobility communications utilizing ultra-massive multiple-input multiple-output (MIMO) technologies and broad spectrum bands, necessitating multi-domain channel extrapolation. Current research predominantly addresses channel extrapolation within a single domain, lacking a comprehensive approach to multi-domain channel extrapolation. To bridge the gap, we propose the concept of multi-domain channel extrapolation, detailing the essential performance requirements for 6G networks. These include precise channel extrapolation, adaptability to varying scenarios, and manageable computational complexity during both training and inference stages. In light of these requirements, we elaborate the potential and challenges of incorporating generative artificial intelligence (GAI)-based models for effective multi-domain channel extrapolation. Given the ability of the Transformer to capture long-range dependencies and hidden patterns, we propose a novel Transformer encoder-like model by eliminating the positional encoding module and replacing the original multi-head attention with a multilayer perceptron (MLP) for multi-domain channel extrapolation. Simulation results indicate that this model surpasses existing baseline models in terms of extrapolation accuracy and inference speed. Ablation studies further demonstrate the effectiveness of the module design of the proposed design. Finally, we pose several open questions for the development of practical GAI-based multi-domain channel extrapolation models, including the issues of explainability, generalization, and dataset collection.

URL PDF HTML ☆

赞 0 踩 0

2508.14798 2026-02-02 cs.AR eess.SP

ListenToJESD204B: A Lightweight Open-Source JESD204B IP Core for FPGA-Based Ultrasound Acquisition systems

Soumyo Bhattacharjee, Federico Villani, Christian Vogt, Andrea Cossettini, Luca Benini

Comments This work has been accepted for publication in IEEE IWASI Conference proceedings. The final published version will be available via IEEE Xplore

2506.10754 2026-02-02 cs.SD cs.AI eess.AS

BNMusic: Blending Environmental Noises into Personalized Music

Chi Zuo, Martin B. Møller, Pablo Martínez-Nuevo, Huayang Huang, Yu Wu, Ye Zhu

Comments This paper has been accepted by NeurIPS 2025