arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.11030 2026-03-12 eess.SP

Exploiting Spatial Modulation for Strong PhaseNoise Mitigation in mmWave Massive MIMO

Oshin Daoud, Haifa Fares, Amor Nafkha, Yahia Medjahdi, Laurent Clavier

详情

英文摘要

This letter investigates phase noise (PN) mitigation in generalized receiver spatial modulation (GRSM) massive MIMO systems at mmWave under a common local oscillator (CLO). Under CLO, the received energy remains invariant relative to the no-PN scenario, enabling reliable energy-based spatial detection using the no-PN threshold. PN-sensitivity and geometry-based metrics are introduced to design compact, PN-resilient MQAM symbol pools with low detection complexity. PN robustness is further improved through an enhanced PN-aware GRSM-MQAM system that exploits spatial modulation (SM) to recover part of the MQAM bits and strategically maps spatial-pattern Hamming weights to reduce the effective PN impact. In addition, a practical single-stage PN estimation/compensation architecture is proposed, while a benchmark double-stage compensation is adopted to quantify the upper bound achievable via separate Tx/Rx PN mitigation. Results show that under PN, the overall BER is mainly dominated by MQAM symbol detection errors, especially for denser constellations, whereas spatial detection remains robust. The proposed single-stage compensation improves PN resilience, while the benchmark double-stage compensation approaches near PN-free performance.

URL PDF HTML ☆

赞 0 踩 0

2603.10958 2026-03-12 eess.SP

Distortion Is Not Noise: On the Limits of the Kappa Model for Monostatic ISAC

Haofan Dong, Ozgur B. Akan

2603.10947 2026-03-12 eess.IV

Regularizing INR with diffusion prior self-supervised 3D reconstruction of neutron computed tomography data

Maliha Hossain, Haley Duba-Sullivan, Amirkoushyar Ziabari

2602.17929 2026-03-12 cs.CV cs.LG eess.IV

ZACH-ViT: Regime-Dependent Inductive Bias in Compact Vision Transformers for Medical Imaging

Athanasios Angelakis

Comments 24 pages, 15 figures, 5 tables. Code and models available at https://github.com/Bluesman79/ZACH-ViT

2601.03410 2026-03-12 cs.LG cs.CV eess.IV

Inferring Clinically Relevant Molecular Subtypes of Pancreatic Cancer from Routine Histopathology Using Deep Learning

Abdul Rehman Akbar, Alejandro Levya, Ashwini Esnakula, Elshad Hasanov, Anne Noonan, Lingbin Meng, Susan Tsai, Vaibhav Sahai, Midhun Malla, Sarbajit Mukherjee, Upender Manne, Anil Parwani, Wei Chen, Ashish Manne, Muhammad Khalid Khan Niazi

详情

英文摘要

Molecular subtyping of PDAC into basal-like and classical has established prognostic and predictive value. However, its use in clinical practice is limited by cost, turnaround time, and tissue requirements, thereby restricting its application in the management of PDAC. We introduce PanSubNet, an interpretable deep learning framework that predicts therapy-relevant molecular subtypes directly from standard H&E-stained WSIs. PanSubNet was developed using data from 1,055 patients across two multi-institutional cohorts (PANCAN, n=846; TCGA, n=209) with paired histology and RNA-seq data. Ground-truth labels were derived using the validated Moffitt 50-gene signature refined by GATA6 expression. The model employs dual-scale architecture that fuses cellular-level morphology with tissue-level architecture, leveraging attention mechanisms for multi-scale representation learning and transparent feature attribution. On internal validation within PANCAN using five-fold cross-validation, PanSubNet achieved mean AUC of 88.5% with balanced sensitivity and specificity. External validation on the independent TCGA cohort without fine-tuning demonstrated robust generalizability (AUC 84.0%). PanSubNet preserved and, in metastatic disease, strengthened prognostic stratification compared to RNA-seq based labels. Prediction uncertainty linked to intermediate transcriptional states, not classification noise. Model predictions are aligned with established transcriptomic programs, differentiation markers, and DNA damage repair signatures. By enabling rapid, cost-effective molecular stratification from routine H&E-stained slides, PanSubNet offers a clinically deployable and interpretable tool for genetic subtyping. We are gathering data from two institutions to validate and assess real-world performance, supporting integration into digital pathology workflows and advancing precision oncology for PDAC.

URL PDF HTML ☆

赞 0 踩 0

2510.00676 2026-03-12 eess.SY cs.SY

Formation Control via Rotation Symmetry Constraints

Zamir Martinez, Daniel Zelazo

2503.11627 2026-03-12 cs.SD cs.LG eess.AS

Are Deep Speech Denoising Models Robust to Adversarial Noise?

Will Schwarzer, Neel Chaudhari, Philip S. Thomas, Andrea Fanelli, Xiaoyu Liu

Comments 22 pages, 14 figures. Related conference version accepted to ICLR 2026: see https://openreview.net/forum?id=WtH2JxKJKf

2603.10909 2026-03-12 eess.SP

Level Crossing Rate Analysis for Optimal Single-user RIS Systems

Amy S. Inwood, Peter J. Smith, Philippa A. Martin, Graeme K. Woodward

2603.10906 2026-03-12 eess.SY cs.SY

Towards Polynomial Immersion of Port-Hamiltonian Systems

Mohammad Itani, Manuel Schaller, Karl Worthmann, Timm Faulwasser

2603.10901 2026-03-12 eess.SP

Phase Selection and Analysis for Multi-frequency Multi-user RIS Systems Employing Subsurfaces

Amy S. Inwood, Peter J. Smith, Philippa A. Martin, Graeme K. Woodward

2603.10890 2026-03-12 cs.RO cs.SY eess.SY

A gripper for flap separation and opening of sealed bags

Sergi Foix, Jaume Oriol, Carme Torras, Júlia Borràs

Comments 8 pages, Accepted at the 2026 IEEE International Conference on Robotics & Automation (ICRA2026)

2603.10812 2026-03-12 math.OC cs.SY eess.SY

Distributed Stability Certification and Control from Local Data

Surya Malladi, Nima Monshizadeh

2603.10802 2026-03-12 cs.NI cs.AI cs.LG cs.SY eess.SY

Towards Intelligent Spectrum Management: Spectrum Demand Estimation Using Graph Neural Networks

Mohamad Alkadamani, Amir Ghasemi, Halim Yanikomeroglu

Comments 13 pages, 10 figures. Submitted to IEEE Transactions on Machine Learning in Communications and Networking

2603.10800 2026-03-12 cs.LG cs.AI cs.SY eess.SY

AI-Enhanced Spatial Cellular Traffic Demand Prediction with Contextual Clustering and Error Correction for 5G/6G Planning

Mohamad Alkadamani, Colin Brown, Halim Yanikomeroglu

Comments 5 pages, 8 figures. Submitted to IEEE Wireless Communications Letters

2603.10791 2026-03-12 eess.IV

Semantic Satellite Communications for Synchronized Audiovisual Reconstruction

Fangyu Liu, Peiwen Jiang, Wenjin Wang, Chao-Kai Wen, Xiao Li, Shi Jin

2603.10763 2026-03-12 cs.LG cs.IT eess.SP math.IT

Prioritizing Gradient Sign Over Modulus: An Importance-Aware Framework for Wireless Federated Learning

Yiyang Yue, Jiacheng Yao, Wei Xu, Zhaohui Yang, George K. Karagiannidis, Dusit Niyato

2603.10671 2026-03-12 cs.AR cs.CV eess.IV

An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS

Qiyue Chen, Yao Li, Jie Tao, Song Chen, Li Li, Dong Liu

2603.10670 2026-03-12 cs.RO cs.SY eess.SY

Dynamic Modeling and Attitude Control of a Reaction-Wheel-Based Low-Gravity Bipedal Hopper

Shriram Hari, M Venkata Sai Nikhil, R Prasanth Kumar

Comments Preprint. Under review

2603.10656 2026-03-12 eess.SY cs.SY

Distributed State Estimation of Discrete-Time LTI Systems via Jordan Canonical Representation

Giulio Fattore, Maria Elena Valcher, Rui Gao, Guang-Hong Yang

Comments Extended version of the conference paper accepted for presentation at the 24th European Control Conference (ECC) in Reykjavík, Iceland

2603.10635 2026-03-12 eess.SP cs.SY eess.SY

Propagation and Rate-Aware Cell Switching Optimization in HAPS-Assisted Wireless Networks

Mehmet Eren Uluçınar, Özgün Ersoy, Berk Ciloglu, Metin Ozturk, Ali Gorcin

2603.10629 2026-03-12 eess.SP

Flexible Multi-Target Angular Emulation for Over-the-Air Testing of Large-Scale ISAC Base Stations: Principle and Experimental Verification

Chunhui Li, Hao Sun, Wei Fan

2603.10623 2026-03-12 eess.AS cs.LG cs.SD

Geo-ATBench: A Benchmark for Geospatial Audio Tagging with Geospatial Semantic Context

Yuanbo Hou, Yanru Wu, Qiaoqiao Ren, Shengchen Li, Stephen Roberts, Dick Botteldooren

详情

英文摘要

Environmental sound understanding in computational auditory scene analysis (CASA) is often formulated as an audio-only recognition problem. This formulation leaves a persistent drawback in multi-label audio tagging (AT): acoustic similarity can make certain events difficult to separate from waveforms alone. In such cases, disambiguating cues often lie outside the waveform. Geospatial semantic context (GSC), derived from geographic information system data, e.g., points of interest (POI), provides location-tied environmental priors that can help reduce this ambiguity. A systematic study of this direction is enabled through the proposed geospatial audio tagging (Geo-AT) task, which conditions multi-label sound event tagging on GSC alongside audio. To benchmark Geo-AT, Geo-ATBench is introduced as a polyphonic audio benchmark with geographical annotations, containing 10.71 hours of audio across 28 event categories; each clip is paired with a GSC representation from 11 semantic context categories. GeoFusion-AT is proposed as a unified geo-audio fusion framework that evaluates feature-, representation-, and decision-level fusion on representative audio backbones, with audio- and GSC-only baselines. Results show that incorporating GSC improves AT performance, especially on acoustically confounded labels, indicating geospatial semantics provide effective priors beyond audio alone. A crowdsourced listening study with 10 participants on 579 samples shows that there is no significant difference in performance between models on Geo-ATBench labels and aggregated human labels, supporting Geo-ATBench as a human-aligned benchmark. The Geo-AT task, benchmark Geo-ATBench, and reproducible geo-audio fusion framework GeoFusion-AT provide a foundation for studying AT with geospatial semantic context within the CASA community. Dataset, code, models are on homepage (https://github.com/WuYanru2002/Geo-ATBench).

URL PDF HTML ☆

赞 0 踩 0

2603.10585 2026-03-12 eess.SP

Path Planning for Sound Speed Profile Estimation

Ludvig Lindström, Tadas Paskevicius, Andreas Jakobsson, Isaac Skog

Comments Submitted to FUSION 2026, Trondheim, 6 pages, 7 figures,

2603.10549 2026-03-12 cs.CV cs.AI eess.SP

Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues

Mohammed Salah, Eman Ouda, Giuseppe Dell'Avvocato, Fabrizio Sarasini, Ester D'Accardi, Jorge Dias, Davor Svetinovic, Stefano Sfarra, Yusra Abdulrahman

详情

英文摘要

Active infrared thermography (AIRT) is currently witnessing a surge of artificial intelligence (AI) methodologies being deployed for automated subsurface defect analysis of high performance carbon fiber-reinforced polymers (CFRP). Deploying AI-based AIRT methodologies for inspecting CFRPs requires the creation of time consuming and expensive datasets of CFRP inspection sequences to train neural networks. To address this challenge, this work introduces a novel language-guided framework for cognitive defect analysis in CFRPs using AIRT and vision-language models (VLMs). Unlike conventional learning-based approaches, the proposed framework does not require developing training datasets for extensive training of defect detectors, instead it relies solely on pretrained multimodal VLM encoders coupled with a lightweight adapter to enable generative zero-shot understanding and localization of subsurface defects. By leveraging pretrained multimodal encoders, the proposed system enables generative zero-shot understanding of thermographic patterns and automatic detection of subsurface defects. Given the domain gap between thermographic data and natural images used to train VLMs, an AIRT-VLM Adapter is proposed to enhance the visibility of defects while aligning the thermographic domain with the learned representations of VLMs. The proposed framework is validated using three representative VLMs; specifically, GroundingDINO, Qwen-VL-Chat, and CogVLM. Validation is performed on 25 CFRP inspection sequences with impacts introduced at different energy levels, reflecting realistic defects encountered in industrial scenarios. Experimental results demonstrate that the AIRT-VLM adapter achieves signal-to-noise ratio (SNR) gains exceeding 10 dB compared with conventional thermographic dimensionality-reduction methods, while enabling zero-shot defect detection with intersection-over-union values reaching 70%.

URL PDF HTML ☆

赞 0 踩 0

2603.10527 2026-03-12 cs.LG cs.SY eess.SY

World Model for Battery Degradation Prediction Under Non-Stationary Aging

Kai Chin Lim, Khay Wai See

Comments 18 pages, 3 figures

2603.10515 2026-03-12 eess.SP

A Harmony Composition-Inspired Tensor Modalization Method for Near-Field IRS Channel Estimation

Wenzhou Cao, Yashuai Cao, Tiejun Lv, Jie Zeng

Comments This work has been accepted for publication in IEEE Transactions on Vehicular Technology

2603.10443 2026-03-12 eess.SP

3D Spectrum Awareness for Radio Dynamic Zones Using Kriging and Matrix Completion

Mushfiqur Rahman, Sung Joon Maeng, Ismail Guvenc, Chau-Wai Wong

Comments Published in IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN), 2024

2603.10426 2026-03-12 cs.IT eess.SP math.IT

3-D Trajectory Optimization for Robust Direction Sensing in Movable Antenna Systems

Wenyan Ma, Lipeng Zhu, Xiaodan Shao, Rui Zhang

2603.10421 2026-03-12 eess.SP cs.NI

Spyglass: Directional Spectrum Sensing with Single-shot AoA Estimation and Virtual Arrays

Raghav Subbaraman, Akshit Agarwal, Wenhao Chen, Dinesh Bharadia

2603.10420 2026-03-12 eess.AS cs.SD

FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System

Kaituo Xu, Yan Jia, Kai Huang, Junjie Chen, Wenpeng Li, Kun Liu, Feng-Long Xie, Xu Tang, Yao Hu