arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.09472 2026-04-13 eess.AS

Data Selection Effects on Self-Supervised Learning of Audio Representations for French Audiovisual Broadcasts

Valentin Pelloin, Lina Bekkali, Reda Dehak, David Doukhan

Comments To be published in the Fifteenth International Conference on Language Resources and Evaluation (LREC 2026)

2604.09468 2026-04-13 eess.IV cs.CV

DSVTLA: Deep Swin Vision Transformer-Based Transfer Learning Architecture for Multi-Type Cancer Histopathological Cancer Image Classification

Muazzem Hussain Khan, Tasdid Hasnain, Md. Jamil khan, Ruhul Amin, Md. Shamim Reza, Md. Al Mehedi Hasan, Md Ashad Alam

Comments 25 [ages. 9 Figures

详情

英文摘要

In this study, we proposed a deep Swin-Vision Transformer-based transfer learning architecture for robust multi-cancer histopathological image classification. The proposed framework integrates a hierarchical Swin Transformer with ResNet50-based convolution features extraction, enabling the model to capture both long-range contextual dependencies and fine-grained local morphological patterns within histopathological images. To validate the efficiency of the proposed architecture, an extensive experiment was executed on a comprehensive multi-cancer dataset including Breast Cancer, Oral Cancer, Lung and Colon Cancer, Kidney Cancer, and Acute Lymphocytic Leukemia (ALL), including both original and segmented images were analyzed to assess model robustness across heterogeneous clinical imaging conditions. Our approach is benchmarked alongside several state-of-the-art CNN and transfer models, including DenseNet121, DenseNet201, InceptionV3, ResNet50, EfficientNetB3, multiple ViT variants, and Swin Transformer models. However, all models were trained and validated using a unified pipeline, incorporating balanced data preprocessing, transfer learning, and fine-tuning strategies. The experimental results demonstrated that our proposed architecture consistently gained superior performance, reaching 100% test accuracy for lung-colon cancer, segmented leukemia datasets, and up to 99.23% accuracy for breast cancer classification. The model also achieved near-perfect precision, f1 score, and recall, indicating highly stable scores across divers cancer types. Overall, the proposed model establishes a highly accurate, interpretable, and also robust multi-cancer classification system, demonstrating strong benchmark for future research and provides a unified comparative assessment useful for designing reliable AI-assisted histopathological diagnosis and clinical decision-making.

URL PDF HTML ☆

赞 0 踩 0

2604.09446 2026-04-13 eess.SP cs.LG

Continuous Orthogonal Mode Decomposition: Haptic Signal Prediction in Tactile Internet

Mohammad Ali Vahedifar, Mojtaba Nazari, Qi Zhang

2604.09421 2026-04-13 eess.IV cs.CV cs.MM

Multi-task Just Recognizable Difference for Video Coding for Machines: Database, Model, and Coding Application

Junqi Liu, Yun Zhang, Xiaoxia Huang, Long Xu, Weisi Lin

Comments Submitted to IEEE Transactions on Circuits and Systems for Video Technology

详情

英文摘要

Just Recognizable Difference (JRD) boosts coding efficiency for machine vision through visibility threshold modeling, but is currently limited to a single-task scenario. To address this issue, we propose a Multi-Task JRD (MT-JRD) dataset and an Attribute-assisted MT-JRD (AMT-JRD) model for Video Coding for Machines (VCM), enhancing both prediction accuracy and coding efficiency. First, we construct a dataset comprising 27,264 JRD annotations from machines, supporting three representative tasks including object detection, instance segmentation, and keypoint detection. Secondly, we propose the AMT-JRD prediction model, which integrates Generalized Feature Extraction Module (GFEM) and Specialized Feature Extraction Module (SFEM) to facilitate joint learning across multiple tasks. Thirdly, we innovatively incorporate object attribute information into object-wise JRD prediction through the Attribute Feature Fusion Module (AFFM), which introduces prior knowledge about object size and location. This design effectively compensates for the limitations of relying solely on image features and enhances the model's capacity to represent the perceptual mechanisms of machine vision. Finally, we apply the AMT-JRD model to VCM, where the accurately predicted JRDs are applied to reduce the coding bit rate while preserving accuracy across multiple machine vision tasks. Extensive experimental results demonstrate that AMT-JRD achieves precise and robust multi-task prediction with a mean absolute error of 3.781 and error variance of 5.332 across three tasks, outperforming the state-of-the-art single-task prediction model by 6.7% and 6.3%, respectively. Coding experiments further reveal that compared to the baseline VVC and JPEG, the AMT-JRD-based VCM improves an average of 3.861% and 7.886% Bjontegaard Delta-mean Average Precision (BD-mAP), respectively.

URL PDF HTML ☆

赞 0 踩 0

2604.09363 2026-04-13 eess.SP

GreenScatter: Through-Canopy Soil Moisture Sensing with UAV-Mounted Radar

Luke Jacobs, Ishfaq Aziz, Benhao Lu, Alireza Tabatabaeenejad, Mohamad Alipour, Elahe Soltanaghai

2604.09351 2026-04-13 eess.SY cs.MA cs.RO cs.SY

Decentralized Opinion-Integrated Decision making at Unsignalized Intersections via Signed Networks

Bhaskar Varma, Ying Shuai Quan, Karl D. von Ellenrieder, Paolo Falcone

Comments Submitted to CDC 2026 with L-CSS Parallel option

2604.09339 2026-04-13 eess.SP

Periodic OFDMA: A Low-PAPR Multiple Access Scheme for Uplink Communications in 5G and Beyond

Gokce Hacioglu, Serkan Vela

2604.09335 2026-04-13 eess.SP

Optimal symmetric low-rank BD-RIS configuration maximizing the determinant of a MIMO link

Ignacio Santamaria, Mohammad Soleymani, Jesus Gutierrez, Eduard Jorswieck

Comments 12 pages, 5 figures

2604.09332 2026-04-13 eess.AS

Phonemes vs. Projectors: An Investigation of Speech-Language Interfaces for LLM-based ASR

Ziwei Li, Lukuang Dong, Saierdaer Yusuyin, Xianyu Zhao, Zhijian Ou

Comments Update after INTERSPEECH2026 submission

2604.09331 2026-04-13 cs.LG cs.SY eess.SY

Stability Enhanced Gaussian Process Variational Autoencoders

Carl R. Richardson, Jichen Zhang, Ethan King, Ján Drgoňa

2604.09321 2026-04-13 eess.IV cs.CV

UHD Low-Light Image Enhancement via Real-Time Enhancement Methods with Clifford Information Fusion

Xiaohan Wang, Chen Wu, Dawei Zhao, Guangwei Gao, Dianjie Lu, Guijuan Zhang, Linwei Fan, Xu Lu, Shuai Wu, Hang Wei, Zhuoran Zheng

2604.09313 2026-04-13 eess.IV cs.CV

Compositional-Degradation UAV Image Restoration: Conditional Decoupled MoE Network and A Benchmark

Jinquan Yan, Zhicheng Zhao, Zhengzheng Tu, Chenglong Li, Jin Tang, Bin Luo

详情

英文摘要

UAV images are critical for applications such as large-area mapping, infrastructure inspection, and emergency response. However, in real-world flight environments, a single image is often affected by multiple degradation factors, including rain, haze, and noise, undermining downstream task performance. Current unified restoration approaches typically rely on implicit degradation representations that entangle multiple factors into a single condition, causing mutual interference among heterogeneous corrections. To this end, we propose DAME-Net, a Degradation-Aware Mixture-of-Experts Network that decouples explicit degradation perception from degradation-conditioned reconstruction for compositional UAV image restoration. Specifically, we design a Factor-wise Degradation Perception module(FDPM) to provide explicit per-factor degradation cues for the restoration stage through multi-label prediction with label-similarity-guided soft alignment, replacing implicit entangled conditions with interpretable and generalizable degradation descriptions. Moreover, we develop a Conditioned Decoupled MoE module(CDMM) that leverages these cues for stage-wise conditioning, spatial-frequency hybrid processing, and mask-constrained decoupled expert routing, enabling selective factor-specific correction while suppressing irrelevant interference. In addition, we construct the Multi-Degradation UAV Restoration benchmark (MDUR), the first large-scale UAV benchmark for compositional UAV image restoration, with 43 degradation configurations from single degradations to four-factor composites and standardized seen/unseen splits.Extensive experiments on MDUR demonstrate consistent improvements over representative unified restoration methods, with greater gains on unseen and higher-order composite degradations. Downstream experiments further validate benefits for UAV object detection.

URL PDF HTML ☆

赞 0 踩 0

2604.09303 2026-04-13 cs.RO cs.LG cs.SY eess.SY

Online Intention Prediction via Control-Informed Learning

Tianyu Zhou, Zihao Liang, Zehui Lu, Shaoshuai Mou

2604.09295 2026-04-13 quant-ph eess.SP

Dyadic-Order Quantum Fractional Transforms: Circuit Constructions and Applications to Hartley and Cosine Transform Families

Matheus J. A. Oliveira, Israel F. Araujo, José R. de Oliveira Neto, Juliano B. Lima

Comments 16 pages 8 figures

2603.20489 2026-04-13 eess.SP

Realization of a Fully Connected Neural Layer Over-the-Air through Multi-hop Amplify-and-Forward Relays

Tolga Girici, Meng Hua, Deniz Gündüz

Comments Accepted to VTC 2026 Spring, Nice, France

2512.22187 2026-04-13 cs.RO cs.ET cs.SY eess.SY

Joint UAV-UGV Positioning and Trajectory Planning via Meta A3C for Reliable Emergency Communications

Ndagijimana Cyprien, Mehdi Sookhak, Hosein Zarini, Chandra N Sekharan, Mohammed Atiquzzaman

2511.13499 2026-04-13 math.OC cs.SY eess.SY

Uniform Feasibility For Smoothed Backup Control Barrier Functions

Anil Alan, Bart De Schutter

Comments 8 pages, final version for ECC 2026

2511.08451 2026-04-13 math.OC cs.SY eess.SY

Solving Quadratic Programs with Slack Variables via ADMM without Increasing the Problem Size

Thomas Lew, Marcus Greiff, John Subosits, Brian Plancher

Comments European Control Conference (ECC) 2026

2409.04765 2026-04-13 math.OC cs.SY eess.SY

Continuous-Time Distributed Seeking for Variational Generalized Nash Equilibrium of Online Game

Jianing Chen, Sichen Qian, Chuangyin Dang, Sitian Qin

Comments Accepted by IEEE Transactions on Automatic Control

2604.09280 2026-04-13 eess.IV cs.CV

AMO-ENE: Attention-based Multi-Omics Fusion Model for Outcome Prediction in Extra Nodal Extension and HPV-associated Oropharyngeal Cancer

Gautier Hénique, William Le, Gabriel Dayan, Coralie Brodeur, Kristoff Nelson, Apostolos Christopoulos, Edith Filion, Phuc-Felix Nguyen-Tan, Laurent Letourneau-Guillon, Houda Bahig, Samuel Kadoury

详情

英文摘要

Extranodal extension (ENE) is an emerging prognostic factor in human papillomavirus (HPV)-associated oropharyngeal cancer (OPC), although it is currently omitted as a clinical staging criteria. Recent works have advocated for the inclusion of iENE as a prognostic marker in HPV-positive OPC staging. However, several practical limitations continue to hinder its clinical integration, including inconsistencies in segmentation, low contrast in the periphery of metastatic lymph nodes on CT imaging, and laborious manual annotations. To address these limitations, we propose a fully automated end-to-end pipeline that uses computed tomography (CT) images with clinical data to assess the status of nodal ENE and predict treatment outcomes. Our approach includes a hierarchical 3D semi-supervised segmentation model designed to detect and delineate relevant iENE from radiotherapy planning CT scans. From these segmentations, a set of radiomics and deep features are extracted to train an imaging-detected ENE grading classifier. The predicted ENE status is then evaluated for its prognostic value and compared with existing staging criteria. Furthermore, we integrate these nodal features with primary tumor characteristics in a multimodal, attention-based outcome prediction model, providing a dynamic framework for outcome prediction. Our method is validated in an internal cohort of 397 HPV-positive OPC patients treated with radiation therapy or chemoradiotherapy between 2009 and 2020. For outcome prediction at the 2-year mark, our pipeline surpassed baseline models with 88.2% (4.8) in AUC for metastatic recurrence, 79.2% (7.4) for overall survival, and 78.1% (8.6) for disease-free survival. We also obtain a concordance index of 83.3% (6.5) for metastatic recurrence, 71.3% (8.9) for overall survival, and 70.0% (8.1) for disease-free survival, making it feasible for clinical decision making.

URL PDF HTML ☆

赞 0 踩 0

2604.09267 2026-04-13 eess.SY cs.SY

On the Existence of Quadratic Control Lyapunov Functions for Koopman-Operator based Bilinear Systems

Sami Leon Noel Aziz Hanna, Nicolas Hoischen, Sandra Hirche, Armin Lederer

Comments Accepted at the European Control Conference (ECC)

2604.09261 2026-04-13 eess.SP

Joint Device Pairing and Bandwidth Allocation Optimisation for Semantic Feature Multiple Access Networks

Jiaxiang Wang, Zhaohui Yang, Mingzhe Chen, Mohammad Shikh-Bahaei

Comments 6 pages, 3 figures, accepted by ICC 2026 workshop

2604.09255 2026-04-13 eess.SP

Semantic Feature Multiple Access Empowered Integrated Learning and Communication Networks

Jiaxiang Wang, Zhouxiang Zhao, Yahao Ding, Zhijin Qin, Zhaohui Yang, Mingzhe Chen, Mohammad Shikh-Bahaei

Comments 13 pages, 8 figures

2604.09252 2026-04-13 math.OC cs.SY eess.SY

A Unified Control-Theoretic Framework for Saddle-Point Dynamics in Constrained Optimization

Veronica Centorrino, Rawan Hoteit, Efe C. Balta, John Lygeros

Comments 8 Pages, 3 Figures

2604.09233 2026-04-13 eess.IV

A GPU-enhanced workflow for non-Fourier SENSE reconstruction

Samuel Bianchi, Klaas P. Pruessmann

Comments 31 pages, 10 figures, 1 table

2604.09227 2026-04-13 eess.IV cs.CV

Training-free, Perceptually Consistent Low-Resolution Previews with High-Resolution Image for Efficient Workflows of Diffusion Models

Wongi Jeong, Hoigi Seo, Se Young Chun

2604.09179 2026-04-13 eess.SY cs.SY math.DS

Discrete-Time Model of a Two-Speed PowerShift suitable for Real-Time Control and Simulation

Riccardo Morselli, Davide Tebaldi, Roberto Zanasi

2604.09128 2026-04-13 eess.SP

Flexible Cylindrical Array-Aided Secure Wireless Communications

Xiangyu Dong, Ran Yang, Songjie Yang, Weidong Mei, Lipeng Zhu, Yue Xiu, Zhongpei Zhang

2604.09118 2026-04-13 eess.SY cs.SY

Efficient Uniform Feasible Set Sampling for Approximate Linear MPC

Elias Milios, Felix Berkel, Felix Gruber, Melanie N. Zeilinger, Kim P. Wabersich

2604.09102 2026-04-13 eess.SY cs.SY

Scheduling Cause-Effect Chains without Timing Anomalies in End-to-End Latency

Yixuan Zhu, Bo Zhang, Yinkang Gao, Haoyuan Ren, Cheng Tang, Caixu Zhao, Lei Gong, Teng Wang, Wenqi Lou, Xi Li