arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00777 2026-05-04 cs.SD cs.CL eess.AS

LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation

Venkata Pushpak Teja Menta

Comments 7 pages, 2 figures, 2 tables. Code, model, and datasets at https://github.com/praxelhq/lase

详情

英文摘要

A speaker encoder used in multilingual voice cloning should treat the same speaker identically regardless of which script the audio was uttered in. Off-the-shelf encoders do not, and the failure is accent-conditional. On a 1043-pair Western-accented voice corpus across English, Hindi, Telugu, and Tamil, WavLM-base-plus-sv loses 0.082 absolute cosine similarity when the same voice changes script and ECAPA-TDNN loses 0.105. On a 1369-pair Indian-accented voice corpus, the gap shrinks to 0.006 (WavLM-SV) and 0.044 (ECAPA-TDNN). The leak is largest where it matters most for cross-script TTS: when a system projects a non-Indic-trained voice into Indic scripts. We present LASE (Language-Adversarial Speaker Encoder), a small projection head over frozen WavLM-base-plus trained with two losses: a supervised contrastive loss over voice identity, and a gradient-reversal cross-entropy against a 4-language classifier that pushes the embedding to be language-uninformative while remaining speaker-informative. Trained on 1118 quality-gated cross-script pairs synthesised from 8 commercial multilingual voices, LASE's residual gap is consistent with zero on both corpora (Delta = 0.013 Western, Delta = 0.026 Indian; both bootstrap 95% CIs include zero) and amplifies the cross-script-vs-floor margin 2.4-2.7x over both baselines. An ECAPA+GRL ablation shows the GRL objective improves either backbone but the WavLM choice contributes too. In synthetic multi-speaker diarisation, LASE matches ECAPA-TDNN on cross-script speaker recall (0.788 vs 0.789) with ~100x less training data. We release the r1 checkpoint, both corpora, and the bootstrap recipe.

URL PDF HTML ☆

赞 0 踩 0

2605.00769 2026-05-04 eess.SY cs.SY

Voltage Ride-Through in Large Loads- A Dual PQ Approach

Amir Norouzi, Michael Morel

Comments 10 pages

2605.00761 2026-05-04 cs.IT eess.SP math.IT

The Benefit of Decoder-Provided Pilots in Highly Dynamic Channels

Duschia Bodet, Muriel Médard, Muralidhar Rangaswamy, Ken Duffy

Comments This work has been submitted to the IEEE for possible publication

2605.00752 2026-05-04 eess.SY cs.LO cs.SY

HyperCertificates: Verification of Discrete-time Dynamical Systems against HyperLTL Specifications

Vishnu Murali, Amin Falah, Ashutosh Trivedi, Majid Zamani

Comments 24 pages, 3 figures, 1 table

2605.00746 2026-05-04 q-bio.NC eess.SP physics.optics

Functional Connectivity-Guided Band Selection for Motor Imagery Brain-Computer Interfaces

Natália Araújo do Carmo, Aarthy Nagarajan

详情

英文摘要

Reliable control in motor imagery brain-computer interfaces (MI-BCIs) requires the precise decoding of user-specific neural rhythms, which vary significantly across individuals. The Common Spatial Pattern (CSP) algorithm is a cornerstone of MI-BCI decoding, yet its performance depends strongly on the spectral range of the input EEG data. Although Filter Bank CSP (FBCSP) extends this as a data-driven decoding framework, its frequency sub-bands are predefined rather than selected using subject-specific physiological criteria. This paper presents a proof-of-concept study of static functional connectivity (FC)-guided band selection for MI-BCI, demonstrated using a conventional FBCSP-based pipeline. The proposed method identifies the most discriminative spectral bands by calculating phase-based connectivity across four sensorimotor channels using wPLI, PLV, and PLI. Nine bands in a 4-40 Hz filter bank are ranked by the effect size of their hemispheric coupling differences and pruned to the top K bands for feature extraction and classification via FBCSP and a Support Vector Regressor. This framework was tested for K values ranging from 1 to 8 across the BCI Competition IV-2a (n = 9) and OpenBMI (n = 54) datasets. Performance was benchmarked against standard nine-band FBCSP and random ablation to determine the minimum number of bands (K*) required to maintain accuracy within a 2% baseline equivalence zone. Results show FC-guided selection can outperform random ablation and achieve near-baseline performance while reducing required CSP fits by 22.2% to 77.8%. PLV enables the most aggressive dimensionality reduction by prioritizing the μ and low-\b{eta} ranges, while wPLI demonstrates superior inter-session robustness by mitigating volume conduction. These findings establish FC-guided selection as a principled and interpretable alternative to heuristic filter bank designs.

URL PDF HTML ☆

赞 0 踩 0

2605.00734 2026-05-04 eess.SY cs.SY

Economic Valuation and Optimal Deployment of Static Synchronous Series Compensators for U.S. Power System Expansion

Wei Ai, Vladimir Dvorkin, Michael T. Craig

Comments 10 pages, 7 figures

2601.05949 2026-05-04 eess.SY cs.SY math.SP

Generalized Spectral Clustering of Low-Inertia Power Networks

Gerald Ogbonna, C. Lindsay Anderson

Comments This manuscript has been submitted to IEEE Transactions on Power Systems

2509.26388 2026-05-04 eess.AS cs.AI cs.CL

Game-Time: Evaluating Temporal Dynamics in Spoken Language Models

Kai-Wei Chang, En-Pei Hu, Chun-Yi Kuan, Wenze Ren, Wei-Chih Chen, Guan-Ting Lin, Yu Tsao, Shao-Hua Sun, Hung-yi Lee, James Glass

Comments Accepted to ICASSP 2026

2605.00726 2026-05-04 eess.SY cs.SY

Multi-Regional Traffic Control with Travel and Charging Demand Co-Management

Yixun Wen, Stelios Timotheou, Boli Chen

2605.00721 2026-05-04 cs.SD cs.AI eess.AS eess.SP

Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation

Anton Ratnarajah, Mehmet Ergezer, Arun Nair, Mrudula Athi

Comments Accepted to Generative Data Augmentation for Real-World Signal Processing Applications (GenDA 2025). An ICASSP 2025 Satellite Workshop and IEEE Data Science and Learning Workshop: Room Acoustics and Speaker Distance Estimation Challenge

2605.00698 2026-05-04 eess.IV cs.LG

FedKPer: Tackling Generalization and Personalization in Medical Federated Learning via Knowledge Personalization

Zoe Fowler, Ghassan AlRegib

Comments Accepted to IEEE International Conference on Image Processing (ICIP)

2605.00681 2026-05-04 eess.SY cs.SY

Deployment-Efficient Short-Term Load Forecasting in AI Data Centers via Sequence-to-Point Knowledge Distillation

Lei Wang, Jiahao Chen, Fanping Sui, Ying Zhang, Di Shi

Comments 7 pages, 4 figures, 3 tables

2605.00630 2026-05-04 cs.CV cs.MM eess.IV

CMTA: Leveraging Cross-Modal Temporal Artifacts for Generalizable AI-Generated Video Detection

Hang Wang, Chao Shen, Chenhao Lin, Minghui Yang, Lei Zhang, Cong Wang

Comments 15 pages, 4 figures

2605.00607 2026-05-04 cs.CL eess.AS

Beyond Decodability: Reconstructing Language Model Representations with an Encoding Probe

Gaofei Shen, Martijn Bentum, Tom Lentz, Afra Alishahi, Grzegorz Chrupała

2605.00585 2026-05-04 eess.SP cs.NA math.NA

Local Geometry of Least Squares for Unmixing Signals with Parameter-Dependent Dictionaries

Santos Michelena, Maxime Ferreira Da Costa, José Picheral

Comments 13 pages, 11 figures. Submitted to IEEE TSP

2605.00535 2026-05-04 eess.SP

From Pilot to Precoding Design: Blind Angular Spoofing For Location Privacy in MIMO Systems

Priyanka Maity, Lorenzo Italiano, Alireza Pourafzal, Gonzalo Seco-Granados, Hui Chen, Monica Nicoli, Henk Wymeersch

2605.00527 2026-05-04 eess.IV cs.CV cs.LG

Multi-frame Restoration for High-rate Lissajous Confocal Laser Endomicroscopy

Minhee Lee, Sangyoon Lee, Jiwook Lee, Minki Hong, Kyuyoung Kim, Wonhwa Kim, Jaeho Lee

2605.00494 2026-05-04 eess.AS

Transformer-based End-to-End Control Filter Generation for Active Noise Control

Ziyi Yang, Zhengding Luo, Yisong Zou, Boxiang Wang, Qirui Huang, Woon-Seng Gan

2605.00486 2026-05-04 eess.SP

Development of Multivariate Attention LSTM Model For Dynamic Line Rating Forecasting

Anushka Bandara, Sahan Siriwardena, Akila Wijethunge, Janaka Ekanayake

2605.00461 2026-05-04 eess.IV cs.CV

Combined Dictionary Unfolding Network with Gradient-Adaptive Fidelity for Transferable Multi-Source Fusion

Ge Luo, Jun-Jie Huang, Qi Yu, Tianrui Liu, Ke Liang, Yuming Xiang, Wentao Zhao, Xinwang Liu, Meng Wang

2605.00458 2026-05-04 cs.LG eess.SP

Federated Learning with Hypergradient-based Online Update of Aggregation Weights

Ayano Nakai-Kasai, Tadashi Wadayama

2605.00449 2026-05-04 cs.IT cs.LG eess.SP math.IT

Soft Graph Diffusion Transformer for MIMO Detection

Nan Jiang, Jiadong Hong, Lei Liu, Xinyu Bian, Wenjie Wang, Zhaoyang Zhang

Comments 6 pages, 4 figures, 2 tables

2605.00448 2026-05-04 cs.CV eess.IV

Learning from Compressed CT: Feature Attention Style Transfer and Structured Factorized Projections for Resource-Efficient Medical Image Analysis

Shadid Yousuf, S. M. Mahbubur Rahman, Mohammed Imamul Hassan Bhuiyan

2605.00431 2026-05-04 cs.SD cs.CV cs.LG eess.AS

MMAudioReverbs: Video-Guided Acoustic Modeling for Dereverberation and Room Impulse Response Estimation

Akira Takahashi, Ryosuke Sawata, Shusuke Takahashi, Yuki Mitsufuji

Comments Accepted to the CVPR 2026 Sight and Sound Workshop

2605.00428 2026-05-04 stat.ME cs.PF cs.SY eess.SY

How to Do Statistical Evaluations in ECE/CS Papers: A Practical Playbook for Defensible Results

Bhaskar Krishnamachari

Comments 30 pages, 8 figures; Tutorial paper; companion student workbook and claude skill available as ancillary material

2605.00404 2026-05-04 eess.SY cs.SY

Electric Grid Topology and Admittance Estimation using Phasor Measurements

Norak Rin, Iman Shames, Ian Petersen, Elizabeth Ratnam

2605.00329 2026-05-04 cs.SD eess.AS

Fast Text-to-Audio Generation with One-Step Sampling via Energy-Scoring and Auxiliary Contextual Representation Distillation

Kuan-Po Huang, Bo-Ru Lu, Byeonggeun Kim, Mihee Lee, Zalan Fabian, Renard Korzeniowski, Qingming Tang, Greg Ver Steeg, Hung-yi Lee, Chieh-Chi Kao, Chao Wang

2605.00317 2026-05-04 eess.SY cs.SY

Real-Time Neural Distributed Energy Resources Dispatch with Feasibility Guarantees

Jie Zhu, Yinliang Xu, Hongbin Sun

2605.00306 2026-05-04 cs.IT eess.SP math.IT

Artificial-Noise Aided Design for Movable-Antenna Enabled Physical-Layer Service Integration

Zhifeng Tang, Guangchen Wang, Nan Yang, Xiangyun Zhou, Salman Durrani

2605.00258 2026-05-04 cs.IT cs.SY eess.SY math.IT

Joint Accuracy and Confidentiality in Semantic-Aware Secure Remote Reconstruction

Bowen Li, Nikolaos Pappas