arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.25897 2026-04-29 cs.RO cs.LG cs.SY eess.SY

Variational Neural Belief Parameterizations for Robust Dexterous Grasping under Multimodal Uncertainty

Clinton Enwerem, Shreya Kalyanaraman, John S. Baras, Calin Belta

Comments 11 pages, 10 figures

详情

英文摘要

Contact variability, sensing uncertainty, and external disturbances make grasp execution stochastic. Expected-quality objectives ignore tail outcomes and often select grasps that fail under adverse contact realizations. Risk-sensitive POMDPs address this failure mode, but many use particle-filter beliefs that scale poorly, obstruct gradient-based optimization, and estimate Conditional Value-at-Risk (CVaR) with high-variance approximations. We instead formulate grasp acquisition as variational inference over latent contact parameters and object pose, representing the belief with a differentiable Gaussian mixture. We use Gumbel-Softmax component selection and location-scale reparameterization to express samples as smooth functions of the belief parameters, enabling pathwise gradients through a differentiable CVaR surrogate for direct optimization of tail robustness. In simulation, our variational neural belief improves robust grasp success under contact-parameter uncertainty and exogenous force perturbations while reducing planning time by roughly an order of magnitude relative to particle-filter model-predictive control. On a serial-chain robot arm with a multifingered hand, we validate grasp-and-lift success under object-pose uncertainty against a Gaussian baseline. Both methods succeed on the tested perturbations, but our controller terminates in fewer steps and less wall-clock time while achieving a higher tactile grasp-quality proxy. Our learned belief also calibrates risk more accurately, keeping mean absolute calibration error below 0.14 across tested simulation regimes, compared with 0.58 for a Cross-Entropy Method planner.

URL PDF HTML ☆

赞 0 踩 0

2604.25887 2026-04-29 cs.CV cs.AI cs.RO cs.SY eess.SY

No Pedestrian Left Behind: Real-Time Detection and Tracking of Vulnerable Road Users for Adaptive Traffic Signal Control

Anas Gamal Aly, Hala ElAarag

Comments © Anas Gamal Aly and Hala ElAarag, 2026. This is the authors' version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record will be published in Proceedings of the 2026 ACM Southeast Conference (ACMSE 2026)

2604.25815 2026-04-29 eess.SY cs.SY math.AP

Backstepping Observer for the Quasilinear Heat Equation with Linear Design Gains: Beyond Local Stability

Mohamed Camil Belhadjoudja, Kirsten A. Morris

Comments This is a working document of a work in progress

2604.25777 2026-04-29 eess.SP cs.DC

SpecFed: Accelerating Federated LLM Inference with Speculative Decoding and Compressed Transmission

Ce Zheng, Xinghan Wang, Jiahong Ning, Yuxuan Shi, Ning Huang, Tingting Yang

Comments IEEE International Symposium on Information Theory (ISIT), 2026

2604.25757 2026-04-29 cs.CR cs.AI cs.RO cs.SY eess.SY

Threat-Oriented Digital Twinning for Security Evaluation of Autonomous Platforms

Thomas J. Neubert, Laxima Niure Kandel, Berker Peköz

Comments Camera ready accepted for presentation at and publication in the proceedings of 2026 56st Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W): Dependable and Secure Autonomous Systems (DSAS)

2604.25738 2026-04-29 eess.SY cs.SY

Local Shifted Passivity Analysis of the Single-Machine Infinite-Bus System

Xinyuan Jiang

Comments 14 pages

2604.25728 2026-04-29 eess.SP

Joint Design of Doppler-Resilient Unimodular Discrete-Phase Waveforms and Receiving Filters for MIMO Radars

Junpeng Ma, Yuke Li, Junbo Wang, Yongxing Zhou

Comments 14 pages, 7 figures

2604.25685 2026-04-29 eess.IV cs.CV

Robustness Evaluation of a Foundation Segmentation Model Under Simulated Domain Shifts in Abdominal CT: Implications for Health Digital Twin Deployment

Sanghati Basu

Comments 8 Pages, 5 Tables, 2 Figures

2604.25680 2026-04-29 cs.CV eess.IV

Exploring Remote Photoplethysmography for Neonatal Pain Detection from Facial Videos

Ashutosh Dhamaniya, Anup Kumar Gupta, Trishna Saikia, Puneet Gupta

Comments 25 pages, 9 figures, 10 tables. Proposed rPPG-based method for neonatal pain detection from facial videos, with multimodal (rPPG + audio) analysis and extensive ablation studies on the iCOPEvid dataset

2604.25650 2026-04-29 cs.SE cs.SY eess.SY

Using Large Language Models for Black-Box Testing of FMU-Based Simulations

Abdullah Mughees, Gaadha Sudheerbabu, Tanwir Ahmad, Dragos Truscan, Mikael Manngård, Kristian Klemets

2604.25624 2026-04-29 eess.AS

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

Chong-Xin Gan, Peter Bell, Man-Wai Mak, Zhe Li, Zezhong Jin, Zilong Huang, Kong Aik Lee

Comments Submitted to Interspeech 2026

2604.25592 2026-04-29 q-bio.NC eess.SP

A geometry aware framework enhances noninvasive mapping of whole human brain dynamics

Song Wang, Kexin Lou, Chen Wei, Zhiyuan Sheng, Jiahao Tang, Kaining Peng, Xinke Shen, Shuhao Mei, Liang Chen, Dongfeng Gu, Quanying Liu

2604.25591 2026-04-29 eess.AS cs.AI cs.CL cs.LG cs.SD

Walking Through Uncertainty: An Empirical Study of Uncertainty Estimation for Audio-Aware Large Language Models

Chun-Yi Kuan, Wei-Ping Huang, Hung-yi Lee

Comments Manuscript in progress

2604.25541 2026-04-29 eess.SP cs.RO

Bridging the Indoor-Outdoor Gap: Cross-Technology Ranging for Seamless Robot Navigation

Paul Schwarzbach

2604.22821 2026-04-29 cs.SD cs.LG eess.AS

Audio2Tool: Speak, Call, Act -- A Dataset for Benchmarking Speech Tool Use

Ramit Pahwa, Apoorva Beedu, Parivesh Priye, Rutu Gandhi, Saloni Takawale, Aruna Baijal, Zengli Yang

2512.22578 2026-04-29 eess.SP

A Novel Geometry-Aware GPR-Based Energy-Efficient and Low-Overhead Channel Estimation Scheme

Syed Luqman Shah, Nurul Huda Mahmood

Comments Submitted for possible publication in IEEE

2512.05552 2026-04-29 eess.SY cs.SY

Inverse Linear-Quadratic Gaussian Differential Games

Lucas Günther, Felix Thömmes, Karl Handwerker, Balint Varga, Sören Hohmann

2509.08470 2026-04-29 eess.AS cs.AI

Joint Learning using Mixture-of-Expert-Based Representation for Speech Enhancement and Robust Emotion Recognition

Jing-Tong Tzeng, Carlos Busso, Chi-Chun Lee

Comments Accepted by IEEE Transactions on Audio, Speech and Language Processing (TASLP)

详情

DOI: 10.1109/TASLPRO.2026.3688928

英文摘要

Speech emotion recognition (SER) plays a critical role in building emotion-aware speech systems, but its performance degrades significantly under noisy conditions. Although speech enhancement (SE) can improve robustness, it often introduces artifacts that obscure emotional cues and adds computational overhead to the pipeline. Multi-task learning (MTL) offers an alternative by jointly optimizing SE and SER tasks. However, conventional shared-backbone models frequently suffer from gradient interference and representational conflicts between tasks. To address these challenges, we propose the Sparse Mixture-of-Experts Representation Integration Technique (Sparse MERIT), a flexible MTL framework that applies frame-wise expert routing over self-supervised speech representations. Sparse MERIT incorporates task-specific gating networks that dynamically select from a shared pool of experts for each frame, enabling parameter-efficient and task-adaptive representation learning. Experiments on the MSP-Podcast corpus show that Sparse MERIT consistently outperforms baseline models on both SER and SE tasks. Under the most challenging condition of -5 dB signal-to-noise ratio (SNR), Sparse MERIT improves SER F1-macro by an average of 12.0% over a baseline relying on a SE pre-processing strategy, and by 3.4% over a naive MTL baseline, with statistical significance on unseen noise conditions. For SE, Sparse MERIT improves segmental SNR (SSNR) by 28.2% over the SE pre-processing baseline and by 20.0% over the naive MTL baseline. These results demonstrate that Sparse MERIT provides robust and generalizable performance for both emotion recognition and enhancement tasks in noisy environments.

URL PDF HTML ☆

赞 0 踩 0

2501.17653 2026-04-29 cs.LG cs.CE eess.SP

Drivetrain simulation using variational autoencoders

Pallavi Sharma, Jorge-Humberto Urrea-Quintero, Bogdan Bogdan, Adrian-Dumitru Ciotec, Laura Vasilie, Henning Wessels, Matteo Skull

Comments 27 pages

2501.10842 2026-04-29 eess.SY cs.SY

BOOST: Microgrid Sizing using Ordinal Optimization

Mohamad Chehade, Sami Karaki

2412.02315 2026-04-29 eess.SY cs.SY

Topology Reconstruction of a Resistor Network with Limited Boundary Measurements: An Optimization Approach

Shivanagouda Biradar, Deepak U Patil

2410.06723 2026-04-29 eess.IV cs.CV cs.LG

Evaluating Computational Pathology Foundation Models for Prostate Cancer Grading under Distribution Shifts

Fredrik K. Gustafsson, Mattias Rantalainen

2211.12080 2026-04-29 cs.SD eess.AS

Robust Training for Speaker Verification against Noisy Labels

Zhihua Fang, Liang He, Hanhan Ma, Xiaochen Guo, Lin Li

Comments Accepted by INTERSPEECH 2023

1803.11131 2026-04-29 eess.SP cs.NA math.NA

Novel Fourier Quadrature Transforms and Analytic Signal Representations for Nonlinear and Non-stationary Time Series Analysis

Pushpendra Singh

Comments 25 pages, 13 figures

2604.25527 2026-04-29 eess.SY cs.SY

Multi-layer barrier adaptation of the discrete-time super-twisting controller

Antoine Thibault Vié, Leonid Fridman, Roberto Galeazzi, Dimitrios Papageorgiou

Comments 6 pages, accepted to 18th International Workshop on Variable Structure Systems

2604.25473 2026-04-29 eess.SY cs.SY

Complex-Vector Power and Cross-Phase Unbalance in Three-Phase Systems

Juan Carlos Bravo-Rodríguez, Juan Carlos del-Pino-López, Francisco Casado-Machado

Comments 8 pages, 1 figure, submitted to IEEE Trans. on Power Delivery

2604.25468 2026-04-29 eess.SY cs.SY math.OC math.ST stat.TH

Distributed adaptive estimation for stochastic large regression models

Die Gan, Siyu Xie, Zhixin Liu, Xuebo Zhang

Comments 13 pages, submitted to IEEE TAC

2604.25453 2026-04-29 eess.SP

Polarization-diverse Detection at Microwave Frequencies Using A Passive Metasurface Aperture

Md. Abrar A Mushfik, Mohammad Ali Kaisar, Mohiminul Islam Bhuiyan Sahed, Idban Alamzadeh

Comments 9 pages, Journal (TAP)

2604.25441 2026-04-29 cs.SD cs.CL eess.AS

Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Venkata Pushpak Teja Menta

Comments 9 pages, 6 figures, 6 tables. Companion paper to PSP benchmark. Code: https://github.com/praxelhq/praxy ; Model: https://huggingface.co/Praxel/praxy-voice-r6 ; Demo: https://huggingface.co/spaces/Praxel/praxy-voice-demo

2604.25430 2026-04-29 eess.SY cs.SY eess.SP

A Miniaturized Broadband 1-Bit Coding Reconfigurable Intelligent Surface for NLOS UE Localization and Uplink Communication

Khagendra Joshi, Deepak Kumar Sahoo, Kamalesh Kumar K, Debidas Kundu, Vivek A. Bohara, Amalendu Patnaik