arXivDaily arXiv每日学术速递 周一至周五更新
重置
2603.12202 2026-03-13 eess.SY cs.SY

Technology configurations for decarbonizing residential heat supply through district heating and implications for the electricity network

Christian Doh Dinga, Francesco Lombardi, Roald Arkesteijn, Arjan van Voorden, Sander van Rijn, Laurens James de Vries, Milos Cvetkovic

详情
英文摘要

District heating networks (DHNs) have significant potential to decarbonize residential heating and accelerate the energy transition. However, designing carbon-neutral DHNs requires balancing several objectives, including economic costs, social acceptance, long-term uncertainties, and grid-integration challenges from electrification. By combining modeling-to-generate-alternatives with power flow simulation techniques, we develop a decision-support method for designing carbon-neutral DHNs that are cost-effective, socially acceptable, robust to future risks, and impose minimal impacts on the electricity grid. Applying our method to a Dutch case, we find substantial diversity in how carbon-neutral DHNs can be designed. The flexibility in technology choice, sizing, and location enables accommodating different real-world needs and achieving high electrification levels without increasing grid loading. For instance, intelligently located heat pumps and thermal storage can limit grid stress even when renewable baseload heat sources and green-fuel boilers are scarce. Using our method, planners can explore diverse carbon-neutral DHN designs and identify the design that best balances stakeholders' preferences.

2603.12187 2026-03-13 eess.SY cs.SY

Integrated Online Monitoring and Adaption of Process Model Predictive Controllers

Samuel Mallick, Laura Boca de de Giuli, Alessio La Bella, Azita Dabiri, Bart De Schutter, Riccardo Scattolini

Comments 6 pages, 3 figures, submitted to IEEE L-CSS

详情
英文摘要

This paper addresses the design of an event-triggered, data-based, and performance-oriented adaption method for model predictive control (MPC). The performance of such a strategy strongly depends on the accuracy of the prediction model, which may require online adaption to prevent performance degradation under changing operating conditions. Unlike existing methods that continuously update model and control parameters from data, potentially leading to catastrophic forgetting and unnecessary control modifications, we propose a novel approach based on statistical monitoring of closed-loop performance indicators. This framework enables the detection of performance degradation, and, when required, controller adaption is performed via reinforcement learning and identification techniques. The proposed strategy is validated on a high-fidelity simulation of a district heating system benchmark.

2603.12172 2026-03-13 eess.SP

Simultaneous Multi-Modal Covert Communications: Analysis and Optimization

Justin H. Kong, Terrence J. Moore, Fikadu T. Dagefu

详情
英文摘要

This paper investigates the problem of covert communications in a heterogeneous wireless network where multiple communication modalities are used simultaneously. In this setup, a legitimate transmitter sends confidential data to its receiver by selecting multiple modalities with the goal of maximizing communication covertness against a passive adversary (Willie) while satisfying a transmission rate requirement. We analyze two distinct scenarios for a given observation time by Willie. The two scenarios are: (i) Willie knows the modalities selected by the friendly transmitter, and (ii) Willie is unaware of the selected modalities. We first derive the optimal detector for Willie that minimizes the detection error probability (DEP) in both cases. For the first scenario, we derive an exact expression for the DEP and provide a computationally efficient approximation. For the second scenario, we introduce the DEP expressions in the low-signal-to-noise ratio (SNR) regime at Willie. Building on this analysis, we propose a novel low-complexity modality set selection technique designed to maximize the DEP subject to a rate constraint. Numerical simulations validate the derived analytical expressions and demonstrate that the proposed modality set selection technique achieves near-optimal performance, outperforming benchmark schemes.

2603.12144 2026-03-13 cs.CV cs.RO eess.IV

O3N: Omnidirectional Open-Vocabulary Occupancy Prediction

Mengfei Duan, Hao Shi, Fei Teng, Guoqiang Zhao, Yuheng Zhang, Zhiyong Li, Kailun Yang

Comments The source code will be made publicly available at https://github.com/MengfeiD/O3N

详情
英文摘要

Understanding and reconstructing the 3D world through omnidirectional perception is an inevitable trend in the development of autonomous agents and embodied intelligence. However, existing 3D occupancy prediction methods are constrained by limited perspective inputs and predefined training distribution, making them difficult to apply to embodied agents that require comprehensive and safe perception of scenes in open world exploration. To address this, we present O3N, the first purely visual, end-to-end Omnidirectional Open-vocabulary Occupancy predictioN framework. O3N embeds omnidirectional voxels in a polar-spiral topology via the Polar-spiral Mamba (PsM) module, enabling continuous spatial representation and long-range context modeling across 360°. The Occupancy Cost Aggregation (OCA) module introduces a principled mechanism for unifying geometric and semantic supervision within the voxel space, ensuring consistency between the reconstructed geometry and the underlying semantic structure. Moreover, Natural Modality Alignment (NMA) establishes a gradient-free alignment pathway that harmonizes visual features, voxel embeddings, and text semantics, forming a consistent "pixel-voxel-text" representation triad. Extensive experiments on multiple models demonstrate that our method not only achieves state-of-the-art performance on QuadOcc and Human360Occ benchmarks but also exhibits remarkable cross-scene generalization and semantic scalability, paving the way toward universal 3D world modeling. The source code will be made publicly available at https://github.com/MengfeiD/O3N.

2603.12098 2026-03-13 eess.SY cs.SY math.CO math.OC

Maximum-Entropy Random Walks on Hypergraphs

Anqi Dong, Anzhi Sheng, Xin Mao, Can Chen

详情
英文摘要

Random walks are fundamental tools for analyzing complex networked systems, including social networks, biological systems, and communication infrastructures. While classical random walks focus on pairwise interactions, many real-world systems exhibit higher-order interactions naturally modeled by hypergraphs. Existing random walk models on hypergraphs often focus on undirected structures or do not incorporate entropy-based inference, limiting their ability to capture directional flows, uncertainty, or information diffusion in complex systems. In this article, we develop a maximum-entropy random walk framework on directed hypergraphs with two interaction mechanisms: broadcasting where a pivot node activates multiple receiver nodes and merging where multiple pivot nodes jointly influence a receiver node. We infer a transition kernel via a Kullback--Leibler divergence projection onto constraints enforcing stochasticity and stationarity. The resulting optimality conditions yield a multiplicative scaling form, implemented using Sinkhorn--Schrödinger-type iterations with tensor contractions. We further analyze ergodicity, including projected linear kernels for broadcasting and tensor spectral criteria for polynomial dynamics in merging. The effectiveness of our framework is demonstrated with both synthetic and real-world examples.

2603.12083 2026-03-13 cs.CV cs.RO eess.IV physics.optics

Towards Universal Computational Aberration Correction in Photographic Cameras: A Comprehensive Benchmark Analysis

Xiaolong Qian, Qi Jiang, Yao Gao, Lei Sun, Zhonghua Yi, Kailun Yang, Luc Van Gool, Kaiwei Wang

Comments Accepted to CVPR 2026. Benchmarks, codes, and Zemax files will be available at https://github.com/XiaolongQian/UniCAC

详情
英文摘要

Prevalent Computational Aberration Correction (CAC) methods are typically tailored to specific optical systems, leading to poor generalization and labor-intensive re-training for new lenses. Developing CAC paradigms capable of generalizing across diverse photographic lenses offers a promising solution to these challenges. However, efforts to achieve such cross-lens universality within consumer photography are still in their early stages due to the lack of a comprehensive benchmark that encompasses a sufficiently wide range of optical aberrations. Furthermore, it remains unclear which specific factors influence existing CAC methods and how these factors affect their performance. In this paper, we present comprehensive experiments and evaluations involving 24 image restoration and CAC algorithms, utilizing our newly proposed UniCAC, a large-scale benchmark for photographic cameras constructed via automatic optical design. The Optical Degradation Evaluator (ODE) is introduced as a novel framework to objectively assess the difficulty of CAC tasks, offering credible quantification of optical aberrations and enabling reliable evaluation. Drawing on our comparative analysis, we identify three key factors -- prior utilization, network architecture, and training strategy -- that most significantly influence CAC performance, and further investigate their respective effects. We believe that our benchmark, dataset, and observations contribute foundational insights to related areas and lay the groundwork for future investigations. Benchmarks, codes, and Zemax files will be available at https://github.com/XiaolongQian/UniCAC.

2603.12075 2026-03-13 cs.RO cs.SY eess.SY

Decentralized Cooperative Localization for Multi-Robot Systems with Asynchronous Sensor Fusion

Nivand Khosravi, Niusha Khosravi, Mohammad Bozorg, Masoud S. Bahraini

Comments Presented at the 13th RSI International Conference on Robotics and Mechatronics (ICRoM 2025)

详情
英文摘要

Decentralized cooperative localization (DCL) is a promising approach for nonholonomic mobile robots operating in GPS-denied environments with limited communication infrastructure. This paper presents a DCL framework in which each robot performs localization locally using an Extended Kalman Filter, while sharing measurement information during update stages only when communication links are available and companion robots are successfully detected by LiDAR. The framework preserves cross-correlation consistency among robot state estimates while handling asynchronous sensor data with heterogeneous sampling rates and accommodating accelerations during dynamic maneuvers. Unlike methods that require pre-aligned coordinate systems, the proposed approach allows robots to initialize with arbitrary reference-frame orientations and achieves automatic alignment through transformation matrices in both the prediction and update stages. To improve robustness in feature-sparse environments, we introduce a dual-landmark evaluation framework that exploits both static environmental features and mobile robots as dynamic landmarks. The proposed framework enables reliable detection and feature extraction during sharp turns, while prediction accuracy is improved through information sharing from mutual observations. Experimental results in both Gazebo simulation and real-world basement environments show that DCL outperforms centralized cooperative localization (CCL), achieving a 34% reduction in RMSE, while the dual-landmark variant yields an improvement of 56%. These results demonstrate the applicability of DCL to challenging domains such as enclosed spaces, underwater environments, and feature-sparse terrains where conventional localization methods are ineffective.

2603.12069 2026-03-13 cs.DB cs.SY eess.SY

Numerical benchmark for damage identification in Structural Health Monitoring

Francesca Marafini, Giacomo Zini, Alberto Barontini, Nuno Mendes, Alice Cicirello, Michele Betti, Gianni Bartoli

Comments Submitted for peer review to Data Centric Engineering, Cambridge University Press

详情
英文摘要

The availability of a dataset for validation and verification purposes of novel data-driven strategies and/or hybrid physics-data approaches is currently one of the most pressing challenges in the engineering field. Data ownership, security, access and metadata handiness are currently hindering advances across many fields, particularly in Structural Health Monitoring (SHM) applications. This paper presents a simulated SHM dataset, comprised of dynamic and static measurements (i.e., acceleration and displacement), and includes the conceptual framework designed to generate it. The simulated measurements were generated to incorporate the effects of Environmental and Operational Variations (EOVs), different types of damage, measurement noise and sensor faults and malfunctions, in order to account for scenarios that may occur during real acquisitions. A fixed-fixed steel beam structure was chosen as reference for the numerical benchmark. The simulated monitoring was operated under the assumptions of a Single Degree of Freedom (SDOF) for generating acceleration records and of the Euler-Bernoulli beam for the simulated displacement measurements. The generation process involved the use of parallel computation, which is detailed within the provided open-source code. The generated data is also available open-source, thus ensuring reproducibility, repeatability and accessibility for further research. The comprehensive description of data types, formats, and collection methodologies makes this dataset a valuable resource for researchers aiming to develop or refine SHM techniques, fostering advancements in the field through accessible, high-quality synthetic data.

2603.12059 2026-03-13 cs.RO cs.SY eess.SY

Flight through Narrow Gaps with Morphing-Wing Drones

Julius Wanner, Hoang-Vu Phan, Charbel Toumieh, Dario Floreano

详情
英文摘要

The size of a narrow gap traversable by a fixed-wing drone is limited by its wingspan. Inspired by birds, here, we enable the traversal of a gap of sub-wingspan width and height using a morphing-wing drone capable of temporarily sweeping in its wings mid-flight. This maneuver poses control challenges due to sudden lift loss during gap-passage at low flight speeds and the need for precisely timed wing-sweep actuation ahead of the gap. To address these challenges, we first develop an aerodynamic model for general wing-sweep morphing drone flight including low flight speeds and post-stall angles of attack. We integrate longitudinal drone dynamics into an optimal reference trajectory generation and Nonlinear Model Predictive Control framework with runtime adaptive costs and constraints. Validated on a 130 g wing-sweep-morphing drone, our method achieves an average altitude error of 5 cm during narrow-gap passage at forward speeds between 5 and 7 m/s, whilst enforcing fully swept wings near the gap across variable threshold distances. Trajectory analysis shows that the drone can compensate for lift loss during gap-passage by accelerating and pitching upwards ahead of the gap to an extent that differs between reference trajectory optimization objectives. We show that our strategy also allows for accurate gap passage on hardware whilst maintaining a constant forward flight speed reference and near-constant altitude.

2603.12027 2026-03-13 eess.SP

A Joint JSCC-Resource Allocation Framework for QoS-Aware Semantic Communication in LEO Satellite-based EO Missions

Hung Nguyen-Kha, Ti Ti Nguyen, Vu Nguyen Ha, Eva Lagunas, Symeon Chatzinotas, Bjorn Ottersten

Comments Accepted for publishing in the proceeding IEEE ICC 2026

详情
英文摘要

In Earth observation (EO) missions with Low Earth orbit (LEO) satellites, high-resolution image acquisition generates a massive data volume that poses a significant challenge for transmission under the limited satellite power budget, while LEO movement introduces dynamic systems. To enable efficient image transmission, this paper employs semantic communication (SemCom) with joint source-channel coding (JSCC), which focuses on transmitting meaningful information to reduce power consumption. Under a quality-of-service (QoS) requirement defined by image reconstruction quality, this work aims to minimize the total transmit power by jointly optimizing the JSCC encoder-decoder parameters and resource allocation. However, the implicit relationship among JSCC parameters, link quality, and image quality, coupled with the presence of mixed integer-continuous variables, makes the problem difficult to solve directly. To address this, a curve-fitting model is proposed to approximate the JSCC compression-SNR-quality relationship. Then, the joint compression ratio-resource allocation (JCRRA) algorithm is proposed to address the underlying problem. Numerical results demonstrate that the proposed method achieves substantial power savings compared to both greedy algorithms and conventional transmission paradigms.

2603.12014 2026-03-13 eess.SP

Array Geometry-Centric Axial Sidelobe Interference Analysis for Near-Field Multi-User MIMO

Ahmed Hussain, Asmaa Abdallah, Abdulkadir Celik, Ahmed M. Eltawil

详情
英文摘要

With the deployment of large antenna arrays at high-frequency bands, future wireless communication systems are likely to operate in the radiative near-field (NF). Unlike far-field beam steering, NF beams can be focused on a spatial region with finite depth, enabling user multiplexing in both range and angle. In NF multiuser multiple-input multiple-output (MU-MIMO) systems, achievable rates are limited by interference arising from sidelobes in both the axial (range) and lateral (angle) dimensions. This work investigates how axial sidelobes (ASLs) vary with array geometry. Closed-form array gain expressions are derived to characterize ASLs for uniform planar arrays. Analytical results show that the uniform square array (USA) yields the lowest ASLs, followed by the uniform concentric circular array (UCCA), uniform linear array (ULA), and uniform circular array (UCA). Specifically, the USA achieves a peak sidelobe level (PSLL) of -17.6 dB versus -7.9 dB for the UCA. Numerical simulations confirm that the USA provides superior sidelobe suppression and highest sumrate performance.

2603.11982 2026-03-13 quant-ph cs.SY eess.SY

Approximate Reduced Lindblad Dynamics via Algebraic and Adiabatic Methods

Tommaso Grigoletto, Alain Sarlette, Francesco Ticozzi, Lorenza Viola

详情
英文摘要

We present an algebraic framework for approximate model reduction of Markovian open quantum dynamics that guarantees complete positivity and trace preservation by construction. First, we show that projecting a Lindblad generator on its center manifold -- the space spanned by eigenoperators with purely imaginary eigenvalue -- yields an asymptotically exact reduced quantum dynamical semigroup whose dynamics is unitary, with exponentially decaying transient error controlled by the generator's spectral gap. Second, for analytic perturbations of a Lindblad generator with a tractable center manifold, we propose a perturbative reduction that keeps the reduced space fixed at the unperturbed center manifold. The resulting generator is shown to remain a valid Lindbladian for arbitrary perturbation strengths, and explicit finite-time error bounds, that quantify leakage from the unperturbed center sector, are provided. We further clarify the connection to adiabatic elimination methods, by both showing how the algebraic reduction can be directly related to a first-order adiabatic-elimination and by providing sufficient conditions under which the latter method can be applied while preserving complete positivity. We showcase the usefulness of our techniques in dissipative many-body quantum systems exhibiting non-stationary long-time dynamics.

2603.11978 2026-03-13 eess.SY cs.SY

Robust Parametric Microgrid Dispatch Under Endogenous Uncertainty of Operation- and Temperature-Dependent Battery Degradation

Rui Xie, Jun Wang, Jiaxu Duan, Chao Ma, Yunhui Liu, Yue Chen

Comments 8 pages, 4 figures

详情
英文摘要

Batteries play a critical role in microgrid energy management by ensuring power balance, enhancing renewable utilization, and reducing operational costs. However, battery degradation poses a significant challenge, particularly under extreme temperatures. This paper investigates the optimal trade-off between battery degradation and operational costs in microgrid dispatch to find a robust cost-effective strategy from a full life-cycle perspective. A key challenge arises from the endogenous uncertainty (or decision-dependent uncertainty, DDU) of battery degradation: Dispatch decisions influence the probability distribution of battery degradation, while in turn degradation changes battery operation model and thus affects dispatch. In this paper, we first develop an XGBoost-based probabilistic degradation model trained on experimental data across varying temperature conditions. We then formulate a parametric model predictive control (MPC) framework for microgrid dispatch, where the weight parameters of the battery degradation penalty terms are tuned through long-term simulation of degradation and dispatch interactions. Case studies validate the effectiveness of the proposed approach.

2603.11959 2026-03-13 eess.SP

Near-Field Multiuser Beam Training for XL-MIMO: An End-to-End Interference-Aware Approach with Pilot Limitations

Xinyang Li, Songjie Yang, Xiang Ling, Jianhui Song, Yibo Wang, Hua Chen

Comments 5 pages, 5 figures, submitted to IEEE Wireless Communications Letters

详情
英文摘要

Near-field propagation in extremely large-scale MIMO (XL-MIMO) enlarges the beam training (BT) search space by introducing an additional range dimension, which makes conventional codebook-based beam sweeping prohibitively expensive under limited pilot resources, especially for multiuser sub-connected hybrid architectures. This letter proposes a deep-learning-based interference-aware multiuser BT framework (DL-IABT) that directly predicts analog beam indices from a small number of uplink sensing measurements. By exploiting a subarray-level approximation, a far-field codebook is adopted to represent each subarray response with negligible mismatch. To enable end-to-end (E2E) learning, we derive a variant-MSE surrogate loss by eliminating the digital precoder through a closed-form MMSE solution from KKT conditions, which implicitly accounts for multiuser interference (MUI). The proposed network integrates a complex-valued sensing front-end, a shared complex-valued encoder, a Transformer-based multiuser predictor, and a scalable Gumbel--Softmax beam selection head. Simulation results show that DL-IABT achieves near-optimal sum-rate performance while providing markedly higher effective throughput under pilot overhead constraints.

2603.11947 2026-03-13 cs.SD cs.CL cs.MM eess.AS

Resurfacing Paralinguistic Awareness in Large Audio Language Models

Hao Yang, Minghan Wang, Tongtong Wu, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari

Comments Submitted to Interspeech 2026

详情
英文摘要

Large Audio Language Models (LALMs) have expanded the interaction with human to speech modality, which introduces great interactive potential, due to the paralinguistic cues implicitly indicating the user context. However, building on the current content-centred paradigm, LALMs usually neglect such paralinguistic cues and respond solely based on query content. In this work, to resurface the paralinguistic awareness in LALMs, we introduce five diverse layer-wise analyses to jointly identify paralinguistic layers and semantic understanding layers. Based on these insights, we propose a paralinguistic-enhanced fine-tuning (PE-FT) protocol accordingly to equip LALMs with paralinguistic-aware capabilities, including (1) selective-layer fine-tuning, and (2) an auxiliary dual-level classification head. Our experiments demonstrate that PE-FT protocol efficiently and effectively resurfaces the paralinguistic awareness, even surpassing the performance of the all-layer fine-tuning strategy.

2603.11943 2026-03-13 eess.SY cs.SY

Emergency-Aware and Frequency-Constrained HVDC Planning for A Multi-Area Asynchronously Interconnected Grid

Yiliu He, Haiwang Zhong, Grant Ruan, Yan Xu, Chongqing Kang

详情
英文摘要

High-voltage direct current (HVDC) technology has played a crucial role for long-distance transmission of renewable power generation. However, the integration of large-capacity HVDC lines introduces significant frequency security challenges during HVDC fault emergencies. This paper proposes an emergency-aware and frequency-constrained HVDC planning method to optimize the capacity of inter-area HVDC tie-lines in a multi-area asynchronously interconnected grid. Firstly, a coordinated emergency frequency control scheme is proposed to allocate the emergency control resources during HVDC faults. Then, an enhanced system frequency response model integrating event-driven emergency frequency control is developed and a weighted oblique decision tree approach is employed to extract frequency nadir security constraints. The proposed planning model considers all potential HVDC fault emergencies while treating candidate HVDC capacities as decision variables. Simulation results demonstrate superior performance in balancing economic efficiency with frequency security requirements, providing a practical solution for inter-area HVDC planning.

2603.10970 2026-03-13 cs.ET cs.AR cs.DC cs.SY eess.SY

Reference Architecture of a Quantum-Centric Supercomputer

Seetharami Seelam, Jerry M. Chow, Antonio Córcoles, Sarah Sheldon, Tushar Mittal, Abhinav Kandala, Sean Dague, Ian Hincks, Hiroshi Horii, Blake Johnson, Michael Le, Hani Jamjoom, Jay M. Gambetta

Comments 20 pages, 5 figures, minor fixes

详情
英文摘要

Quantum computers have demonstrated utility in simulating quantum systems beyond brute-force classical approaches. As the community builds on these demonstrations to explore using quantum computing for applied research, algorithms and workflows have emerged that require leveraging both quantum computers and classical high-performance computing (HPC) systems to scale applications, especially in chemistry and materials, beyond what either system can simulate alone. Today, these disparate systems operate in isolation, forcing users to manually orchestrate workloads, coordinate job scheduling, and transfer data between systems -- a cumbersome process that hinders productivity and severely limits rapid algorithmic exploration. These challenges motivate the need for flexible and high-performance Quantum-Centric Supercomputing (QCSC) systems that integrate Quantum Processing Units (QPUs), Graphics Processing Units (GPUs), and Central Processing Units (CPUs) to accelerate discovery of such algorithms across applications. These systems will be co-designed across quantum and classical HPC infrastructure, middleware, and application layers to accelerate the adoption of quantum computing for solving critical computational problems. We envision QCSC evolution through three distinct phases: (1) quantum systems as specialized compute offload engines within existing HPC complexes; (2) heterogeneous quantum and classical HPC systems coupled through advanced middleware, enabling seamless execution of hybrid quantum-classical algorithms; and (3) fully co-designed heterogeneous quantum-HPC systems for hybrid computational workflows. This article presents a reference architecture and roadmap for these QCSC systems.

2603.09600 2026-03-13 q-bio.NC cs.AI cs.NE cs.SY eess.SY physics.bio-ph

A Variational Latent Equilibrium for Learning in Neuronal Circuits

Simon Brandt, Paul Haider, Walter Senn, Federico Benitez, Mihai A. Petrovici

详情
英文摘要

Brains remain unrivaled in their ability to recognize and generate complex spatiotemporal patterns. While AI is able to reproduce some of these capabilities, deep learning algorithms remain largely at odds with our current understanding of brain circuitry and dynamics. This is prominently the case for backpropagation through time (BPTT), the go-to algorithm for learning complex temporal dependencies. In this work we propose a general formalism to approximate BPTT in a controlled, biologically plausible manner. Our approach builds on, unifies and extends several previous approaches to local, time-continuous, phase-free spatiotemporal credit assignment based on principles of energy conservation and extremal action. Our starting point is a prospective energy function of neuronal states, from which we calculate real-time error dynamics for time-continuous neuronal networks. In the general case, this provides a simple and straightforward derivation of the adjoint method result for neuronal networks, the time-continuous equivalent to BPTT. With a few modifications, we can turn this into a fully local (in space and time) set of equations for neuron and synapse dynamics. Our theory provides a rigorous framework for spatiotemporal deep learning in the brain, while simultaneously suggesting a blueprint for physical circuits capable of carrying out these computations. These results reframe and extend the recently proposed Generalized Latent Equilibrium (GLE) model.

2601.13799 2026-03-13 eess.SY cs.SY

Linear viscoelastic rheological FrBD models

Luigi Romano, Ole Morten Aamo, Jan Åslund, Erik Frisk

Comments 6 pages, 3 figures. Under review at IEEE LCSS

详情
英文摘要

In [1], a new modeling paradigm for developing rate-and-state-dependent, control-oriented friction models was introduced. The framework, termed Friction with Bristle Dynamics (FrBD), combines nonlinear analytical expressions for the friction coefficient with constitutive equations for bristle-like elements. Within the FrBD framework, this letter introduces two novel formulations based on the two most general linear viscoelastic models for solids: the Generalized Maxwell (GM) and Generalized Kelvin-Voigt (GKV) elements. Both are analyzed in terms of boundedness and passivity, revealing that these properties are satisfied for any physically meaningful parametrization. An application of passivity for control design is also illustrated, considering an example from robotics. The findings of this letter systematically integrate rate-and-state dynamic friction models with linear viscoelasticity.

2509.15423 2026-03-13 cs.RO cs.SY eess.SY

Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing

Christopher Oeltjen, Carson Sobolewski, Saleh Faghfoorian, Lorant Domokos, Giancarlo Vidal, Sriram Yerramsetty, Ivan Ruchkin

Comments Equal contribution by the first three authors

详情
英文摘要

Accurate knowledge of the tire-road friction coefficient (TRFC) is essential for vehicle safety, stability, and performance, especially in autonomous racing, where vehicles often operate at the friction limit. However, TRFC cannot be directly measured with standard sensors, and existing estimation methods either depend on vehicle or tire models with uncertain parameters or require large training datasets. In this paper, we present a lightweight approach for online slip detection and TRFC estimation. Our approach relies solely on IMU and LiDAR measurements and the control actions, without special dynamical or tire models, parameter identification, or training data. Slip events are detected in real time by comparing commanded and measured motions, and the TRFC is then estimated directly from observed accelerations under no-slip conditions. Experiments with a 1:10-scale autonomous racing car across different friction levels demonstrate that the proposed approach achieves accurate and consistent slip detections and friction coefficients, with results closely matching ground-truth measurements. These findings highlight the potential of our simple, deployable, and computationally efficient approach for real-time slip monitoring and friction coefficient estimation in autonomous driving.

2509.14065 2026-03-13 eess.SY cs.SY math.OC

Identifying Network Structure of Linear Dynamical Systems: Observability and Edge Misclassification

Jaidev Gill, Jing Shuang Li

Comments To appear 2026 ACC

详情
英文摘要

This work studies the limitations of uniquely identifying the structure (i.e., topology) of a networked linear system from partial measurements of its nodal dynamics. In general, many networks can be consistent with these measurements; this is a consideration often neglected by standard network inference methods. We show that the space of these networks are related through the nullspace of the observability matrix for the true network. We establish relevant metrics to investigate this space, including an analytic characterization of the most structurally dissimilar network that can be inferred, as well as the possibility of mis-inferring presence or absence of edges. In simulations, we find that when observing over 6\% of nodes in random network models (e.g., Erd\H os-R\' enyi and Watts-Strogatz), approximately 99\% of edges are correctly classified. Extending this discussion, we construct a family of networks that keep measurements $ε$-close to each other, and connect the identifiability of these networks to the spectral properties of an augmented observability Gramian.

2509.13505 2026-03-13 eess.SY cs.SY math.OC

Identifying Network Structure of Nonlinear Dynamical Systems: Contraction and Kuramoto Oscillators

Jaidev Gill, Jing Shuang Li

Comments To appear 2026 ACC

详情
英文摘要

In this work, we study the identifiability of network structures (i.e., topologies) for networked nonlinear systems when partial measurements of the nodal dynamics are taken. We explore scenarios where different candidate structures can yield similar measurements, thus limiting identifiability. To do so, we apply the contraction theory framework to facilitate comparisons between different networks. We show that semicontraction in the observable space is a sufficient condition for two systems to become indistinguishable from one another based on partial measurements. We apply this framework to study networks of Kuramoto oscillators, and discuss scenarios in which different network structures (both connected and disconnected) become indistinguishable.

2603.11918 2026-03-13 eess.SP

Indirect and Direct Multiuser Hybrid Beamforming for Far-Field and Near-Field Communications: A Deep Learning Approach

Xinyang Li, Songjie Yang, Boyu Ning, Zongmiao He, Xiang Ling, Chau Yuen

Comments 16 pages, 14 figures, submitted to IEEE Transactions on Wireless Communications

详情
英文摘要

Hybrid beamforming for extremely large-scale multiple-input multiple-output (XL-MIMO) systems is challenging in the near field because the channel depends jointly on angle and distance, and the multiuser interference (MUI) is strong. Existing deep learning methods typically follow either a decoupled design that optimizes analog beamforming without explicitly accounting for MUI, or an end-to-end (E2E) joint analog-digital optimization that can be unstable under nonconvex constant-modulus (CM), pronounced analog-digital coupling, and gradient pattern of sum-rate loss. To address both issues, we develop a complex-valued E2E framework based on a variant minimum mean square error (variant-MMSE) criterion, where the digital precoder is eliminated in closed form via Karush-Kuhn-Tucker (KKT) conditions so that analog learning is trained with a stable objective. The network employs a grouped complex-convolution sensing front-end for uplink (UL) measurements, a shared complex multi-layer perceptron (MLP) for per-user feature extraction, and a merged constant-modulus head to output the analog precoder. In the indirect mode, the network designs hybrid beamformers from estimated channel state information (CSI). In the direct mode where explicit CSI is unavailable, the network learns the sensing operator and the analog mapping from short pilots, after which additional pilots estimate the equivalent channel and enable a KKT closed-form digital precoder. Simulations show that the indirect mode approaches the performance of iterative variant-MMSE optimization with a complexity reduction proportional to the antenna number. In the direct mode, the proposed method improves spectral efficiency over sparse-recovery pipelines and recent deep learning baselines under the same pilot budget.

2603.11905 2026-03-13 eess.SY cs.SY

Risk-Based Dynamic Thermal Rating in Distribution Transformers via Probabilistic Forecasting

Scott Angus, Jethro Browell, David Greenwood, Matthew Deakin

Comments Submitted to 24th Power Systems Computation Conference (PSCC 2026). 8 pages, 8 figures

详情
英文摘要

Low voltage (LV) distribution transformers face accelerating demand growth while replacement lead times and costs continue to rise, making improved utilisation of existing assets essential. Static and conservative protection devices (PDs) in distribution transformers are inflexible and limit the available headroom of the transformer. This paper presents a probabilistic framework for dynamically forecasting optimal thermal protection settings. The proposed approach directly predicts the day-ahead scale factor which maximises the dynamic thermal rating of the transformer from historical load, temperature, and metadata using clustered quantile regression models trained on 644 UK LV transformers. Probabilistic forecasting quantifies overheating risk directly through the prediction percentile, enabling risk-informed operational decisions. Results show a 10--12\% additional capacity gain compared to static settings, with hotspot temperature risk matching the selected percentile, including under realistic temperature forecast errors. These results demonstrate a practical approach for distribution network operators to take advantage of PDs with adaptive settings to maximise capacity and manage risk on operational time scales.

2603.11886 2026-03-13 eess.SP

Beyond the Limits of Rigid Arrays: Flexible Intelligent Metasurfaces for Next-Generation Wireless Networks

Ahmed Magbool, Vaibhav Kumar, Marco Di Renzo, Mark F. Flanagan

Comments Submitted to IEEE for possible publication

详情
英文摘要

Following recent advances in flexible electronics and programmable metasurfaces, flexible intelligent metasurfaces (FIMs) have emerged as a promising enabling technology for next-generation wireless networks. A FIM is a morphable electromagnetic surface capable of dynamically adjusting its physical geometry to influence the radiation and propagation of electromagnetic waves. Unlike conventional rigid arrays, FIMs introduce an additional spatial degree of design freedom enabled by mechanical flexibility, which can enhance beamforming, spatial focusing, and adaptation to dynamic wireless environments. This added capability enables wireless systems to shape the propagation environment not only through electromagnetic tuning but also through controllable geometric reconfiguration. This article explores the potential of FIMs for next-generation wireless networks. We first introduce the main hardware architectures of FIMs and explain how they can be integrated into wireless communication systems. We then present representative application scenarios, highlighting the advantages of FIMs for future wireless networks and comparing them with other emerging flexible wireless technologies. To illustrate their potential impact, we present case studies comparing FIM-enabled architectures with conventional rigid-array systems, demonstrating the performance gains enabled by surface flexibility for both communication and sensing applications. Finally, we discuss key opportunities, practical challenges, and open research directions that must be addressed to fully realize the potential of FIM technology in future wireless communication systems.

2603.11877 2026-03-13 eess.AS

Silent Speech Interfaces in the Era of Large Language Models: A Comprehensive Taxonomy and Systematic Review

Kele Xu, Yifan Wang, Ming Feng, Qisheng Xu, Wuyang Chen, Yutao Dou, Cheng Yang, Huaimin Wang

Comments 20 pages, 4 figures

详情
英文摘要

Human-computer interaction has traditionally relied on the acoustic channel, a dependency that introduces systemic vulnerabilities to environmental noise, privacy constraints, and physiological speech impairments. Silent Speech Interfaces (SSIs) emerge as a transformative paradigm that bypasses the acoustic stage by decoding linguistic intent directly from the neuro-muscular-articulatory continuum. This review provides a high-level synthesis of the SSI landscape, transitioning from traditional transducer-centric analysis to a holistic intent-to-execution taxonomy. We systematically evaluate sensing modalities across four critical physiological interception points: neural oscillations, neuromuscular activation, articulatory kinematics (ultrasound/magnetometry), and pervasive active probing via acoustic or radio-frequency sensing. Critically, we analyze the current paradigm shift from heuristic signal processing to Latent Semantic Alignment. In this new era, Large Language Models (LLMs) and deep generative architectures serve as high-level linguistic priors to resolve the ``informational sparsity'' and non-stationarity of biosignals. By mapping fragmented physiological gestures into structured semantic latent spaces, modern SSI frameworks have, for the first time, approached the Word Error Rate usability threshold required for real-world deployment. We further examine the transition of SSIs from bulky laboratory instrumentation to ``invisible interfaces'' integrated into commodity-grade wearables, such as earables and smart glasses. Finally, we outline a strategic roadmap addressing the ``user-dependency paradox'' through self-supervised foundation models and define the ethical boundaries of ``neuro-security'' to protect cognitive liberty in an increasingly interfaced world.

2603.11876 2026-03-13 cs.CR cs.MM eess.IV

On the Possible Detectability of Image-in-Image Steganography

Antoine Mallet, Patrick Bas

详情
英文摘要

This paper investigates the detectability of popular imagein-image steganography schemes [1, 2, 3, 4, 5]. In this paradigm, the payload is usually an image of the same size as the Cover image, leading to very high embedding rates. We first show that the embedding yields a mixing process that is easily identifiable by independent component analysis. We then propose a simple, interpretable steganalysis method based on the first four moments of the independent components estimated from the wavelet decomposition of the images, which are used to distinguish between the distributions of Cover and Stego components. Experimental results demonstrate the efficiency of the proposed method, with eight-dimensional input vectors attaining up to 84.6% accuracy. This vulnerability analysis is supported by two other facts: the use of keyless extraction networks and the high detectability w.r.t. classical steganalysis methods, such as the SRM combined with support vector machines, which attains over 99% accuracy.

2603.11847 2026-03-13 eess.AS

Reconstruction of the Vocal Tract from Speech via Phonetic Representations Using MRI Data

Sofiane Azzouz, Pierre-André Vuissoz, Yves Laprie

详情
英文摘要

Articulatory acoustic inversion aims to reconstruct the complete geometry of the vocal tract from the speech signal. In this paper, we present a comparative study of several levels of phonetic segmentation accuracy, together with a comparison to the baseline introduced in our previous work, which is based on Mel-Frequency Cepstral Coefficients (MFCCs). All the approaches considered are based on a denoised speech signal and aim to investigate the impact of incorporating phonetic information through three successive levels: an uncorrected automatic transcription, a temporally aligned phonetic segmentation, and an expert manual correction following alignment. The models are trained to predict articulatory contours extracted from vocal tract MRI images using an automatic contour tracking method. The results show that, among the models relying on phonetic representations, manual correction after alignment yields the best performance, approaching that of the baseline.

2603.11845 2026-03-13 eess.AS

Acoustic-to-Articulatory Inversion of Clean Speech Using an MRI-Trained Model

Sofiane Azzouz, Pierre-André Vuissoz, Yves Laprie

详情
英文摘要

Articulatory acoustic inversion reconstructs vocal tract shapes from speech. Real-time magnetic resonance imaging (rt-MRI) allows simultaneous acquisition of both the acoustic speech signal and articulatory information. Besides the complexity of rt-MRI acquisition, the recorded audio is heavily corrupted by scanner noise and requires denoising to be usable. For practical use, it must be possible to invert speech recorded without MRI noise. In this study, we investigate the use of speech recorded in a clean acoustic environment as an alternative to denoised MRI speech. To this end we compare two signals from the same speaker with identical sentences which are aligned using phonetic segmentation. A model trained on denoised MRI speech is evaluated on both denoised MRI and clean speech. We also assess a model trained and tested only on clean speech. Results show that clean speech supports articulatory inversion effectively, achieving an RMSE of 1.56 mm, close to MRI-based performance.

2603.11841 2026-03-13 eess.AS

ReDimNet2: Scaling Speaker Verification via Time-Pooled Dimension Reshaping

Ivan Yakovlev, Anton Okhotnikov

Comments Submitted to Interspeech 2026

详情
英文摘要

We present ReDimNet2, an improved neural network architecture for extracting utterance-level speaker representations that builds upon the ReDimNet dimension-reshaping framework. The key modification in ReDimNet2 is the introduction of pooling over the time dimension within the 1D processing pathway. This operation preserves the nature of the 1D feature space, since 1D features remain a reshaped version of 2D features regardless of temporal resolution, while enabling significantly more aggressive scaling of the channel dimension without proportional compute increase. We introduce a family of seven model configurations (B0-B6) ranging from 1.1M to 12.3M parameters and 0.33 to 13 GMACS. Experimental results on VoxCeleb1 benchmarks demonstrate that ReDimNet2 improves the Pareto front of computational cost versus accuracy at every scale point compared to ReDimNet, achieving 0.287% EER on Vox1-O with 12.3M parameters and 13 GMACS.