arXivDaily arXiv每日学术速递 周一至周五更新
重置
EESS电气与系统 133
2604.01211 2026-04-02 eess.SY cs.SY

Making Every Bit Count for $A$-Optimal State Estimation

Cameron Khanpour, Daniel Turizo, Samuel Talkington

详情
英文摘要

We study the problem of controlling how a limited communication bandwidth budget is allocated across heterogeneously quantized sensor measurements. The performance criterion is the trace of the error covariance matrix of the linear minimum mean square error (LMMSE) state estimator, i.e., an $A$-optimal design criterion. Minimizing this criterion with a bit budget constraint yields a nonconvex optimization problem. We derive a formula that reduces each evaluation of the gradient to a single Cholesky factorization. This enables efficient optimization by both a projection-free Frank-Wolfe method (with a computable convergence certificate) and an interior point method with L-BFGS Hessian approximation over the problem's continuous relaxation. A largest remainder rounding procedure recovers integer bit allocations with a bound on the quality of the rounded solution. Numerical experiments in IEEE power grid test cases with up to 300 buses compare both solvers and demonstrate that the analytic gradient is the key computational enabler for both methods. Additionally, the heterogeneous bit allocation is compared to standard uniform bit allocation on the 500 bus IEEE power grid test case.

2604.01188 2026-04-02 eess.SY cs.SY

Learning Neural Network Controllers with Certified Robust Performance via Adversarial Training

Neelay Junnarkar, Yasin Sonmez, Murat Arcak

详情
英文摘要

Neural network (NN) controllers achieve strong empirical performance on nonlinear dynamical systems, yet deploying them in safety-critical settings requires robustness to disturbances and uncertainty. We present a method for jointly synthesizing NN controllers and dissipativity certificates that formally guarantee robust closed-loop performance using adversarial training, in which we use counterexamples to the robust dissipativity condition to guide training. Verification is done post-training using alpha,beta-CROWN, a branch-and-bound-based method that enables direct analysis of the nonlinear dynamical system. The proposed method uses quadratic constraints (QCs) only for characterization of non-parametric uncertainties. The method is tested in numerical experiments on maximizing the volume of the set on which a system is certified to be robustly dissipative. Our method certifies regions up to 78 times larger than the region certified by a linear matrix inequality-based approach that we derive for comparison.

2512.09344 2026-04-02 eess.SP physics.optics

389.3-Tb/s 1017-km C-band Transmission over Field-Installed 12-Coupled-Core Fiber Cable with >12-Tb/s Spatial MIMO Channels

Akira Kawai, Kohki Shibahara, Megumi Hoshi, Masanori Nakamura, Takayuki Kobayashi, Ryota Imada, Takayoshi Mori, Taiji Sakamoto, Yusuke Yamada, Kazuhide Nakajima, Munehiko Nagatani, Hitoshi Wakita, Yuta Shiratori, Hiroshi Yamazaki, Hiroyuki Takahashi, Soichi Endo, Takemi Hasegawa, Ryo Nagase, Yutaka Miyamoto

Comments Published in 50th European Conference on Optical Communication (ECOC2024), Postdeadline paper Th3B.1

详情
英文摘要

We demonstrate 4.65-THz WDM/SDM transmission of 140-Gbaud PS-QAM signals over field-installed 12-coupled-core fiber cable with standard cladding diameter, achieving a record 0.455 Pb/s coupled-core capacity in a field environment. We also demonstrate 0.389 Pb/s over-1000-km transmission of spatial MIMO channels with >12 Tb/s/wavelength net bitrate.

2604.01173 2026-04-02 eess.SY cs.LG cs.SY math.OC

Safe learning-based control via function-based uncertainty quantification

Abdullah Tokmak, Toni Karvonen, Thomas B. Schön, Dominik Baumann

Comments Under review for CDC 2026

详情
英文摘要

Uncertainty quantification is essential when deploying learning-based control methods in safety-critical systems. This is commonly realized by constructing uncertainty tubes that enclose the unknown function of interest, e.g., the reward and constraint functions or the underlying dynamics model, with high probability. However, existing approaches for uncertainty quantification typically rely on restrictive assumptions on the unknown function, such as known bounds on functional norms or Lipschitz constants, and struggle with discontinuities. In this paper, we model the unknown function as a random function from which independent and identically distributed realizations can be generated, and construct uncertainty tubes via the scenario approach that hold with high probability and rely solely on the sampled realizations. We integrate these uncertainty tubes into a safe Bayesian optimization algorithm, which we then use to safely tune control parameters on a real Furuta pendulum.

2604.01167 2026-04-02 eess.IV cs.AI cs.CV

AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation

Prantik Deb, Srimanth Dhondy, N. Ramakrishna, Anu Kapoor, Raju S. Bapi, Tapabrata Chakraborti

Comments Accepted to ISBI 2026(Oral Presentation)

详情
英文摘要

Chest X-ray (CXR) segmentation is an important step in computer-aided diagnosis, yet deploying large foundation models in clinical settings remains challenging due to computational constraints. We propose AdaLoRA-QAT, a two-stage fine-tuning framework that combines adaptive low-rank encoder adaptation with full quantization-aware training. Adaptive rank allocation improves parameter efficiency, while selective mixed-precision INT8 quantization preserves structural fidelity crucial for clinical reliability. Evaluated across large-scale CXR datasets, AdaLoRA-QAT achieves 95.6% Dice, matching full-precision SAM decoder fine-tuning while reducing trainable parameters by 16.6\times and yielding 2.24\times model compression. A Wilcoxon signed-rank test confirms that quantization does not significantly degrade segmentation accuracy. These results demonstrate that AdaLoRA-QAT effectively balances accuracy, efficiency, and structural trust-worthiness, enabling compact and deployable foundation models for medical image segmentation. Code and pretrained models are available at: https://prantik-pdeb.github.io/adaloraqat.github.io/

2604.01156 2026-04-02 eess.SY cs.SY

Data-based Low-conservative Nonlinear Safe Control Learning

Amir Modares, Bahare Kiumarsi, Hamidreza Modares

详情
英文摘要

This paper develops a data-driven safe control framework for nonlinear discrete-time systems with parametric uncertainty and additive disturbances. The proposed approach constructs a data-consistent closed-loop representation that enables controller synthesis and safety certification directly from data. Unlike existing methods that treat unmodeled nonlinearities as global worst-case uncertainties using Lipschitz bounds, the proposed approach embeds nonlinear terms directly into the invariance conditions via a geometry-aware difference-of-convex formulation. This enables facet- and direction-specific convexification, avoiding both nonlinearity cancellation and the excessive conservatism induced by uniform global bounds. We further propose a vertex-dependent controller construction that enforces convexity and contractivity conditions locally on the active facets associated with each vertex, thereby enlarging the class of certifiable invariant sets. For systems subject to additive disturbances, disturbance effects are embedded directly into the verification conditions through optimized, geometry-dependent bounds, rather than via uniform margin inflation, yielding less conservative robust safety guarantees. As a result, the proposed methods can certify substantially larger safe sets, naturally accommodate joint state and input constraints, and provide data-driven safety guarantees. The simulation results show a significant improvement in both nonlinearity tolerance and the size of the certified safe set.

2604.01149 2026-04-02 math.OC cs.SY eess.SY

Spectral Decomposition of Discrete-Time Controllability Gramian and Its Inverse via System Eigenvalues

Alexey Iskakov

详情
英文摘要

This paper develops a closed-form spectral decomposition framework for the Gramian matrices of discrete-time linear dynamical systems. The main results provide explicit decompositions of the discrete-time controllability Gramian and its inverse in terms of the eigenvalues of the dynamics matrix, yielding a mode-resolved representation of these matrices. In contrast to the more common use of aggregate Gramian characteristics, such as eigenvalues, singular values, determinants, and trace-based metrics, the proposed approach describes the internal structure of the Gramian itself through contributions associated with individual modes and their pairwise combinations. The framework is extended further to the solution of the discrete-time Lyapunov difference equation, placing the obtained formulas in a broader context relevant to the analysis and computation of time-varying and nonlinear systems. In addition, the decomposition is generalized to systems whose dynamics matrix has multiple eigenvalues, enabling a closed-form estimation of the effects of resonant interactions between eigenmodes. The proposed results provide a structural tool for the analysis of controllability, observability and stability in discrete-time systems and complement existing Gramian-based methods used in model reduction, estimation, actuator and sensor selection, and energy-aware control. Beyond their theoretical interest, the derived decompositions may support the development of improved computational procedures and more informative performance criteria for a range of discrete-time control problems.

2604.01144 2026-04-02 eess.SY cs.SY

Schrodinger Bridges and Density Steering Problems for Gaussian Mixtures Models in Discrete-Time

George Rapakoulias, Fengjiao Liu, Panagiotis Tsiotras

详情
英文摘要

In this work, we revisit the discrete-time Schrödinger Bridge (SB) and Density Steering (DS) problems for Gaussian mixture model (GMM) boundary distributions. Building on the existing literature, we construct a set of feasible Markovian policies that transport the initial distribution to the final distribution, and are expressed as mixtures of elementary component-to-component optimal policies. We then study the policy optimization within this feasible set in the context of discrete-time SBs and density-steering problems, respectively. We show that for minimum-effort density-steering problems, the proposed policy achieves the same control cost as existing approaches in the literature. For discrete-time SB problems, the proposed policy yields a cost smaller than or equal to that in the literature, resulting in a less conservative approximation. Finally, we study the continuous-time limit of our proposed discrete-time approach and show that it agrees with recently proposed approximations to the continuous-time SB for GMM boundary distributions. We illustrate this new result through two numerical examples.

2604.01141 2026-04-02 cs.CV cs.AI eess.IV

Looking into a Pixel by Nonlinear Unmixing -- A Generative Approach

Maofeng Tang, Hairong Qi

详情
英文摘要

Due to the large footprint of pixels in remote sensing imagery, hyperspectral unmixing (HU) has become an important and necessary procedure in hyperspectral image analysis. Traditional HU methods rely on a prior spectral mixing model, especially for nonlinear mixtures, which has largely limited the performance and generalization capacity of the unmixing approach. In this paper, we address the challenging problem of hyperspectral nonlinear unmixing (HNU) without explicit knowledge of the mixing model. Inspired by the principle of generative models, where images of the same distribution can be generated as that of the training images without knowing the exact probability distribution function of the image, we develop an invertible mixing-unmixing process via a bi-directional GAN framework, constrained by both the cycle consistency and the linkage between linear and nonlinear mixtures. The combination of cycle consistency and linear linkage provides powerful constraints without requiring an explicit mixing model. We refer to the proposed approach as the linearly-constrained CycleGAN unmixing net, or LCGU net. Experimental results indicate that the proposed LCGU net exhibits stable and competitive performance across different datasets compared with other state-of-the-art model-based HNU methods.

2604.01134 2026-04-02 cs.RO cs.DB eess.IV

VRUD: A Drone Dataset for Complex Vehicle-VRU Interactions within Mixed Traffic

Ziyu Wang, Hongrui Kou, Cheng Wang, Ruochen Li, Hubert P. H. Shum, Amir Atapour-Abarghouei, Yuxin Zhang

详情
英文摘要

The Operational Design Domain (ODD) of urbanoriented Level 4 (L4) autonomous driving, especially for autonomous robotaxis, confronts formidable challenges in complex urban mixed traffic environments. These challenges stem mainly from the high density of Vulnerable Road Users (VRUs) and their highly uncertain and unpredictable interaction behaviors. However, existing open-source datasets predominantly focus on structured scenarios such as highways or regulated intersections, leaving a critical gap in data representing chaotic, unstructured urban environments. To address this, this paper proposes an efficient, high-precision method for constructing drone-based datasets and establishes the Vehicle-Vulnerable Road User Interaction Dataset (VRUD), as illustrated in Figure 1. Distinct from prior works, VRUD is collected from typical "Urban Villages" in Shenzhen, characterized by loose traffic supervision and extreme occlusion. The dataset comprises 4 hours of 4K/30Hz recording, containing 11,479 VRU trajectories and 1,939 vehicle trajectories. A key characteristic of VRUD is its composition: VRUs account for about 87% of all traffic participants, significantly exceeding the proportions in existing benchmarks. Furthermore, unlike datasets that only provide raw trajectories, we extracted 4,002 multi-agent interaction scenarios based on a novel Vector Time to Collision (VTTC) threshold, supported by standard OpenDRIVE HD maps. This study provides valuable, rare edge-case resources for enhancing the safety performance of ADS in complex, unstructured urban environments. To facilitate further research, we have made the VRUD dataset open-source at: https://zzi4.github.io/VRUD/.

2604.01122 2026-04-02 eess.IV

Region-Adaptive Generative Compression with Spatially Varying Diffusion Models

Lucas Relic, Roberto Azevedo, Yang Zhang, Stephan Mandt, Markus Gross, Christopher Schroers

详情
英文摘要

Generative image codecs aim to optimize perceptual quality, producing realistic and detailed reconstructions. However, they often overlook a key property of human vision: our tendency to focus on particular aspects of a visual scene (e.g., salient objects) while giving less importance to other regions. An ideal perceptual codec should be able to exploit this property by allocating more representational capacity to perceptually important areas. To this end, we propose a region-adaptive diffusion-based image codec that supports non-uniform bit allocation within an image. We design a novel spatially varying diffusion model capable of denoising varying amounts of noise per pixel according to arbitrary importance maps. We further identify that these maps can serve as effective priors on the latent representation, and integrate them into our entropy model, improving rate-distortion performance. Built on these contributions, our spatially-adaptive diffusion-based codec outperforms state-of-the-art ROI-controllable baselines in both full-image and ROI-masked perceptual quality.

2604.01120 2026-04-02 eess.AS

Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation

Yun-Ning, Hung, Richard Vogl, Filip Korzeniowski, Igor Pereira

Comments Accepted at ICASSP 2026

详情
英文摘要

While diffusion models are best known for their performance in generative tasks, they have also been successfully applied to many other tasks, including audio source separation. However, current generative approaches to music source separation often underperform on standard objective metrics. In this paper, we address this issue by introducing a novel generative vocal separation model based on the Elucidated Diffusion Model (EDM) framework. Our model processes complex short-time Fourier transform spectrograms and employs an improved U-Net architecture based on music-informed design choices. Our approach matches discriminative baselines on objective metrics and achieves perceptual quality comparable to state-of-the-art systems, as assessed by proxy subjective metrics. We hope these results encourage broader exploration of generative methods for music source separation

2604.01115 2026-04-02 eess.SY cs.SY

A Distributed SOS Program For Local Stability Analysis of Polynomial PDEs in the PIE Representation

Carl R Richardson, Declan S Jagt, Matthew M Peet, Antonis Papachristodoulou

详情
英文摘要

It has recently been shown that the evolution of a state, described by a Partial Differential Equation (PDE), can be more conveniently represented as the evolution of the state's highest spatial derivative (the ``fundamental state''), which lies in $L_2$ and has no boundary conditions (BCs) or continuity constraints. For linear PDEs, this yields a Partial Integral Equation (PIE) parametrized by Partial Integral (PI) operators mapping the fundamental state to the PDE state. In this paper, we show that for polynomial PDEs, the dynamics of the fundamental state can instead be compactly expressed as a distributed polynomial in the fundamental state, parametrized by a new tensor algebra of PI operators acting on the tensor product of the fundamental state. We further define a SOS parametrization of the distributed polynomial and use this to construct a distributed SOS program, for testing local stability of polynomial PDEs.

2604.01104 2026-04-02 eess.SY cs.SY

Maximizing Power Flexibility of Hybrid Energy Systems for Capacity Market

Tanmay Mishra, Mads R Almassalkhi

详情
英文摘要

Hybrid Energy Systems (HES), integrating generation sources, energy storage, and controllable loads, are well-positioned to provide real-time grid flexibility. However, quantifying this maximum flexibility is challenging due to renewable generation uncertainty and the complexity of power allocation across multiple assets in real time. This paper presents a rule-based framework for characterizing HES flexibility and systematically allocating power among its constituent assets. The flexibility envelope defines the dynamic power boundary within which the HES can inject or absorb power without violating operational constraints. Shaped in real time by capacity bids, available solar generation, and power allocation protocol, it enables reliable and predictable HES participation in regulation markets. Depending on the operational objective, the framework supports both symmetric and asymmetric flexibility cases. Further, the proposed power-allocation rule is benchmarked against an optimal dispatch, providing a performance reference under realistic conditions. Finally, state of charge drift correction control is presented to ensure sustained battery operation and system reliability. This work, therefore, offers a rigorous and practical framework for integrating HES into capacity markets through effective flexibility characterization.

2604.01081 2026-04-02 cs.CV cs.LG cs.RO eess.IV

ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction

Yuheng Zhang, Mengfei Duan, Kunyu Peng, Yuhang Wang, Di Wen, Danda Pani Paudel, Luc Van Gool, Kailun Yang

Comments Accepted to CVPR 2026. The source code is publicly available at https://github.com/7uHeng/ProOOD

详情
英文摘要

3D semantic occupancy prediction is central to autonomous driving, yet current methods are vulnerable to long-tailed class bias and out-of-distribution (OOD) inputs, often overconfidently assigning anomalies to rare classes. We present ProOOD, a lightweight, plug-and-play method that couples prototype-guided refinement with training-free OOD scoring. ProOOD comprises (i) prototype-guided semantic imputation that fills occluded regions with class-consistent features, (ii) prototype-guided tail mining that strengthens rare-class representations to curb OOD absorption, and (iii) EchoOOD, which fuses local logit coherence with local and global prototype matching to produce reliable voxel-level OOD scores. Extensive experiments on five datasets demonstrate that ProOOD achieves state-of-the-art performance on both in-distribution 3D occupancy prediction and OOD detection. On SemanticKITTI, it surpasses baselines by +3.57% mIoU overall and +24.80% tail-class mIoU; on VAA-KITTI, it improves AuPRCr by +19.34 points, with consistent gains across benchmarks. These improvements yield more calibrated occupancy estimates and more reliable OOD detection in safety-critical urban driving. The source code is publicly available at https://github.com/7uHeng/ProOOD.

2604.01060 2026-04-02 eess.SP

Data-Model Co-Driven Continuous Channel Map Construction: A Perceptive Foundation for Embodied Intelligent Agents in 6G Networks

Tianrun Qi, Cheng-Xiang Wang, Chen Huang, Junling Li, John S Thompson

详情
英文摘要

Future 6G networks will host massive numbers of embodied intelligent agents, which require real-time channel awareness over continuous-space for autonomous decision-making. By pre-obtaining location-specific channel state information (CSI), channel map can be served as a foundational world model for embodied intelligence to achieve wireless channel perception. However, acquiring CSI via measurements is costly, so in practice only sparse observations are available, leaving agents blind to channel conditions at unvisited locations. Meanwhile, purely model-driven channel maps can provide dense CSI but often yields unsatisfactory accuracy and robustness, while purely data-driven interpolation from sparse measurements is computationally prohibitive for real-time updates. To address these challenges, this paper proposes a data-model co-driven (DMcD) framework that performs a two-stage interpolation toward a space-time continuous channel map, First, a hybrid ray tracing and geometry-based channel model (H-RT/GBSM) is developed to capture dynamic scatterers, providing dense, time-variant channel properties that match measurement statistics as a physically consistent prior. Then, an inductive edge-conditioned graph neural network (InductE-GNN) fuses the prior with sparse measurements to perform real-time spatial interpolation, enabling rapid online adaptation without retraining, ensuring the synchronization with the dynamic physical reality. Evaluations with measured datasets show that the proposed DMcD framework significantly outperforms data-only and model-only baselines, providing accurate and queryable channel information for embodied intelligent agents.

2604.01056 2026-04-02 eess.SY cs.SY

A Functional Learning Approach for Team-Optimal Traffic Coordination

Weihao Sun, Gehui Xu, Alessio Moreschini, Thomas Parisini, Andreas A. Malikopoulos

Comments 8 pages, 7 figures, conference

详情
英文摘要

In this paper, we develop a kernel-based policy iteration functional learning framework for computing team-optimal strategies in traffic coordination problems. We consider a multi-agent discrete-time linear system with a cost function that combines quadratic regulation terms and nonlinear safety penalties. Building on the Hilbert space formulation of offline receding-horizon policy iteration, we seek approximate solutions within a reproducing kernel Hilbert space, where the policy improvement step is implemented via a discrete Fréchet derivative. We further study the model-free receding-horizon scenario, where the system dynamics are estimated using recursive least squares, followed by updating the policy using rolling online data. The proposed method is tested in signal-free intersection scenarios via both model-based and model-free simulations and validated in SUMO.

2604.00995 2026-04-02 eess.SP

Robust Multidimensional Chinese Remainder Theorem (MD-CRT) with Non-Diagonal Moduli and Multi-Stage Framework

Guangpu Guo, Xiang-Gen Xia

详情
英文摘要

The Chinese remainder theorem (CRT) provides an efficient way to reconstruct an integer from its remainders modulo several integer moduli, and has been widely applied in signal processing and information theory. Its multidimensional extension (MD-CRT) generalizes this principle to integer vectors and integer matrix moduli, enabling reconstruction in multidimensional signal processing scenarios. However, since matrices are generally non-commutative, the multidimensional extension introduces new theoretical and algorithmic challenges. When all matrix moduli are diagonal, the system is equivalent to applying the one-dimensional CRT independently along each dimension. This work first investigates whether non-diagonal (non-separable) moduli offer fundamental advantages over traditional diagonal ones. We show that under the same determinant constraint, non-diagonal matrices do not increase the dynamic range but yield more balanced and better-conditioned sampling patterns. More importantly, they generate lattices with longer shortest vectors, leading to higher robustness to vector remainder errors, compared to diagonal ones. To further improve the robustness, we develop a multi-stage robust MD-CRT framework that improves the robustness level without reducing the dynamic range. Due to the multidimensional nature and modulo matrix forms, it is challenging and not straightforward to extend the existing one-dimensional multi-stage robust CRT. In this paper, we obtain a new condition for matrix moduli, which can be easily checked, such that a multi-stage robust MD-CRT can be implemented. Both theoretical analysis and simulation results demonstrate that the proposed multi-stage robust MD-CRT achieves stronger error tolerance and more reliable reconstruction under erroneous vector remainders than that of single-stage robust MD-CRT.

2604.00992 2026-04-02 eess.SY cs.SY

Tube-Based Safety for Anticipative Tracking in Multi-Agent Systems

Armel Koulong, Ali Pakniyat

Comments This work has been submitted to the 65th IEEE Conference on Decision and Control for possible publication

详情
英文摘要

A tube-based safety framework is presented for robust anticipative tracking in nonlinear Brunovsky multi-agent systems subject to bounded disturbances. The architecture establishes robust safety certificates for a feedforward-augmented ancillary control policy. By rendering the state-deviation dynamics independent of the agents' internal nonlinearities, the formulation strictly circumvents the restrictive Lipschitz-bound feasibility conditions otherwise required for robust stabilization. Consequently, this structure admits an explicit, closed-form robust positively invariant (RPI) tube radius that systematically attenuates the exponential control barrier function (eCBF) tightening margins, thereby mitigating constraint conservatism while preserving formal forward invariance. Within the distributed model predictive control (MPC) layer, mapping the local tube radii through the communication graph yields a closed-form global formation error bound formulated via the minimum singular value of the augmented Laplacian. Robust inter-agent safety is enforced with minimal communication overhead, requiring only a single scalar broadcast per neighbor at initialization. Numerical simulations confirm the framework's efficacy in safely navigating heterogeneous formations through cluttered environments.

2604.00982 2026-04-02 eess.AS

VisG AV-HuBERT: Viseme-Guided AV-HuBERT

Aristeidis Papadopoulos, Rishabh Jain, Naomi Harte

Comments Includes Supplementary Material. Accepted for Publication at International Conference on Pattern Recognition 2026 - ICPR 2026. Code is available at https://github.com/aristosp/visg_avhubert

详情
英文摘要

Audio-Visual Speech Recognition (AVSR) systems nowadays integrate Large Language Model (LLM) decoders with transformer-based encoders, achieving state-of-the-art results. However, the relative contributions of improved language modelling versus enhanced audiovisual encoding remain unclear. We propose Viseme-Guided AV-HuBERT (VisG AV-HuBERT), a multi-task fine-tuning framework that incorporates auxiliary viseme classification to strengthen the model's reliance on visual articulatory features. By extending AV-HuBERT with a lightweight viseme prediction sub-network, this method explicitly guides the encoder to preserve visual speech information. Evaluated on LRS3, VisG AV-HuBERT achieves comparable or improved performance over the baseline AV-HuBERT, with notable gains under heavy noise conditions. WER reduces from 13.59% to 6.60% (51.4% relative improvement) at -10 dB Signal-to-Noise Ratio (SNR) for Speech noise. Deeper analysis reveals substantial reductions in substitution errors across noise types, demonstrating improved speech unit discrimination. Evaluation on LRS2 confirms generalization capability. Our results demonstrate that explicit viseme modelling enhances encoder representations, and provides a foundation for enhancing noise-robust AVSR through encoder-level improvements.

2604.00950 2026-04-02 eess.SY cs.SY

Mean-Field Control of Adherence in Participation-Coupled Vehicle Rebalancing Systems

Avalpreet Singh Brar, Rong Su, Jaskaranveer Kaur, Gioele Zardini

详情
英文摘要

Human driver participation is a critical source of uncertainty in Mobility-on-Demand (MoD) rebalancing. Drivers follow platform recommendations probabilistically, and their willingness to comply evolves with experienced outcomes. This creates a closed-loop feedback in which stronger recommendations increase participation, participation increases congestion, congestion lowers allocation success, and realized allocations update adherence beliefs. We propose a microscopic stochastic model that couples (i) belief-driven participation, (ii) Poisson demand, (iii) uniform matching, and (iv) Beta--Bernoulli belief updates. Under a large-population closure, we derive a deterministic mean-field recursion for the population adherence state under platform actuation. For i.i.d. Poisson demand and constant recommendation intensity, we prove global well-posedness and invariance of the recursion, establish equilibrium existence, provide uniqueness conditions, and show global convergence in the regime where platform recommendations are no weaker than baseline participation. We then define steady-state adherence and throughput, characterize the induced performance frontier, and show that adherence and throughput cannot, in general, be simultaneously maximized under uniform time-invariant actuation. This yields a throughput-maximization problem with an adherence floor. Exploiting the monotone frontier structure, we show the optimal uniform time-invariant policy is the maximal feasible recommendation intensity and provide an efficient bisection-based algorithm.

2604.00935 2026-04-02 eess.SY cs.SY math.OC

Polynomial Parametric Koopman Operators for Stochastic MPC

Efstathios Iliakis, Wallace Gian Yion Tan, Liang Wu, Jan Drgona, Richard D. Braatz

Comments 8 pages, 5 figures, submitted to CDC 2026

详情
英文摘要

This paper develops a parametric Koopman operator framework for Stochastic Model Predictive Control (SMPC), where the Koopman operator is parametrized by Polynomial Chaos Expansions (PCEs). The model is learned from data using the Extended Dynamic Mode Decomposition -- Dictionary Learning (EDMD-DL) method, which preserves the convex least-squares structure for the PCE coefficients of the EDMD matrix. Unlike conventional stochastic Galerkin projection approaches, we derive a condensed deterministic reformulation of the SMPC problem whose dimension scales only with the control horizon and input dimension, and is independent of both the lifted state dimension and the number of retained PCE terms. Our framework, therefore, enables efficient nonlinear SMPC problems with expectation and second-order moment constraints with standard convex optimization solvers. Numerical examples demonstrate the efficacy of our framework for uncertainty-aware SMPC of nonlinear systems.

2604.00926 2026-04-02 eess.SY cs.SY

Dispatch-Embedded Long-Term Tail Risk Assessment and Mitigation via CVaR for Renewable Power Systems

Kai Kang, Feng Liu

Comments 2026 PESIM BEST PAPER AWARD

详情
英文摘要

Renewable energy (RE) generation exhibits pronounced seasonality and variability, and neglecting these features can lead to significant underestimation of long-term power system risks in power supply. While long-term dispatch strategies are essential for evaluating and mitigating tail risks, they are often excluded from existing models due to their complexity. This paper proposes a long-term tail risk assessment and mitigation framework for renewable power systems, explicitly embedding dispatch strategies. A representative scenario generation method is designed, combining multi-timescale Copula modeling to capture RE's long-range variability and correlation. Building on these scenarios, an evolution-based risk assessment model is established, where Conditional Value-at-Risk (CVaR) is employed as a robust metric to quantify tail risks. Finally, a controlled evolution-based risk mitigation scheme is introduced to refine long-term dispatch strategies for mitigating tail risks. Case studies on a modified IEEE-39 bus system incorporating real-world data substantiate the efficacy of the proposed method.

2604.00900 2026-04-02 eess.SY cs.SY math.OC

Soft projections for robust data-driven control

András Sasfi, Jaap Eising, Florian Dörfler

详情
英文摘要

We consider data-based predictive control based on behavioral systems theory. In the linear setting this means that a system is described as a subspace of trajectories, and predictive control can be formulated using a projection onto the intersection of this behavior and a constraint set. Instead of learning the model, or subspace, we focus on determining this projection from data. Motivated by the use of regularization in data-enabled predictive control (DeePC), we introduce the use of soft projections, which approximate the true projector onto the behavior from noisy data. In the simplest case, these are equivalent to known regularized DeePC schemes, but they exhibit a number of benefits. First, we provide a bound on the approximation error consisting of a bias and a variance term that can be traded-off by the regularization weight. The derived bound is independent of the true system order, highlighting the benefit of soft projections compared to low-dimensional subspace estimates. Moreover, soft projections allow for intuitive generalizations, one of which we show has superior performance on a case study. Finally, we provide update formulas for soft projectors enabling the efficient adaptation of the proposed data-driven control methods in the case of streaming data.

2604.00864 2026-04-02 eess.SP

DOA Estimation for Low-Altitude Networks: HAD Architectures, Methods, and Challenges

Ye Tian, Tuo Wu, Jintao Wu, He Xu, Yuanjun Shen, Xianfu Lei, Kin-Fai Tong

详情
英文摘要

With the rapid expansion of low-altitude economy (LAE) services and the growing demand for integrated sensing and communication (ISAC) in air-ground networks, reliable direction-of-arrival (DOA) estimation has become essential for both directional communication and sensing functions. DOA underpins beam alignment, spatial-reuse scheduling, and ISAC-critical tasks such as airspace situational awareness and multi-target monitoring. Hybrid analog-digital (HAD) architectures have emerged as a practical solution for large-aperture directional operation under stringent radio frequency (RF), analog-to-digital converter (ADC), and size, weight, and power (SWaP) constraints. However, HAD compresses antenna-domain observations through analog combining, fundamentally reshaping the measurement model and introducing new algorithmic and system-level challenges for DOA estimation. This article first reviews the principles and representative architectures of HAD, highlighting their advantages for scalable beam-centric and ISAC-oriented operation in LAE scenarios. We then provide a structured overview of HAD-enabled DOA estimation methodologies, including spatial covariance matrix (SCM) reconstruction, multi-combiner scan-based acquisition, and pilot-aided estimation, along with key design tradeoffs. Finally, we discuss open challenges and outline reliability-driven research directions toward robust, deployable HAD-enabled DOA solutions for practical ISAC-enabled low-altitude environments.

2604.00863 2026-04-02 eess.SP

Optimal Anchor Placement for Wireless Localization in Mixed LOS and NLOS Scenarios

Gaurav Duggal, R. Michael Buehrer, Harpreet S. Dhillon, Jeffrey H. Reed

Comments Under Review

详情
英文摘要

We develop a unified Fisher-information framework for localization in environments with both Line-of-Sight (LOS) and Non-Line-of-Sight (NLOS) paths, focusing on diffraction-dominated NLOS propagation characteristic of Outdoor-to-Indoor (O2I) signal propagation. The model couples anchor geometry with a physically grounded path-loss law that is continuous across the LOS/NLOS boundary and serves as an optimization objective for our optimal anchor placement problem. As the first step, we analyze single-target anchor placement and derive the classical A-, D-, and E-optimality criteria. Under a specific path-loss assumption, these criteria collapse to a polygon-closure condition in the complex plane: A-, D-, and E-optimal designs coincide, yielding necessary and sufficient conditions for optimal placement. Next, we extend the notion of optimal anchor placement with respect to a single target to optimality over a feasible region (multi-target setting) using a general formulation that explicitly includes a realistic path loss model. This is achieved by recasting the anchor placement as a combinatorial anchor-selection problem with provable guarantees. Next, we specify E- and D-optimal objectives over multiple targets in a predefined feasible target region and show that E-optimality straddles A-optimality (within a constant factor), while D-optimality provides looser bounds. These insights yield two practical algorithms, both mixed-integer second-order cone programs (MISOCP) with exact E-optimal and exact D-optimal objectives that produce robust, region-wide designs under mixed LOS/NLOS conditions.

2604.00846 2026-04-02 eess.SP

Spatial Upper Bound of Radiated Power in Active Antenna Systems

Dominique Nussbaum, Christ Rizk, Eric Seguenot, Florian Kaltenberger, Andrea Moro, Alessandro Sinicco, Laura Pometcu

详情
英文摘要

The assessment of unwanted radiated emissions from Active Antenna Systems (AAS) has become a critical issue in adjacent-band coexistence scenarios. In this paper, we establish the existence of a deterministic spatial upper bound on the radiated power of active antenna arrays. We show that the maximum radiated power always occurs in the boresight direction, irrespective of frequency or signal nature (useful signal, nonlinear distortion, or noise), or instantaneous beamforming configuration, thereby defining a conservative spatial upper bound whose angular envelope is solely determined by the elementary radiating building block of the antenna architecture, i.e., the element or sub-array radiation pattern. Starting from a two-element array with third-order nonlinearities, we derive the spatial envelope and extend the result to realistic AAS architectures. The theoretical findings are validated by over-the-air (OTA) measurements performed on a 3.5 GHz Massive Multiple-Input Multiple-Output (MIMO) antenna. The proposed approach offers a simple, robust, and measurement-oriented methodology for coexistence assessments involving beamformed radio systems.

2604.00826 2026-04-02 eess.SY cs.SY

Bridging RL and MPC for mixed-integer optimal control with application to Formula 1 race strategies

Joschua Wüthrich, Romir Damle, Giona Fieni, Melanie N. Zeilinger, Christopher H. Onder, Andrea Carron

Comments 8 pages, 5 figures; This work has been submitted to the IEEE for possible publication

详情
英文摘要

We propose a hybrid reinforcement learning (RL) and model predictive control (MPC) framework for mixed-integer optimal control, where discrete variables enter the cost and dynamics but not the constraints. Existing hierarchical approaches use RL only for the discrete action space, leaving continuous optimization to MPC. Unlike these methods, we train the RL agent on the full hybrid action space, ensuring consistency with the cost of the underlying Markov decision process. During deployment, the RL actor is rolled out over the prediction horizon to parametrize an integer-free nonlinear MPC through the discrete action sequence and provide a continuous warm-start. The learned critic serves as a terminal cost to capture long-term performance. We prove recursive feasibility, and validate the framework on a Formula 1 race strategy problem. The hybrid method achieves near-optimal performance relative to an offline mixed-integer nonlinear program benchmark, outperforming a standalone RL agent. Moreover, the hybrid scheme enables adaptation to unseen disturbances through modular MPC extensions at zero retraining cost.

2604.00825 2026-04-02 eess.SY cs.SY math.OC

Min-Max Grassmannian Optimization for Online Subspace Tracking

Shreyas Bharadwaj, Bamdev Mishra, Cyrus Mostajeran, Alberto Padoan, Jeremy Coulson, Ravi Banavar

Comments Submitted to the 65th IEEE Conference on Decision and Control, December 15-18 2026, Honolulu, Hawaii, USA

详情
英文摘要

This paper discusses robustness guarantees for online tracking of time-varying subspaces from noisy data. Building on recent work in optimization over a Grassmannian manifold, we introduce a new approach for robust subspace tracking by modeling data uncertainty in a Grassmannian ball. The robust subspace tracking problem is cast into a min-max optimization framework, for which we derive a closed-form solution for the worst-case subspace, enabling a geometric robustness adjustment that is both analytically tractable and computationally efficient, unlike iterative convex relaxations. The resulting algorithm, GeRoST (Geometrically Robust Subspace Tracking), is validated on two case studies: tracking a linear time-varying system and online foreground-background separation in video.

2604.00823 2026-04-02 eess.SP physics.optics

Novel Single Clad Ho-doped Fiber with High Slope Efficiency and Low Ion Pairing

Robert E. Tench, Wiktor Walasik, Alexandre Amavigan, Jean-Marc Delavaux, Colin C. Baker, Daniel Rhonehouse

Comments 6 pages, 7 figures, 1 table

详情
英文摘要

We report the design and experimental and simulated performance for a 2050 nm band fiber amplifier with high optical-optical slope efficiency and low ion pairing, using a novel high performance single clad Ho-doped fiber from the Naval Research Laboratory (NRL). We measure an optical-optical slope efficiency of 57% using 1 mW input signal power and 1860 nm pumping which we believe is the highest slope efficiency obtained to date for a single clad single stage copumped HDFA. A new method for non-destructive measurement of the ion pairing coefficient in Ho-doped fibers is introduced and validated. Using this method, we link our 57% slope efficiency to a low ion pairing coefficient of 4% in the NRL Ho-doped fiber as derived from our experimental data. We present an overview and survey of the ion pairing results for Ho-doped fiber amplifiers and lasers reported so far in the literature.