arXivDaily arXiv每日学术速递 周一至周五更新
重置
EESS电气与系统 174
2603.08654 2026-03-10 eess.SY cs.SY

Carbon-aware Market Participation for Building Energy Management Systems

Young-ho Cho, Mohamad Chehade, Fatima Al-Janahi, Sol Lim, Javad Mohammadi, Hao Zhu

详情
英文摘要

Tackling climate change requires the rapid and deep decarbonization of electric power systems. While energy management systems (EMSs) play a central role in this transition, conventional EMSs focus mainly on economic efficiency and often overlook the environmental impact of operational decisions. To address this gap, this paper proposes a unified, real-time building-level carbon-aware EMS (CAEMS) capable of simultaneously co-optimizing grid imports, energy storage, and flexible demand within a single integrated framework. We formulate a mixed-integer linear program (MILP) model that directly integrates time-varying marginal carbon intensity signals into the EMS objective for coordinated participation in both day-ahead (DA) and real-time (RT) markets. To relax the unrealistic assumption of perfect foresight, we incorporate a model predictive control (MPC) extension driven by a Transformer-based forecaster that jointly predicts electricity prices and carbon intensity. The proposed CAEMS is validated using real-world data from the PJM electricity market. Simulation results demonstrate that modest carbon prices can achieve a significant 22.5% reduction in emissions with only a 1.7% increase in cost.

2603.08633 2026-03-10 eess.SY cs.SY

Reachability-based Temporal Logic Verification for Reliable LLM-guided Human-Autonomy Teaming

Joonwon Choi, Kartik Anand Pant, Karthik Nune, Inseok Hwang

详情
英文摘要

We propose a reachability-based framework for reliable LLM-guided human-autonomy teaming (HAT) using signal temporal logic (STL). In the proposed framework, LLM is leveraged as a translator that transfers natural language commands given by a human operator into corresponding STL specifications or vice versa. An STL feasibility filter (SFF) is proposed to check the feasibility of the generated STL. The SFF first decomposes the complex and nested LLM translation into a set of simpler subformulas for parallelization and informative feedback generation. The reachability analysis method is then applied to verify if each subformula is feasible for a target dynamical system: if feasible, perform mission planning, otherwise, reject it. The proposed SFF can identify infeasible subformulas, more than simply providing the boolean verification results for the whole STL, thereby facilitating the feedback generation of LLM to request modification of the command to the human. Consequently, the proposed framework can allow more reliable HAT by enabling safe and informative communication between the human operator and the autonomous agent. Our experiments demonstrate that the proposed framework can successfully filter out infeasible subformulas and generate informative feedback based on such information.

2603.08591 2026-03-10 eess.SP

On the SNR Statistics in Coupled-Core Multi-Core Fiber Transmissions with Mode-Dependent Loss

Chiara Lasagni, Paolo Serena, Alberto Bononi, Lucas A. Zischler, Giammarco Di Sciullo, Antonio Mecozzi, Cristian Antonelli

详情
英文摘要

We investigate the impact of mode-dependent loss (MDL) on the statistics of the signal-to-noise ratio (SNR) in coupled-core multi-core fiber (CC-MCF) systems. Through numerical and theoretical simulations, we present an in-depth analysis of the impact of MDL on received amplified spontaneous emission (ASE) noise and nonlinear interference (NLI), as well as their joint contribution to the SNR. We show that MDL induces different statistics on the two noises and discuss the differences with single-mode polarization-dependent loss. Moreover, we investigate the impact of spatial mode dispersion (SMD) on the MDL-induced impairment, offering insights on their joint effects on ASE and NLI.

2603.08535 2026-03-10 math.OC cs.SY eess.SY

Rethinking Strict Dissipativity for Economic MPC

Mario Zanon

详情
英文摘要

Stability of economic model predictive control can be proven under the assumption that a strict dissipativity condition holds. This assumption has a clear interpretation in terms of the so-called rotated stage cost, which must have its minimum at the optimal steady state. However, contrary to dissipativity, for strict dissipativity the storage function cannot be immediately related to the value function of an optimal control problem formulated with the economic stage cost. We propose the novel concept of two-storage strict dissipativity, which requires two storage functions to satisfy dissipativity and be separated by a positive definite function. This new condition can be immediately related to optimal control by means of value functions and might be easier to verify than strict dissipativity. Furthermore, we prove that two-storage strict dissipativity is sufficient and necessary for asymptotic stability, it is related to strict dissipativity, and also to alternative approaches relying on the so-called cost-to-travel. Finally, we discuss commonly used and new terminal cost designs that guarantee asymptotic stability in the finite-horizon case.

2603.08521 2026-03-10 cs.CV cs.RO eess.IV

OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye Cameras

Yongzhi Lin, Kai Luo, Yuanfan Zheng, Hao Shi, Mengfei Duan, Yang Liu, Kailun Yang

Comments The benchmark and source code will be made publicly available at https://github.com/YouthZest-Lin/OccTrack360

详情
英文摘要

Understanding dynamic 3D environments in a spatially continuous and temporally consistent manner is fundamental for robotics and autonomous driving. While recent advances in occupancy prediction provide a unified representation of scene geometry and semantics, progress in 4D panoptic occupancy tracking remains limited by the lack of benchmarks that support surround-view fisheye sensing, long temporal sequences, and instance-level voxel tracking. To address this gap, we present OccTrack360, a new benchmark for 4D panoptic occupancy tracking from surround-view fisheye cameras. OccTrack360 provides substantially longer and more diverse sequences (174~2234 frames) than prior benchmarks, together with principled voxel visibility annotations, including an all-direction occlusion mask and an MEI-based fisheye field-of-view mask. To establish a strong fisheye-oriented baseline, we further propose Focus on Sphere Occ (FoSOcc), a framework that addresses two core challenges in fisheye occupancy tracking: distorted spherical projection and inaccurate voxel-space localization. FoSOcc includes a Center Focusing Module (CFM) to enhance instance-aware spatial localization through supervised focus guidance, and a Spherical Lift Module (SLM) that extends perspective lifting to fisheye imaging under the Unified Projection Model. Extensive experiments on Occ3D-Waymo and OccTrack360 show that our method improves occupancy tracking quality with notable gains on geometrically regular categories, and establishes a strong baseline for future research on surround-view fisheye 4D occupancy tracking. The benchmark and source code will be made publicly available at https://github.com/YouthZest-Lin/OccTrack360.

2603.08503 2026-03-10 cs.CV cs.GR cs.RO eess.IV

Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene Reconstruction

Zhe Yang, Guoqiang Zhao, Sheng Wu, Kai Luo, Kailun Yang

Comments The source code and dataset will be released at https://github.com/1170632760/Spherical-GOF

详情
英文摘要

Omnidirectional images are increasingly used in robotics and vision due to their wide field of view. However, extending 3D Gaussian Splatting (3DGS) to panoramic camera models remains challenging, as existing formulations are designed for perspective projections and naive adaptations often introduce distortion and geometric inconsistencies. We present Spherical-GOF, an omnidirectional Gaussian rendering framework built upon Gaussian Opacity Fields (GOF). Unlike projection-based rasterization, Spherical-GOF performs GOF ray sampling directly on the unit sphere in spherical ray space, enabling consistent ray-Gaussian interactions for panoramic rendering. To make the spherical ray casting efficient and robust, we derive a conservative spherical bounding rule for fast ray-Gaussian culling and introduce a spherical filtering scheme that adapts Gaussian footprints to distortion-varying panoramic pixel sampling. Extensive experiments on standard panoramic benchmarks (OmniBlender and OmniPhotos) demonstrate competitive photometric quality and substantially improved geometric consistency. Compared with the strongest baseline, Spherical-GOF reduces depth reprojection error by 57% and improves cycle inlier ratio by 21%. Qualitative results show cleaner depth and more coherent normal maps, with strong robustness to global panorama rotations. We further validate generalization on OmniRob, a real-world robotic omnidirectional dataset introduced in this work, featuring UAV and quadruped platforms. The source code and the OmniRob dataset will be released at https://github.com/1170632760/Spherical-GOF.

2603.08480 2026-03-10 eess.SY cs.SY math.DS

Input Dexterity and Output Negotiation in Feedback-Linearizable Nonlinear Systems

Mirko Mizzoni, Pieter van Goor, Barbara Bazzana, Antonio Franchi

详情
英文摘要

We introduce a task-relative taxonomy of actuator inputs for nonlinear systems within the input-output feedback-linearization framework. Given a flat output specifying the task, inputs are classified as essential, redundant, or dexterity: essential inputs are required for exact linearization, redundant inputs can be removed without effect, and dexterity inputs can be deactivated while preserving exact linearization of a reduced task. We show that a subset is dexterity if and only if, under a suitable dynamic prolongation, it can appear as additional output channels (flat-input complement) on a common validity set. Whenever a family of systems obtained by (de)activating dexterity inputs admits a common prolongation, the family can be interpreted as a single prolonged system endowed with different output selections. This enables a unified linearizing controller that negotiates between full and reduced tasks without transients on shared outputs under compatibility and dwell-time conditions. Simulations on a fully actuated aerial platform illustrate graceful task downgrades from six-dimensional pose tracking as lateral-force channels are deactivated.

2603.08477 2026-03-10 eess.SY cs.SY

Behavioral Generative Agents for Power Dispatch and Auction

Shaoze Li, Justin S. Kim, Cong Chen

详情
英文摘要

This paper presents positive initial evidence that generative agents can relax the rigidity of traditional mathematical models for human decision-making in power dispatch and auction settings. We design two proof-of-concept energy experiments with generative agents powered by a large language model (LLM). First, we construct a home battery management testbed with stochastic electricity prices and blackout interventions, and benchmark LLM decisions against dynamic programming. By incorporating an in-context learning (ICL) module, we show that behavioral patterns discovered by a stronger reasoning model can be transferred to a smaller LLM via example-based prompting, leading agents to prioritize post-blackout energy reserves over short-term profit. Second, we study LLM agents in simultaneous ascending auctions (SAA) for power network access, comparing their behavior with an optimization benchmark, the straightforward bidding strategy. By designing ICL prompts with rule-based, myopic, and strategic objectives, we find that structured prompting combined with ICL enables LLM agents to both reproduce economically rational strategies and exhibit systematic behavioral deviations. Overall, these results suggest that LLM-powered agents provide a flexible and expressive testbed for modeling human decision-making in power system applications.

2603.08472 2026-03-10 eess.SP cs.SY eess.SY

Multi-Mode Pinching-Antenna Systems: Mode Selection or Mode Combining?

Xiaoxia Xu, Xidong Mu, Yuanwei Liu, Arumugam Nallanathan

Comments Submitted to IEEE. Code is available at https://github.com/xiaoxiaxusummer/multi_mode_pinching_antenna

详情
英文摘要

This letter investigates multi-mode pinching antenna systems (PASS), where signals of multiple orthogonal modes can be transmitted within a dielectric waveguide and radiated by pinching antennas (PAs). This enables mode-domain multiplexing for efficient multi-user communications using a single waveguide. In particular, two operating protocols are proposed, namely mode selection and mode combining. Mode selection enforces each PA to predominantly radiate signal power of one single mode, while mode combining allows each PA to flexibly radiate power of multiple modes. Based on the two protocols, a sum rate maximization problem is formulated for multi-mode PASS-enabled multi-user downlink communications, where the transmit beamforming, PA positions, and PA propagation constants are jointly optimized. To address this rapidly oscillating and highly nonconvex problem, a particle swarm optimization (PSO) based Karush-Kuhn-Tucker (KKT)-parameterized beamforming (PSO- KPBF) algorithm is proposed. KKT-conditioned solutions are exploited to guide the swarm search, thus reducing the search space and achieving fast convergence. Numerical results demonstrate that: 1) Even using a simple uniform mode-combining design, the multi-mode PASS significantly outperform conventional single-mode PASS and hybrid beamforming systems; and 2) Mode combining achieves high spectral efficiency, while mode selection approximates its performance with a lower hardware complexity. Code is released at https://github.com/xiaoxiaxusummer/multi_mode_pinching_antenna

2603.08468 2026-03-10 eess.SY cs.LG cs.SY

Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning

Shreya Das, Kundan Kumar, Muhammad Iqbal, Outi Savolainen, Dominik Baumann, Laura Ruotsalainen, Simo Särkkä

Comments 5 pages, 3 figures

详情
英文摘要

Model-based reinforcement learning (MBRL) is sample-efficient but depends on the accuracy of the learned dynamics, which are often modeled using black-box methods that do not adhere to physical laws. Those methods tend to produce inaccurate predictions when presented with data that differ from the original training set. In this work, we employ Lagrangian neural networks (LNNs), which enforce an underlying Lagrangian structure to train the model within a Dyna-based MBRL framework. Furthermore, we train the LNN using stochastic gradient-based and state-estimation-based optimizers to learn the network's weights. The state-estimation-based method converges faster than the stochastic gradient-based method during neural network training. Simulation results are provided to illustrate the effectiveness of the proposed LNN-based Dyna framework for MBRL.

2603.08457 2026-03-10 cs.RO cs.LG cs.SY eess.SP eess.SY physics.data-an

Adaptive Entropy-Driven Sensor Selection in a Camera-LiDAR Particle Filter for Single-Vessel Tracking

Andrei Starodubov, Yaqub Aris Prabowo, Andreas Hadjipieris, Ioannis Kyriakides, Roberto Galeazzi

Comments 8 pages, 5 figures, submitted to FUSION 2026 conference proceedings

详情
英文摘要

Robust single-vessel tracking from fixed coastal platforms is hindered by modality-specific degradations: cameras suffer from illumination and visual clutter, while LiDAR performance drops with range and intermittent returns. We present a heterogeneous multi-sensor fusion particle-filter tracker that incorporates an information-gain (entropy-reduction) adaptive sensing policy to select the most informative configuration at each fusion time bin. The approach is validated in a real maritime deployment at the CMMI Smart Marina Testbed (Ayia Napa Marina, Cyprus), using a shore-mounted 3D LiDAR and an elevated fixed camera to track a rigid inflatable boat with onboard GNSS ground truth. We compare LiDAR-only, camera-only, all-sensors, and adaptive configurations. Results show LiDAR dominates near-field accuracy, the camera sustains longer-range coverage when LiDAR becomes unavailable, and the adaptive policy achieves a favorable accuracy-continuity trade-off by switching modalities based on information gain. By avoiding continuous multi-stream processing, the adaptive configuration provides a practical baseline for resilient and resource-aware maritime surveillance.

2603.08438 2026-03-10 eess.SP

Graph Based Semantic Encoder Decoder Framework for Task Oriented Communications in Connected Autonomous Vehicles

Soheyb Ribouh, Phil Polo Ditsia Di Ngoma

详情
英文摘要

Connected autonomous vehicles (CAVs) require reliable and efficient communication frameworks to support safety critical and task-oriented applications such as collision avoidance, cooperative perception, and traffic risk assessment. Traditional communication paradigms, which focus on transmitting raw bits, often incur excessive bandwidth consumption and fail to preserve the semantic relevance of transmitted information. To bridge this gap, we propose a Graph-Based Semantic Encoder-Decoder (GBSED) architecture tailored for task-oriented communications in CAV networks. The encoder leverages scene graphs to capture spatial and semantic relationships among road entities, combined with a semantic compression algorithm that reduces the size of the extracted graph based representations by up to 99% compared to raw images, while the decoder reconstructs task relevant representations rather than raw data. This design enables a significant reduction in communication overhead while maintaining high semantic fidelity, exceeding 0.9 at SNR levels above 10dB, for downstream vehicular tasks. We evaluate the proposed framework through simulations in autonomous driving scenarios, where the semantic encoder and decoder are integrated into a MIMO OFDM physical layer system. The results demonstrate high prediction success rates for risk assessment, improved robustness under the 3GPP CDL channel, and significant compression gains, confirming that the proposed semantic communication framework is a promising solution for future 6G systems.

2603.08425 2026-03-10 cs.AI cs.HC cs.LG cs.MA cs.SY eess.SY

IronEngine: Towards General AI Assistant

Xi Mo

Comments Technical Report

详情
英文摘要

This paper presents IronEngine, a general AI assistant platform organized around a unified orchestration core that connects a desktop user interface, REST and WebSocket APIs, Python clients, local and cloud model backends, persistent memory, task scheduling, reusable skills, 24-category tool execution, MCP-compatible extensibility, and hardware-facing integration. IronEngine introduces a three-phase pipeline -- Discussion (Planner--Reviewer collaboration), Model Switch (VRAM-aware transition), and Execution (tool-augmented action loop) -- that separates planning quality from execution capability. The system features a hierarchical memory architecture with multi-level consolidation, a vectorized skill repository backed by ChromaDB, an adaptive model management layer supporting 92 model profiles with VRAM-aware context budgeting, and an intelligent tool routing system with 130+ alias normalization and automatic error correction. We present experimental results on file operation benchmarks achieving 100\% task completion with a mean total time of 1541 seconds across four heterogeneous tasks, and provide detailed comparisons with representative AI assistant systems including ChatGPT, Claude Desktop, Cursor, Windsurf, and open-source agent frameworks. Without disclosing proprietary prompts or core algorithms, this paper analyzes the platform's architectural decomposition, subsystem design, experimental performance, safety boundaries, and comparative engineering advantages. The resulting study positions IronEngine as a system-oriented foundation for general-purpose personal assistants, automation frameworks, and future human-centered agent platforms.

2603.08402 2026-03-10 eess.SP

Deep Learning based Cross-Receiver Radio Frequency Fingerprint Identification Under Varying Channels

Jiashuo He, Yumeng Wang, Feiyang He, Sai Huang, Yiheng Liu, Shuo Chang, Zhiyong Feng

详情
英文摘要

Radio frequency fingerprint identification (RFFI) exploits device-specific hardware impairments for transmitter recognition, but its performance is highly vulnerable to receiver variations and changing wireless channels in cross-receiver deployment. To address both challenges, this paper proposes a novel cross-receiver RFFI framework with channel robustness. In the enrollment stage, a channel-robust preprocessing method is developed to construct denoised spectral quotient (DSQ) sequences, and a DSQ-based convolutional neural network (DSQCNN) is trained using data collected from the source receiver. In the cross-receiver deployment stage, a calibration dataset is built from signals captured by both the source and target receivers, and a trainable calibration neural network (TCNN) is designed to learn the nonlinear mapping between them. The cascaded TCNN-DSQCNN framework then enables robust transmitter classification on the target receiver under varying channel conditions. To the best of our knowledge, this is the first work to jointly address channel and receiver portability through combined channel suppression and nonlinear receiver calibration. Simulations with twelve WiFi transmitters and three receivers show that the proposed method achieves reliable cross-receiver classification, reaching over 90\% accuracy at an SNR of 24 dB.

2603.08397 2026-03-10 eess.AS

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Avihu Dekel, Samuel Thomas, Takashi Fukada, George Saon

Comments Preprint

详情
英文摘要

While autoregressive (AR) LLM-based ASR systems achieve strong accuracy, their sequential decoding limits parallelism and incurs high latency. We propose NLE, a non-autoregressive (NAR) approach that formulates speech recognition as conditional transcript editing, enabling fully parallel prediction. NLE extracts acoustic embeddings and an initial hypothesis from a pretrained speech encoder, then refines the hypothesis using a bidirectional LLM editor trained with a latent alignment objective. An interleaved padding strategy exploits the identity mapping bias of Transformers, allowing the model to focus on corrections rather than full reconstruction. On the Open ASR leaderboard, NLE++ achieves 5.67% average WER with an RTFx (inverse real-time factor) of 1630. In single-utterance scenarios, NLE achieves 27x speedup over the AR baseline, making it suitable for real-time applications.

2603.08389 2026-03-10 eess.SP

Mitigating Mixed-field Interference in Near-field and Far-field Communications: An Antenna Selection Approach

Tianyu Liu, Changsheng You, Chao Zhou, Mingjiang Wu, Ming-Min Zhao, Zhaocheng Wang

详情
英文摘要

In mixed near-field and far-field systems, the nonorthogonality between near-field and far-field channels may cause severe inter-user interference and hence degrade rate performance, when the analog beamforming is designed based on the low-complexity full-array maximum ratio transmission (MRT). To tackle this issue, we propose in this paper an antenna selection-based transmission framework to effectively suppress mixed-field interference without mechanically altering antenna structures. To this end, an optimization problem is formulated to maximize the sum-rate of mixed-field systems, by jointly designing antenna selection and power allocation under the MRT-based analog beamforming. As the problem is non-convex and generally difficult to solve optimally, we first consider a typical two-user scenario to obtain useful insights. Interestingly, we analytically show that the strong mixed-field interference can be substantially mitigated by deactivating only a small portion of antennas, yet without compromising array gains too much. Moreover, an inherent tradeoff is revealed in antenna selection between interference suppression and array-gain enhancement, based on which a suboptimal number of deactivated antennas for achieving the maximum sum-rate is obtained. Next, for the general multi-user case, we develop an efficient penalty dual decomposition (PDD)-based two-layer framework to obtain its high quality solution by using the block coordinate descent (BCD) and successive convex approximation (SCA) techniques. To further reduce the computational complexity, a low-complexity antenna deactivation strategy is proposed capitalizing on an interference suppression criterion. Last, numerical results demonstrate that the proposed scheme achieves a favorable trade-off between interference suppression and array gain loss, hence achieving significant performance gains over various baseline schemes.

2603.08351 2026-03-10 eess.SY cs.SY

Eigenvalue Patterns and Participation Analysis of Symmetric Renewable Energy Power Systems

Yao Qin, Yitong Li, Wei Wang, Shaoze Zhou, Zheng Wei, Jinjun Liu

Comments 17 pages, 15 figures

详情
英文摘要

State-space analysis is widely employed for examining power system dynamics but faces challenges in large-scale power systems integrated with numerous inverter-based resources (IBRs), where the significant increase of system states complicates modal analysis. Notably, renewable energy power systems often consist of multiple homogeneous generation units. This uniformity, termed symmetry in this paper, can facilitate the system stability analysis. Eigenvalue patterns and participation factors in three types of symmetric renewable energy power systems are investigated, including ideally-, quasi-, and group-symmetric systems. An ideally-symmetric (quasi-symmetric) system comprises a group of identical (similar) subsystems connected to an external grid. A system containing multiple such groups is termed group-symmetric. In these symmetric systems, two types of modes are defined to characterize different interactions: inner-group modes, which describe the interactions among subsystems within a single group, and group-grid modes, which describe the interactions between the groups and the external grid. A new concept termed group participation factor is also proposed to extend the use of conventional participation factors for repeated and close modes. In addition, the invariance properties of the inner-group modes and group-grid modes are discussed. The findings provide insights for stability analysis and targeted optimization in power systems. Theoretical advances are validated through numerical results and electromagnetic transient (EMT) simulations on example power systems of varied types and scales.

2603.08339 2026-03-10 eess.SP cs.AI cs.LG

Electrocardiogram Classification with Transformers Using Koopman and Wavelet Features

Sucheta Ghosh, Zahra Monfared

详情
英文摘要

Electrocardiogram (ECG) analysis is vital for detecting cardiac abnormalities, yet robust automated classification is challenging due to the complexity and variability of physiological signals. In this work, we investigate transformer-based ECG classification using features derived from the Koopman operator and wavelet transforms. Two tasks are studied: (1) binary classification (Normal vs. Non-normal), and (2) four-class classification (Normal, Atrial Fibrillation, Ventricular Arrhythmia, Block). We use Extended Dynamic Mode Decomposition (EDMD) to approximate the Koopman operator. Our results show that wavelet features excel in binary classification, while Koopman features, when paired with transformers, achieve superior performance in the four-class setting. A simple hybrid of Koopman and wavelet features does not improve accuracy. However, selecting an appropriate EDMD dictionary -- specifically a radial basis function dictionary with tuned parameters -- yields significant gains, surpassing the wavelet-only baseline and the hybrid wavelet-Koopman system. We also present a Koopman-based reconstruction analysis for interpretable insights into the learned dynamics and compare against a recurrent neural network baseline. Overall, our findings demonstrate the effectiveness of Koopman-based feature learning with transformers and highlight promising directions for integrating dynamical systems theory into time-series classification.

2603.08335 2026-03-10 eess.SY cs.SY

The coordination between TSO and DSO in the context of energy transition - A review

Hang Nguyen, Koen Kok, Trung Thai Tran, Phuong H. Nguyen

Comments Published in: 2024 59th International Universities Power Engineering Conference (UPEC)

详情
英文摘要

Nowadays, energy transition is ongoing in many countries, aiming to reduce dependence on fossil fuels and CO2 emissions. Besides the positive impacts on the environment, this transition brings technical challenges to the system operators, such as the intricacies of energy system integration, diminishing uncertainty, and incentivizing customers with advanced transaction models. The coordination between the Transmission system operator (TSO) and the Distribution system operator (DSO) is one of the most important aspects of encountering these obstacles. This coordination enhances the utilization of flexibility from Distributed energy resources (DERs) by incentivizing the market parties with better willingness to pay schemes. This paper provides an overview of the coordination schemes (CS), their classification, assessment of the current situation and the challenges associated with applying these schemes in practical context. The main purpose is to investigate the most effective way for TSO/DSOs to use the flexibility resource to maintain the balancing of the entire system while ensuring no congestion occurs in the network. A broad range of possible coordination schemes along with exploiting flexibility services is presented and the pros and cons are analyzed. Additionally, the study presents a general scenario that describes the interaction between the operators and the third party in providing service to the balancing market, considering cases with and without coordination.

2603.08307 2026-03-10 eess.SY cs.SY

Adaptive Tracking Control of Euler-Lagrange Systems with Time-Varying State and Input Constraints

Poulomee Ghosh, Shubhendu Bhasin

详情
英文摘要

This paper presents an adaptive control framework for Euler-Lagrange (E-L) systems that enforces user-defined time-varying state and input constraints in the presence of parametric uncertainties and bounded disturbances. The proposed design integrates a time-varying barrier Lyapunov Function (TVBLF) with a saturated control law to guarantee constraint satisfaction without resorting to real-time optimization. A key contribution is the development of an offline, verifiable feasibility condition that certifies the existence of a feasible control policy for any prescribed pair of time-varying state and input envelopes. Additionally, we prove boundedness of all closed-loop signals. Real-time experiments conducted on a 2-DoF helicopter model validate the efficacy and practical viability of the proposed method.

2603.08283 2026-03-10 cs.LG cs.SY eess.SY math.OC

PolyFormer: learning efficient reformulations for scalable optimization under complex physical constraints

Yilin Wen, Yi Guo, Bo Zhao, Wei Qi, Zechun Hu, Colin Jones, Jian Sun

Comments Code availability: All the data and code are made openly available at https://github.com/wenyl16/PolyFormer

详情
英文摘要

Real-world optimization problems are often constrained by complex physical laws that limit computational scalability. These constraints are inherently tied to complex regions, and thus learning models that incorporate physical and geometric knowledge, i.e., physics-informed machine learning (PIML), offer a promising pathway for efficient solution. Here, we introduce PolyFormer, which opens a new direction for PIML in prescriptive optimization tasks, where physical and geometric knowledge is not merely used to regularize learning models, but to simplify the problems themselves. PolyFormer captures geometric structures behind constraints and transforms them into efficient polytopic reformulations, thereby decoupling problem complexity from solution difficulty and enabling off-the-shelf optimization solvers to efficiently produce feasible solutions with acceptable optimality loss. Through evaluations across three important problems (large-scale resource aggregation, network-constrained optimization, and optimization under uncertainty), PolyFormer achieves computational speedups up to 6,400-fold and memory reductions up to 99.87%, while maintaining solution quality competitive with or superior to state-of-the-art methods. These results demonstrate that PolyFormer provides an efficient and reliable solution for scalable constrained optimization, expanding the scope of PIML to prescriptive tasks in scientific discovery and engineering applications.

2603.08249 2026-03-10 eess.AS cs.CL eess.IV

Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data

Pol Buitrago, Pol Gàlvez, Oriol Pareras, Javier Hernando

Comments 6 pages, 3 figures, Submitted to Interspeech 2026

详情
英文摘要

Audiovisual speech recognition (AVSR) combines acoustic and visual cues to improve transcription robustness under challenging conditions but remains out of reach for most under-resourced languages due to the lack of labeled video corpora for training. We propose a zero-AV-resource AVSR framework that relies on synthetic visual streams generated by lip-syncing static facial images with real audio. We first evaluate synthetic visual augmentation on Spanish benchmarks, then apply it to Catalan, a language with no annotated audiovisual corpora. We synthesize over 700 hours of talking-head video and fine-tune a pre-trained AV-HuBERT model. On a manually annotated Catalan benchmark, our model achieves near state-of-the-art performance with much fewer parameters and training data, outperforms an identically trained audio-only baseline, and preserves multimodal advantages in noise. Scalable synthetic video thus offers a viable substitute for real recordings in zero-AV-resource AVSR.

2603.08248 2026-03-10 eess.SY cs.SY

Coupling Europe's Capacity Markets

Kamal Adekola, Laurens de Vries, Kenneth Bruninx

详情
英文摘要

European Member States are increasingly introducing national capacity mechanisms (CMs) to manage growing adequacy risks. However, isolated national CMs are inefficient in highly interconnected electricity systems, such as the European system. While progress has been made in facilitating cross-border participation by generation capacity in CMs, existing arrangements are prone to under- or over-investment and do not properly value the contribution of interconnection capacity to Member States' adequacy targets. In this paper, we propose a novel conceptual design for a coupled European capacity market that utilises the logic of flow-based market coupling. In a comparative analysis of different market design scenarios in an illustrative multi-zone case study, using a bespoke long-run equilibrium problem, we show that the proposed flow-based coupling of capacity markets reduces system costs by harnessing available capacity in neighbouring market zones while ensuring deliverability with respect to network constraints in all scarcity situations.

2603.08244 2026-03-10 eess.SP

Fluid Antennas Meet Intelligent Surfaces: Security Analysis of NOMA Systems Under Hardware Impairments

Tuo Wu, Jianchao Zheng, Xiazhi Lai, Maged Elkashlan, Hyundong Shin, Naofal Al-Dhahir

详情
英文摘要

The revolutionary convergence of fluid antenna systems (FAS) and reconfigurable intelligent surfaces (RIS) creates unprecedented opportunities for secure wireless communications, yet the practical implications of hardware impairments on this promising combination remain largely unexplored. This paper investigates the security performance of non-orthogonal multiple access (NOMA) systems when fluid antennas (FAs) meet intelligent surfaces under realistic hardware constraints. We develop a comprehensive analytical framework that captures the complex interplay between adaptive spatial diversity, intelligent signal reflection, and hardware-induced distortions in short-packet communications. Through novel piecewise linear approximations and block-correlation models, we derive tractable expressions for average secure block error rate (BLER) that reveal fundamental performance limits imposed by hardware impairments. Our analysis demonstrates that while the synergy between FAs and intelligent surfaces offers remarkable degrees of freedom for security enhancement, practical hardware imperfections create performance ceilings that persist regardless of spatial diversity gains. The theoretical framework exposes critical design trade-offs between system complexity and achievable security performance, showing that hardware quality becomes a decisive factor in realizing the full potential of FAS-RIS architectures. Extensive simulations validate our analytical insights and provide practical design guidelines for implementing secure NOMA systems that effectively balance the benefits of fluid-intelligent cooperation against the constraints of realistic hardware limitations.

2603.08231 2026-03-10 eess.AS cs.CL

Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks

Pol Buitrago, Oriol Pareras, Federico Costa, Javier Hernando

Comments 6 pages, 5 figures, Submitted to Interspeech 2026

详情
英文摘要

Paralinguistic speech tasks are often considered relatively language-agnostic, as they rely on extralinguistic acoustic cues rather than lexical content. However, prior studies report performance degradation under cross-lingual conditions, indicating non-negligible language dependence. Still, these studies typically focus on isolated language pairs or task-specific settings, limiting comparability and preventing a systematic assessment of task-level language dependence. We introduce the Cross-Lingual Transfer Matrix (CLTM), a systematic method to quantify cross-lingual interactions between pairs of languages within a given task. We apply the CLTM to two paralinguistic tasks, gender identification and speaker verification, using a multilingual HuBERT-based encoder, to analyze how donor-language data affects target-language performance during fine-tuning. Our results reveal distinct transfer patterns across tasks and languages, reflecting systematic, language-dependent effects.

2603.08230 2026-03-10 cs.SD cs.AI eess.AS

Disentangling Reasoning in Large Audio-Language Models for Ambiguous Emotion Prediction

Xiaofeng Yu, Jiaheng Dong, Jean Honorio, Abhirup Ghosh, Hong Jia, Ting Dang

Comments The paper was submitted to Interspeech for review

详情
英文摘要

Speech emotion recognition plays an important role in various applications. However, most existing approaches predict a single emotion label, oversimplifying the inherently ambiguous nature of human emotional expression. Recent large audio-language models show promise in generating richer outputs, but their reasoning ability for ambiguous emotional understanding remains limited. In this work, we reformulate ambiguous emotion recognition as a distributional reasoning problem and present the first systematic study of ambiguity-aware reasoning in LALMs. Our framework comprises two complementary components: an ambiguity-aware objective that aligns predictions with human perceptual distributions, and a structured ambiguity-aware chain-of-thought supervision that guides reasoning over emotional cues. Experiments on IEMOCAP and CREMA-D demonstrate consistent improvements across SFT, DPO, and GRPO training strategies.

2603.08229 2026-03-10 eess.SP

From Design to Validation: Preparing a LEO-Capable UE for End-to-End System Evaluation

Amedeo Giuliani, Pol Henarejos, Erislandy Mozo, Màrius Caus, Miguel Ángel Solis Gallego, Jaime Suárez García, Rami Othman, Justin Tallon

详情
英文摘要

The extension of 5G connectivity through Low-Earth Orbit satellite systems introduces significant technical challenges, particularly due to time-varying propagation delays and high Doppler shifts resulting from satellite motion. While the Third Generation Partnership Project Release 17 established the initial framework for non-terrestrial networks, the ongoing developments in Release 19 further enhance this effort by introducing support for regenerative payload architectures, where part of the communication protocol stack is processed directly on board the satellite. In this work, we present the design of a 5G user equipment adapted for Low-Earth Orbit satellite connectivity, with specific focus on strategies for managing variable delay and Doppler compensation. Additionally, we describe a custom experimental platform based on a drone-mounted software-defined radio platform capable of emulating both transparent and regenerative satellite payloads. Although full end-to-end system validation is not yet complete, initial laboratory tests confirm the feasibility of the architecture and lay the groundwork for future experimental campaigns.

2603.08217 2026-03-10 eess.IV

3-D Near-Field Passive Radar Imaging Using Multiple Illumination Sources

Quanfeng Wang, Mei Song Tong, Thomas F. Eibert

Comments This article has been accepted for publication in 2025 IEEE International Symposium on Antennas and Propagation and North American Radio Science Meeting. This is the version of the author which has not been fully edited and content may change prior to final publication

详情
英文摘要

Near-field (NF) passive radar imaging depends on the illumination of the imaging scene by a non-cooperative transmitter (Tx). It is demonstrated that combining imaging results obtained with Tx antennas at different positions can enhance the performance of passive radar imaging. On the one hand, multiple Tx antennas provide diverse illumination perspectives, reducing the likelihood of unilluminated regions on the targets of interest (TOIs). On the other hand, the coherent summation of imaging results obtained for different illuminations helps to suppress potential artifacts. This approach is in particular advantageous for imaging complex objects with concave structures such as dihedral arrangements, where the ghosts due to multiple reflections are highly configuration-dependent. For each illuminating configuration, a single-frequency inverse source solver is utilized to reconstruct the equivalent sources of the TOIs and the resulting single-frequency images are then superimposed coherently with corresponding phase and magnitude correction methods. The obtained multi-frequency images are finally coherently combined to enhance the imaging quality. Both simulation and measurement results are presented to validate the effectiveness of the approach.

2603.08216 2026-03-10 eess.AS cs.CL cs.SD

DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining

Shangeth Rajaa

Comments Submitted to Interspeech 2026

详情
英文摘要

Speech-to-speech models handle turn-taking naturally but offer limited support for tool-calling or complex reasoning, while production ASR-LLM-TTS voice pipelines offer these capabilities but rely on silence timeouts, which lead to unnatural turn-taking. We present DualTurn, which narrows this gap through generative pretraining on dual-channel conversational audio. The model generates both speakers' future audio autoregressively, implicitly learning conversational dynamics without any labels, and is then fine-tuned to predict interpretable turn-taking signals that map directly to agent actions. DualTurn monitors both channels continuously, anticipating turn boundaries and producing five agent actions. On standard benchmarks, DualTurn (0.5B) outperforms both VAP on agent action prediction (wF1 0.633 vs. 0.389) and a 3.1B audio-text model on word-level turn prediction (AUC 0.930 vs. 0.880), while anticipating turn boundaries earlier with fewer interruptions.

2603.08198 2026-03-10 eess.SP

Instantaneous Frequency Estimation in Noisy Multicomponent Signals with Interfering Modes Based on Prony Method and Spline Approximation

Basile Dubois-Bonnaire, Sylvain Meignen, Kévin Polisano

详情
英文摘要

In this paper, we propose a novel estimator of the instantaneous frequencies (IFs) of the modes making up multicomponent signals (MCSs). We are particularly interested in dealing with noisy MCSs containing close modes in the time-frequency plane. Though it is possible to adapt Prony approach to estimate IFs in such situations, interference between the modes generates oscillations in the obtained estimations. After having investigated the nature of these oscillations, we propose an algorithm to remove these in IFs estimation, based on spline approximation. Numerical applications in various situations illustrate the benefit of mixing Prony technique with spline approximation for IF estimation in noisy MCSs containing close modes.