arXivDaily arXiv每日学术速递 周一至周五更新
重置
2603.26647 2026-03-30 cs.LG cs.SY eess.SY

An LP-based Sampling Policy for Multi-Armed Bandits with Side-Observations and Stochastic Availability

Ashutosh Soni, Peizhong Ju, Atilla Eryilmaz, Ness B. Shroff

详情
英文摘要

We study the stochastic multi-armed bandit (MAB) problem where an underlying network structure enables side-observations across related actions. We use a bipartite graph to link actions to a set of unknowns, such that selecting an action reveals observations for all the unknowns it is connected to. While previous works rely on the assumption that all actions are permanently accessible, we investigate the more practical setting of stochastic availability, where the set of feasible actions (the "activation set") varies dynamically in each round. This framework models real-world systems with both structural dependencies and volatility, such as social networks where users provide side-information about their peers' preferences, yet are not always online to be queried. To address this challenge, we propose UCB-LP-A, a novel policy that leverages a Linear Programming (LP) approach to optimize exploration-exploitation trade-offs under stochastic availability. Unlike standard network bandit algorithms that assume constant access, UCB-LP-A computes an optimal sampling distribution over the realizable activation sets, ensuring that the necessary observations are gathered using only the currently active arms. We derive a theoretical upper bound on the regret of our policy, characterizing the impact of both the network structure and the activation probabilities. Finally, we demonstrate through numerical simulations that UCB-LP-A significantly outperforms existing heuristics that ignore either the side-information or the availability constraints.

2603.26636 2026-03-30 physics.app-ph cs.SY eess.SY

Patched-Wall Quasistatic Cavity Resonators for 3-D Wireless Power Transfer

Takuya Sasatani, Yoshihiro Kawahara

Comments 5 pages, 6 figures

详情
英文摘要

Traditional wireless power transfer (WPT) systems are largely limited to 1-D charging pads or 2-D charging surfaces and therefore do not support a truly ubiquitous device-powering experience. Although room-scale WPT based on multimode quasistatic cavity resonance (QSCR) has demonstrated full-volume coverage by leveraging multiple resonant modes, existing high-coverage implementations require obstructive internal conductive structures, such as a central pole. This letter presents a new structure, termed the patched-wall QSCR, that eliminates such internal obstructions while preserving full-volume coverage. By using conductive wall segments interconnected by capacitors, the proposed structure supports two complementary resonant modes that cover both the peripheral and central regions without obstructions within the charging volume. Electromagnetic simulations show that, by selectively exciting these two resonant modes, the proposed structure achieves a minimum power-transfer efficiency of 48.1% across the evaluated 54 m^3 charging volume while preserving an unobstructed interior space.

2603.26621 2026-03-30 eess.SY cs.SY

Inclusion conditions for the Constrained Polynomial Zonotopic case

Bogdan Gheorghe, Amr Alanwar, Florin Stoican

详情
英文摘要

Set operations are well understood for convex sets but become considerably more challenging in the non-convex case due to the loss of structural properties in their representation. Constrained polynomial zonotopes (CPZs) offer an effective compromise, as they can capture complex, typically non-convex geometries while maintaining an algebraic structure suitable for further manipulation. Building on this, we propose novel nonlinear encodings that provide sufficient conditions for testing inclusion between two CPZs and adapt them for seamless integration within optimization frameworks.

2603.26578 2026-03-30 eess.SY cs.SY

Port-Transversal Barriers: Graph-Theoretic Safety for Port-Hamiltonian Systems

Chi Ho Leung, Philip E. Paré

详情
英文摘要

We study port-Hamiltonian systems with energy functions that split into local storage terms. From the interconnection and dissipation structure, we construct a graph on the energy compartments. From this graph, we show that the shortest-path distance from a constrained compartment to the nearest actuated one gives a lower bound on the relative degree of the corresponding safety constraint. We also show that no smooth static feedback can reduce it when no path exists. When the relative degree exceeds one and the immediate graph neighbors of the constrained compartment is connected to at least one input port, we reshape the constraint by subtracting their shifted local storages, producing a candidate barrier function of relative degree one. We then identify sufficient regularity conditions that recover CBF feasibility under bounded inputs. We validate the framework on an LC ladder network, where the enforceability of a capacitor charge constraint depends only on the input topology.

2603.26566 2026-03-30 eess.SP cs.IT math.IT

Beam-Coherence-Aware Two-Stage Digital Combining for mmWave MU-MIMO Systems

Yasaman Khorsandmanesh, Emil Bjornson, Joakim Jalden, Bengt Lindoff

Comments arXiv admin note: substantial text overlap with arXiv:2508.04214

详情
英文摘要

This paper considers a wideband millimeter-wave MIMO system with fully digital transceivers at both the base station and the user equipment (UE), focusing on mobile scenarios. To reduce the baseband processing burden at the UE, we propose a two-stage digital combining architecture, where the received signals are compressed from $K$ antennas to dimension $N_{\mathrm c}$ before baseband processing. The first-stage combining matrix exploits channel geometry and is updated on the beam-coherence timescale, which is longer than the channel coherence time, while the second stage is updated per channel coherence time. We develop a pilot-based channel estimation framework tailored to the proposed two-stage digital combining architecture, leveraging maximum likelihood estimation. Furthermore, we propose a time-domain method that exploits the finite delay spread to reconstruct the full channel from a reduced number of pilot subcarriers. Precoding and combining schemes are designed accordingly, and spectral efficiency expressions with imperfect channel state information are derived. Numerical results show that the proposed time-domain approach outperforms hybrid beamforming while reducing pilot overhead. We further demonstrate that the framework extends to multi-user MIMO and retains its performance advantages. These results highlight the potential of two-stage fully digital transceivers for future wideband systems.

2603.26454 2026-03-30 eess.SP

Near-Field MMSE Channel Estimation for THz RIS-aided Communications with Electromagnetic Interference

Wen-Xuan Long, Marco Moretti, Giacomo Bacci, Luca Sanguinetti

详情
英文摘要

This letter investigates the channel estimation problem in THz wireless communications where a RIS is employed to assist wireless transmission between different devices. Unlike existing studies, we consider a novel scenario where specific devices are all located in the radiative NF region of the RIS. Meanwhile, we also account for the impact on channel estimation of the random electromagnetic interference occurring at the RIS location. A linear minimum mean-square error estimator is employed, where the estimation error is fully determined by the RIS configuration. Optimizing the RIS involves solving a non-convex problem, which is addressed using an alternating optimization approach based on the diagonally scaled gradient descent algorithm. Numerical results in the THz band highlight the importance of leveraging NF channel statistics over far-field approximations and demonstrate that the proposed estimator achieves substantial improvements in normalized mean-square error compared to existing methods.

2603.26393 2026-03-30 eess.IV cs.CV

Adapting Frozen Mono-modal Backbones for Multi-modal Registration via Contrast-Agnostic Instance Optimization

Yi Zhang, Yidong Zhao, Qian Tao

Comments MICCAI Learn2Reg Challenge

详情
英文摘要

Deformable image registration remains a central challenge in medical image analysis, particularly under multi-modal scenarios where intensity distributions vary significantly across scans. While deep learning methods provide efficient feed-forward predictions, they often fail to generalize robustly under distribution shifts at test time. A straightforward remedy is full network fine-tuning, yet for modern architectures such as Transformers or deep U-Nets, this adaptation is prohibitively expensive in both memory and runtime when operating in 3D. Meanwhile, the naive fine-tuning struggles more with potential degradation in performance in the existence of drastic domain shifts. In this work, we propose a registration framework that integrates a frozen pretrained \textbf{mono-modal} registration model with a lightweight adaptation pipeline for \textbf{multi-modal} image registration. Specifically, we employ style transfer based on contrast-agnostic representation generation and refinement modules to bridge modality and domain gaps with instance optimization at test time. This design is orthogonal to the choice of backbone mono-modal model, thus avoids the computational burden of full fine-tuning while retaining the flexibility to adapt to unseen domains. We evaluate our approach on the Learn2Reg 2025 LUMIR validation set and observe consistent improvements over the pretrained state-of-the-art mono-modal backbone. In particular, the method ranks second on the multi-modal subset, third on the out-of-domain subset, and achieves fourth place overall in Dice score. These results demonstrate that combining frozen mono-modal models with modality adaptation and lightweight instance optimization offers an effective and practical pathway toward robust multi-modal registration.

2603.26387 2026-03-30 eess.IV

Rethinking Feature Conditioning for Robust Forged Media Detection in Edge AI Sensing Systems

Izaldein Al-Zyoud, Abdulmotaleb El Saddik

详情
英文摘要

Generalization under manipulation and dataset shift remains a core challenge in forged media detection for AI-driven edge sensing systems. Frozen vision foundation models with linear probes are strong baselines, but most pipelines use default backbone outputs without testing conditioning at the frozen feature interface. We present the first controlled probing study on DINOv3 ConvNeXt and show that, without task-specific fine-tuning, linear probing alone yields competitive forged-media detection performance, indicating that ViT-7B self-supervised distillation transfers to security-critical vision workloads at edge-compatible inference cost. Backbone, head, data, and optimization are fixed while conditioning is varied; LN-Affine, the default ConvNeXt head output, is the natural baseline. On FaceForensics++ c23, five conditioning variants are evaluated under in-distribution testing, leave-one-manipulation-out (LOMO), and cross-dataset transfer to Celeb-DF v2 and DeepFakeDetection. In ConvNeXt-Tiny, conditioning alone changes LOMO mean AUC by 6.1 points and reverses ID-vs-OOD ranking: LN-Affine is strongest on external datasets, while LayerNorm is strongest in-distribution. In ConvNeXt-Base replication, the OOD winner becomes protocol-dependent, and ID-optimal selection still fails as a robust deployment rule. Results show that feature conditioning is a first-order design variable and should be selected with robustness-oriented validation, not ID accuracy alone.

2603.26384 2026-03-30 eess.SP

PARAFAC-Based Channel Estimation for Beyond Diagonal Reconfigurable Surfaces

Gilderlan Tavares de Araújo, Bruno Sokal, André L. F. de Almeida

详情
英文摘要

Channel estimation is a central bottleneck in BD-RIS-assisted MIMO systems. The richer inter-element coupling that enables large performance gains also makes training and hardware control substantially harder than in diagonal RIS architectures. Existing estimators either target only cascaded channels or require block-by-block reconfiguration of the BD-RIS interconnections, which is costly and difficult to implement in practice. To overcome this limitation, we propose a pilot-assisted tensor framework for group-connected BD-RIS under a two-timescale protocol, where the scattering structure is designed as a low-rank PARAFAC model with fixed factor matrices. This design keeps the interconnection topology constant across blocks and updates only phase shifts, enabling practical operation without sacrificing estimation quality. Building on this structure, we develop a PARAFAC-based alternating least-squares (PALS) receiver that recovers the individual channels. Numerical results confirm that PALS delivers markedly lower composite-channel NMSE than conventional LS, matches the accuracy of state-of-the-art tensor receivers, and sharply reduces BD-RIS design complexity

2603.26367 2026-03-30 eess.SP

WiMamba: Linear-Scale Wireless Foundation Model

Tomer Raviv, Nir Shlezinger

详情
英文摘要

Foundation models learn transferable representations, motivating growing interest in their application to wireless systems. Existing wireless foundation models are predominantly based on transformer architectures, whose quadratic computational and memory complexity can hinder practical deployment for large-scale channels. In this work, we introduce WiMamba, a wireless foundation model built upon the recently proposed Mamba architecture, which replaces attention mechanisms with selective state-space models and enables linear-time sequence modeling. Leveraging this architectural advantage combined with adaptive preprocessing, WiMamba achieves scalable and low-latency inference while maintaining strong representational expressivity. We further develop a dedicated task-agnostic, self-supervised pre-training framework tailored to wireless channels, resulting in a genuine foundation model that learns transferable channel representations. Evaluations across four downstream tasks demonstrate that WiMamba matches or outperforms transformer-based wireless foundation models, while offering dramatic latency and memory reductions.

2603.26347 2026-03-30 cs.RO cs.SY eess.SY

Optimal Prioritized Dissipation and Closed-Form Damping Limitation under Actuator Constraints for Haptic Interfaces

Camilla Celli, Andrea Bini, Valerio Novelli, Alessandro Filippeschi, Francesco Porcini, Antonio Frisoli

详情
英文摘要

In haptics, guaranteeing stability is essential to ensure safe interaction with remote or virtual environments. One of the most relevant methods at the state-of-the-art is the Time Domain Passivity Approach (TDPA). However, its high conservatism leads to a significant degradation of transparency. Moreover, the stabilizing action may conflict with the device's physical limitations. State-of-the-art solutions have attempted to address these actuator limits, but they still fail to account simultaneously for the power limits of each actuator while maximizing transparency. This work proposes a new damping limitation method based on prioritized dissipation actions. It prioritizes an optimal dissipation direction that minimizes actuator load, while any excess dissipation is allocated to the orthogonal hyperplane. The solution provides a closed-form formulation and is robust in multi-DoF scenarios, even in the presence of actuator and motion anisotropies. The method is experimentally validated using a parallel haptic interface interacting with a virtual environment and tested under different operating conditions.

2603.26344 2026-03-30 stat.ML cs.LG cs.SD eess.AS eess.SP

A Power-Weighted Noncentral Complex Gaussian Distribution

Toru Nakashika

详情
英文摘要

The complex Gaussian distribution has been widely used as a fundamental spectral and noise model in signal processing and communication. However, its Gaussian structure often limits its ability to represent the diverse amplitude characteristics observed in individual source signals. On the other hand, many existing non-Gaussian amplitude distributions derived from hyperspherical models achieve good empirical fit due to their power-law structures, while they do not explicitly account for the complex-plane geometry inherent in complex-valued observations. In this paper, we propose a new probabilistic model for complex-valued random variables, which can be interpreted as a power-weighted noncentral complex Gaussian distribution. Unlike conventional hyperspherical amplitude models, the proposed model is formulated directly on the complex plane and preserves the geometric structure of complex-valued observations while retaining a higher-dimensional interpretation. The model introduces a nonlinear phase diffusion through a single shape parameter, enabling continuous control of the distributional geometry from arc-shaped diffusion along the phase direction to concentration of probability mass toward the origin. We formulate the proposed distribution and analyze the statistical properties of the induced amplitude distribution. The derived amplitude and power distributions provide a unified framework encompassing several widely used distributions in signal modeling, including the Rice, Nakagami, and gamma distributions. Experimental results on speech power spectra demonstrate that the proposed model consistently outperforms conventional distributions in terms of log-likelihood.

2603.26339 2026-03-30 cs.LG cs.RO cs.SY eess.SY

Curvature-aware Expected Free Energy as an Acquisition Function for Bayesian Optimization

Ajith Anil Meera, Wouter Kouw

Comments under review

详情
英文摘要

We propose an Expected Free Energy-based acquisition function for Bayesian optimization to solve the joint learning and optimization problem, i.e., optimize and learn the underlying function simultaneously. We show that, under specific assumptions, Expected Free Energy reduces to Upper Confidence Bound, Lower Confidence Bound, and Expected Information Gain. We prove that Expected Free Energy has unbiased convergence guarantees for concave functions. Using the results from these derivations, we introduce a curvature-aware update law for Expected Free Energy and show its proof of concept using a system identification problem on a Van der Pol oscillator. Through rigorous simulation experiments, we show that our adaptive Expected Free Energy-based acquisition function outperforms state-of-the-art acquisition functions with the least final simple regret and error in learning the Gaussian process.

2603.26264 2026-03-30 cs.LG cs.SY eess.SY

Topology-Aware Graph Reinforcement Learning for Energy Storage Systems Optimal Dispatch in Distribution Networks

Shuyi Gao, Stavros Orfanoudakis, Shengren Hou, Peter Palensky, Pedro P. Vergara

Comments 15 pages, 10 figures

详情
英文摘要

Optimal dispatch of energy storage systems (ESSs) in distribution networks involves jointly improving operating economy and voltage security under time-varying conditions and possible topology changes. To support fast online decision making, we develop a topology-aware Reinforcement Learning architecture based on Twin Delayed Deep Deterministic Policy Gradient (TD3), which integrates graph neural networks (GNNs) as graph feature encoders for ESS dispatch. We conduct a systematic investigation of three GNN variants: graph convolutional networks (GCNs), topology adaptive graph convolutional networks (TAGConv), and graph attention networks (GATs) on the 34-bus and 69-bus systems, and evaluate robustness under multiple topology reconfiguration cases as well as cross-system transfer between networks with different system sizes. Results show that GNN-based controllers consistently reduce the number and magnitude of voltage violations, with clearer benefits on the 69-bus system and under reconfiguration; on the 69-bus system, TD3-GCN and TD3-TAGConv also achieve lower saved cost relative to the NLP benchmark than the NN baseline. We also highlight that transfer gains are case-dependent, and zero-shot transfer between fundamentally different systems results in notable performance degradation and increased voltage magnitude violations. This work is available at: https://github.com/ShuyiGao/GNNs_RL_ESSs and https://github.com/distributionnetworksTUDelft/GNNs_RL_ESSs.

2603.26246 2026-03-30 cs.CL cs.AI cs.LG eess.AS

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Shashi Kumar, Esaú Villatoro-Tello, Sergio Burdisso, Kadri Hacioglu, Thibault Bañeras-Roux, Hasindri Watawana, Dairazalia Sanchez-Cortes, Srikanth Madikeri, Petr Motlicek, Andreas Stolcke

Comments 11 pages

详情
英文摘要

Standard LLM-based speech recognition systems typically process utterances in isolation, limiting their ability to leverage conversational context. In this work, we study whether multimodal context from prior turns improves LLM-based ASR and how to represent that context efficiently. We find that, after supervised multi-turn training, conversational context mainly helps with the recognition of contextual entities. However, conditioning on raw context is expensive because the prior-turn audio token sequence grows rapidly with conversation length. To address this, we propose Abstract Compression, which replaces the audio portion of prior turns with a fixed number of learned latent tokens while retaining corresponding transcripts explicitly. On both in-domain and out-of-domain test sets, the compressed model recovers part of the gains of raw-context conditioning with a smaller prior-turn audio footprint. We also provide targeted analyses of the compression setup and its trade-offs.

2603.26216 2026-03-30 eess.SP

Antenna Elements' Trajectory Optimization for Throughput Maximization in Continuous-Trajectory Fluid Antenna-Aided Wireless Communications

Shuaixin Yang, Yijia Li, Yue Xiao, Yong Liang Guan, Kai-Kit Wong, Hyundong Shin, Chau Yuen

Comments 35 pages, 3 figures

详情
英文摘要

Fluid antenna (FA) systems offer novel spatial degrees of freedom (DoFs) with the potential for significant performance gains. Compared to existing works focusing solely on optimizing FA positions at discrete time instants, we introduce the concept of continuous-trajectory fluid antenna (CTFA), which explicitly considers the antenna element's movement trajectory across continuous time intervals and incorporates the inherent kinematic constraints present in practical FA implementations. Accordingly, we formulate the total throughput maximization problem in CTFA-aided wireless communication systems, addressing the joint optimization of continuous antenna trajectories in conjunction with the transmit covariance matrices under kinematic constraints. To effectively solve this non-convex problem with highly coupled optimization variables, we develop an iterative algorithm based on block coordinate descent (BCD) and majorization-minimization (MM) principles with the aid of the weighted minimum mean square error (WMMSE) method. Finally, numerical results are presented to validate the efficacy of the proposed algorithms and to quantify the substantial total throughput advantages afforded by the conceived CTFA-aided system compared to conventional fixed-position antenna (FPA) benchmarks and alternative approaches employing simplified trajectories.

2603.26155 2026-03-30 eess.SY cs.SY

Aging States Estimation and Monitoring Strategies of Li-Ion Batteries Using Incremental Capacity Analysis and Gaussian Process Regression

Moritz Landwehr, Patrick Hoher, Johannes Reuter

详情
英文摘要

Existing approaches for battery health forecasting often rely on extensive cycling histories and continuously monitored cells. In contrast, many real-world scenarios provide only sparse information, e.g. a single diagnostic cycle. In our study, we investigate state of health (SoH)- and remaining useful life (RUL) estimation of previously unseen lithium-ion cells, relying on cycling data from begin of life (BOL) to end of life (EOL) of multiple similar cells by using the publicly available Oxford battery aging dataset. The estimator applies incremental capacity analysis (ICA)-based feature extraction in combination with data-efficient regression methods. Particular emphasis is placed on a multi-model Gaussian process regression ensemble approach (GPRn), which also provides uncertainty quantification. Due to a rather cell invariant behaviour, the mapping of ICA features to SoH estimation is highly precise and points out a normalized mean absolute error (NMAE) of 1.3%. The more cell variant mapping to RUL estimation is challenging, reflecting in a NMAE of 5.3%. Using the estimation results, a RUL monitoring strategy is derived. The objective is to safely operate a battery cell from BOL to EOL by only taking sparse diagnostic measurements. On average, only four diagnostic measurements are required during a cell's lifetime of 3300 to 5000 cycles.

2603.26153 2026-03-30 eess.SP

Movable-Antenna Index Modulation (MA-IM): System Framework and Performance Analysis

Bang Huang, Shunyuan Shang, Mohamed-Slim Alouini

详情
英文摘要

This paper proposes a movable-antenna-based index modulation (MA-IM) framework that exploits the spatial mobility of a single reconfigurable antenna to create additional information-bearing dimensions for next-generation wireless systems. By discretizing the continuous movable region into a dense set of candidate sampling points and selecting representative anchors for indexing, the proposed framework converts spatial degrees of freedom into a practical modulation resource. Building on this framework, we develop a family of anchor-selection strategies with different levels of channel awareness, including geometry-based, SNR-based, max--min channel-domain, and joint constellation-aware designs. For the resulting MA-IM schemes, joint maximum-likelihood (ML) detectors are derived, along with a low-complexity two-stage detector, and unified analytical upper bounds on the average bit error probability (ABEP) are established based on the joint index--modulation constellation. The results reveal that directly indexing all sampling points is generally unreliable, highlighting the necessity of anchor optimization. The performance of MA-IM is shown to depend on key system parameters, including channel richness, spatial correlation, the number of index states, and the modulation order. In particular, increasing the number of index states and increasing the QAM order affect MA-IM in fundamentally different ways, even under the same transmission rate. Among the proposed schemes, the joint constellation-aware anchor design achieves the best error performance, demonstrating that optimizing channel-domain separation alone is insufficient and that effective MA-IM design must account for the geometry of the joint signal constellation. Simulation results further show that, with properly designed anchors, MA-IM can approach or even outperform same-spectral-efficiency QAM baselines.

2603.26150 2026-03-30 eess.SP

Vector Similarity Search-Based MCS Selection in Massive Multi-User MIMO-OFDM

Fuga Kobayashi, Takumi Takahashi, Shinsuke Ibi, Takanobu Doi, Kazushi Muraoka, Hideki Ochiai

Comments 17 pages, 41 figures

详情
英文摘要

This paper proposes a novel modulation and coding scheme (MCS) selection framework that integrates mutual information (MI) prediction based on vector similarity search (VSS) for massive multi-user multiple-input multiple-output orthogonal frequency-division multiplexing (MU-MIMO-OFDM) systems with advanced uplink multi-user detection (MUD). The framework performs MCS selection at the transport block (TB)-level MI and establishes the mapping from post-MUD MI to post-decoding block error rate (BLER) using a prediction function generated from extrinsic information transfer (EXIT) curves. A key innovation is the VSS-based MI prediction scheme, which addresses the challenge of analytically predicting MI in iterative detectors such as expectation propagation (EP). In this scheme, an offline vector database (VDB) stores feature vectors derived from channel state information (CSI) and average received signal-to-noise ratio (SNR), together with corresponding MI values achieved with advanced MUD. During online operation, an approximate nearest neighbor (ANN) search on graphics processing units (GPUs) enables ultra-fast and accurate MI prediction, effectively capturing iterative detection gains. Simulation results under fifth-generation new radio (5G NR)-compliant settings demonstrate that the proposed framework significantly improves both system and user throughput, ensuring that the detection gains of advanced MUD are faithfully translated into tangible system-level performance improvements.

2603.26143 2026-03-30 eess.SP

Optimized Non-Uniform Pilot Pattern for OFDM Sensing

Amir Bouziane, Huseyin Arslan

详情
英文摘要

Standard periodic pilot patterns in orthogonal frequency division multiplexing (OFDM) systems induce severe delay-domain grating lobes, compromising radar sensing. This paper proposes a two-stage framework to design non-periodic pilot patterns that minimize the peak sidelobe level (PSL) while strictly enforcing communication anchor constraints. We black solve this combinatorial problem using a low-complexity hybrid greedy-stochastic cyclic coordinate descent (SCCD) algorithm. This approach shatters cyclic periodicities to suppress deterministic grating lobes beneath the impassable data-to-pilot interference (DPI) noise floor. System-level evaluations demonstrate the performance of the proposed design in resolving the sensing-communication trade-off, showing improved range root mean square error (RMSE) without degrading the primary communication bit error rate (BER).

2603.26125 2026-03-30 cs.IT eess.SP math.IT

CL-SEC: Cross-Layer Semantic Error Correction Empowered by Language Models

Yirun Wang, Yuyang Du, Soung Chang Liew, Yuchen Pan, Feifan Zhang, Lihao Zhang

详情
英文摘要

Achieving reliable communication has long been a fundamental challenge in networked systems. Semantic Error Correction (SEC) leverages the semantic understanding capabilities of language models (LMs) to perform application-layer error correction, complementing conventional channel decoding. While promising, existing SEC approaches rely solely on context captured by LMs at the application layer, ignoring the rich information available at the physical layer. To address this limitation, this paper introduces Cross-Layer SEC (CL-SEC), an LM-empowered error correction framework that integrates cross-layer information from both the physical and application layers to jointly correct corrupted words in text communication. Using a Bayesian combination in product form tailored to this framework, CL-SEC achieves significantly improved performance over methods that process information in isolated layers. CL-SEC shows substantial gains across multiple error-correction metrics, including bit-error rate, word-error rate, and semantic fidelity scores. Importantly, unlike most semantic communication systems that focus solely on recovering the semantic meaning of transmitted messages, CL-SEC aims to reconstruct the original transmitted message verbatim, leveraging the semantic understanding capabilities of LMs for precise reconstruction.

2603.26117 2026-03-30 eess.IV cs.CV

FINDER: Zero-Shot Field-Integrated Network for Distortion-free EPI Reconstruction in Diffusion MRI

Namgyu Han, Seong Dae Yun, Chaeeun Lim, Sunghyun Seok, Sunju Kim, Yoonhwan Kim, Yohan Jun, Tae Hyung Kim, Berkin Bilgic, Jaejin Cho

Comments 11 pages, 4 figures

详情
英文摘要

Echo-planar imaging (EPI) remains the cornerstone of diffusion MRI, but it is prone to severe geometric distortions due to its rapid sampling scheme that renders the sequence highly sensitive to $B_{0}$ field inhomogeneities. While deep learning has helped improve MRI reconstruction, integrating robust geometric distortion correction into a self-supervised framework remains an unmet need. To address this, we present FINDER (Field-Integrated Network for Distortion-free EPI Reconstruction), a novel zero-shot, scan-specific framework that reformulates reconstruction as a joint optimization of the underlying image and the $B_{0}$ field map. Specifically, we employ a physics-guided unrolled network that integrates dual-domain denoisers and virtual coil extensions to enforce robust data consistency. This is coupled with an Implicit Neural Representation (INR) conditioned on spatial coordinates and latent image features to model the off-resonance field as a continuous, differentiable function. Employing an alternating minimization strategy, FINDER synergistically updates the reconstruction network and the field map, effectively disentangling susceptibility-induced geometric distortions from anatomical structures. Experimental results demonstrate that FINDER achieves superior geometric fidelity and image quality compared to state-of-the-art baselines, offering a robust solution for high-quality diffusion imaging.

2603.26113 2026-03-30 cs.MM cs.SD eess.AS

Cinematic Audio Source Separation Using Visual Cues

Kang Zhang, Suyeon Lee, Arda Senocak, Joon Son Chung

Comments CVPR 2026. Project page: https://cass-flowmatching.github.io

详情
英文摘要

Cinematic Audio Source Separation (CASS) aims to decompose mixed film audio into speech, music, and sound effects, enabling applications like dubbing and remastering. Existing CASS approaches are audio-only, overlooking the inherent audio-visual nature of films, where sounds often align with visual cues. We present the first framework for audio-visual CASS (AV-CASS), leveraging visual context to enhance separation quality. Our method formulates CASS as a conditional generative modeling problem using conditional flow matching, enabling multimodal audio source separation. To address the lack of cinematic datasets with isolated sound tracks, we introduce a training data synthesis pipeline that pairs in-the-wild audio and video streams (e.g., facial videos for speech, scene videos for effects) and design a dedicated visual encoder for this dual-stream setup. Trained entirely on synthetic data, our model generalizes effectively to real-world cinematic content and achieves strong performance on synthetic, real-world, and audio-only CASS benchmarks. Code and demo are available at \url{https://cass-flowmatching.github.io}.

2603.26101 2026-03-30 eess.SP

Joint Sensing and Covert Communications in RIS-NOMA Systems

Jiayi Lei, Xidong Mu, Tiankui Zhang, Wenjun Xu, Ping Zhang

详情
英文摘要

A reconfigurable intelligent surface (RIS)-assisted non-orthogonal multiple access (NOMA) system is investigated, where the transmitter (Alice) is a dual functional radar communication (DFRC) base station (BS) that aims to sense the location of a potential warden (Willie), while simultaneously transmitting public and covert signals to the legitimate users, Carol and Bob, respectively. Both cases of known and unknown Willie locations are considered. For the known-location case, assuming perfect channel state information (CSI) at Willie, a covert rate maximization is formulated with the joint optimization of active and passive beamforming, which is solved using successive convex approximation (SCA), penalty method, and semidefinite relaxation (SDR). For the unknown-location case, we propose to estimate Willie's location via radar sensing and develop a sensing-based imperfect CSI model. In particular, the CSI error uncertainty is bounded by the sensing accuracy, which is characterized by the Cramer-Rao bound (CRB). Subsequently, a robust communication rate maximization problem is formulated under the constraints on quality-of-service (QoS) of Carol, sensing accuracy, and covertness level. The Schur complement and S-procedure are employed to handle the non-convex constraints. Numerical results compare the system performance under the two cases, and demonstrate the significant covert performance superiority of the sensing-based imperfect CSI model and NOMA over the general norm-bounded imperfect CSI model and the orthogonal multiple access scheme. Furthermore, the dual yet contradictory effects of sensing on covert communications are revealed. It is also found that Alice primarily utilizes Carol's signal for sensing, while allocating almost all of Bob's signal for communication.

2603.26087 2026-03-30 eess.SP cs.IT math.IT

Repeater-Assisted MIMO Can Also Boost Frequency Diversity: A Semi-Analytic Study

Hiroki Iimori, Yuto Hama

Comments 6 pages, 2 figures. This manuscript has been submitted to IEEE for possible publication

详情
英文摘要

Massive multiple-input multiple-output (MIMO) has enabled substantial spatial multiplexing and array gains in real-world systems, while distributed MIMO (D-MIMO) improves macro-diversity over wide areas at the cost of deployment complexity. Repeater-assisted massive MIMO (RA-MIMO) is a lower-cost alternative that can recover key distributed-MIMO advantages. This paper asks whether repeater assistance can also enhance frequency diversity. We study an uncoded discrete Fourier transform-spread orthogonal frequency-division multiplexing (DFT-s-OFDM) uplink with one-tap single-carrier frequency-domain equalization (SC-FDE) based on minimum mean-square error (MMSE) and derive a receiver-matched semi-analytic bit-error rate (BER) expression by averaging over channel and interference realizations, without Gaussian approximation of residual despreading interference. The analysis clarifies how repeater delay reshapes frequency correlation, and waveform simulations confirm tight agreement with the derived expression together with improved high-signal-to-noise ratio (SNR) BER decay, highlighting delay as a practical tuning knob.

2603.26081 2026-03-30 eess.SY cs.CV cs.SY

Experimental study on surveillance video-based indoor occupancy measurement with occupant-centric control

Irfan Qaisar, Kailai Sun, Qingshan Jia, Qianchuan Zhao

详情
英文摘要

Accurate occupancy information is essential for closed-loop occupant-centric control (OCC) in smart buildings. However, existing vision-based occupancy measurement methods often struggle to provide stable and accurate measurements in real indoor environments, and their implications for downstream HVAC control remain insufficiently studied. To achieve Net Zero emissions by 2050, this paper presents an experimental study of large language models (LLMs)-enhanced vision-based indoor occupancy measurement and its impact on OCC-enabled HVAC operation. Detection-only, tracking-based, and LLM-based refinement pipelines are compared under identical conditions using real surveillance data collected from a research laboratory in China, with frame-level manual ground-truth annotations. Results show that tracking-based methods improve temporal stability over detection-only measurement, while LLM-based refinement further improves occupancy measurement performance and reduces false unoccupied prediction. The best-performing pipeline, YOLOv8+DeepSeek, achieves an accuracy of 0.8824 and an F1-score of 0.9320. This pipeline is then integrated into an HVAC supervisory model predictive control framework in OpenStudio-EnergyPlus. Experimental results demonstrate that the proposed framework can support more efficient OCC operation, achieving a substantial HVAC energy-saving potential of 17.94%. These findings provide an effective methodology and practical foundation for future research in AI-enhanced smart building operations.

2603.26080 2026-03-30 eess.SY cs.SY

LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method

Leilei Cui, Richard D. Braatz

Comments 16 pages, 5 figures

详情
英文摘要

A gradient-based method is proposed for solving the linear quadratic regulator (LQR) problem for linear systems with nonlinear dependence on time-invariant probabilistic parametric uncertainties. The approach explicitly accounts for model uncertainty and ensures robust performance. By leveraging polynomial chaos theory (PCT) in conjunction with policy optimization techniques, the original stochastic system is lifted into a high-dimensional linear time-invariant (LTI) system with structured state-feedback control. A first-order gradient descent algorithm is then developed to directly optimize the structured feedback gain and iteratively minimize the LQR cost. We rigorously establish linear convergence of the gradient descent algorithm and show that the PCT-based approximation error decays algebraically at a rate $O(N^{-p})$ for any positive integer $p$, where $N$ denotes the order of the polynomials. Numerical examples demonstrate that the proposed method achieves significantly higher computational efficiency than conventional bilinear matrix inequality (BMI)-based approaches.

2603.26050 2026-03-30 eess.SY cs.SY

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Dianyu Zhong, Tian Xing, Kailai Sun, Xu Yang, Heye Huang, Irfan Qaisar, Tinggang Jia, Shaobo Wang, Qianchuan Zhao

详情
英文摘要

Heating, ventilation, and air conditioning (HVAC) systems account for a substantial share of building energy consumption. Environmental uncertainty and dynamic occupancy behavior bring challenges in decarbonized HVAC control. Reinforcement learning (RL) can optimize long-horizon comfort-energy trade-offs but suffers from exponential action-space growth and inefficient exploration in multi-zone buildings. Large language models (LLMs) can encode semantic context and operational knowledge, yet when used alone they lack reliable closed-loop numerical optimization and may result in less reliable comfort-energy trade-offs. To address these limitations, we propose a hierarchical control framework in which a fine-tuned LLM, trained on historical building operation data, generates state-dependent feasible action masks that prune the combinatorial joint action space into operationally plausible subsets. A masked value-based RL agent then performs constrained optimization within this reduced space, improving exploration efficiency and training stability. Evaluated in a high-fidelity simulator calibrated with real-world sensor and occupancy data from a 7-zone office building, the proposed method achieves a mean PPD of 7.30%, corresponding to reductions of 39.1% relative to DQN, the best vanilla RL baseline in comfort, and 53.1% relative to the best vanilla LLM baseline, while reducing daily HVAC energy use to 140.90~kWh, lower than all vanilla RL baselines. The results suggest that LLM-guided action masking is a promising pathway toward efficient multi-zone HVAC control.

2603.25238 2026-03-30 eess.SP cs.IT math.IT

Rate-Splitting Multiple Access with a SIC-Free Receiver: An Experimental Study

Guoqian Sun, Xinze Lyu, Bruno Clerckx

详情
英文摘要

Most Rate-Splitting Multiple Access (RSMA) implementations rely on successive interference cancellation (SIC) at the receiver, whose performance is inherently limited by error propagation during common-stream decoding. This paper addresses this issue by developing a SIC-free RSMA receiver based on joint demapping (JD), which directly evaluates bit vectors over a composite constellation. Using a two-user Multiple-Input Single-Output (MISO) prototype, we conduct over-the-air measurements to systematically compare SIC and JD-based receivers. The results show that the proposed SIC-free receiver provides stronger reliability and better practicality over a wider operating range, with all observations being consistent with theoretical expectations.

2603.25211 2026-03-30 eess.SY cs.SY

On Port-Hamiltonian Formulation of Hysteretic Energy Storage Elements: The Backlash Case

Jurrien Keulen, Bayu Jayawardhana, Arjan van der Schaft

详情
英文摘要

This paper presents a port-Hamiltonian formulation of hysteretic energy storage elements. First, we revisit the passivity property of backlash-driven storage elements by presenting a family of storage functions associated to the dissipativity property of such elements. We explicitly derive the corresponding available storage and required supply functions `a la Willems [1], and show the interlacing property of the aforementioned family of storage functions sandwiched between the available storage and required supply functions. Second, using the proposed family of storage functions, we present a port-Hamiltonian formulation of hysteretic inductors as prototypical storage elements in port-Hamiltonian systems. In particular, we show how a Hamiltonian function can be chosen from the family of storage functions and how the hysteretic elements can be expressed as port-Hamiltonian system with feedthrough term, where the feedthrough term represents energy dissipation. Correspondingly, we illustrate its applicability in describing an RLC circuit (in parallel and in series) containing a hysteretic inductor element.