arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2506.02797 2026-03-11 eess.AS cs.SD

Fast-Converging Distributed Signal Estimation in Topology-Unconstrained Wireless Acoustic Sensor Networks

Paul Didier, Toon van Waterschoot, Simon Doclo, Jörg Bitzer, Marc Moonen

详情

英文摘要

This paper focuses on distributed signal estimation in topology-unconstrained wireless acoustic sensor networks (WASNs) where sensor nodes only transmit fused versions of their local sensor signals. For this task, the topology-independent (TI) distributed adaptive node-specific signal estimation (DANSE) algorithm (TI-DANSE) has previously been proposed. It converges towards the centralized signal estimation solution in non-fully connected and time-varying network topologies. However, the applicability of TI-DANSE in real-world scenarios is limited due to its slow convergence. The latter results from the fact that, in TI-DANSE, nodes only have access to the in-network sum of all fused signals in the WASN. We address this low convergence speed by introducing an improved TI-DANSE algorithm, referred to as TI-DANSE+, in which updating nodes separately use the partial in-network sums of fused signals coming from each of their neighbors. Nodes can maximize the number of available degrees of freedom in their local optimization problem, leading to faster convergence. This is further exploited by combining TI-DANSE+ with a tree-pruning strategy that maximizes the number of neighbors at the updating node. In fully connected WASNs, TI-DANSE+ converges as fast as the original DANSE algorithm (the latter only defined for fully connected WASNs) while using peer-to-peer data transmission instead of broadcasting and thus saving communication bandwidth. If link failures occur, the convergence of TI-DANSE+ towards the centralized solution is preserved without any change in its formulation. Altogether, the proposed TI-DANSE+ algorithm can be viewed as an all-round alternative to DANSE and TI-DANSE which (i) merges the advantages of both, (ii) reconciliates their differences into a single formulation, and (iii) shows advantages of its own in terms of communication bandwidth usage.

URL PDF HTML ☆

赞 0 踩 0

2505.17655 2026-03-11 eess.AS cs.SD

Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer

Soumya Dutta, Avni Jain, Sriram Ganapathy

Comments 11 pages, 10 figures, 6 tables

2504.08999 2026-03-11 cs.CR cs.AI

MCP Bridge: A Lightweight, LLM-Agnostic RESTful Proxy for Model Context Protocol Servers

Arash Ahmadi, Sarah Sharif, Yaser M. Banad

Comments 42 pages, 28 figures

2503.21735 2026-03-11 cs.SE cs.AI cs.CL cs.MA

GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics

Arsham Gholamzadeh Khoee, Shuai Wang, Robert Feldt, Dhasarathy Parthasarathy, Yinan Yu

详情

英文摘要

Ensuring reliable data-driven decisions is crucial in domains where analytical accuracy directly impacts safety, compliance, or operational outcomes. Decision support in such domains relies on large tabular datasets, where manual analysis is slow, costly, and error-prone. While Large Language Models (LLMs) offer promising automation potential, they face challenges in analytical reasoning, structured data handling, and ambiguity resolution. This paper introduces GateLens, an LLM-based architecture for reliable analysis of complex tabular data. Its key innovation is the use of Relational Algebra (RA) as a formal intermediate representation between natural-language reasoning and executable code, addressing the reasoning-to-code gap that can arise in direct generation approaches. In our automotive instantiation, GateLens translates natural language queries into RA expressions and generates optimized Python code. Unlike traditional multi-agent or planning-based systems that can be slow, opaque, and costly to maintain, GateLens emphasizes speed, transparency, and reliability. We validate the architecture in automotive software release analytics, where experimental results show that GateLens outperforms the existing Chain-of-Thought (CoT) + Self-Consistency (SC) based system on real-world datasets, particularly in handling complex and ambiguous queries. Ablation studies confirm the essential role of the RA layer. Industrial deployment demonstrates over 80% reduction in analysis time while maintaining high accuracy across domain-specific tasks. GateLens operates effectively in zero-shot settings without requiring few-shot examples or agent orchestration. This work advances deployable LLM system design by identifying key architectural features--intermediate formal representations, execution efficiency, and low configuration overhead--crucial for domain-specific analytical applications.

URL PDF HTML ☆

赞 0 踩 0

2503.08104 2026-03-11 cond-mat.mtrl-sci cs.LG

Functional Unit: A New Perspective on Materials Science Research Paradigms

Caichao Ye, Tao Feng, Weishu Liu, Wenqing Zhang

2501.17901 2026-03-11 q-bio.BM cs.LG

Molecular Fingerprints Are Strong Models for Peptide Function Prediction

Jakub Adamczyk, Piotr Ludynia, Wojciech Czech

2411.13862 2026-03-11 eess.IV cs.CV cs.RO

Image Compression Using Novel View Synthesis Priors

Luyuan Peng, Mandar Chitre, Hari Vishnu, Yuen Min Too, Bharath Kalyan, Rajat Mishra, Soo Pieng Tan

Comments Preprint submitted to IEEE Journal of Oceanic Engineering (v2.0)

2410.07409 2026-03-11 eess.SY cs.LG cs.MA cs.RO cs.SY

Learning responsibility allocations for multi-agent interactions: A differentiable optimization approach with control barrier functions

Isaac Remy, David Fridovich-Keil, Karen Leung

Comments 8 pages, 7 figures

2401.06340 2026-03-11 cs.HC cs.AI

A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding

Xujin Li, Wei Wei, Shuang Qiu, Huiguang He

Comments 19 pages, 10 figures

详情

DOI: 10.1016/j.neunet.2024.106844
Journal ref: Neural Networks, 2025, 181: 106844

英文摘要

The Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interface (BCI) is an efficient technology for target retrieval using electroencephalography (EEG) signals. The performance improvement of traditional decoding methods relies on a substantial amount of training data from new test subjects, which increases preparation time for BCI systems. Several studies introduce data from existing subjects to reduce the dependence of performance improvement on data from new subjects, but their optimization strategy based on adversarial learning with extensive data increases training time during the preparation procedure. Moreover, most previous methods only focus on the single-view information of EEG signals, but ignore the information from other views which may further improve performance. To enhance decoding performance while reducing preparation time, we propose a Temporal-Spectral fusion transformer with Subject-specific Adapter (TSformer-SA). Specifically, a cross-view interaction module is proposed to facilitate information transfer and extract common representations across two-view features extracted from EEG temporal signals and spectrogram images. Then, an attention-based fusion module fuses the features of two views to obtain comprehensive discriminative features for classification. Furthermore, a multi-view consistency loss is proposed to maximize the feature similarity between two views of the same EEG signal. Finally, we propose a subject-specific adapter to rapidly transfer the knowledge of the model trained on data from existing subjects to decode data from new subjects. Experimental results show that TSformer-SA significantly outperforms comparison methods and achieves outstanding performance with limited training data from new subjects. This facilitates efficient decoding and rapid deployment of BCI systems in practical use.

URL PDF HTML ☆

赞 0 踩 0

2603.09072 2026-03-11 cs.HC cs.AI

A Text-Native Interface for Generative Video Authoring

Xingyu Bruce Liu, Mira Dontcheva, Dingzeyu Li

2603.09067 2026-03-11 stat.ML cond-mat.stat-mech cs.LG math-ph math.MP

Verifying Good Regulator Conditions for Hypergraph Observers: Natural Gradient Learning from Causal Invariance via Established Theorems

Max Zhuravlev

Comments 18 pages, 15 formal results. Part of a series of companion papers submitted simultaneously; cross-references updated with arXiv IDs in v2

2603.09058 2026-03-11 stat.ME cs.LG

Adaptive Active Learning for Online Reliability Prediction of Satellite Electronics

Shixiang Li, Yubin Tian, Dianpeng Wang, Piao Chen, Mengying Ren

2603.09034 2026-03-11 eess.AS cs.SD

Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition

Jordan Prescott, Thanathai Lertpetchpun, Shrikanth Narayanan

Comments Submitted to Interspeech 2026

2603.09029 2026-03-11 cs.SE cs.AI cs.ET

Automating Detection and Root-Cause Analysis of Flaky Tests in Quantum Software

Janakan Sivaloganathan, Ainaz Jamshidi, Andriy Miranskyy, Lei Zhang

Comments 27 pages, 2 figures

2603.09023 2026-03-11 cs.OS cs.AI cs.SE

The Missing Memory Hierarchy: Demand Paging for LLM Context Windows

Tony Mason

2603.09020 2026-03-11 cs.HC cs.AI

AI Phenomenology for Understanding Human-AI Experiences Across Eras

Bhada Yun, Evgenia Taranova, Dana Feng, Renn Su, April Yi Wang

Comments This is an accepted workshop paper at CHI '26, "W37: Human-AI Interaction Alignment: Designing, Evaluating, and Evolving Value-Centered AI For Reciprocal Human-AI Futures", or https://bialign-workshop.github.io/2026/cfp

2603.09009 2026-03-11 stat.ML cs.LG

Statistical Inference via Generative Models: Flow Matching and Causal Inference

Shinto Eguchi

2603.08993 2026-03-11 cs.SE cs.AI cs.CR cs.PL

Arbiter: Detecting Interference in LLM Agent System Prompts

Tony Mason

2603.08979 2026-03-11 math.OC cs.LG stat.ML

Data-driven robust Markov decision processes on Borel spaces: performance guarantees via an axiomatic approach

Sivaramakrishnan Ramani

2603.08957 2026-03-11 cs.MS cs.AI cs.DB

Automated Tensor-Relational Decomposition for Large-Scale Sparse Tensor Computation

Yuxin Tang, Zhiyuan Xin, Zhimin Ding, Xinyu Yao, Daniel Bourgeois, Tirthak Patel, Chris Jermaine

2603.08947 2026-03-11 stat.ML cs.LG

Towards Reliable Simulation-based Inference

Arnaud Delaunoy

Comments PhD thesis

详情

英文摘要

Scientific knowledge expands by observing the world, hypothesizing some theories about it, and testing them against collected data. When those theories take the form of statistical models, statistical analyses are involved in the process of testing and refining scientific hypotheses. In this thesis, we focus on statistical models that take the form of scientific simulators and provide background about how machine learning can be used for statistical analyses in this context. The first part of this thesis is about showing empirically that performing statistical analyses with machine learning involves a degree of approximation. Specifically, all statistical analyses involve a level of uncertainty in the conclusions drawn, and we show that approximations can lead to overconfident conclusions. We draw caution regarding such overconfident conclusions and introduce a criterion to diagnose overconfident approximations. In the second part, we introduce balancing, a way to regularize machine learning models to reduce overconfidence and favor calibrated or underconfident approximations. Balancing is first introduced for neural ratio estimation algorithms and then extended to other algorithms. Intuition about why balancing leads to less overconfident solutions is provided, and it is shown empirically that balanced algorithms are often either close to calibrated or underconfident. The third part shows that Bayesian neural networks can also be used to mitigate the overconfidence of approximations. Unlike balancing, no regularization is required, and this solution can then work with few training samples and, hence, computationally expensive simulators. To that end, a new Bayesian neural network prior tailored for simulation-based inference is developed, and empirical results show a reduction in overconfidence compared to similar solutions without Bayesian neural networks.

URL PDF HTML ☆

赞 0 踩 0

2603.08945 2026-03-11 math.ST cs.LG stat.ML stat.TH

Kernel Debiased Plug-in Estimation based on the Universal Least Favorable Submodel

Haiyi Chen, Yang Liu, Ivana Malenica

2603.08931 2026-03-11 cs.NI cs.LG cs.SY eess.SY

Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity Networks

Hanzhi Yu, Hasan Farooq, Julien Forgeat, Shruti Bothe, Kristijonas Cyras, Md Moin Uddin Chowdhury, Mingzhe Chen

2603.08911 2026-03-11 cs.DC cs.AI cs.LG

FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID Data

Daniel M. Jimenez-Gutierrez, Giovanni Giunta, Mehrdad Hassanzadeh, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti

Comments Accepted to the IEEE International Workshop on Intelligent Cloud Computing and Networking (ICCN) from the IEEE International Conference on Computer Communications (INFOCOM) 2026

2603.08901 2026-03-11 cs.CR cs.AI

NetDiffuser: Deceiving DNN-Based Network Attack Detection Systems with Diffusion-Generated Adversarial Traffic

Pratyay Kumar, Abu Saleh Md Tayeen, Satyajayant Misra, Huiping Cao, Jiefei Liu, Qixu Gong, Jayashree Harikumar

2603.08881 2026-03-11 cond-mat.mtrl-sci cs.CL

From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial Electrocatalysts

Lei Zhang, Markus Stricker

Comments 15 pages, 3 figures

2603.08865 2026-03-11 cs.NI cs.LG cs.RO

Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case Study

Nils Jörgensen

详情

英文摘要

Communication-aware robot planning requires accurate predictions of wireless network performance. Current approaches rely on channel-level metrics such as received signal strength and signal-to-noise ratio, assuming these translate reliably into end-to-end throughput. We challenge this assumption through a measurement campaign in a private 5G industrial environment. We evaluate throughput predictions from a commercial ray-tracing simulator as well as data-driven Gaussian process regression models against measurements collected using a mobile robot. The study uses off-the-shelf user equipment in an underground, radio-shielded facility with detailed 3D modeling, representing a best-case scenario for prediction accuracy. The ray-tracing simulator captures the spatial structure of indoor propagation and predicts channel-level metrics with reasonable fidelity. However, it systematically over-predicts throughput, even in line-of-sight regions. The dominant error source is shown to be over-estimation of sustainable MIMO spatial layers: the simulator assumes near-uniform four-layer transmission while measurements reveal substantial adaptation between one and three layers. This mismatch inflates predicted throughput even when channel metrics appear accurate. In contrast, a Gaussian process model with a rational quadratic kernel achieves approximately two-thirds reduction in prediction error with near-zero bias by learning end-to-end throughput directly from measurements. These findings demonstrate that favorable channel conditions do not guarantee high throughput; communication-aware planners relying solely on channel-centric predictions risk overly optimistic trajectories that violate reliability requirements. Accurate throughput prediction for 5G systems requires either extensive calibration of link-layer models or data-driven approaches that capture real system behavior.

URL PDF HTML ☆

赞 0 踩 0

2603.08856 2026-03-11 cs.HC cs.AI

Unpacking Interpretability: Human-Centered Criteria for Optimal Combinatorial Solutions

Dominik Pegler, Frank Jäkel, David Steyrl, Frank Scharnowski, Filip Melinscak

Comments 66 pages (42 main text, 24 appendix), 18 figures (5 in main text, 13 in appendix)

2603.08806 2026-03-11 cs.SE cs.AI

Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications

Tzafrir Rehan

Comments 9 pages, 2 figures, open benchmark at https://github.com/f-labs-io/tdad-paper-code

2603.08801 2026-03-11 quant-ph cs.AI

Large Language Model-Assisted Superconducting Qubit Experiments

Shiheng Li, Jacob M. Miller, Phoebe J. Lee, Gustav Andersson, Christopher R. Conner, Yash J. Joshi, Bayan Karimi, Amber M. King, Howard L. Malc, Harsh Mishra, Hong Qiao, Minseok Ryu, Xuntao Wu, Siyuan Xing, Haoxiong Yan, Jian Shi, Andrew N. Cleland

Comments 10 pages, 5 figures