arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.03280 2026-03-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY

How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference

Toru Lin, Shuying Deng, Zhao-Heng Yin, Pieter Abbeel, Jitendra Malik

Comments Project page can be found at https://toruowo.github.io/peel

详情

英文摘要

Many essential manipulation tasks - such as food preparation, surgery, and craftsmanship - remain intractable for autonomous robots. These tasks are characterized not only by contact-rich, force-sensitive dynamics, but also by their "implicit" success criteria: unlike pick-and-place, task quality in these domains is continuous and subjective (e.g. how well a potato is peeled), making quantitative evaluation and reward engineering difficult. We present a learning framework for such tasks, using peeling with a knife as a representative example. Our approach follows a two-stage pipeline: first, we learn a robust initial policy via force-aware data collection and imitation learning, enabling generalization across object variations; second, we refine the policy through preference-based finetuning using a learned reward model that combines quantitative task metrics with qualitative human feedback, aligning policy behavior with human notions of task quality. Using only 50-200 peeling trajectories, our system achieves over 90% average success rates on challenging produce including cucumbers, apples, and potatoes, with performance improving by up to 40% through preference-based finetuning. Remarkably, policies trained on a single produce category exhibit strong zero-shot generalization to unseen in-category instances and to out-of-distribution produce from different categories while maintaining over 90% success rates.

URL PDF HTML ☆

赞 0 踩 0

2603.03196 2026-03-04 math.NA cs.IT cs.LG cs.NA eess.SP math.IT math.PR

Infinite dimensional generative sensing

Paolo Angella, Vito Paolo Pastore, Matteo Santacesaria

2603.03184 2026-03-04 eess.SP

Continuous-Aperture Array-Based ISAC Over Fading Channels

Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

2603.03173 2026-03-04 eess.SY cs.SY

Can a Learner Regret Using a No-Regret Algorithm? A Control-Theoretic Study of Performance Dominance

Hassan Abdelraouf, Jeff S. Shamma

详情

英文摘要

No-regret learning dynamics ensure that a learner asymptotically achieves an average reward no worse than that of any fixed strategy. This no-regret guarantee does not determine the value of the asymptotic average reward. Indeed, it is possible for different no-regret learning dynamics to exhibit different asymptotic average rewards when facing the same environment while both assure the no-regret guarantee. This paper asks whether a "free-lunch" phenomenon can arise among no-regret algorithms. Namely, is it possible for one no-regret learning rule to uniformly outperform another no-regret learning rule across all payoff environments. Stated differently, can a learner regret not using a particular no-regret algorithm? We consider generalized replicator dynamics (RD) as a cascade interconnection between a linear time-invariant (LTI) system and the softmax nonlinearity. Varying this LTI system leads to different realizations of replicator dynamics, including so-called anticipatory RD, exponential RD, and other forms of higher-order RD. Setting the LTI system to be an integrator realizes standard RD, which is known to satisfy the no-regret property. Within this framework, we analyze and compare various realizations of these generalized realizations RD by varying the LTI system. We first formulate performance comparison as a passivity property of an associated comparison system and establish "local" dominance results, i.e., comparing the asymptotic performance near an equilibrium payoff vector. We then cast performance comparison between a form of anticipatory RD and standard RD as an optimal-control problem. We show that the minimal achievable cumulative reward gap is zero, thereby establishing global dominance of anticipatory RD across all payoff environments and establishing a "free lunch" among no-regret learning dynamics.

URL PDF HTML ☆

赞 0 踩 0

2603.03127 2026-03-04 eess.SY cs.SY math.DS

Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics

Hossein Rastgoftar, Muhammad J. H. Zahed

2603.03102 2026-03-04 eess.SP

KA band mobile antenna for satellite communication

Sidra Tul. Muntaha, Ahmad Arfeen, Kashan Raza

2603.03082 2026-03-04 eess.SY cs.LG cs.SY math.DS math.OC

Safe and Robust Domains of Attraction for Discrete-Time Systems: A Set-Based Characterization and Certifiable Neural Network Estimation

Mohamed Serry, Maxwell Fitzsimmons, Jun Liu

2603.03060 2026-03-04 eess.IV eess.AS

DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming

Shuide Wen, Sungil Seok, Beier Ku, Richee Li, Yubin He, Bowen Qu, Yang Yang, Ping Su, Can Jiao

Comments 14 pages, 13 figures, 6 tables, 7 algorithms, 16 references, submitted to ACM/IEEE International Conference on Systems and Software Engineering

2603.02998 2026-03-04 cs.IT eess.SP math.IT

An Optimization-Based User Scheduling Framework for Multiuser MIMO Systems

Victoria Palhares, Christoph Studer

Comments Submitted to a journal

2603.02975 2026-03-04 eess.SY cs.SY

Grid-Forming Control with Assignable Voltage Regulation Guarantees and Safety-Critical Current Limiting

Bhathiya Rathnayake, Sijia Geng

详情

英文摘要

This paper develops a nonlinear grid-forming (GFM) controller with provable voltage-formation guarantees, with over-current limiting enforced via a control-barrier-function (CBF)-based safety filter. The nominal controller follows a droop-based inner-outer architecture, in which voltage references and frequency are generated by droop laws, an outer-loop voltage controller produces current references using backstepping (BS), and an inner-loop current controller synthesizes the terminal voltage. The grid voltage is treated as an unknown bounded disturbance, without requiring knowledge of its bound, and the controller design does not rely on any network parameters beyond the point of common coupling (PCC). To robustify voltage formation against the grid voltage, a deadzone-adapted disturbance suppression (DADS) framework is incorporated, yielding practical voltage regulation characterized by asymptotic convergence of the PCC voltage errors to an assignably small and known residual set. Furthermore, the closed-loop system is proven to be globally well posed, with all physical and adaptive states bounded and voltage error transients (due to initial conditions) decaying exponentially at an assignable rate. On top of the nominal controller, hard over-current protection is achieved through a minimally invasive CBF-based safety filter that enforces strict current limits via a single-constraint quadratic program. The safety filter is compatible with any locally Lipschitz nominal controller. Rigorous analysis establishes forward invariance of the safe-current set and boundedness of all states under current limiting. Numerical results demonstrate improved transient performance and faster recovery during current-limiting events when the proposed DADS-BS controller is used as the nominal control law, compared with conventional PI-based GFM control.

URL PDF HTML ☆

赞 0 踩 0

2603.02937 2026-03-04 eess.AS cs.LG

Bias and Fairness in Self-Supervised Acoustic Representations for Cognitive Impairment Detection

Kashaf Gulzar, Korbinian Riedhammer, Elmar Nöth, Andreas K. Maier, Paula Andrea Pérez-Toro

Comments 12 pages, 4 figures, 6 tables, Journal paper

2603.02914 2026-03-04 eess.AS

Does Fine-tuning by Reinforcement Learning Improve Generalization in Binary Speech Deepfake Detection?

Xin Wang, Ge Wanying, Junichi Yamagishi

Comments Submitted to Interspeech 2026; put on arxiv based on requirement of paper open-access rule; quote from Interspeech: "Interspeech no longer enforces an anonymity period for submissions. While uploading a version online is permitted, your official submission to Interspeech must not contain any author-identifying information"

2603.02877 2026-03-04 eess.AS

DBMIF: a deep balanced multimodal iterative fusion framework for air- and bone-conduction speech enhancement

Yilei Wu, Changyan Zheng, Xingyu Zhang, Yakun Zhang, Chengshi Zheng, Shuang Yang, Ye Yan, Erwei Yin

Comments 10 pages, 7 figures, Applied Intelligence

2603.02832 2026-03-04 eess.SP

Exploiting Double-Bounce Paths in Snapshot Radio SLAM: Bounds, Algorithms and Experiments

Xi Zhang, Yu Ge, Ossi Kaltiokallio, Musa Furkan Keskin, Henk Wymeersch, Mikko Valkama

2603.02794 2026-03-04 cs.SD cs.AI cs.LG eess.AS

Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising

Riccardo Rota, Kiril Ratmanski, Jozef Coldenhoff, Milos Cernak

Comments Submitted to Interspeech 2026

2603.02768 2026-03-04 eess.SP

Enhancing AAV-Enabled Secure Communications via Synthetic Aperture Beamforming

Bin Qiu, Wenchi Cheng, Hongxiang He, Jiangzhou Wang

2603.00728 2026-03-04 cs.LO cs.SE cs.SY eess.SY

Quantitative Monitoring of Signal First-Order Logic

Marek Chalupa, Thomas A. Henzinger, N. Ege Saraç, Emily Yu

Comments Full version of the FM 2026 paper

2603.00572 2026-03-04 physics.optics cs.SY eess.SY

Depth-adapted adaptive optics for three-photon microscopy

Qi Hu, Jingyu Wang, Huriye Atilgan, Armin Lak, Martin J. Booth

2512.22901 2026-03-04 eess.SY cs.AI cs.LG cs.SY eess.SP

A Neural Network-Based Real-time Casing Collar Recognition System for Downhole Instruments

Si-Yu Xiao, Xin-Di Zhao, Xiang-Zhan Wang, Tian-Hao Mao, Ying-Kai Liao, Xing-Yu Liao, Yu-Qiao Chen, Jun-Jie Wang, Shuang Liu, Tu-Pei Chen, Yang Liu

2511.14065 2026-03-04 q-bio.NC cs.SY eess.SY

Intrinsic Resonance depends on Network Size of Coupled-Delayed Interacting Oscillators

Felipe A. Torres, Alejandro Weinstein, Jesus M. Cortes, Wael El-Deredy

Comments 16 pages, 3 figures: 2 figures in the main text and 1 figure in the appendix

2510.17270 2026-03-04 cs.RO cs.SY eess.SY

Floating-Base Deep Lagrangian Networks

Lucas Schulze, Juliano Decico Negri, Victor Barasuol, Vivian Suzano Medeiros, Marcelo Becker, Jan Peters, Oleg Arenz

2510.16953 2026-03-04 eess.SY cs.RO cs.SY

Safe Payload Transfer with Ship-Mounted Cranes: A Robust Model Predictive Control Approach

Ersin Das, William A. Welch, Patrick Spieler, Keenan Albee, Aurelio Noca, Jeffrey Edlund, Jonathan Becktor, Thomas Touma, Jessica Todd, Sriramya Bhamidipati, Stella Kombo, Maira Saboia, Anna Sabel, Grace Lim, Rohan Thakker, Amir Rahmani, Joel W. Burdick

2510.00256 2026-03-04 eess.AS cs.SD

Subjective quality evaluation of personalized own voice reconstruction systems

Mattes Ohlenbusch, Christian Rollwage, Simon Doclo, Jan Rennies

Comments Submitted to Acta Acustica

2507.16733 2026-03-04 eess.SP

Generative Diffusion Models for Wireless Networks: Fundamental, Architecture, and State-of-the-Art

Dayu Fan, Rui Meng, Xiaodong Xu, Yiming Liu, Guoshun Nan, Chenyuan Feng, Shujun Han, Song Gao, Bingxuan Xu, Dusit Niyato, Tony Q. S. Quek, Ping Zhang

Comments 46 pages, 10 figures

2506.23569 2026-03-04 quant-ph cs.SY eess.SY

Alleviating CoD in Renewable Energy Profile Clustering Using an Optical Quantum Computer

Chengjun Liu, Yijun Xu, Wei Gu, Bo Sun, Kai Wen, Shuai Lu, Lamine Mili

2412.09646 2026-03-04 eess.IV cs.CV cs.GR cs.LG

RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions

Xuhan Sheng, Runyi Li, Bin Chen, Weiqi Li, Xu Jiang, Jian Zhang

2401.01255 2026-03-04 eess.AS cs.AI cs.MM eess.SP

On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals

George P. Kafentzis

2307.04842 2026-03-04 eess.AS cs.AI

Predicting Tuberculosis from Real-World Cough Audio Recordings and Metadata

George P. Kafentzis, Stephane Tetsing, Joe Brew, Lola Jover, Mindaugas Galvosas, Carlos Chaccour, Peter M. Small

2110.03427 2026-03-04 cs.LG cs.CL cs.SD eess.AS eess.SP

Is Attention always needed? A Case Study on Language Identification from Speech

Atanu Mandal, Santanu Pal, Indranil Dutta, Mahidas Bhattacharya, Sudip Kumar Naskar

Comments Accepted for publication in Natural Language Engineering

2603.02721 2026-03-04 eess.SP

Doppler Shift Keying Modulation for Uplink Multiple Access over Doubly-Dispersive Channels

Xuehan Wang, Jintao Wang, Hai Lin, Jinhong Yuan, Xu Shi, Hengyu Zhang, Jian Song

Comments This paper has been accepted by IEEE Transactions on Vehicular Technology (TVT)