arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.24578 2026-03-26 cs.CV eess.IV

Vision-Language Models vs Human: Perceptual Image Quality Assessment

Imran Mehmood, Imad Ali Shah, Ming Ronnier Luo, Brian Deegan

详情

英文摘要

Psychophysical experiments remain the most reliable approach for perceptual image quality assessment (IQA), yet their cost and limited scalability encourage automated approaches. We investigate whether Vision Language Models (VLMs) can approximate human perceptual judgments across three image quality scales: contrast, colorfulness and overall preference. Six VLMs four proprietary and two openweight models are benchmarked against psychophysical data. This work presents a systematic benchmark of VLMs for perceptual IQA through comparison with human psychophysical data. The results reveal strong attribute dependent variability models with high human alignment for colorfulness (ρup to 0.93) underperform on contrast and vice-versa. Attribute weighting analysis further shows that most VLMs assign higher weights to colorfulness compared to contrast when evaluating overall preference similar to the psychophysical data. Intramodel consistency analysis reveals a counterintuitive tradeoff: the most self consistent models are not necessarily the most human aligned suggesting response variability reflects sensitivity to scene dependent perceptual cues. Furthermore, human-VLM agreement is increased with perceptual separability, indicating VLMs are more reliable when stimulus differences are clearly expressed.

URL PDF HTML ☆

赞 0 踩 0

2603.24566 2026-03-26 eess.SY cs.SY

Integral Control Barrier Functions with Input Delay: Prediction, Feasibility, and Robustness

Adam K. Kiss, Ersin Das, Tamas G. Molnar, Aaron D. Ames

2603.24540 2026-03-26 eess.SY cs.SY

A Modular Platooning and Vehicle Coordination Simulator for Research and Education

Kevin Jamsahar, Adrian Wiltz, Maria Charitidou, Dimos V. Dimarogonas

Comments 6 pages

2603.24509 2026-03-26 eess.SY cs.SY

Communication-Aware Dissipative Output Feedback Control

Ingyu Jang, Leila J. Bridgeman

Comments 6 pages, 2 figures, Submitted to IEEE Control Systems Letters (LCSS)

2603.24505 2026-03-26 eess.SP

JSSAnet: Theory-Guided Subchannel Partitioning and Joint Spatial Attention for Near-Field Channel Estimation

Zhiming Zhu, Shu Xu, Chunguo Li, Yongming Huang, Luxi Yang

2603.24503 2026-03-26 cs.LG cs.RO cs.SY eess.SY

Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling

Mihaela-Larisa Clement, Mónika Farsang, Agnes Poks, Johannes Edelmann, Manfred Plöchl, Radu Grosu, Ezio Bartocci

2603.24475 2026-03-26 cs.LG cs.SY eess.SY

Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability

Samuel Filgueira da Silva, Mehmet Fatih Ozkan, Faissal El Idrissi, Marcello Canova

Comments Submitted to the 2026 American Control Conference (ACC)

2603.24419 2026-03-26 eess.SY cs.SY

Robust Optimal Operation of Virtual Power Plants Under Decision-Dependent Uncertainty of Price Elasticity

Tao Tan, Rui Xie, Meng Yang, Yue Chen

Comments 9 pages, 9 figures

2603.24385 2026-03-26 eess.AS

ArrayDPS-Refine: Generative Refinement of Discriminative Multi-Channel Speech Enhancement

Zhongweiyang Xu, Ashutosh Pandey, Juan Azcarreta, Zhaoheng Ni, Sanjeel Parekh, Buye Xu

Comments Accepted to ICASSP 2026

2603.24381 2026-03-26 physics.soc-ph cs.SY eess.SY

On a Co-evolving Opinion-Leadership Model in Social Networks

Martina Alutto, Lorenzo Zino, Karl H. Johansson, Angela Fontan

Comments 8 pages, 6 figures

2603.24328 2026-03-26 eess.SP

Towards Semantic-based Agent Communication Networks: Vision, Technologies, and Challenges

Ping Zhang, Rui Meng, Xiaodong Xu, Yaheng Wang, Zixuan Huang, Yiming Liu, Ruichen Zhang, Yinqiu Liu, Haonan Tong, Huishi Song, Gang Wu, Zhaoming Lu, Jiawen Kang, Geng Sun, Qinghe Du, Zhaohui Yang, Jingxuan Zhang, Han Meng, Lexi Xu, Haitao Zhao, Zesong Fei, Yiqing Zhou, Pei Xiao, Meixia Tao, Qinyu Zhang, Shuguang Cui, Rahim Tafazolli

Comments 46 pages, 15 figures

详情

英文摘要

The International Telecommunication Union (ITU) identifies "Artificial Intelligence (AI) and Communication" as one of six key usage scenarios for 6G. Agentic AI, characterized by its ca-pabilities in multi-modal environmental sensing, complex task coordination, and continuous self-optimization, is anticipated to drive the evolution toward agent-based communication net-works. Semantic communication (SemCom), in turn, has emerged as a transformative paradigm that offers task-oriented efficiency, enhanced reliability in complex environments, and dynamic adaptation in resource allocation. However, comprehensive reviews that trace their technologi-cal evolution in the contexts of agent communications remain scarce. Addressing this gap, this paper systematically explores the role of semantics in agent communication networks. We first propose a novel architecture for semantic-based agent communication networks, structured into three layers, four entities, and four stages. Three wireless agent network layers define the logical structure and organization of entity interactions: the intention extraction and understanding layer, the semantic encoding and processing layer, and the distributed autonomy and collabora-tion layer. Across these layers, four AI agent entities, namely embodied agents, communication agents, network agents, and application agents, coexist and perform distinct tasks. Furthermore, four operational stages of semantic-enhanced agentic AI systems, namely perception, memory, reasoning, and action, form a cognitive cycle guiding agent behavior. Based on the proposed architecture, we provide a comprehensive review of the state-of-the-art on how semantics en-hance agent communication networks. Finally, we identify key challenges and present potential solutions to offer directional guidance for future research in this emerging field.

URL PDF HTML ☆

赞 0 踩 0

2603.24268 2026-03-26 eess.SP

Incremental Learning-Based Open-Set Classification of Unknown UAVs via RF Signal Semantics

Julie Liu, Irshad A. Meer, Cicek Cavdar, Mustafa Ozger

Comments Accepted in ICC 2026

2603.24251 2026-03-26 eess.SY cs.SY

Spatial Correlation, Non-Stationarity, and Degrees of Freedom of Holographic Curvature-Reconfigurable Apertures

Liuxun Xue, Shu Sun, Ruifeng Gao, Xiaoqian Yi

Comments 16 pages, 14figures

2603.24241 2026-03-26 eess.SY cs.LG cs.SY

C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents

Guihlerme Daubt, Adrian Redder

2603.24180 2026-03-26 cs.IT eess.SP math.IT

RIS-Assisted D-MIMO for Energy-Efficient 6G Indoor Networks

Akshay Vayal Parambath, Jose Flordelis, Venkatesh Tentu, Charitha Madapatha, Fredrik Rusek, Erik Bengtsson, Tommy Svensson

Comments 6 pages, 5 figures, Accepted to the IEEE International Conference on Communications (ICC) 2026

2603.22554 2026-03-26 eess.SY cs.SY

A Model Predictive Control Approach to Dual-Axis Agrivoltaic Panel Tracking

Anna Stuhlmacher, Panupong Srisuthankul, Johanna L. Mathieu, Peter Seiler

Comments 10 pages

2603.19995 2026-03-26 eess.IV

Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission

Yujie Xu, Shutong Chen, Nan Li, Yansha Deng, Jinhong Yuan, Robert Schober

2603.15934 2026-03-26 math.OC cs.MS cs.SY eess.SY

Fast Relax-and-Round Unit Commitment with Economic Horizons

Shaked Regev, Eve Tsybina, Slaven Peles

Comments 6 pages (journal limit), 6 figures

2602.11842 2026-03-26 eess.SY cs.SY

A day-ahead market model for power systems: benchmarking and security implications

Andrej Stankovski, Blazhe Gjorgiev, James Ciyu Qin, Giovanni Sansavini

2601.15368 2026-03-26 cs.CV eess.IV

Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency

Yikai Wang, Junqiu Yu, Chenjie Cao, Xiangyang Xue, Yanwei Fu

Comments Extension of our CVPR 2025 highlight paper: arXiv:2312.04831. The paper was submitted to cs.CV but was classified under eess.IV. The authors made an appeal but have not received a response for one month. Therefore, we update the comment to clarify the category

2510.12684 2026-03-26 cs.RO cs.SY eess.SY

Autonomous Legged Mobile Manipulation for Lunar Surface Operations via Constrained Reinforcement Learning

Alvaro Belmonte-Baeza, Miguel Cazorla, Gabriel J. García, Carlos J. Pérez-Del-Pulgar, Jorge Pomares

Comments This is the authors version of the paper accepted for publication in The IEEE International Conference on Space Robotics 2025. The final version link will be added here after conference proceedings are published

详情

DOI: 10.1109/iSpaRo66239.2025.11437097
Journal ref: 2025 International Conference on Space Robotics (iSpaRo)

英文摘要

Robotics plays a pivotal role in planetary science and exploration, where autonomous and reliable systems are crucial due to the risks and challenges inherent to space environments. The establishment of permanent lunar bases demands robotic platforms capable of navigating and manipulating in the harsh lunar terrain. While wheeled rovers have been the mainstay for planetary exploration, their limitations in unstructured and steep terrains motivate the adoption of legged robots, which offer superior mobility and adaptability. This paper introduces a constrained reinforcement learning framework designed for autonomous quadrupedal mobile manipulators operating in lunar environments. The proposed framework integrates whole-body locomotion and manipulation capabilities while explicitly addressing critical safety constraints, including collision avoidance, dynamic stability, and power efficiency, in order to ensure robust performance under lunar-specific conditions, such as reduced gravity and irregular terrain. Experimental results demonstrate the framework's effectiveness in achieving precise 6D task-space end-effector pose tracking, achieving an average positional accuracy of 4 cm and orientation accuracy of 8.1 degrees. The system consistently respects both soft and hard constraints, exhibiting adaptive behaviors optimized for lunar gravity conditions. This work effectively bridges adaptive learning with essential mission-critical safety requirements, paving the way for advanced autonomous robotic explorers for future lunar missions.

URL PDF HTML ☆

赞 0 踩 0

2510.08161 2026-03-26 eess.SP

Attitude and Heading Estimation in Symmetrical Inertial Arrays

Yaakov Libero, Itzik Klein

详情

DOI: 10.1109/TIM.2026.3676210

英文摘要

Attitude and heading reference systems (AHRS) play a central role in autonomous navigation systems on land, air and maritime platforms. AHRS utilize inertial sensor measurements to estimate platform orientation. In recent years, there has been increasing interest in multiple inertial measurement units (MIMU) arrays to improve navigation accuracy and robustness. A particularly challenging MIMU implementation is the gyro-free (GF) configuration, in which angular velocity is derived solely from accelerometer measurements. While the GF configurations have multiple benefits, including outlier detection and in angular acceleration measurements, their main drawbacks are inherent instability and an increased divergence rate. To address these shortcomings, we introduce a novel symmetrical MIMU formulation, in which the IMUs are arranged in symmetric diagonal pairs to decouple linear and rotational acceleration components. To this end, we derive the theoretical foundations for the symmetrical MIMU formulation of the GF equations, develop a nonlinear least squares estimation process, and integrate statistical hypothesis testing into an AHRS error-state extended Kalman filter. We validate our approach using real-world datasets containing 85 minutes of navigation data recorded on both airborne and land platforms. Our results demonstrated a 30\% average reduction in attitude estimation errors, rotation detection accuracy exceeding 95\% improvement, and significantly improved stability compared to a standard GF implementation. These results enable reliable GF navigation in applications where gyroscopes are unavailable, unreliable, or energy-constrained. Common examples include miniature platforms, computational-constraint platforms, and long-endurance marine platforms.

URL PDF HTML ☆

赞 0 踩 0

2506.20334 2026-03-26 eess.SY cs.LG cs.SY

Recurrent neural network-based robust control systems with regional properties and application to MPC design

Daniele Ravasio, Alessio La Bella, Marcello Farina, Andrea Ballarino

Comments 27 pages, 5 figures

2406.03138 2026-03-26 cs.SD eess.AS

An interpretable speech foundation model for depression detection by revealing prediction-relevant acoustic features from long speech

Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

Comments 5 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2309.13476

2312.00357 2026-03-26 eess.IV cs.CV cs.LG

A Generalizable Deep Learning System for Cardiac MRI

Rohan Shad, Cyril Zakka, Dhamanpreet Kaur, Mrudang Mathur, Robyn Fong, Joseph Cho, Ross Warren Filice, John Mongan, Kimberly Kalianos, Nishith Khandwala, David Eng, Matthew Leipzig, Walter R. Witschey, Alejandro de Feria, Victor A. Ferrari, Euan A. Ashley, Michael A. Acker, Curtis Langlotz, William Hiesinger

Comments Published in Nature Biomedical Engineering; Supplementary Appendix available on publisher website. Code: https://github.com/rohanshad/cmr_transformer

2306.17466 2026-03-26 eess.IV cs.CV

MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis

Zhaoshan Liu, Qiujie Lv, Yifan Li, Ziduo Yang, Lei Shen

Comments Knowledge-Based Systems Accepted

2603.24144 2026-03-26 cs.SD eess.AS

Semantic-Aware Interruption Detection in Spoken Dialogue Systems: Benchmark, Metric, and Model

Kangxiang Xia, Bingshen Mu, Xian Shi, Jin Xu, Lei Xie

Comments Accepted by ICME 2026

2603.24138 2026-03-26 cs.LG cs.SY eess.SY

Efficient Controller Learning from Human Preferences and Numerical Data Via Multi-Modal Surrogate Models

Lukas Theiner, Maik Pfefferkorn, Yongpeng Zhao, Sebastian Hirt, Rolf Findeisen

Comments 8 pages, 4 figures, accepted for ECC 2026

2603.24130 2026-03-26 cs.RO cs.SY eess.SY

Equivariant Filter Transformations for Consistent and Efficient Visual--Inertial Navigation

Chungeng Tian, Fenghua He, Ning Hao

Comments 28 papes, 11 figures

2603.24109 2026-03-26 eess.IV cs.AI cs.CV

Comparative analysis of dual-form networks for live land monitoring using multi-modal satellite image time series

Iris Dumeur, Jérémy Anger, Gabriele Facciolo