arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.14074 2026-03-17 cs.CV cs.LG

Self-Supervised Uncertainty Estimation For Super-Resolution of Satellite Images

Zhe Zheng, Valéry Dewil, Pablo Arias

Comments Conference submission

详情

英文摘要

Super-resolution (SR) of satellite imagery is challenging due to the lack of paired low-/high-resolution data. Recent self-supervised SR methods overcome this limitation by exploiting the temporal redundancy in burst observations, but they lack a mechanism to quantify uncertainty in the reconstruction. In this work, we introduce a novel self-supervised loss that allows to estimate uncertainty in image super-resolution without ever accessing the ground-truth high-resolution data. We adopt a decision-theoretic perspective and show that minimizing the corresponding Bayesian risk yields the posterior mean and variance as optimal estimators. We validate our approach on a synthetic SkySat L1B dataset and demonstrate that it produces calibrated uncertainty estimates comparable to supervised methods. Our work bridges self-supervised restoration with uncertainty quantification, making a practical framework for uncertainty-aware image reconstruction.

URL PDF HTML ☆

赞 0 踩 0

2603.14073 2026-03-17 cs.CV cs.AI cs.LG

MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation

Byungjun Kim, Soobin Um, Jong Chul Ye

2603.14069 2026-03-17 cs.LG

Gated Graph Attention Networks for Predicting Duration of Large Scale Power Outages Induced by Natural Disasters

Chenghao Duan, Chuanyi Ji, Anwar Walid, Scott Ganz

2603.14068 2026-03-17 cs.RO

Stiffness Copilot: An Impedance Policy for Contact-Rich Teleoperation

Yeping Wang, Zhengtong Xu, Pornthep Preechayasomboon, Ben Abbatematteo, Amirhossein H. Memar, Nick Colonnese, Sonny Chan

Comments Project website: https://stiffness-copilot.github.io

2603.14062 2026-03-17 cs.CV cs.LG

TMPDiff: Temporal Mixed-Precision for Diffusion Models

Basile Lewandowski, Simon Kurz, Aditya Shankar, Robert Birke, Jian-Jia Chen, Lydia Y. Chen

2603.14057 2026-03-17 cs.AI

Demand-Driven Context: A Methodology for Building Enterprise Knowledge Bases Through Agent Failure

Raj Navakoti, Saideep Navakoti

Comments 18 pages, 5 figures, 1 algorithm. Preprint

2603.14056 2026-03-17 cs.RO cs.SY eess.SY

Amortizing Trajectory Diffusion with Keyed Drift Fields

Gokul Puthumanaillam, Melkior Ornik

2603.14053 2026-03-17 cs.CL cs.AI cs.LG

NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments

Rupak Raj Ghimire, Bipesh Subedi, Balaram Prasain, Prakash Poudyal, Praveen Acharya, Nischal Karki, Rupak Tiwari, Rishikesh Kumar Sharma, Jenny Poudel, Bal Krishna Bal

Comments Accepted in LREC 2026

2603.14041 2026-03-17 cs.AI

GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models

Zhijie Wang

2603.14039 2026-03-17 cs.CV

EyeWorld: A Generative World Model of Ocular State and Dynamics

Ziyu Gao, Xinyuan Wu, Xiaolan Chen, Zhuoran Liu, Ruoyu Chen, Bowen Liu, Bingjie Yan, Zhenhan Wang, Kai Jin, Jiancheng Yang, Yih Chung Tham, Mingguang He, Danli Shi

Comments 38 pages, 8 figures

2603.14035 2026-03-17 cs.SD cs.CL

Probing neural audio codecs for distinctions among English nuclear tunes

Juan Pablo Vigneaux, Jennifer Cole

Comments 5 pages; 1 table; 3 figures. Accepted as conference paper at Speech Prosody 2026

2603.14033 2026-03-17 cs.SD cs.AI cs.LG eess.AS

What Counts as Real? Speech Restoration and Voice Quality Conversion Pose New Challenges to Deepfake Detection

Shree Harsha Bokkahalli Satish, Harm Lameris, Joakim Gustafson, Éva Székely

Comments 5 pages, 4 figures, 3 tables. Submitted to Interspeech 2026

2603.14031 2026-03-17 cs.CV

Intrinsic Tolerance in C-Arm Imaging: How Extrinsic Re-optimization Preserves 3D Reconstruction Accuracy

Lin Li, Benjamin Aubert, Paul Kemper, Aric Plumley

2603.14030 2026-03-17 cs.LG

Benchmarking Open-Source PPG Foundation Models for Biological Age Prediction

N. Brag

Comments 11 pages, 4 figures, 3 tables. Code available at https://github.com/Misterbra/ppg-age-benchmark

2603.14028 2026-03-17 cs.AI cs.ET

Traffic and weather driven hybrid digital twin for bridge monitoring

Phani Raja Bharath Balijepalli, Bulent Soykan, Veeraraghava Raju Hasti

Comments 8 pages, 4 Figures, International Association for Bridge Maintenance and Safety IABMAS 2026

2603.14021 2026-03-17 cs.CV cs.AI

EI-Part: Explode for Completion and Implode for Refinement

Wanhu Sun, Zhongjin Luo, Heliang Zheng, Jiahao Chang, Chongjie Ye, Huiang He, Shengchu Zhao, Rongfei Jia, Xiaoguang Han

2603.14012 2026-03-17 cs.CV

Multi-Grained Vision-Language Alignment for Domain Generalized Person Re-Identification

Jiachen Li, Xiaojin Gong, Dongping Zhang

2603.14007 2026-03-17 cs.AI

Formal Abductive Explanations for Navigating Mental Health Help-Seeking and Diversity in Tech Workplaces

Belona Sonna, Alain Momo, Alban Grastien

Comments Appeared in the Proceedings of the Empowering Women of Colour in AI-Driven Mental Health Research at IJCAI 2025

2603.14006 2026-03-17 cs.CL

Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs

Hang Gao, Dimitris N. Metaxas

2603.14005 2026-03-17 cs.CV

Towards Generalizable Deepfake Detection via Real Distribution Bias Correction

Ming-Hui Liu, Harry Cheng, Xin Luo, Xin-Shun Xu, Mohan S. Kankanhalli

Comments First Version

2603.14004 2026-03-17 cs.CV cs.AI

U-Face: An Efficient and Generalizable Framework for Unsupervised Facial Attribute Editing via Subspace Learning

Bo Liu, Xuan Cui, Run Zeng, Wei Duan, Chongwen Liu, Jinrui Qian, Lianggui Tang, Hongping Gan

详情

英文摘要

Latent space-based facial attribute editing methods have gained popularity in applications such as digital entertainment, virtual avatar creation, and human-computer interaction systems due to their potential for efficient and flexible attribute manipulation, particularly for continuous edits. Among these, unsupervised latent space-based methods, which discover effective semantic vectors without relying on labeled data, have attracted considerable attention in the research community. However, existing methods still encounter difficulties in disentanglement, as manipulating a specific facial attribute may unintentionally affect other attributes, complicating fine-grained controllability. To address these challenges, we propose a novel framework designed to offer an effective and adaptable solution for unsupervised facial attribute editing, called Unsupervised Facial Attribute Controllable Editing (U-Face). The proposed method frames semantic vector learning as a subspace learning problem, where latent vectors are approximated within a lower-dimensional semantic subspace spanned by a semantic vector matrix. This formulation can also be equivalently interpreted from a projection-reconstruction perspective and further generalized into an autoencoder framework, providing a foundation that can support disentangled representation learning in a flexible manner. To improve disentanglement and controllability, we impose orthogonal non-negative constraints on the semantic vectors and incorporate attribute boundary vectors to reduce entanglement in the learned directions. Although these constraints make the optimization problem challenging, we design an alternating iterative algorithm, called Alternating Iterative Disentanglement and Controllability (AIDC), with closed-form updates and provable convergence under specific conditions.

URL PDF HTML ☆

赞 0 踩 0

2603.14001 2026-03-17 cs.CV

PhyGaP: Physically-Grounded Gaussians with Polarization Cues

Jiale Wu, Xiaoyang Bai, Zongqi He, Weiwei Xu, Yifan Peng

Comments The paper is accepted by CVPR 2026

2603.13998 2026-03-17 cs.AI cs.LG

A Systematic Evaluation Protocol of Graph-Derived Signals for Tabular Machine Learning

Mario Heidrich, Jeffrey Heidemann, Rüdiger Buchkremer, Gonzalo Wandosell Fernández de Bobadilla

2603.13994 2026-03-17 cs.CV cs.AI q-bio.NC

Human-like Object Grouping in Self-supervised Vision Transformers

Hossein Adeli, Seoyoung Ahn, Andrew Luo, Mengmi Zhang, Nikolaus Kriegeskorte, Gregory Zelinsky

2603.13993 2026-03-17 cs.CV cs.AI

VAD4Space: Visual Anomaly Detection for Planetary Surface Imagery

Fabrizio Genilotti, Arianna Stropeni, Francesco Borsatti, Manuel Barusco, Davide Dalle Pezze, Gian Antonio Susto

2603.13987 2026-03-17 cs.RO

Vision-guided Autonomous Dual-arm Extraction Robot for Bell Pepper Harvesting

Kshitij Madhav Bhat, Tom Gao, Abhishek Mathur, Rohit Satishkumar, Francisco Yandun, Dominik Bauer, Nancy Pollard

Comments 9 pages; first four authors have equal contribution

2603.13985 2026-03-17 cs.AI cs.CL

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Haitao Jiang, Wenbo Zhang, Jiarui Yao, Hengrui Cai, Sheng Wang, Rui Song

Comments 26 pages

2603.13978 2026-03-17 cs.CV

When Visual Privacy Protection Meets Multimodal Large Language Models

Xiaofei Hui, Qian Wu, Haoxuan Qu, Majid Mirmehdi, Hossein Rahmani, Jun Liu

2603.13972 2026-03-17 cs.CL cs.AI

FLUX: Data Worth Training On

Gowtham, Sai Rupesh, Sanjay Kumar, Saravanan, Venkata Chaithanya

2603.13971 2026-03-17 cs.LG cs.AI

Chunk-Guided Q-Learning

Gwanwoo Song, Kwanyoung Park, Youngwoon Lee

Comments Project page: https://gwanwoosong.github.io/cgq