arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.19036 2026-03-20 cs.CV

FUMO: Prior-Modulated Diffusion for Single Image Reflection Removal

Telang Xu, Chaoyang Zhang, Guangtao Zhai, Xiaohong Liu

详情

英文摘要

Single image reflection removal (SIRR) is challenging in real scenes, where reflection strength varies spatially and reflection patterns are tightly entangled with transmission structures. This paper presents a diffusion model with prior modulation framework (FUMO) that introduces explicit guidance signals to improve spatial controllability and structural faithfulness. Two priors are extracted directly from the mixed image, an intensity prior that estimates spatial reflection severity and a high-frequency prior that captures detail-sensitive responses via multi-scale residual aggregation. We propose a coarse-to-fine training paradigm. In the first stage, these cues are combined to gate the conditional residual injections, focusing the conditioning on regions that are both reflection-dominant and structure-sensitive. In the second stage, a fine-grained refinement network corrects local misalignment and sharpens fine details in the image space. Experiments conducted on both standard benchmarks and challenging images in the wild demonstrate competitive quantitative results and consistently improved perceptual quality. The code is released at https://github.com/Lucious-Desmon/FUMO.

URL PDF HTML ☆

赞 0 踩 0

2603.19029 2026-03-20 cs.RO

ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning

Weihang Huang, Chaoran Zhang, Xiaoxin Deng, Hao Zhou, Zhaobo Xu, Shubo Cui, Long Zeng

Comments 32 pages, 13 figures

2603.19028 2026-03-20 cs.CV cs.AI cs.LG

SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models

Quentin Guimard, Federico Bartsch, Simone Caldarella, Rahaf Aljundi, Elisa Ricci, Massimiliano Mancini

Comments CVPR Findings 2026. Project website: https://sparse-embedding-modulation.github.io/

2603.19026 2026-03-20 cs.CV

Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token

Anqi Zhang, Xiaokang Ji, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei

Comments Paper is accepted by CVPR 2026

2603.19022 2026-03-20 cs.AI

Behavioral Fingerprints for LLM Endpoint Stability and Identity

Jonah Leshin, Manish Shah, Ian Timmis, Daniel Kang

Comments 4 pages, 1 figure, submitted to CAIS 2026 System Demonstrations

2603.19017 2026-03-20 cs.CL cs.AI

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Gagan Bhatia, Ahmad Muhammad Isa, Maxime Peyrard, Wei Zhao

2603.19008 2026-03-20 cs.CL cs.AI cs.LG

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

Hangeol Chang, Changsun Lee, Seungjoon Rho, Junho Yeo, Jong Chul Ye

2603.19004 2026-03-20 cs.CV

Unleashing the Power of Simplicity: A Minimalist Strategy for State-of-the-Art Fingerprint Enhancement

Raffaele Cappelli

2603.19002 2026-03-20 cs.CL

RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation

Weronika Łajewska, Paul Missault, George Davidson, Saab Mansour

2603.18999 2026-03-20 cs.AI cs.DS cs.GT cs.LG

Regret Bounds for Competitive Resource Allocation with Endogenous Costs

Rui Chai

Comments This is Paper 7 in a 9-paper series on Super-Alignment via Wuxing Institutional Architecture. The series explores resource competition and institutional design for human-aligned AI systems

2603.18992 2026-03-20 cs.LG cs.AI

Foundations of Schrödinger Bridges for Generative Modeling

Sophia Tang

Comments 220 pages, 24 figures

2603.18991 2026-03-20 cs.CV cs.LG

CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think

Zening Sun, Zhengpeng Xie, Lichen Bai, Shitong Shao, Shuo Yang, Zeke Xie

Comments CVPR2026

2603.18988 2026-03-20 cs.RO

MERGE: Guided Vision-Language Models for Multi-Actor Event Reasoning and Grounding in Human-Robot Interaction

Joerg Deigmoeller, Nakul Agarwal, Stephan Hasler, Daniel Tanneberg, Anna Belardinelli, Reza Ghoddoosian, Chao Wang, Felix Ocker, Fan Zhang, Behzad Dariush, Michael Gienger

2603.18981 2026-03-20 cs.LG cs.HC

Book your room in the Turing Hotel! A symmetric and distributed Turing Test with multiple AIs and humans

Christian Di Maio, Tommaso Guidi, Luigi Quarantiello, Jack Bell, Marco Gori, Stefano Melacci, Vincenzo Lomonaco

2603.18979 2026-03-20 cs.RO cs.AI

PRIOR: Perceptive Learning for Humanoid Locomotion with Reference Gait Priors

Chenxi Han, Shilu He, Yi Cheng, Linqi Ye, Houde Liu

Comments https://prior-iros2026.github.io/

2603.18976 2026-03-20 cs.AI

Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction

Peng Gang

Comments 27 pages, figures, tables, and appendix. Primary category: human-computer interaction / human-AI interaction. Public artifact repository and implementation resources are referenced in the manuscript

2603.18968 2026-03-20 cs.AI

Teleological Inference in Structural Causal Models via Intentional Interventions

Dario Compagno, Fabio Massimo Zennaro

Comments 29 pages, 3 figures

2603.18965 2026-03-20 cs.LG stat.ML

Maximum-Entropy Exploration with Future State-Action Visitation Measures

Adrien Bolland, Gaspard Lambrechts, Damien Ernst

Comments arXiv admin note: substantial text overlap with arXiv:2412.06655

2603.18957 2026-03-20 cs.LG stat.ME

BVSIMC: Bayesian Variable Selection-Guided Inductive Matrix Completion for Improved and Interpretable Drug Discovery

Sijian Fan, Liyan Xiong, Dayuan Wang, Guoshuai Cai, Ray Bai

2603.18954 2026-03-20 cs.LG

Balancing Performance and Fairness in Explainable AI for Anomaly Detection in Distributed Power Plants Monitoring

Corneille Niyonkuru, Marcellin Atemkeng, Gabin Maxime Nguegnang, Arnaud Nguembang Fadja

详情

英文摘要

Reliable anomaly detection in distributed power plant monitoring systems is essential for ensuring operational continuity and reducing maintenance costs, particularly in regions where telecom operators heavily rely on diesel generators. However, this task is challenged by extreme class imbalance, lack of interpretability, and potential fairness issues across regional clusters. In this work, we propose a supervised ML framework that integrates ensemble methods (LightGBM, XGBoost, Random Forest, CatBoost, GBDT, AdaBoost) and baseline models (Support Vector Machine, K-Nearrest Neighbors, Multilayer Perceptrons, and Logistic Regression) with advanced resampling techniques (SMOTE with Tomek Links and ENN) to address imbalance in a dataset of diesel generator operations in Cameroon. Interpretability is achieved through SHAP (SHapley Additive exPlanations), while fairness is quantified using the Disparate Impact Ratio (DIR) across operational clusters. We further evaluate model generalization using Maximum Mean Discrepancy (MMD) to capture domain shifts between regions. Experimental results show that ensemble models consistently outperform baselines, with LightGBM achieving an F1-score of 0.99 and minimal bias across clusters (DIR $\approx 0.95$). SHAP analysis highlights fuel consumption rate and runtime per day as dominant predictors, providing actionable insights for operators. Our findings demonstrate that it is possible to balance performance, interpretability, and fairness in anomaly detection, paving the way for more equitable and explainable AI systems in industrial power management. {\color{black} Finally, beyond offline evaluation, we also discuss how the trained models can be deployed in practice for real-time monitoring. We show how containerized services can process in real-time, deliver low-latency predictions, and provide interpretable outputs for operators.

URL PDF HTML ☆

赞 0 踩 0

2603.18953 2026-03-20 cs.LG

Context Bootstrapped Reinforcement Learning

Saaket Agashe, Jayanth Srinivasa, Gaowen Liu, Ramana Kompella, Xin Eric Wang

2603.18927 2026-03-20 cs.LG

An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction

Ezekiel Nii Noye Nortey, Jones Asante-Koranteng, Marcellin Atemkeng, Theophilus Ansah-Narh, David Mensah, Rebecca Davis, Ravenhill Adjetey Laryea

详情

英文摘要

Accurate prediction of loan defaults is a central challenge in credit risk management, particularly in modern financial datasets characterised by nonlinear relationships, class imbalance, and evolving borrower behaviour. Traditional statistical models and static ensemble methods often struggle to maintain reliable performance under such conditions. This study proposes an Optimised Greedy-Weighted Ensemble framework for loan default prediction that dynamically allocates model weights based on empirical predictive performance. The framework integrates multiple machine learning classifiers, with their hyperparameters first optimised using Particle Swarm Optimisation. Model predictions are then combined via a regularised greedy weighting mechanism. At the same time, a neural-network-based meta-learner is employed within stacked-ensemble to capture higher-order relationships among model outputs. Experiments conducted on the Lending Club dataset demonstrate that the proposed framework improves predictive performance compared with individual classifiers. The BlendNet ensemble achieved the strongest results with an AUC of 0.80, a macro-average F1-score of 0.73, and a default recall of 0.81. Calibration analysis further shows that tree-based ensembles such as Extra Trees and Gradient Boosting provide the most reliable probability estimates, while the stacked ensemble offers superior ranking capability. Feature analysis using Recursive Feature Elimination identifies revolving utilisation, annual income, and debt-to-income ratio as the most influential predictors of loan default. These findings demonstrate that performance-driven ensemble weighting can improve both predictive accuracy and interpretability in credit risk modelling. The proposed framework provides a scalable data-driven approach to support institutional credit assessment, risk monitoring, and financial decision-making.

URL PDF HTML ☆

赞 0 踩 0

2603.18924 2026-03-20 cs.CV

Unsupervised Contrastive Learning for Efficient and Robust Spectral Shape Matching

Feifan Luo, Hongyang Chen

2603.18921 2026-03-20 cs.RO cs.SY eess.SY

Lightweight Model Predictive Control for Spacecraft Rendezvous Attitude Synchronization

Peter Stadler, Alexander Meinert, Niklas Baldauf, Alen Turnwald

Comments Accepted at European Control Conference (ECC 2026)

2603.18912 2026-03-20 cs.CV

GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting

Ahmed Tawfik Aboukhadra, Marcel Rogge, Nadia Robertini, Abdalla Arafa, Jameel Malik, Ahmed Elhayek, Didier Stricker

2603.18911 2026-03-20 cs.CL cs.AI

Progressive Training for Explainable Citation-Grounded Dialogue: Reducing Hallucination to Zero in English-Hindi LLMs

Vedant Pandya

Comments 30 pages, 15 figures, 11 tables. Comprehensive study across 6 LLMs (250M-7B parameters) with explainability analysis. Code and data available upon request

2603.18910 2026-03-20 cs.RO cs.SY eess.SY

Safety-Guaranteed Imitation Learning from Nonlinear Model Predictive Control for Spacecraft Close Proximity Operations

Alexander Meinert, Niklas Baldauf, Peter Stadler, Alen Turnwald

Comments Accepted at European Control Conference (ECC 2026)

2603.18907 2026-03-20 cs.LG cs.NA math.NA

Neural Galerkin Normalizing Flow for Transition Probability Density Functions of Diffusion Models

Riccardo Saporiti, Fabio Nobile

Comments 12 pages, 4 figures

2603.18899 2026-03-20 cs.LG math.OC

Uniform a priori bounds and error analysis for the Adam stochastic gradient descent optimization method

Steffen Dereich, Thang Do, Arnulf Jentzen

Comments 34 pages

2603.18896 2026-03-20 cs.CV cs.AI

Translating MRI to PET through Conditional Diffusion Models with Enhanced Pathology Awareness

Yitong Li, Igor Yakushev, Dennis M. Hedderich, Christian Wachinger

Comments Accepted by Medical Image Analysis