arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18032 2026-04-21 cs.CV

CFSR: Geometry-Conditioned Shadow Removal via Physical Disentanglement

Pan Wang, Yihao Hu, Xiujin Liu, Hang Wang

详情

英文摘要

Traditional shadow removal networks often treat image restoration as an unconstrained mapping, lacking the physical interpretability required to balance localized texture recovery with global illumination consistency. To address this, we propose CFSR, a multi-modal prior-driven framework that reframes shadow removal as a physics-constrained restoration process. By seamlessly integrating 3D geometric cues with large-scale foundation model semantics, CFSR effectively bridges the 2D-3D domain gap. Specifically, we first map observations into a custom HVI color space to suppress shadow-induced noise and robustly fuse RGB data with estimated depth priors. At its core, our Geometric & Semantic Dual Explicit Guided Attention mechanism utilizes DINO features and 3D surface normals to directly modulate the attention affinity matrix, structurally enforcing physical lighting constraints. To recover severely degraded regions, we inject holistic priors via a frozen CLIP encoder. Finally, our Frequency Collaborative Reconstruction Module (FCRM) achieves an optimal synthesis by decoupling the decoding process. Conditioned on geometric priors, FCRM seamlessly harmonizes the reconstruction of sharp high-frequency occlusion boundaries with the restoration of low-frequency global illumination. Extensive experiments demonstrate that CFSR achieves state-of-the-art performance across multiple challenging benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2604.18031 2026-04-21 cs.CL cs.LG q-bio.BM

How Creative Are Large Language Models in Generating Molecules?

Wen Tao, Yiwei Wang, Peng Zhou, Bryan Hooi, Wanlong Fang, Tianle Zhang, Xiao Luo, Yuansheng Liu, Alvin Chan

2604.18026 2026-04-21 cs.LG cs.AI

RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments

Enze Pan

Comments Withdraw by ICML and prepare for NeurIPS or ICLR

2604.18024 2026-04-21 cs.LG

Clusterability-Based Assessment of Potentially Noisy Views for Multi-View Clustering

Mudi Jiang, Jiahui Zhou, Xinying Liu, Zengyou He, Zhikui Chen

2604.18019 2026-04-21 cs.CV

Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval

Hang Cheng, Muyan He, Mingyu Fan, Chengfeng Xie, Xi Cheng, Long Zeng

2604.18012 2026-04-21 cs.LG cs.NA math.NA

Neural Shape Operator Surrogates -- Expression Rate Bounds

Helmut Harbrecht, Christoph Schwab

2604.18003 2026-04-21 cs.AI

SELF-EMO: Emotional Self-Evolution from Recognition to Consistent Expression

Shaowei Zhang, Faqiang Qian, Yan Chen, Ziliang Wang, Kang An, Yong Dai, Mengya Gao, Yichao Wu

2604.18002 2026-04-21 cs.LG

Neural Garbage Collection: Learning to Forget while Learning to Reason

Michael Y. Li, Jubayer Ibn Hamid, Emily B. Fox, Noah D. Goodman

2604.18001 2026-04-21 cs.CV

Trustworthy Endoscopic Super-Resolution

Julio Silva-Rodríguez, Ender Konukoglu

Comments Code: https://github.com/jusiro/Endoscopic-CFM

2604.18000 2026-04-21 cs.RO

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models

Haiweng Xu, Sipeng Zheng, Hao Luo, Wanpeng Zhang, Ziheng Xi, Zongqing Lu

2604.17998 2026-04-21 cs.LG

Causally-Constrained Probabilistic Forecasting for Time-Series Anomaly Detection

Pooyan Khosravinia, João Gama, Bruno Veloso

Comments This work is currently under review for possible publication in the IEEE Access journal. All intellectual property rights are retained by IEEE

2604.17989 2026-04-21 cs.AI

AIT Academy: Cultivating the Complete Agent with a Confucian Three-Domain Curriculum

Jiaqi Li, Lvyang Zhang, Yang Zhao, Wen Lu, Lidong Zhai

Comments 11 pages, 5 figures

2604.17988 2026-04-21 cs.CL

Employing General-Purpose and Biomedical Large Language Models with Advanced Prompt Engineering for Pharmacoepidemiologic Study Design

Xinyao Zhang, Nicole Sonne Heckmann, Manuela Del Castillo Suero, Francesco Paolo Speca, Maurizio Sessa

2604.17986 2026-04-21 cs.SD cs.AI

Latent Fourier Transform

Mason Wang, Cheng-Zhi Anna Huang

Comments ICLR 2026 Oral

2604.17984 2026-04-21 cs.LG stat.ML

Online Conformal Prediction with Adversarial Semi-bandit Feedback via Regret Minimization

Junyoung Yang, Kyungmin Kim, Sangdon Park

2604.17982 2026-04-21 cs.CV cs.CL

Mitigating Multimodal Hallucination via Phase-wise Self-reward

Yu Zhang, Chuyang Sun, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang

Comments Self-reward for vision hallucination mitigation

2604.17976 2026-04-21 cs.CL

ltzGLUE: Luxembourgish General Language Understanding Evaluation

Alistair Plum, Felicia Körner, Anne-Marie Lutgen, Laura Bernardy, Fred Philippy, Emilia Milano, Nils Rehlinger, Cédric Lothritz, Tharindu Ranasinghe, Barbara Plank, Christoph Purschke

Comments Accepted at ACL Findings 2026

2604.17972 2026-04-21 cs.CL

Modeling Multiple Support Strategies within a Single Turn for Emotional Support Conversations

Jie Zhu, Huaixia Dou, Junhui Li, Lifan Guo, Feng Chen, Jinsong Su, Chi Zhang, Fang Kong

2604.17971 2026-04-21 cs.CV

Identifying Ethical Biases in Action Recognition Models

Ana Baltaretu, Pascal Benschop, Jan van Gemert

2604.17968 2026-04-21 cs.AI cs.CL

From Fallback to Frontline: When Can LLMs be Superior Annotators of Human Perspectives?

Hasan Amin, Harry Yizhou Tian, Xiaoni Duan, Chien-Ju Ho, Rajiv Khanna, Ming Yin

Comments ACL 2026

2604.17967 2026-04-21 cs.AI cs.LG

A Sugeno Integral View of Binarized Neural Network Inference

Ismaïl Baaj, Henri Prade

2604.17966 2026-04-21 cs.AI

TPS-CalcBench: A Benchmark and Diagnostic Evaluation Framework for LLM Analytical Calculation Competence in Hypersonic Thermal Protection System Engineering

Jinglai Zheng, Chuhan Qiao, Haiming Huang

2604.17965 2026-04-21 cs.CV

MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene

Wenjie Mu, Zhan Li, Chuanzhou Su, Xuanyi Shen, Ziniu Liu, Fan Lu, Yujian Mo, Junqiao Zhao, Tiantian Feng, Chen Ye, Guang Chen

Comments Accepted by CVPR 2026

2604.17961 2026-04-21 cs.CV

DifFoundMAD: Foundation Models meet Differential Morphing Attack Detection

Lazaro J. Gonzalez-Soler, André Dörsch, Christian Rathgeb, Christoph Busch

2604.17959 2026-04-21 cs.CV cs.GR

Chatting about Upper-Body Expressive Human Pose and Shape Estimation

Yuxiang Zhao, Wei Huang, Yujie Song, Liu Wang, Huan Zhao

2604.17957 2026-04-21 cs.CL

Process Reward Models Meet Planning: Generating Precise and Scalable Datasets for Step-Level Rewards

Raffaele Pisano, Roberto Navigli

Comments Accepted to ACL 2026 (main conference)

2604.17956 2026-04-21 cs.LG stat.ME

Federated Rule Ensemble Method in Medical Data

Ke Wan, Kensuke Tanioka, Toshio Shimokawa

2604.17950 2026-04-21 cs.AI

CADMAS-CTX: Contextual Capability Calibration for Multi-Agent Delegation

Chuhan Qiao

详情

英文摘要

We revisit multi-agent delegation under a stronger and more realistic assumption: an agent's capability is not fixed at the skill level, but depends on task context. A coding agent may excel at short standalone edits yet fail on long-horizon debugging; a planner may perform well on shallow tasks yet degrade on chained dependencies. Static skill-level capability profiles therefore average over heterogeneous situations and can induce systematic misdelegation. We propose CADMAS-CTX, a framework for contextual capability calibration. For each agent, skill, and coarse context bucket, CADMAS-CTX maintains a Beta posterior that captures stable experience in that part of the task space. Delegation is then made by a risk-aware score that combines the posterior mean with an uncertainty penalty, so that agents delegate only when a peer appears better and that assessment is sufficiently well supported by evidence. This paper makes three contributions. First, a hierarchical contextual capability profile replaces static skill-level confidence with context-conditioned posteriors. Second, based on contextual bandit theory, we formally prove context-aware routing achieves lower cumulative regret than static routing under sufficient context heterogeneity, formalizing the bias-variance tradeoff. Third, we empirically validate our method on GAIA and SWE-bench benchmarks. On GAIA with GPT-4o agents, CADMAS-CTX achieves 0.442 accuracy, outperforming static baseline 0.381 and AutoGen 0.354 with non-overlapping 95% confidence intervals. On SWE-bench Lite, it improves resolve rate from 22.3% to 31.4%. Ablations show the uncertainty penalty improves robustness against context tagging noise. Our results demonstrate contextual calibration and risk-aware delegation significantly improve multi-agent teamwork compared with static global skill assignments.

URL PDF HTML ☆

赞 0 踩 0

2604.17949 2026-04-21 cs.CV

ZSG-IAD: A Multimodal Framework for Zero-Shot Grounded Industrial Anomaly Detection

Qiuhui Chen, Jiaxiang Song, Shuai Tan, Weimin Zhong

2604.17944 2026-04-21 cs.CL

ReCoQA: A Benchmark for Tool-Augmented and Multi-Step Reasoning in Real Estate Question and Answering

Yindong Zhang, Wenmian Yang, Yiquan Zhang, Weijia Jia

Comments Accepted by ACL 2026