arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.15310 2026-04-20 cs.CV cs.GR

TokenLight: Precise Lighting Control in Images using Attribute Tokens

Sumit Chaturvedi, Yannick Hold-Geoffroy, Mengwei Ren, Jingyuan Liu, He Zhang, Yiqun Mei, Julie Dorsey, Zhixin Shu

Comments 32 pages, CVPR 2026, Project Page: https://vrroom.github.io/tokenlight/

详情

英文摘要

This paper presents a method for image relighting that enables precise and continuous control over multiple illumination attributes in a photograph. We formulate relighting as a conditional image generation task and introduce attribute tokens to encode distinct lighting factors such as intensity, color, ambient illumination, diffuse level, and 3D light positions. The model is trained on a large-scale synthetic dataset with ground-truth lighting annotations, supplemented by a small set of real captures to enhance realism and generalization. We validate our approach across a variety of relighting tasks, including controlling in-scene lighting fixtures and editing environment illumination using virtual light sources, on synthetic and real images. Our method achieves state-of-the-art quantitative and qualitative performance compared to prior work. Remarkably, without explicit inverse rendering supervision, the model exhibits an inherent understanding of how light interacts with scene geometry, occlusion, and materials, yielding convincing lighting effects even in traditionally challenging scenarios such as placing lights within objects or relighting transparent materials plausibly. Project page: vrroom.github.io/tokenlight/

URL PDF HTML ☆

赞 0 踩 0

2604.15297 2026-04-20 cs.LG

Benchmarking Optimizers for MLPs in Tabular Deep Learning

Yury Gorishniy, Ivan Rubachev, Dmitrii Feoktistov, Artem Babenko

Comments Code: https://github.com/yandex-research/tabular-dl-optimizers

2604.15237 2026-04-20 cs.CV

StreamCacheVGGT: Streaming Visual Geometry Transformers with Robust Scoring and Hybrid Cache Compression

Xuanyi Liu, Chunan Yu, Deyi Ji, Qi Zhu, Lingyun Sun, Xuanfu Li, Jin Ma, Tianrun Chen, Lanyun Zhu

2604.14928 2026-04-20 cs.CV cs.GR

Hybrid Latents: Geometry-Appearance-Aware Surfel Splatting

Neel Kelkar, Simon Niedermayr, Klaus Engel, Rüdiger Westermann

Comments 22 pages, 9 figures

2604.14489 2026-04-20 cs.CL

CobwebTM: Probabilistic Concept Formation for Lifelong and Hierarchical Topic Modeling

Karthik Singaravadivelan, Anant Gupta, Zekun Wang, Christopher J. MacLellan

Comments 16 pages, 8 figures, 11 tables

2604.14333 2026-04-20 cs.LG

When Missing Becomes Structure: Intent-Preserving Policy Completion from Financial KOL Discourse

Yuncong Liu, Yuan Wan, Zhou Jiang, Yao Lu

Comments Main paper with supplementary material included

2604.14243 2026-04-20 cs.LG cs.AI

Optimistic Policy Learning under Pessimistic Adversaries with Regret and Violation Guarantees

Sourav Ganguly, Kartik Pandit, Arnob Ghosh

2604.13061 2026-04-20 cs.CL cs.AI

Token Statistics Reveal Conversational Drift in Multi-turn LLM Interaction

Wael Hafez, Amir Nazeri

Comments 13 Pages, 3 Figures

2604.08809 2026-04-20 cs.LG stat.AP

Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis

Haonan Zhu, Adrienne Deganutti, Elad Hirsch, Purvanshi Mehta

2604.07055 2026-04-20 cs.LG

AdaBoost Does Not Always Cycle: A Computer-Assisted Counterexample

Erik Y. Wang

2604.04101 2026-04-20 cs.LG

Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning

Nida Zamir, I-Hong Hou

2603.26062 2026-04-20 cs.CL cs.CY cs.SI

Measuring the Semantic Structure and Evolution of Conspiracy Theories

Manisha Keim, Sarmad Chandio, Osama Khalid, Rishab Nithyanand

2603.20210 2026-04-20 cs.CL cs.AI

CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language

Roy Uziel, Omer Belhasin, Itay Levy, Akhiad Bercovich, Ran El-Yaniv, Ran Zilberstein, Michael Elad

2603.05719 2026-04-20 cs.LG

Unsupervised domain adaptation for radioisotope identification in gamma spectroscopy

Peter Lalor, Ayush Panigrahy, Alex Hagen

Comments 38 pages, 5 figures, and 14 tables

2603.01098 2026-04-20 cs.CV cs.AI cs.LG

Differential privacy representation geometry for medical image analysis

Soroosh Tayebi Arasteh, Marziyeh Mohammadi, Sven Nebelung, Daniel Truhn

2602.22479 2026-04-20 cs.LG

Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns

Afshin Khadangi

2602.09953 2026-04-20 cs.CL

ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning

Shuaiyi Nie, Siyu Ding, Wenyuan Zhang, Linhao Yu, Tianmeng Yang, Yao Chen, Weichong Yin, Yu Sun, Hua Wu, Tingwen Liu

Comments Accepted by ACL 2026 Main

2601.12193 2026-04-20 cs.CV

VeRVE: Versatile Retrieval for Videos via Unified Embeddings

Shaunak Halbe, Bhagyashree Puranik, Jayakrishnan Unnikrishnan, Kushan Thakkar, Vimal Bhat, Toufiq Parag

2601.10198 2026-04-20 cs.CL

HumanLLM: Benchmarking and Improving LLM Anthropomorphism via Human Cognitive Patterns

Xintao Wang, Jian Yang, Weiyuan Li, Rui Xie, Jen-tse Huang, Jun Gao, Shuai Huang, Yueping Kang, Yuanli Gou, Hongwei Feng, Yanghua Xiao

Comments Accepted to ACL 2026 Main Conference

2601.05858 2026-04-20 cs.CL cs.AI cs.LG

CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning

Alexandra Dragomir, Florin Brad, Radu Tudor Ionescu

Comments Accepted at ACL 2026

2601.05808 2026-04-20 cs.CL cs.AI cs.LG

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Xiaoshuai Song, Haofei Chang, Guanting Dong, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou

Comments Add some experiments

2601.05201 2026-04-20 cs.CV cs.AI cs.CL

Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

William Rudman, Michal Golovanevsky, Dana Arad, Yonatan Belinkov, Ritambhara Singh, Carsten Eickhoff, Kyle Mahowald

Comments ACL 2026 Main

2512.17052 2026-04-20 cs.LG

Dynamic Tool Dependency Retrieval for Lightweight Function Calling

Bhrij Patel, Davide Belli, Amir Jalalirad, Maximilian Arnold, Aleksandr Ermolov, Bence Major

Comments 24 pages, 6 figures, 8 tables

2512.01099 2026-04-20 cs.AI

Cost-Aware Model Orchestration for LLM-based Systems

Daria Smirnova, Hamid Nasiri, Marta Adamska, Zhengxin Yu, Peter Garraghan

Comments 9 pages, 5 figures. Accepted at EuroMLSys '26, Edinburgh, Scotland UK

2511.02626 2026-04-20 cs.CL

Understanding New-Knowledge-Induced Factual Hallucinations in LLMs: Analysis and Interpretation

Renfei Dang, Peng Hu, Zhejian Lai, Changjiang Gao, Min Zhang, Shujian Huang

Comments ACL 2026 Findings

2510.23536 2026-04-20 cs.CL

IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering

Jieyong Kim, Maryam Amirizaniani, Soojin Yoon, Dongha Lee

2510.22977 2026-04-20 cs.LG cs.AI

The Reasoning Trap: How Enhancing LLM Reasoning Amplifies Tool Hallucination

Chenlong Yin, Zeyang Sha, Shiwen Cui, Changhua Meng, Zechao Li

Comments Accepted to ACL 2026 Main

详情

英文摘要

Enhancing the reasoning capabilities of Large Language Models (LLMs) is a key strategy for building Agents that "think then act." However, recent observations, like OpenAI's o3, suggest a paradox: stronger reasoning often coincides with increased hallucination, yet no prior work has systematically examined whether reasoning enhancement itself causes tool hallucination. To address this gap, we pose the central question: Does strengthening reasoning increase tool hallucination? To answer this, we introduce SimpleToolHalluBench, a diagnostic benchmark measuring tool hallucination in two failure modes: (i) no tool available, and (ii) only distractor tools available. Through controlled experiments, we establish three key findings. First, we demonstrate a causal relationship: progressively enhancing reasoning through RL increases tool hallucination proportionally with task performance gains. Second, this effect transcends overfitting - training on non-tool tasks (e.g., mathematics) still amplifies subsequent tool hallucination. Third, the effect is method-agnostic, appearing when reasoning is instilled via supervised fine-tuning and when it is merely elicited at inference by switching from direct answers to step-by-step thinking. We also evaluate mitigation strategies including Prompt Engineering and Direct Preference Optimization (DPO), revealing a fundamental reliability-capability trade-off: reducing hallucination consistently degrades utility. Mechanistically, Reasoning RL disproportionately collapses tool-reliability-related representations, and hallucinations surface as amplified divergences concentrated in late-layer residual streams. These findings reveal that current reasoning enhancement methods inherently amplify tool hallucination, highlighting the need for new training objectives that jointly optimize for capability and reliability.

URL PDF HTML ☆

赞 0 踩 0

2510.21783 2026-04-20 cs.CV cs.AI cs.CR

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Guo Li, Weihong Chen, Yongfu Fan

2510.09065 2026-04-20 cs.SD cs.CV cs.LG eess.AS

MMAudioSep: Taming Video-to-Audio Generative Model Towards Video/Text-Queried Sound Separation

Akira Takahashi, Shusuke Takahashi, Yuki Mitsufuji

Comments Accepted to ICASSP 2026. 4 pages, 4 figures, 2 tables

2510.09033 2026-04-20 cs.CL

Do LLMs Really Know What They Don't Know? Internal States Mainly Reflect Knowledge Recall Rather Than Truthfulness

Chi Seng Cheang, Hou Pong Chan, Wenxuan Zhang, Yang Deng