arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.19747 2026-04-22 cs.CV

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Yutian Chen, Shi Guo, Renbiao Jin, Tianshuo Yang, Xin Cai, Yawen Luo, Mingxin Yang, Mulin Yu, Linning Xu, Tianfan Xue

Comments Webpage: https://yutian10.github.io/AnyRecon/

详情

英文摘要

Sparse-view 3D reconstruction is essential for modeling scenes from casual captures, but remain challenging for non-generative reconstruction. Existing diffusion-based approaches mitigates this issues by synthesizing novel views, but they often condition on only one or two capture frames, which restricts geometric consistency and limits scalability to large or diverse scenes. We propose AnyRecon, a scalable framework for reconstruction from arbitrary and unordered sparse inputs that preserves explicit geometric control while supporting flexible conditioning cardinality. To support long-range conditioning, our method constructs a persistent global scene memory via a prepended capture view cache, and removes temporal compression to maintain frame-level correspondence under large viewpoint changes. Beyond better generative model, we also find that the interplay between generation and reconstruction is crucial for large-scale 3D scenes. Thus, we introduce a geometry-aware conditioning strategy that couples generation and reconstruction through an explicit 3D geometric memory and geometry-driven capture-view retrieval. To ensure efficiency, we combine 4-step diffusion distillation with context-window sparse attention to reduce quadratic complexity. Extensive experiments demonstrate robust and scalable reconstruction across irregular inputs, large viewpoint gaps, and long trajectories.

URL PDF HTML ☆

赞 0 踩 0

2604.19742 2026-04-22 cs.SE

PlayCoder: Making LLM-Generated GUI Code Playable

Zhiyuan Peng, Wei Tao, Xin Yin, Chenhao Ying, Yuan Luo, Yiwen Guo

Comments September 11, 2025 Submitted to FSE2026

详情

英文摘要

Large language models (LLMs) have achieved strong results in code generation, but their ability to generate GUI applications, especially games, remains insufficiently studied. Existing benchmarks mainly evaluate correctness through test cases, which are inadequate for GUI applications because these systems are interactive, event-driven, and require correct state transitions across sequences of user actions. Their evaluation therefore should consider interaction flows and UI logic rather than only pass/fail outcomes. To study this problem, we introduce PlayEval, a repository-aware benchmark built from 43 multilingual GUI applications in Python, TypeScript, and JavaScript. Unlike prior GUI benchmarks that are difficult to adapt to desktop environments, PlayEval covers six major GUI application categories and directly supports code-generation evaluation. We further propose Play@k, a metric that measures whether at least one of *k* generated candidates can be played end-to-end without logical errors. To support reliable evaluation, we develop PlayTester, an LLM-based agent that performs task-oriented GUI playthroughs and detects logic violations automatically. Experiments on 10 state-of-the-art code LLMs show that, despite high compilation rates, they achieve near-zero Play@3, revealing major weaknesses in generating logically correct GUI applications. To address this limitation, we present PlayCoder, a multi-agent, repository-aware framework that generates, evaluates, and iteratively repairs GUI application code in a closed loop. PlayCoder substantially improves both functional correctness and semantic alignment for open-source and closed-source models, reaching up to 38.1% Exec@3 and 20.3% Play@3. Case studies further show that it can uncover silent logic bugs missed by traditional metrics and fix them through targeted edits.

URL PDF HTML ☆

赞 0 踩 0

2604.19740 2026-04-22 cs.LG cs.AI cs.CV stat.ML

Generalization at the Edge of Stability

Mario Tuci, Caner Korkmaz, Umut Şimşekli, Tolga Birdal

Comments Project page: https://circle-group.github.io/research/GATES

2604.19737 2026-04-22 cs.LG

Safe Continual Reinforcement Learning in Non-stationary Environments

Austin Coursey, Abel Diaz-Gonzalez, Marcos Quinones-Grueiro, Gautam Biswas

2604.19736 2026-04-22 cs.CV

Generative Drifting for Conditional Medical Image Generation

Zirong Li, Siyuan Mei, Weiwen Wu, Andreas Maier, Lina Gölz, Yan Xia

2604.19734 2026-04-22 cs.RO cs.AI

UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

Boyu Chen, Yi Chen, Lu Qiu, Jerry Bai, Yuying Ge, Yixiao Ge

Comments Project page: https://xpeng-robotics.github.io/unit/

2604.19733 2026-04-22 math.CO cs.DS cs.NI cs.SI

Greedy Routing in a Sequentially Grown One-Dimensional Random Graph

Alexander Ponomarenko

2604.19730 2026-04-22 cs.LG cs.AI

FASTER: Value-Guided Sampling for Fast RL

Perry Dong, Alexander Swerdlow, Dorsa Sadigh, Chelsea Finn

2604.19729 2026-04-22 cs.LG cs.IT eess.SP math.IT

FB-NLL: A Feature-Based Approach to Tackle Noisy Labels in Personalized Federated Learning

Abdulmoneam Ali, Ahmed Arafa

Comments Submitted for journal publication

2604.19728 2026-04-22 cs.RO cs.AI cs.CV cs.LG cs.SE

VLA Foundry: A Unified Framework for Training Vision-Language-Action Models

Jean Mercat, Sedrick Keh, Kushal Arora, Isabella Huang, Paarth Shah, Haruki Nishimura, Shun Iwase, Katherine Liu

Comments 32 pages, 16 figures, technical report

2604.19724 2026-04-22 cs.LG cs.AI

Benign Overfitting in Adversarial Training for Vision Transformers

Jiaming Zhang, Meng Ding, Shaopeng Fu, Jingfeng Zhang, Di Wang

Comments arXiv admin note: text overlap with arXiv:2409.19345 by other authors

2604.19722 2026-04-22 cs.LG cs.AI

Adaptive MSD-Splitting: Enhancing C4.5 and Random Forests for Skewed Continuous Attributes

Jake Lee

2604.19720 2026-04-22 cs.CV

ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis

Zhengwentai Sun, Keru Zheng, Chenghong Li, Hongjie Liao, Xihe Yang, Heyuan Li, Yihao Zhi, Shuliang Ning, Shuguang Cui, Xiaoguang Han

2604.19719 2026-04-22 cs.FL

On Languages Describing Large Graph Classes

Henning Fernau, Pamela Fleischmann, Kevin Mann, Silas Cato Sacher

Comments arXiv admin note: substantial text overlap with arXiv:2411.03274

2604.19717 2026-04-22 quant-ph cs.CC

Qubit Routing for (Almost) Free

Arianne Meijer-van de Griend

Comments 14 pages, rough draft

2604.19716 2026-04-22 cs.CL

Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic Views

Feihao Fang, My T. Thai, Yuanyuan Lei

Comments Accepted to ACL 2026

2604.19715 2026-04-22 cs.CV cs.SY eess.SY

A Network-Aware Evaluation of Distributed Energy Resource Control in Smart Distribution Systems

Houchao Gan

2604.19712 2026-04-22 cs.LG cond-mat.dis-nn cs.IT math.IT math.PR stat.ML

Ultrametric OGP - parametric RDT \emph{symmetric} binary perceptron connection

Mihailo Stojnic

2604.19711 2026-04-22 cs.CR cs.CY cs.HC

"We are currently clean on OPSEC": Why JD Can't Encrypt

Maurice Chiodo, Toni Erskine, Dennis Müller, James G. Wright

Comments 31 pages

2604.19710 2026-04-22 cs.CV

SpanVLA: Efficient Action Bridging and Learning from Negative-Recovery Samples for Vision-Language-Action Model

Zewei Zhou, Ruining Yang, Xuewei, Qi, Yiluan Guo, Sherry X. Chen, Tao Feng, Kateryna Pistunova, Yishan Shen, Lili Su, Jiaqi Ma

Comments Project page: https://spanvla.github.io/

2604.19709 2026-04-22 eess.SP cs.IT math.IT

Networked Tracking of Multiple Moving Targets in 6G Network

Yanmo Hu, Weifeng Zhu, Chenshu Wu, Shuowen Zhang, J. Andrew Zhang, Liang Liu

2604.19702 2026-04-22 cs.CV

Face Anything: 4D Face Reconstruction from Any Image Sequence

Umut Kocasari, Simon Giebenhain, Richard Shaw, Matthias Nießner

Comments Project website: https://kocasariumut.github.io/FaceAnything/ , Video: https://www.youtube.com/watch?v=wSGHpAscp0Y

2604.19698 2026-04-22 cs.LG math.ST stat.TH

On two ways to use determinantal point processes for Monte Carlo integration

Guillaume Gautier, Rémi Bardenet, Michal Valko

Comments NeurIPS 2019

2604.19695 2026-04-22 cs.LG

Planning in entropy-regularized Markov decision processes and games

Jean-Bastien Grill, Omar Darwiche Domingues, Pierre Ménard, Rémi Munos, Michal Valko

Comments NeurIPS 2019

2604.19689 2026-04-22 cs.AI

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding

Shuai Wang, Hongyi Zhu, Jia-Hong Huang, Yixian Shen, Chengxi Zeng, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

2604.19686 2026-04-22 eess.SY cs.SY

Towards Reproducible Test Annotation for Cyber-Physical Energy Systems using Ontology-driven Dataspaces

Kai Heussen, Jawad Kazmi, Narges Mehran, Artjoms Obushevs, Terence O'Donnell, Thomas I. Strasser

Comments 2026 Open Source Modelling and Simulation of Energy Systems (OSMSES)

2604.19685 2026-04-22 cs.CL

An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA

Saransh Sharma, Pritika Ramu, Aparna Garimella, Koyel Mukherjee

Comments Paper accepted at ACL Findings 2026

2604.19684 2026-04-22 cs.LG

PREF-XAI: Preference-Based Personalized Rule Explanations of Black-Box Machine Learning Models

Salvatore Greco, Jacek Karolczak, Roman Słowiński, Jerzy Stefanowski

2604.19680 2026-04-22 cs.CV

IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow

Zihao Fan, Xin Lu, Jie Xiao, Dong Li, Jie Huang, Xueyang Fu

2604.19678 2026-04-22 cs.CL

Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation

Nurkhan Laiyk, Gerard I. Gállego, Javier Ferrando, Fajri Koto