arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.04934 2026-04-07 cs.CV

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Hyunsoo Cha, Wonjung Woo, Byungjun Kim, Hanbyul Joo

Comments Accepted to CVPR 2026, Project Page: https://hyunsoocha.github.io/vanast/

详情

英文摘要

We present Vanast, a unified framework that generates garment-transferred human animation videos directly from a single human image, garment images, and a pose guidance video. Conventional two-stage pipelines treat image-based virtual try-on and pose-driven animation as separate processes, which often results in identity drift, garment distortion, and front-back inconsistency. Our model addresses these issues by performing the entire process in a single unified step to achieve coherent synthesis. To enable this setting, we construct large-scale triplet supervision. Our data generation pipeline includes generating identity-preserving human images in alternative outfits that differ from garment catalog images, capturing full upper and lower garment triplets to overcome the single-garment-posed video pair limitation, and assembling diverse in-the-wild triplets without requiring garment catalog images. We further introduce a Dual Module architecture for video diffusion transformers to stabilize training, preserve pretrained generative quality, and improve garment accuracy, pose adherence, and identity preservation while supporting zero-shot garment interpolation. Together, these contributions allow Vanast to produce high-fidelity, identity-consistent animation across a wide range of garment types.

URL PDF HTML ☆

赞 0 踩 0

2604.04933 2026-04-07 cs.CV

PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding

Siyuan Liu, Chaoqun Zheng, Xin Zhou, Tianrui Feng, Dingkang Liang, Xiang Bai

Comments Accepted by CVPR 2026. The code is available at https://github.com/H-EmbodVis/PointTPA

2604.04931 2026-04-07 cs.CV

LoMa: Local Feature Matching Revisited

David Nordström, Johan Edstedt, Georg Bökman, Jonathan Astermark, Anders Heyden, Viktor Larsson, Mårten Wadenbäck, Michael Felsberg, Fredrik Kahl

2604.04930 2026-04-07 cs.CL cs.AI cs.LG

Early Stopping for Large Reasoning Models via Confidence Dynamics

Parsa Hosseini, Sumit Nawathe, Mahdi Salmani, Meisam Razaviyayn, Soheil Feizi

2604.04929 2026-04-07 cs.CV

Rethinking Model Efficiency: Multi-Agent Inference with Large Models

Sixun Dong, Juhua Hu, Steven Li, Wei Wen, Qi Qian

2604.04926 2026-04-07 cs.CR cs.HC

Comprehensive List of User Deception Techniques in Emails

Maxime Veit, Mattia Mossano, Tobias Länge, Melanie Volkamer

2604.04924 2026-04-07 cs.CV cs.AI

Your Pre-trained Diffusion Model Secretly Knows Restoration

Sudarshan Rajagopalan, Vishal M. Patel

Comments Project page: https://sudraj2002.github.io/yptpage/

2604.04923 2026-04-07 cs.LG cs.LO cs.SY eess.SY math.AT

Stratifying Reinforcement Learning with Signal Temporal Logic

Justin Curry, Alberto Speranzon

Comments 8 pages, 13 figures

2604.04921 2026-04-07 cs.CL cs.CV

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Weian Mao, Xi Lin, Wei Huang, Yuxin Xie, Tianfu Fu, Bohan Zhuang, Song Han, Yukang Chen

Comments Code is available at https://github.com/WeianMao/triattention

2604.04920 2026-04-07 math.OC cs.LG

PINNs in PDE Constrained Optimal Control Problems: Direct vs Indirect Methods

Zhen Zhang, Shanqing Liu, Alessandro Alla, Jerome Darbon, George Em Karniadakis

Comments 8 pages, 3 figures

2604.04918 2026-04-07 cs.HC

Comparing Human Oversight Strategies for Computer-Use Agents

Chaoran Chen, Zhiping Zhang, Zeya Chen, Eryue Xu, Yinuo Yang, Ibrahim Khalilov, Simret A Gebreegziabher, Yanfang Ye, Ziang Xiao, Yaxing Yao, Tianshi Li, Toby Jia-Jun Li

2604.04916 2026-04-07 cs.LG

Empowering Power Outage Prediction with Spatially Aware Hybrid Graph Neural Networks and Contrastive Learning

Xuyang Shen, Zijie Pan, Diego Cerrai, Xinxuan Zhang, Christopher Colorio, Emmanouil N. Anagnostou, Dongjin Song

2604.04915 2026-04-07 cs.HC

Exploring Expert Perspectives on Wearable-Triggered LLM Conversational Support for Daily Stress Management

Poorvesh Dongre, Sameer Neupane, Priyanka Jadhav, Nikitha Donekal Chandrashekar, Christian Webb, Denis Gračanin

2604.04914 2026-04-07 cs.NI cs.AI cs.LG

Analyzing Symbolic Properties for DRL Agents in Systems and Networking

Mohammad Zangooei, Jannis Weil, Amr Rizk, Mina Tahmasbi Arashloo, Raouf Boutaba

Comments Accepted in ACM SIGMETRICS'26

详情

英文摘要

Deep reinforcement learning (DRL) has shown remarkable performance on complex control problems in systems and networking, including adaptive video streaming, wireless resource management, and congestion control. For safe deployment, however, it is critical to reason about how agents behave across the range of system states they encounter in practice. Existing verification-based methods in this domain primarily focus on point properties, defined around fixed input states, which offer limited coverage and require substantial manual effort to identify relevant input-output pairs for analysis. In this paper, we study symbolic properties, that specify expected behavior over ranges of input states, for DRL agents in systems and networking. We present a generic formulation for symbolic properties, with monotonicity and robustness as concrete examples, and show how they can be analyzed using existing DNN verification engines. Our approach encodes symbolic properties as comparisons between related executions of the same policy and decomposes them into practically tractable sub-properties. These techniques serve as practical enablers for applying existing verification tools to symbolic analysis. Using our framework, diffRL, we conduct an extensive empirical study across three DRL-based control systems, adaptive video streaming, wireless resource management, and congestion control. Through these case studies, we analyze symbolic properties over broad input ranges, examine how property satisfaction evolves during training, study the impact of model size on verifiability, and compare multiple verification backends. Our results show that symbolic properties provide substantially broader coverage than point properties and can uncover non-obvious, operationally meaningful counterexamples, while also revealing practical solver trade-offs and limitations.

URL PDF HTML ☆

赞 0 踩 0

2604.04913 2026-04-07 cs.CV

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Tommie Kerssies, Gabriele Berton, Ju He, Qihang Yu, Wufei Ma, Daan de Geus, Gijs Dubbelman, Liang-Chieh Chen

Comments CVPR 2026. Code and weights: https://deltatok.github.io

2604.04912 2026-04-07 cs.DS

Dominating Set with Quotas: Balancing Coverage and Constraints

Sobyasachi Chatterjee, Sushmita Gupta, Saket Saurabh, Sanjay Seetharaman, Anannya Upasana

Comments 24 pages; full version of the paper to appear in IWOCA 2026

2604.04909 2026-04-07 cond-mat.other cs.NA math.NA physics.chem-ph

Weak Solutions to the Bloch Equations with Distant Dipolar Field

Louis-S. Bouchard

Comments 28 pages, 9 figures, 3 tables

详情

DOI: 10.1063/5.0325917

英文摘要

The distant dipolar field (DDF) is a long-range, nonlocal contribution to liquid-state spin dynamics that arises from intermolecular dipolar couplings and can generate multiple-quantum coherences and novel MRI contrast. Its sign-changing kernel makes Bloch-DDF dynamics strongly geometry dependent, and FFT-based dipolar convolutions naturally assume periodic or padded Cartesian domains rather than bounded samples with reflective diffusion boundaries. We study the Bloch equations with the DDF on bounded domains under homogeneous Neumann diffusion conditions. We derive a finite-element weak formulation that supports spatially varying diffusion and relaxation parameters and uses a short-distance regularization of the secular DDF kernel with length a>0. For fixed a we prove boundedness of the DDF operator, establish an L2 energy balance in which precession is neutral while diffusion and transverse relaxation are dissipative, and obtain local well-posedness with continuous dependence on the data, with global existence under energy-neutral transport. For the Galerkin semi-discretization we show a discrete energy identity mirroring the continuum estimate. For computation, we evaluate the DDF in real space with a matrix-free near/far scheme and advance in time using a second-order IMEX splitting method that treats diffusion and relaxation implicitly and precession explicitly. The explicit stage applies a Rodrigues rotation at DDF quadrature points followed by an L2 projection, enabling stable multi-cycle lab-frame simulations. We validate against three closed-form benchmarks and quantify curved-boundary effects by comparing mapped finite elements with a voxel-mask finite-difference baseline on spherical Neumann eigenmode decay. These results provide an analyzable and reproducible route for Bloch-DDF dynamics on bounded domains with complex geometry.

URL PDF HTML ☆

赞 0 踩 0

2604.04908 2026-04-07 cs.LG

HI-MoE: Hierarchical Instance-Conditioned Mixture-of-Experts for Object Detection

Vadim Vashkelis, Natalia Trukhina

2604.04906 2026-04-07 econ.TH cs.AI cs.CY cs.SI

How AI Aggregation Affects Knowledge

Daron Acemoglu, Tianyi Lin, Asuman Ozdaglar, James Siderius

Comments 45 pages

2604.04905 2026-04-07 cs.CV cs.GR cs.HC

ClickAIXR: On-Device Multimodal Vision-Language Interaction with Real-World Objects in Extended Reality

Dawar Khan, Alexandre Kouyoumdjian, Xinyu Liu, Omar Mena, Dominik Engel, Ivan Viola

2604.04904 2026-04-07 cs.HC cs.CY

Demonstrating SIMA-Play: A Serious Game for Forest Management Decision-Making through Board Game and Digital Simulation

Arka Majhi, Daniel Fernández Galeote, Timo Nummenmaa, Juho Hamari, Aaron Petty, Jari Vauhkonen, Heli Peltola

Comments Accepted to the GamiFIN 2026 conference

2604.04902 2026-04-07 cs.LG

Are Latent Reasoning Models Easily Interpretable?

Connor Dilgren, Sarah Wiegreffe

Comments Preprint

2604.04901 2026-04-07 cs.CV cs.AI

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Shuai Liu, Shulin Tian, Kairui Hu, Yuhao Dong, Zhe Yang, Bo Li, Jingkang Yang, Chen Change Loy, Ziwei Liu

Comments Project Page: https://filegram.choiszt.com, Code: https://github.com/synvo-ai/FileGram

2604.04898 2026-04-07 cs.AI cs.CL cs.LG

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

LM-Provers, Yuxiao Qu, Amrith Setlur, Jasper Dekoninck, Edward Beeching, Jia Li, Ian Wu, Lewis Tunstall, Aviral Kumar

2604.04896 2026-04-07 math.CO cs.DM

Measuring Depth of Matroids

Jakub Balabán, Petr Hliněný, Jan Jedelský, Kristýna Pekárková

2604.04895 2026-04-07 cs.MA cs.AI

Agentic Federated Learning: The Future of Distributed Training Orchestration

Rafael O. Jarczewski, Gabriel U. Talasso, Leandro Villas, Allan M. de Souza

2604.04893 2026-04-07 cs.DB cs.IT math.IT

Query Optimization and Evaluation via Information Theory: A Tutorial

Mahmoud Abo Khamis, Hung Q. Ngo, Dan Suciu

2604.04892 2026-04-07 cs.LG

Data Attribution in Adaptive Learning

Amit Kiran Rege

Comments Work in progress

2604.04890 2026-04-07 cs.DC cs.NI

Towards Policy-Enabled Multi-Hop Routing for Cross-Chain Message Delivery

Amin Rezaei, Solomon L. Davidson, Bernard Wong

Comments 11 pages, 8 figures

2604.04887 2026-04-07 cs.CV

HorizonWeaver: Generalizable Multi-Level Semantic Editing for Driving Scenes

Mauricio Soroco, Francesco Pittaluga, Zaid Tasneem, Abhishek Aich, Bingbing Zhuang, Wuyang Chen, Manmohan Chandraker, Ziyu Jiang

Comments CVPR Findings 2026