arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.26951 2026-04-30 cs.CL cs.AI cs.LG

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Gongbo Zhang, Wen Wang, Ye Tian, Li Yuan

Comments 15 pages, 3 figures. Code: https://github.com/PKU-YuanGroup/TIDE

详情

英文摘要

Diffusion large language models (dLLMs) offer parallel decoding and bidirectional context, but state-of-the-art dLLMs require billions of parameters for competitive performance. While existing distillation methods for dLLMs reduce inference steps within a single architecture, none address cross-architecture knowledge transfer, in which the teacher and student differ in architecture, attention mechanism, and tokenizer. We present TIDE, the first framework for cross-architecture dLLM distillation, comprising three modular components: (1) TIDAL, which jointly modulates distillation strength across training progress and diffusion timestep to account for the teacher's noise-dependent reliability; (2) CompDemo, which enriches the teacher's context via complementary mask splitting to improve predictions under heavy masking; and (3) Reverse CALM, a cross-tokenizer objective that inverts chunk-level likelihood matching, yielding bounded gradients and dual-end noise filtering. Distilling 8B dense and 16B MoE teachers into a 0.6B student via two heterogeneous pipelines outperforms the baseline by an average of 1.53 points across eight benchmarks, yielding notable gains in code generation, where HumanEval scores reach 48.78 compared to 32.3 for the AR baseline.

URL PDF HTML ☆

赞 0 踩 0

2604.26946 2026-04-30 cs.CV cs.RO

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation

Wanrong Zheng, Yunhao Ge, Laurent Itti

Comments Accepted to AISTATS 2026. Code: https://github.com/ZoeyZheng0/3-step-Nav

2604.26944 2026-04-30 math.CA cs.SC

Fractions of Recurrence Operators for Generalized Fourier Series in Classical Orthogonal Polynomials

Alexandre Benoit, Nicolas Brisebarre, Bruno Salvy

2604.26943 2026-04-30 cs.CV

ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python

Alexander Raistrick, Karhan Kayan, Jack Nugent, David Yan, Lingjie Mei, Meenal Parakh, Hongyu Wen, Dylan Li, Yiming Zuo, Erich Liang, Jia Deng

2604.26942 2026-04-30 cs.LG math.ST q-bio.GN stat.ME stat.ML stat.TH

Hyper Input Convex Neural Networks for Shape Constrained Learning and Optimal Transport

Shayan Hundrieser, Insung Kong, Johannes Schmidt-Hieber

Comments 65 pages, 13 figures, the first two authors contributed equally

2604.26939 2026-04-30 math.PR cs.SI q-bio.PE

Degree-dependent and distance-dependent contact rates interpolate between explosive, exponential and polynomial epidemic growth

Zylan Benjert, Júlia Komjáthy, Johannes Lengler, John Lapinskas, Ulysse Schaller

2604.26935 2026-04-30 cs.HC

Artistic Practice Opportunities in CST Evaluations: A Longitudinal Group Deployment of ArtKrit

Catherine Liu, Tao Long, Asya Vaisberg, Chau Vu, Jiaju Ma, Jingyi Li

Comments 17 pages, 8 figures. Accepted to DIS 2026

2604.26934 2026-04-30 cs.CV

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Wanyue Zhang, Wenxiang Wu, Wang Xu, Jiaxin Luo, Helu Zhi, Yibin Huang, Shuo Ren, Zitao Liu, Jiajun Zhang

Comments The code is available at https://github.com/WanyueZhang-ai/World2VLM. The dataset is available at https://huggingface.co/datasets/WanyueZhang/World2VLM

2604.26932 2026-04-30 math.OC cs.LG

Learning Over-Relaxation Policies for ADMM with Convergence Guarantees

Junan Lin, Paul J. Goulart, Luca Furieri

2604.26931 2026-04-30 cs.DC

Adaptive Self-Organization in Anonymous Dynamic Networks

Garrett Parzych, Joshua J. Daymude

Comments 30 pages, 1 figure, 1 table, 1 algorithm. To appear as a brief announcement at SAND 2026

2604.26926 2026-04-30 cs.LG math.OC stat.ML

A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound

Francesco Orabona

2604.26923 2026-04-30 cs.SE cs.CL

ClassEval-Pro: A Cross-Domain Benchmark for Class-Level Code Generation

Yeheng Chen, Chaoxiang Xie, Yuling Shi, Wenhao Zeng, Yongpan Wang, Hongyu Zhang, Xiaodong Gu

Comments Accepted to AIware 2026. Code and data available at https://github.com/ian-Kappa/ClassEval-Pro

2604.26922 2026-04-30 cs.LG cs.DS cs.GT stat.ML

On the Learning Curves of Revenue Maximization

Steve Hanneke, Alkis Kalavasis, Shay Moran, Grigoris Velegkas

Comments To appear in the 58th ACM Symposium on Theory of Computing (STOC 2026)

2604.26921 2026-04-30 quant-ph cs.CC

En Route to a Standard QMA1 vs. QCMA Oracle Separation

David Miloschewsky, Supartha Podder, Dorian Rudolph

Comments 25 pages

2604.26920 2026-04-30 cs.CV

Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction

David Novikov, Eilon Vaknin, Narek Tumanyan, Mark Sheinin

Comments accepted to IEEE CVPR 2026 as a highlight

2604.26919 2026-04-30 cs.LG cs.AI cs.NE

Causal Learning with Neural Assemblies

Evangelia Kopadi, Dimitris Kalles

Comments 8 pages, 11 figures

2604.26917 2026-04-30 cs.CV

AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation

Zijie Wu, Chaohui Yu, Fan Wang, Xiang Bai

Comments 14 pages, TPAMI submission, code url: https://github.com/JarrentWu1031/AnimateAnyMesh-pp

2604.26913 2026-04-30 math.OC cs.NA math.NA math.PR

Generalization of Zeroth-Order Method for Quotients of Quadratic Functions

Jonas Bresch

2604.26910 2026-04-30 cs.RO

Bi-Level Optimization for Contact and Motion Planning in Rope-Assisted Legged Robots

Ruben Malacarne, Ioannis Tsikelis, Enrico Mingo Hoffman, Michele Focchi

2604.26903 2026-04-30 eess.SP cs.AI cs.AR cs.ET cs.SY eess.SY

Recent Advances in mm-Wave and Sub-THz/THz Oscillators for FutureG Technologies

Baktash Behmanesh, Ahmad Rezvanitabar

2604.26900 2026-04-30 quant-ph cs.CC cs.DS

Strict Hierarchy for Quantum Channel Certification to Unitary

Kean Chen, Qisheng Wang, Zhicheng Zhang

Comments 13 pages, 3 algorithms

2604.26899 2026-04-30 eess.SY cs.RO cs.SY

Safe Navigation using Neural Radiance Fields via Reachable Sets

Omanshu Thapliyal, Malarvizhi Sankaranarayanasamy, Ravigopal Vennelakanti

Comments 5 pages, 8 figures, 2026 4th International Conference on Mechatronics, Control and Robotics (ICMCR)

2604.26898 2026-04-30 math.PR cs.LG stat.ML

Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models

Andrea Agazzi, Giuseppe Bruno, Eloy Mosig García, Samuele Saviozzi, Marco Romito

Comments 55 pages, 6 figures

2604.26897 2026-04-30 cs.RO cs.SY eess.SY

Stochastic Entanglement of Deterministic Origami Tentacles For Universal Robotic Gripping

Alec Boron, Bokun Zheng, Ziyang Zhou, Noel Naughton, Suyi Li

2604.26896 2026-04-30 math.NA cs.NA physics.flu-dyn

Data assimilation for slightly compressible flow

Aytekin Çıbık, Rui Fang

2604.26893 2026-04-30 cs.CV

Graph-based Semantic Calibration Network for Unaligned UAV RGBT Image Semantic Segmentation and A Large-scale Benchmark

Fangqiang Fan, Zhicheng Zhao, Xiaoliang Ma, Chenglong Li, Jin Tang

Comments 13 pages,13 figures

2604.26892 2026-04-30 cs.SE

Hot Fixing in the Wild

Carol Hanna, Karine Even-Mendoza, W. B. Langdon, Mar Zamorano López, Justyna Petke, Federica Sarro

2604.26889 2026-04-30 cs.PF

Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight

Yuang Yan, Ian Karlin, Ryan Grant

2604.26888 2026-04-30 cs.LG

Multiple Additive Neural Networks for Structured and Unstructured Data

Janis Mohr, Jörg Frochte

Comments Accepted author manuscript; page layout differs from the published Springer version

2604.26883 2026-04-30 cs.CV

SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset

Changhyun Roh, Yonghyun Jeong, Jonghyun Lee, Chanho Eom, Jihyong Oh

Comments The last two authors are co-corresponding authors. Please visit our project page at https://cmlab-korea.github.io/SEAL