arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.26951 2026-04-30 cs.CL cs.AI cs.LG

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Gongbo Zhang, Wen Wang, Ye Tian, Li Yuan

Comments 15 pages, 3 figures. Code: https://github.com/PKU-YuanGroup/TIDE

详情

英文摘要

Diffusion large language models (dLLMs) offer parallel decoding and bidirectional context, but state-of-the-art dLLMs require billions of parameters for competitive performance. While existing distillation methods for dLLMs reduce inference steps within a single architecture, none address cross-architecture knowledge transfer, in which the teacher and student differ in architecture, attention mechanism, and tokenizer. We present TIDE, the first framework for cross-architecture dLLM distillation, comprising three modular components: (1) TIDAL, which jointly modulates distillation strength across training progress and diffusion timestep to account for the teacher's noise-dependent reliability; (2) CompDemo, which enriches the teacher's context via complementary mask splitting to improve predictions under heavy masking; and (3) Reverse CALM, a cross-tokenizer objective that inverts chunk-level likelihood matching, yielding bounded gradients and dual-end noise filtering. Distilling 8B dense and 16B MoE teachers into a 0.6B student via two heterogeneous pipelines outperforms the baseline by an average of 1.53 points across eight benchmarks, yielding notable gains in code generation, where HumanEval scores reach 48.78 compared to 32.3 for the AR baseline.

URL PDF HTML ☆

赞 0 踩 0

2604.26946 2026-04-30 cs.CV cs.RO

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation

Wanrong Zheng, Yunhao Ge, Laurent Itti

Comments Accepted to AISTATS 2026. Code: https://github.com/ZoeyZheng0/3-step-Nav

2604.26943 2026-04-30 cs.CV

ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python

Alexander Raistrick, Karhan Kayan, Jack Nugent, David Yan, Lingjie Mei, Meenal Parakh, Hongyu Wen, Dylan Li, Yiming Zuo, Erich Liang, Jia Deng

2604.26942 2026-04-30 cs.LG math.ST q-bio.GN stat.ME stat.ML stat.TH

Hyper Input Convex Neural Networks for Shape Constrained Learning and Optimal Transport

Shayan Hundrieser, Insung Kong, Johannes Schmidt-Hieber

Comments 65 pages, 13 figures, the first two authors contributed equally

2604.26934 2026-04-30 cs.CV

World2VLM: Distilling World Model Imagination into VLMs for Dynamic Spatial Reasoning

Wanyue Zhang, Wenxiang Wu, Wang Xu, Jiaxin Luo, Helu Zhi, Yibin Huang, Shuo Ren, Zitao Liu, Jiajun Zhang

Comments The code is available at https://github.com/WanyueZhang-ai/World2VLM. The dataset is available at https://huggingface.co/datasets/WanyueZhang/World2VLM

2604.26926 2026-04-30 cs.LG math.OC stat.ML

A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound

Francesco Orabona

2604.26922 2026-04-30 cs.LG cs.DS cs.GT stat.ML

On the Learning Curves of Revenue Maximization

Steve Hanneke, Alkis Kalavasis, Shay Moran, Grigoris Velegkas

Comments To appear in the 58th ACM Symposium on Theory of Computing (STOC 2026)

2604.26920 2026-04-30 cs.CV

Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction

David Novikov, Eilon Vaknin, Narek Tumanyan, Mark Sheinin

Comments accepted to IEEE CVPR 2026 as a highlight

2604.26919 2026-04-30 cs.LG cs.AI cs.NE

Causal Learning with Neural Assemblies

Evangelia Kopadi, Dimitris Kalles

Comments 8 pages, 11 figures

2604.26917 2026-04-30 cs.CV

AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation

Zijie Wu, Chaohui Yu, Fan Wang, Xiang Bai

Comments 14 pages, TPAMI submission, code url: https://github.com/JarrentWu1031/AnimateAnyMesh-pp

2604.26910 2026-04-30 cs.RO

Bi-Level Optimization for Contact and Motion Planning in Rope-Assisted Legged Robots

Ruben Malacarne, Ioannis Tsikelis, Enrico Mingo Hoffman, Michele Focchi

2604.26897 2026-04-30 cs.RO cs.SY eess.SY

Stochastic Entanglement of Deterministic Origami Tentacles For Universal Robotic Gripping

Alec Boron, Bokun Zheng, Ziyang Zhou, Noel Naughton, Suyi Li

2604.26893 2026-04-30 cs.CV

Graph-based Semantic Calibration Network for Unaligned UAV RGBT Image Semantic Segmentation and A Large-scale Benchmark

Fangqiang Fan, Zhicheng Zhao, Xiaoliang Ma, Chenglong Li, Jin Tang

Comments 13 pages,13 figures

2604.26888 2026-04-30 cs.LG

Multiple Additive Neural Networks for Structured and Unstructured Data

Janis Mohr, Jörg Frochte

Comments Accepted author manuscript; page layout differs from the published Springer version

2604.26883 2026-04-30 cs.CV

SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset

Changhyun Roh, Yonghyun Jeong, Jonghyun Lee, Chanho Eom, Jihyong Oh

Comments The last two authors are co-corresponding authors. Please visit our project page at https://cmlab-korea.github.io/SEAL

2604.26880 2026-04-30 cs.CL cs.LG

HealthNLP_Retrievers at ArchEHR-QA 2026: Cascaded LLM Pipeline for Grounded Clinical Question Answering

Md Biplob Hosen, Md Alomgeer Hussein, Md Akmol Masud, Omar Faruque, Tera L Reynolds, Lujie Karen Chen

2604.26873 2026-04-30 cs.CV

Uncertainty-Aware Pedestrian Attribute Recognition via Evidential Deep Learning

Zhuofan Lou, Shihang Zhang, Fangle Zhu, Shengjie Ye, Pingyu Wang

Comments 11 pages, 6 figures, 5 tables

2604.26869 2026-04-30 cs.LG cs.CV

KAYRA: A Microservice Architecture for AI-Assisted Karyotyping with Cloud and On-Premise Deployment

Attila Pintér, Javier Rico, Attila Répai, Jalal Al-Afandi, Adrienn Éva Borsy, András Kozma, Hajnalka Andrikovics, György Cserey

2604.26868 2026-04-30 cs.CV

Breaking the Rigid Prior: Towards Articulated 3D Anomaly Detection

Jinye Gan, Bozhong Zheng, Xiaohao Xu, Junye Ren, Zixuan Zhang, Na Ni, Yingna Wu

2604.26866 2026-04-30 cs.CL cs.LG

MoRFI: Monotonic Sparse Autoencoder Feature Identification

Dimitris Dimakopoulos, Shay B. Cohen, Ioannis Konstas

2604.26857 2026-04-30 cs.CV cs.LG cs.RO eess.IV

Edge AI for Automotive Vulnerable Road User Safety: Deployable Detection via Knowledge Distillation

Akshay Karjol, Darrin M. Hanna

Comments 6 pages, 3 figures

2604.26844 2026-04-30 cs.CL

What Kind of Language is Easy to Language-Model Under Curriculum Learning?

Nadine El-Naggar, Tatsuki Kuribayashi, Ted Briscoe

Comments The 15th edition of the Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2026)

2604.26841 2026-04-30 cs.LG cs.AI cs.CL

Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data

Bao Pham, Mohammed J. Zaki, Luca Ambrogioni, Dmitry Krotov, Matteo Negri

Comments Also see arXiv:2505.21777 for a related work

2604.26839 2026-04-30 cs.RO

Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance

Lingfeng Zhang, Xiaoshuai Hao, Xizhou Bu, Yingbo Tang, Hongsheng Li, Jinghui Lu, Xiu-shen Wei, Jiayi Ma, Yu Liu, Jing Zhang, Hangjun Ye, Xiaojun Liang, Long Chen, Wenbo Ding

2604.26837 2026-04-30 cs.LG

Unifying Sparse Attention with Hierarchical Memory for Scalable Long-Context LLM Serving

Zihan Zhao, Baotong Lu, Shengjie Lin, Yizou Chen, Jing Liu, Yanqi Zhang, Ziming Miao, Ming-Chang Yang, Haiying Shen, Qi Chen, Fan Yang

Comments 15 pages

2604.26835 2026-04-30 cs.CL cs.AI cs.DL

HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists

Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe

Comments Work In Progress

2604.26833 2026-04-30 cs.RO cs.AI cs.LG

Rule-based High-Level Coaching for Goal-Conditioned Reinforcement Learning in Search-and-Rescue UAV Missions Under Limited-Simulation Training

Mahya Ramezani, Holger Voos

2604.26830 2026-04-30 cs.LG cs.AI

Random Cloud: Finding Minimal Neural Architectures Without Training

Javier Gil Blázquez

2604.26820 2026-04-30 cs.CV

Bridge: Basis-Driven Causal Inference Marries VFMs for Domain Generalization

Mingbo Hong, Feng Liu, Caroline Gevaert, George Vosselman, Hao Cheng

Comments Accepted by CVPR 2026

2604.26818 2026-04-30 cs.LG

Semi-supervised learning with max-margin graph cuts

Branislav Kveton, Michal Valko, Ali Rahimi, Ling Huang

Comments Published at AISTATS 2010 (13th International Conference on Artificial Intelligence and Statistics)