arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.23792 2026-03-26 cs.LG stat.ML

Manifold Generalization Provably Proceeds Memorization in Diffusion Models

Zebang Shen, Ya-Ping Hsieh, Niao He

Comments The first two authors contributed equally

详情

英文摘要

Diffusion models often generate novel samples even when the learned score is only \emph{coarse} -- a phenomenon not accounted for by the standard view of diffusion training as density estimation. In this paper, we show that, under the \emph{manifold hypothesis}, this behavior can instead be explained by coarse scores capturing the \emph{geometry} of the data while discarding the fine-scale distributional structure of the population measure~$μ_{\scriptscriptstyle\mathrm{data}}$. Concretely, whereas estimating the full data distribution $μ_{\scriptscriptstyle\mathrm{data}}$ supported on a $k$-dimensional manifold is known to require the classical minimax rate $\tilde{\mathcal{O}}(N^{-1/k})$, we prove that diffusion models trained with coarse scores can exploit the \emph{regularity of the manifold support} and attain a near-parametric rate toward a \emph{different} target distribution. This target distribution has density uniformly comparable to that of~$μ_{\scriptscriptstyle\mathrm{data}}$ throughout any $\tilde{\mathcal{O}}\bigl(N^{-β/(4k)}\bigr)$-neighborhood of the manifold, where $β$ denotes the manifold regularity. Our guarantees therefore depend only on the smoothness of the underlying support, and are especially favorable when the data density itself is irregular, for instance non-differentiable. In particular, when the manifold is sufficiently smooth, we obtain that \emph{generalization} -- formalized as the ability to generate novel, high-fidelity samples -- occurs at a statistical rate strictly faster than that required to estimate the full population distribution~$μ_{\scriptscriptstyle\mathrm{data}}$.

URL PDF HTML ☆

赞 0 踩 0

2603.23788 2026-03-26 cs.CV

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Mingqi Gao, Sijie Li, Jungong Han

2603.23785 2026-03-26 cs.CV cs.LG

Retinal Disease Classification from Fundus Images using CNN Transfer Learning

Ali Akram

Comments 4 figures

2603.23784 2026-03-26 cs.LG

Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic

Anand Swaroop

Comments 9 pages, 5 figures

2603.23780 2026-03-26 cs.LG

Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

Nan Cui, Wendy Hui Wang, Yue Ning

2603.23757 2026-03-26 cs.CV

Learning Cross-Joint Attention for Generalizable Video-Based Seizure Detection

Omar Zamzam, Takfarinas Medani, Chinmay Chinara, Richard Leahy

2603.23755 2026-03-26 cs.LG cs.AI

Self Paced Gaussian Contextual Reinforcement Learning

Mohsen Sahraei Ardakani, Rui Song

Comments 16 pages, 10 figures

2603.23754 2026-03-26 cs.CV

IJmond Industrial Smoke Segmentation Dataset

Yen-Chia Hsu, Despoina Touska

2603.23753 2026-03-26 cs.RO

Task-Space Singularity Avoidance for Control Affine Systems Using Control Barrier Functions

Kimia Forghani, Suraj Raval, Lamar Mair, Axel Krieger, Yancy Diaz-Mercado

2603.23749 2026-03-26 cs.AI

Efficient Benchmarking of AI Agents

Franck Ndzomga

Comments 22 pages, 7 figures, 5 tables

2603.23746 2026-03-26 cs.LG

Kronecker-Structured Nonparametric Spatiotemporal Point Processes

Zhitong Xu, Qiwei Yuan, Yinghao Chen, Yan Sun, Bin Shen, Shandian Zhe

2603.23742 2026-03-26 cs.CV

Detection and Classification of (Pre)Cancerous Cells in Pap Smears: An Ensemble Strategy for the RIVA Cervical Cytology Challenge

Lautaro Kogan, María Victoria Ríos

Comments Accepted for Poster Presentation at the RIVA Cervical Cytology Challenge, IEEE ISBI 2026. 4 pages, 2 figures

2603.23738 2026-03-26 cs.LG

BXRL: Behavior-Explainable Reinforcement Learning

Ram Rachum, Yotam Amitai, Yonatan Nakar, Reuth Mirsky, Cameron Allen

2603.23730 2026-03-26 cs.CV

An Adapter-free Fine-tuning Approach for Tuning 3D Foundation Models

Sneha Paul, Zachary Patterson, Nizar Bouguila

Comments Accepted at The Fifth International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI 2026)

2603.23729 2026-03-26 cs.CV

Bi-CRCL: Bidirectional Conservative-Radical Complementary Learning with Pre-trained Foundation Models for Class-incremental Medical Image Analysis

Xinyao Wu, Zhe Xu, Cheng Chen, Jiawei Ma, Yefeng Zheng, Raymond Kai-yu Tong

Comments preprint; under review

2603.23725 2026-03-26 cs.RO

Form-Fitting, Large-Area Sensor Mounting for Obstacle Detection

Anna Soukhovei, Carson Kohlbrenner, Caleb Escobedo, Alexander Gholmieh, Alexander Dickhans, Alessandro Roncone

Comments Accepted at 2025 Humanoids Workshop on Advances in Contact-Rich Robotics: Rich Tactile-Based Physical Interaction [ConRich]

2603.23719 2026-03-26 cs.LG cs.AI

CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records

Shaonan Liu, Yuichiro Iwashita, Soichiro Nakako, Masakazu Iwamura, Koichi Kise

2603.23714 2026-03-26 cs.AI cs.CL

LLMs Do Not Grade Essays Like Humans

Jerin George Mathew, Sumayya Taher, Anindita Kundu, Denilson Barbosa

2603.23701 2026-03-26 cs.CL cs.AI

The Diminishing Returns of Early-Exit Decoding in Modern LLMs

Rui Wei, Rui Du, Hanfei Yu, Devesh Tiwari, Jian Li, Zhaozhuo Xu, Hao Wang

2603.23690 2026-03-26 cs.RO

ROSCell: A ROS2-Based Framework for Automated Formation and Orchestration of Multi-Robot Systems

Jiangtao Shuai, Marvin Carl May, Sonja Schimmler, Manfred Hauswirth

2603.23686 2026-03-26 cs.CV

AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models

Yiran Qiao, Yiren Lu, Yunlai Zhou, Rui Yang, Linlin Hou, Yu Yin, Jing Ma

2603.23678 2026-03-26 cs.CL cs.AI

PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation

Manjushree B. Aithal, Ph. D., Alexander Kotz, James Mitchell, Ph. D

Comments 10 pages, 2 figures, Under review AMIA Symposium

2603.23676 2026-03-26 cs.AI cs.RO

Grounding Vision and Language to 3D Masks for Long-Horizon Box Rearrangement

Ashish Malik, Caleb Lowe, Aayam Shrestha, Stefan Lee, Fuxin Li, Alan Fern

2603.23669 2026-03-26 cs.CV cs.AI cs.LG

Estimating Individual Tree Height and Species from UAV Imagery

Jannik Endres, Etienne Laliberté, David Rolnick, Arthur Ouaknine

Comments Project page: https://RolnickLab.github.io/DINOvTree

2603.23667 2026-03-26 cs.SD cs.AI eess.AS

Echoes: A semantically-aligned music deepfake detection dataset

Octavian Pascu, Dan Oneata, Horia Cucu, Nicolas M. Muller

2603.23666 2026-03-26 cs.RO physics.app-ph

Quadrature Oscillation System for Coordinated Motion in Crawling Origami Robot

Sean Liu, Ankur Mehta, Wenzhong Yan

Comments 8 pages, 11 figures, Accepted to ICRA 2026

2603.23660 2026-03-26 cs.AI

GTO Wizard Benchmark

Marc-Antoine Provost, Nejc Ilenic, Christopher Solinas, Philippe Beardsell

2603.23659 2026-03-26 cs.CL cs.AI

Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges

Weilun Xu, Alexander Rusnak, Frederic Kaplan

2603.23658 2026-03-26 cs.LG cs.NA math.NA math.OC

Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

Abhijit Chowdhary, Elizabeth Newman, Deepanshu Verma

Comments 55 pages, 14 figures

2603.23654 2026-03-26 cs.CL

Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages

Badr M. Abdullah, Israel Abebe Azime, Atnafu Lambebo Tonja, Jesujoba O. Alabi, Abel Mulat Alemu, Eyob G. Hagos, Bontu Fufa Balcha, Mulubrhan A. Nerea, Debela Desalegn Yadeta, Dagnachew Mekonnen Marilign, Amanuel Temesgen Fentahun, Tadesse Kebede, Israel D. Gebru, Michael Melese Woldeyohannis, Walelign Tewabe Sewunetie, Bernd Möbius, Dietrich Klakow

Comments Preprint (under review)