arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.15517 2026-04-02 cs.LG stat.AP

A Survey and Comparative Evaluation of Intrinsic Dimension Estimators under the Manifold Hypothesis

Zelong Bi, Pierre Lafaye de Micheaux

详情

英文摘要

The manifold hypothesis suggests that high-dimensional data often lie on or near a low-dimensional manifold. Estimating the dimension of this manifold is essential for leveraging its structure, yet existing work on dimension estimation is fragmented and lacks systematic evaluation. This article provides a comprehensive survey for both researchers and practitioners. We review often-overlooked theoretical foundations and present eight representative estimators. Through controlled experiments, we analyze how individual factors, such as noise, curvature, and sample size, affect performance. We also compare the estimators on diverse synthetic and real-world datasets, introducing a principled approach to dataset-specific hyperparameter tuning. Our results offer practical guidance for estimator selection and yield insights that will inform future estimator design.

URL PDF HTML ☆

赞 0 踩 0

2508.09281 2026-04-02 cs.LG

Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning

Muntasir Hoq, Griffin Pitts, Tirth Bhatt, Aum Pandya, Andrew Lan, Peter Brusilovsky, Bita Akram

Comments In Proceedings of the 19th International Conference on Educational Data Mining (EDM), 2026

2506.13841 2026-04-02 cs.AI

LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning

Miho Koda, Yu Zheng, Ruixian Ma, Mingyang Sun, Devesh Pansare, Fabio Duarte, Paolo Santi

Comments ICLR 2026 Workshop on Efficient Spatial Reasoning

2505.10913 2026-04-02 cs.LG

Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions

Muntasir Hoq, Ananya Rao, Reisha Jaishankar, Krish Piryani, Nithya Janapati, Jessica Vandenberg, Bradford Mott, Narges Norouzi, James Lester, Bita Akram

Comments Accepted for publication at the 18th International Conference on Educational Data Mining (EDM), 2025

2504.13129 2026-04-02 cs.CV cs.AI cs.LG

Science-T2I: Addressing Scientific Illusions in Image Synthesis

Jialuo Li, Wenhao Chai, Xingyu Fu, Haiyang Xu, Saining Xie

Comments Accepted to CVPR 2025. Code, docs, weight, benchmark and training data are all avaliable at https://jialuo-li.github.io/Science-T2I-Web

2503.19851 2026-04-02 cs.CV

Towards Online Multi-Modal Social Interaction Understanding

Xinpeng Li, Shijian Deng, Bolin Lai, Weiguo Pian, James M. Rehg, Yapeng Tian

Comments Accepted to Transactions on Machine Learning Research (TMLR). Project page: https://sampson-lee.github.io/online-mmsi-project-page

2503.02976 2026-04-02 cs.AI

Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment

Matthew DosSantos DiSorbo, Harang Ju, Sinan Aral

2411.08687 2026-04-02 cs.LG

Diagnosing Neural Convergence with Topological Alignment Spectra

Tiago F. Tavares, Fabio Ayres, Paris Smaragdis

2406.08097 2026-04-02 cs.LG stat.AP stat.ME

Inductive Global and Local Manifold Approximation and Projection

Jungeum Kim, Xiao Wang

Comments Accepted at TMLR (2024)

2306.14052 2026-04-02 cs.LG cs.AR cs.DC

A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware

Shichang Zhang, Atefeh Sohrabizadeh, Cheng Wan, Zijie Huang, Ziniu Hu, Yewen Wang, Yingyan, Lin, Jason Cong, Yizhou Sun

2604.01181 2026-04-02 cs.HC cs.CL cs.CV

True (VIS) Lies: Analyzing How Generative AI Recognizes Intentionality, Rhetoric, and Misleadingness in Visualization Lies

Graziano Blasilli, Marco Angelini

详情

英文摘要

This study investigates the ability of multimodal Large Language Models (LLMs) to identify and interpret misleading visualizations, and recognize these observations along with their underlying causes and potential intentionality. Our analysis leverages concepts from visualization rhetoric and a newly developed taxonomy of authorial intents as explanatory lenses. We formulated three research questions and addressed them experimentally using a dataset of 2,336 COVID-19-related tweets, half of which contain misleading visualizations, and supplemented it with real-world examples of perceptual, cognitive, and conceptual errors drawn from VisLies, the IEEE VIS community event dedicated to showcasing deceptive and misleading visualizations. To ensure broad coverage of the current LLM landscape, we evaluated 16 state-of-the-art models. Among them, 15 are open-weight models, spanning a wide range of model sizes, architectural families, and reasoning capabilities. The selection comprises small models, namely Nemotron-Nano-V2-VL (12B parameters), Mistral-Small-3.2 (24B), DeepSeek-VL2 (27B), Gemma3 (27B), and GTA1 (32B); medium-sized models, namely Qianfan-VL (70B), Molmo (72B), GLM-4.5V (108B), LLaVA-NeXT (110B), and Pixtral-Large (124B); and large models, namely Qwen3-VL (235B), InternVL3.5 (241B), Step3 (321B), Llama-4-Maverick (400B), and Kimi-K2.5 (1000B). In addition, we employed OpenAI GPT-5.4, a frontier proprietary model. To establish a human perspective on these tasks, we also conducted a user study with visualization experts to assess how people perceive rhetorical techniques and the authorial intentions behind the same misleading visualizations. This allows comparison between model and expert behavior, revealing similarities and differences that provide insights into where LLMs align with human judgment and where they diverge.

URL PDF HTML ☆

赞 0 踩 0

2604.01173 2026-04-02 eess.SY cs.LG cs.SY math.OC

Safe learning-based control via function-based uncertainty quantification

Abdullah Tokmak, Toni Karvonen, Thomas B. Schön, Dominik Baumann

Comments Under review for CDC 2026

2604.01167 2026-04-02 eess.IV cs.AI cs.CV

AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation

Prantik Deb, Srimanth Dhondy, N. Ramakrishna, Anu Kapoor, Raju S. Bapi, Tapabrata Chakraborti

Comments Accepted to ISBI 2026(Oral Presentation)

2604.01106 2026-04-02 physics.optics cs.LG

Inverse Design of Optical Multilayer Thin Films using Robust Masked Diffusion Models

Jonas Schaible, Asena Karolin Özdemir, Charlotte Debus, Sven Burger, Achim Streit, Christiane Becker, Klaus Jäger, Markus Götz

Comments 24 pages, 14 Figures

2604.01052 2026-04-02 cs.CR cs.AI

VibeGuard: A Security Gate Framework for AI-Generated Code

Ying Xie

2604.01049 2026-04-02 cs.NI cs.AI

Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and Recovery

Deemah H. Tashman, Soumaya Cherkaoui

2604.01036 2026-04-02 cs.IR cs.AI cs.CY

Aligning Recommendations with User Popularity Preferences

Mona Schirmer, Anton Thielmann, Pola Schwöbel, Thomas Martynec, Giuseppe Di Benedetto, Ben London, Yannik Stein

Comments Accepted at FAccT 2026

2604.01029 2026-04-02 cs.SE cs.AI cs.CL

Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines

Jingjie Ning, Xueqi Li, Chengyu Yu

2604.01020 2026-04-02 cs.MA cs.AI

OrgAgent: Organize Your Multi-Agent System like a Company

Yiru Wang, Xinyue Shen, Yaohui Han, Michael Backes, Pin-Yu Chen, Tsung-Yi Ho

2604.01014 2026-04-02 cs.CR cs.CV

AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration

Ruhao Liu, Weiqi Huang, Qi Li, Xinchao Wang

2604.00987 2026-04-02 stat.ML cs.AI cs.LG

Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications

Yi Cao, Zexun Chen, Lin William Cong, Heqing Shi

2604.00917 2026-04-02 cs.SE cs.AI cs.LG

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Razvan Mihai Popescu, David Gros, Andrei Botocan, Rahul Pandita, Prem Devanbu, Maliheh Izadi

Comments MSR 2026 Technical Track

2604.00868 2026-04-02 cs.DB cs.LG

Accurate and Scalable Matrix Mechanisms via Divide and Conquer

Guanlin He, Yingtai Xiao, Jiamu Bai, Xin Gu, Zeyu Ding, Wenpeng Yin, Daniel Kifer

Comments 17 pages

2604.00811 2026-04-02 stat.ML cs.LG stat.ME

Deconfounding Scores and Representation Learning for Causal Effect Estimation with Weak Overlap

Oscar Clivio, Alexander D'Amour, Alexander Franks, David Bruns-Smith, Chris Holmes, Avi Feller

Comments To appear at AISTATS 2026

2604.00717 2026-04-02 cs.MA cs.AI

GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization

Sihan Zhou, Tiantian He, Yifan Lu, Yaqing Hou, Yew-Soon Ong

2604.00704 2026-04-02 cs.CR cs.AI cs.SE

AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications

Ruozhao Yang, Mingfei Cheng, Gelei Deng, Junjie Wang, Tianwei Zhang, Xiaofei Xie

Comments 21 pages, 18 figures

2604.00697 2026-04-02 stat.ML cs.LG

Inverse-Free Sparse Variational Gaussian Processes

Stefano Cortinovis, Laurence Aitchison, Stefanos Eleftheriadis, Mark van der Wilk

Comments Accepted to AISTATS 2026. 20 pages, 3 figures, 2 tables

2604.00694 2026-04-02 cs.ET cs.AI

Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures

Lewis Tham, Nicholas Mac Gregor Garcia, Jungpil Hahn

Comments 17 pages, 2 figures, 5 tables

2604.00675 2026-04-02 physics.comp-ph cs.AI cs.CE

Procela: Epistemic Governance in Mechanistic Simulations Under Structural Uncertainty

Kinson Vernet

2604.00660 2026-04-02 cs.DB cs.AI

Streaming Model Cascades for Semantic SQL

Paweł Liskowski, Kyle Schmaus