arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2508.09346 2026-03-16 cs.RO

How Safe Will I Be Given What I Saw? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy

Zhenjiang Mao, Mrinall Eashaan Umasudhan, Ivan Ruchkin

Comments arXiv admin note: text overlap with arXiv:2308.12252

详情

英文摘要

Autonomous robots that rely on deep neural network controllers pose critical challenges for safety prediction, especially under partial observability and distribution shift. Traditional model-based verification techniques are limited in scalability and require access to low-dimensional state models, while model-free methods often lack reliability guarantees. This paper addresses these limitations by introducing a framework for calibrated safety prediction in end-to-end vision-controlled systems, where neither the state-transition model nor the observation model is accessible. Building on the foundation of world models, we leverage variational autoencoders and recurrent predictors to forecast future latent trajectories from raw image sequences and estimate the probability of satisfying safety properties. We distinguish between monolithic and composite prediction pipelines and introduce a calibration mechanism to quantify prediction confidence. In long-horizon predictions from high-dimensional observations, the forecasted inputs to the safety evaluator can deviate significantly from the training distribution due to compounding prediction errors and changing environmental conditions, leading to miscalibrated risk estimates. To address this, we incorporate unsupervised domain adaptation to ensure robustness of safety evaluation under distribution shift in predictions without requiring manual labels. Our formulation provides theoretical calibration guarantees and supports practical evaluation across long prediction horizons. Experimental results on three benchmarks show that our UDA-equipped evaluators maintain high accuracy and substantially lower false positive rates under distribution shift. Similarly, world model-based composite predictors outperform their monolithic counterparts on long-horizon tasks, and our conformal calibration provides reliable statistical bounds.

URL PDF HTML ☆

赞 0 踩 0

2508.07370 2026-03-16 cs.LG

Intrinsic training dynamics of deep neural networks

Sibylle Marcotte, Gabriel Peyré, Rémi Gribonval

Comments Accepted at ICLR 2026

2508.01587 2026-03-16 cs.CV

Distilling the Past: Information-Dense and Style-Aware Replay for Lifelong Person Re-Identification

Mingyu Wang, Wei Jiang, Haojie Liu, Zhiyong Li, Q. M. Jonathan Wu

Comments 21 pages, 11 figures

2507.18059 2026-03-16 cs.AI cs.MA

Multi-Agent Guided Policy Optimization

Yueheng Li, Guangming Xie, Zongqing Lu

2507.17288 2026-03-16 cs.CL eess.AS

Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Miaomiao Gao, Xiaoxiao Xiang, Yiwen Guo

Comments Accepted By Interspeech 2025 MLC-SLM workshop

2507.06119 2026-03-16 cs.CV

Omni-Video: Democratizing Unified Video Understanding and Generation

Zhiyu Tan, Hao Yang, Luozheng Qin, Jia Gong, Mengping Yang, Hao Li

Comments Technical report, project page: https://howellyoung-s.github.io/OmniVideo_project/

2507.05914 2026-03-16 cs.LG

Accelerating Diffusion Model Training under Minimal Budgets: A Condensation-Based Perspective

Rui Huang, Shitong Shao, Zikai Zhou, Pukun Zhao, Hangyu Guo, Tian Ye, Lichen Bai, Shuo Yang, Zeke Xie

Comments CVPR 2026 camera-ready version. Introduces D2C, a framework for efficient diffusion model training

2507.03633 2026-03-16 cs.CV cs.AI cs.LG

From Video to EEG: Adapting Joint Embedding Predictive Architecture to Uncover Saptiotemporal Dynamics in Brain Signal Analysis

Amirabbas Hojjati, Lu Li, Ibrahim Hameed, Anis Yazidi, Pedro G. Lind, Rabindra Khadka

2506.18248 2026-03-16 cs.CV cs.AI

Improving Black-Box Generative Attacks via Generator Semantic Consistency

Jongoh Jeong, Hunmin Yang, Jaeseok Jeong, Kuk-Jin Yoon

Comments Accepted for publication at ICLR 2026

2506.16225 2026-03-16 cs.SD eess.AS

AeroGPT: Leveraging Large-Scale Audio Model for Aero-Engine Bearing Fault Diagnosis

Jiale Liu, Dandan Peng, Huan Wang, Chenyu Liu, Yan-Fu Li, Min Xie

2506.04586 2026-03-16 cs.CL cs.SD eess.AS

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild Data

Wen Ding, Fan Qian

Comments Accepted by ICASSP 2026

2505.22954 2026-03-16 cs.AI

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Jenny Zhang, Shengran Hu, Cong Lu, Robert Lange, Jeff Clune

Comments Code at https://github.com/jennyzzt/dgm

详情

英文摘要

Today's AI systems have human-designed, fixed architectures and cannot autonomously and continuously improve themselves. The advance of AI could itself be automated. If done safely, that would accelerate AI development and allow us to reap its benefits much sooner. Meta-learning can automate the discovery of novel algorithms, but is limited by first-order improvements and the human design of a suitable search space. The Gödel machine proposed a theoretical alternative: a self-improving AI that repeatedly modifies itself in a provably beneficial manner. Unfortunately, proving that most changes are net beneficial is impossible in practice. We introduce the Darwin Gödel Machine (DGM), a self-improving system that iteratively modifies its own code (thereby also improving its ability to modify its own codebase) and empirically validates each change using coding benchmarks. Inspired by Darwinian evolution and open-endedness research, the DGM maintains an archive of generated coding agents. It grows the archive by sampling an agent from it and using a foundation model to create a new, interesting, version of the sampled agent. This open-ended exploration forms a growing tree of diverse, high-quality agents and allows the parallel exploration of many different paths through the search space. Empirically, the DGM automatically improves its coding capabilities (e.g., better code editing tools, long-context window management, peer-review mechanisms), increasing performance on SWE-bench from 20.0% to 50.0%, and on Polyglot from 14.2% to 30.7%. Furthermore, the DGM significantly outperforms baselines without self-improvement or open-ended exploration. All experiments were done with safety precautions (e.g., sandboxing, human oversight). The DGM is a significant step toward self-improving AI, capable of gathering its own stepping stones along paths that unfold into endless innovation.

URL PDF HTML ☆

赞 0 踩 0

2505.20343 2026-03-16 cs.CL cs.AI

Do LLMs have a Gender (Entropy) Bias?

Sonal Prabhune, Balaji Padmanabhan, Kaushik Dutta

Comments 18 pages, 4 figures

详情

Journal ref: IEEE ICDM 2025 Workshops Proceedings

英文摘要

We investigate the existence and persistence of a specific type of gender bias in some of the popular LLMs and contribute a new benchmark dataset, RealWorldQuestioning (released on HuggingFace ), developed from real-world questions across four key domains in business and health contexts: education, jobs, personal financial management, and general health. We define and study entropy bias, which we define as a discrepancy in the amount of information generated by an LLM in response to real questions users have asked. We tested this using four different LLMs and evaluated the generated responses both qualitatively and quantitatively by using ChatGPT-4o (as "LLM-as-judge"). Our analyses (metric-based comparisons and "LLM-as-judge" evaluation) suggest that there is no significant bias in LLM responses for men and women at a category level. However, at a finer granularity (the individual question level), there are substantial differences in LLM responses for men and women in the majority of cases, which "cancel" each other out often due to some responses being better for males and vice versa. This is still a concern since typical users of these tools often ask a specific question (only) as opposed to several varied ones in each of these common yet important areas of life. We suggest a simple debiasing approach that iteratively merges the responses for the two genders to produce a final result. Our approach demonstrates that a simple, prompt-based debiasing strategy can effectively debias LLM outputs, thus producing responses with higher information content than both gendered variants in 78% of the cases, and consistently achieving a balanced integration in the remaining cases.

URL PDF HTML ☆

赞 0 踩 0

2505.20133 2026-03-16 cs.CL cs.LG

Token Distillation: Attention-aware Input Embeddings For New Tokens

Konstantin Dobler, Desmond Elliott, Gerard de Melo

Comments ICLR 2026 camera-ready

2505.17815 2026-03-16 cs.AI

Evaluation Faking: Unveiling Observer Effects in Safety Evaluation of Frontier AI Systems

Yihe Fan, Wenqi Zhang, Xudong Pan, Min Yang

2505.17476 2026-03-16 cs.CV

The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts

Yuchen Zhang, Yaxiong Wang, Yujiao Wu, Lianwei Wu, Li Zhu, Zhedong Zheng

Comments Accepted to CVPR 2026 main track

2505.16736 2026-03-16 cs.LG

Backward Oversmoothing: why is it hard to train deep Graph Neural Networks?

Nicolas Keriven

2505.15418 2026-03-16 cs.LG cs.AI cs.RO

Guided Policy Optimization under Partial Observability

Yueheng Li, Guangming Xie, Zongqing Lu

2505.07920 2026-03-16 cs.CL cs.AI cs.LG

Re2: A Consistency-ensured Dataset for Full-stage Peer Review and Multi-turn Rebuttal Discussions

Daoze Zhang, Zhijian Bao, Sihang Du, Zhiyi Zhao, Kuangling Zhang, Dezheng Bao, Yang Yang

Comments 2 figures, 5 tables

2505.04984 2026-03-16 cs.CL

Rethinking the Relationship between the Power Law and Hierarchical Structures

Kai Nakaishi, Ryo Yoshida, Kohei Kajikawa, Koji Hukushima, Yohei Oseki

Comments Accepted for publication in Transactions of the Association for Computational Linguistics (TACL). This is a pre-MIT Press publication version

2505.00818 2026-03-16 cs.LG cs.SY eess.SY math.PR

Dual Filter: A Transformer-like Inference Architecture for Hidden Markov Models

Heng-Sheng Chang, Prashant G. Mehta

Comments 50 pages, 9 figures

2503.04979 2026-03-16 cs.CV

HyDA: Hypernetworks for Test Time Domain Adaptation in Medical Imaging Analysis

Doron Serebro, Tammy Riklin-Raviv

Comments submitted to MICCAI 2025

2503.03222 2026-03-16 cs.CV

Mocap-2-to-3: Multi-view Lifting for Monocular Motion Recovery with 2D Pretraining

Zhumei Wang, Zechen Hu, Ruoxi Guo, Huaijin Pi, Ziyong Feng, Liang Zhang, Mingtao Pei, Siyuan Huang

Comments Project page: https://wangzhumei.github.io/mocap-2-to-3/

2503.01212 2026-03-16 cs.CV cs.LG

Understanding Dataset Distillation via Spectral Filtering

Deyu Bo, Songhua Liu, Xinchao Wang

Comments Accepted by ICLR 2026. Code is available at https://github.com/bdy9527/UniDD

2502.07490 2026-03-16 cs.CL cs.LG

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Xialie Zhuang, Zhikai Jia, Jianjin Li, Zhenyu Zhang, Li Shen, Zheng Cao, Shiwei Liu

Comments 17 pages,7 figures

2412.13852 2026-03-16 cs.LG physics.comp-ph

RadField3D: A Data Generator and Data Format for Deep Learning in Radiation-Protection Dosimetry for Medical Applications

Felix Lehner, Pasquale Lombardo, Susana Castillo, Oliver Hupe, Marcus Magnor

2412.00547 2026-03-16 cs.CV cs.AI

Motion Dreamer: Boundary Conditional Motion Reasoning for Physically Coherent Video Generation

Tianshuo Xu, Zhifei Chen, Leyi Wu, Hao Lu, Yuying Chen, Lihui Jiang, Bingbing Liu, Yingcong Chen

Comments The authors have decided to withdraw this article due to the following reasons identified after publication: Experimental Errors: Significant inaccuracies were discovered in the experimental results concerning segmentation and depth estimation. Authorship Disputes: In addition to the technical issues, there are unresolved disagreements regarding the author sequence and contributions

2410.18613 2026-03-16 cs.LG cs.CV stat.ML

Rethinking Attention: Polynomial Alternatives to Softmax in Transformers

Hemanth Saratchandran, Jianqiao Zheng, Yiping Ji, Wenbo Zhang, Simon Lucey

2410.10234 2026-03-16 cs.CV

LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space

Shunsuke Sakai, Tatushito Hasegawa, Makoto Koshino

Comments Accepted at TMLR2025. Code is available at https://github.com/SkyShunsuke/LADMIM

2410.01647 2026-03-16 cs.CV

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object Detection

Yang Cao, Yuanliang Ju, Dan Xu

Comments The code and models will be made publicly available upon acceptance at: \href{https://github.com/yangcaoai/3DGS-DET}{https://github.com/yangcaoai/3DGS-DET}