arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.24594 2026-03-26 cs.LG cs.NA math.NA stat.ML

Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method

Arthur Jacot

详情

英文摘要

We introduce the Multilevel Euler-Maruyama (ML-EM) method compute solutions of SDEs and ODEs using a range of approximators $f^1,\dots,f^k$ to the drift $f$ with increasing accuracy and computational cost, only requiring a few evaluations of the most accurate $f^k$ and many evaluations of the less costly $f^1,\dots,f^{k-1}$. If the drift lies in the so-called Harder than Monte Carlo (HTMC) regime, i.e. it requires $ε^{-γ}$ compute to be $ε$-approximated for some $γ>2$, then ML-EM $ε$-approximates the solution of the SDE with $ε^{-γ}$ compute, improving over the traditional EM rate of $ε^{-γ-1}$. In other terms it allows us to solve the SDE at the same cost as a single evaluation of the drift. In the context of diffusion models, the different levels $f^{1},\dots,f^{k}$ are obtained by training UNets of increasing sizes, and ML-EM allows us to perform sampling with the equivalent of a single evaluation of the largest UNet. Our numerical experiments confirm our theory: we obtain up to fourfold speedups for image generation on the CelebA dataset downscaled to 64x64, where we measure a $γ\approx2.5$. Given that this is a polynomial speedup, we expect even stronger speedups in practical applications which involve orders of magnitude larger networks.

URL PDF HTML ☆

赞 0 踩 0

2603.24584 2026-03-26 cs.CV cs.RO

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models

Jiaying Zhou, Zhihao Zhan, Ruifeng Zhai, Qinhan Lyu, Hao Liu, Keze Wang, Liang Lin, Guangrun Wang

2603.24582 2026-03-26 cs.AI

The Stochastic Gap: A Markovian Framework for Pre-Deployment Reliability and Oversight-Cost Auditing in Agentic Artificial Intelligence

Biplab Pal, Santanu Bhattacharya

Comments 22 pages, 5 figures, submitted to Engineering Applications of Artificial Intelligence

2603.24581 2026-03-26 cs.CV cs.RO

Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving

Linbo Wang, Yupeng Zheng, Qiang Chen, Shiwei Li, Yichen Zhang, Zebin Xing, Qichao Zhang, Xiang Li, Deheng Qian, Pengxuan Yang, Yihang Dong, Ce Hao, Xiaoqing Ye, Junyu han, Yifeng Pan, Dongbin Zhao

2603.24580 2026-03-26 cs.CL cs.AI cs.CY cs.IR cs.LG

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

Saahil Mathur, Ryan David Rittner, Vedant Ajit Thakur, Daniel Stuart Schiff, Tunazzina Islam

2603.24579 2026-03-26 cs.CL

MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

Zhuo Li, Yupeng Zhang, Pengyu Cheng, Jiajun Song, Mengyu Zhou, Hao Li, Shujie Hu, Yu Qin, Erchao Zhao, Xiaoxi Jiang, Guanjun Jiang

2603.24578 2026-03-26 cs.CV eess.IV

Vision-Language Models vs Human: Perceptual Image Quality Assessment

Imran Mehmood, Imad Ali Shah, Ming Ronnier Luo, Brian Deegan

2603.24575 2026-03-26 cs.CV cs.AI

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Qijia He, Xunmei Liu, Hammaad Memon, Ziang Li, Zixian Ma, Jaemin Cho, Jason Ren, Daniel S Weld, Ranjay Krishna

2603.24574 2026-03-26 cs.DS math.OC

Coordinating Spot and Contract Supply in Freight Marketplaces

Philip Kaminsky, Rachitesh Kumar, Roger Lederman

2603.24572 2026-03-26 cs.AI

Completeness of Unbounded Best-First Minimax and Descent Minimax

Quentin Cohen-Solal

2603.24571 2026-03-26 cs.CV

Towards Training-Free Scene Text Editing

Yubo Li, Xugong Qin, Peng Zhang, Hailun Lin, Gangyan Zeng, Kexin Zhang

Comments Accepted by CVPR 2026

2603.24570 2026-03-26 cs.CV cs.AI

Anti-I2V: Safeguarding your photos from malicious image-to-video generation

Duc Vu, Anh Nguyen, Chi Tran, Anh Tran

Comments Accepted to CVPR 2026 (Main Conference)

2603.24567 2026-03-26 stat.ML cs.LG

Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

Raju Chowdhury, Tanmay Sen, Prajamitra Bhuyan, Biswabrata Pradhan

2603.24566 2026-03-26 eess.SY cs.SY

Integral Control Barrier Functions with Input Delay: Prediction, Feasibility, and Robustness

Adam K. Kiss, Ersin Das, Tamas G. Molnar, Aaron D. Ames

2603.24564 2026-03-26 cs.CR cs.CY

Infrastructure for Valuable, Tradable, and Verifiable Agent Memory

Mengyuan Li, Lei Gao, Haoxuan Xu, Jiate Li, Potung Yu, Lingke Cheng, Yue Zhao, Murali Annavaram

2603.24560 2026-03-26 cs.SE

Boosting LLMs for Mutation Generation

Bo Wang, Ming Deng, Mingda Chen, Chengran Yang, Youfang Lin, Mark Harman, Mike Papadakis, Jie M. Zhang

Comments to be published in the collection of FSE 2026

2603.24559 2026-03-26 cs.NE cs.AI cs.MA

The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems

Martin Jaraiz

Comments 26 pages, 3 figures, 2 tables, draft

详情

英文摘要

We introduce the Free-Market Algorithm (FMA), a novel metaheuristic inspired by free-market economics. Unlike Genetic Algorithms, Particle Swarm Optimization, and Simulated Annealing -- which require prescribed fitness functions and fixed search spaces -- FMA uses distributed supply-and-demand dynamics where fitness is emergent, the search space is open-ended, and solutions take the form of hierarchical pathway networks. Autonomous agents discover rules, trade goods, open and close firms, and compete for demand with no centralized controller. FMA operates through a three-layer architecture: a universal market mechanism (supply, demand, competition, selection), pluggable domain-specific behavioral rules, and domain-specific observation. The market mechanism is identical across applications; only the behavioral rules change. Validated in two unrelated domains. In prebiotic chemistry, starting from 900 bare atoms (C, H, O, N), FMA discovers all 12 feasible amino acid formulas, all 5 nucleobases, the formose sugar chain, and Krebs cycle intermediates in under 5 minutes on a laptop -- with up to 240 independent synthesis routes per product. In macroeconomic forecasting, reading a single input-output table with zero estimated parameters, FMA achieves Mean Absolute Error of 0.42 percentage points for non-crisis GDP prediction, comparable to professional forecasters, portable to 33 countries. Assembly Theory alignment shows that FMA provides the first explicit, tunable mechanism for the selection signatures described by Sharma et al. (Nature, 2023). The event-driven assembly dynamics resonate with foundational programs in physics -- causal set theory, relational quantum mechanics, constructor theory -- suggesting that Darwinian market dynamics may reflect a deeper organizational principle that lead to the unfolding of Nature itself.

URL PDF HTML ☆

赞 0 踩 0

2603.24558 2026-03-26 cs.CV cs.AI

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan

Comments To be published in CVPR 2026

2603.24556 2026-03-26 cs.IR cs.AI

Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents

Samuel Taiwo, Mohd Amaluddin Yusoff

Comments Presented at CCSEIT 2026. This version matches the published proceedings

2603.24552 2026-03-26 cs.CV

The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series

Jan Hemmerling, Marcel Schwieder, Philippe Rufin, Leon-Friedrich Thomas, Mirela Tulbure, Patrick Hostert, Stefan Erasmi

详情

英文摘要

Organic farming is a key element in achieving more sustainable agriculture. For a better understanding of the development and impact of organic farming, comprehensive, spatially explicit information is needed. This study presents an approach for the discrimination of organic and conventional farming systems using intra-annual Sentinel-2 time series. In addition, it examines two factors influencing this discrimination: the joint learning of crop type information in a concurrent task and the role of spatial context. A Vision Transformer model based on the Temporo-Spatial Vision Transformer (TSViT) architecture was used to construct a classification model for the two farming systems. The model was extended for simultaneous learning of the crop type, creating a multitask learning setting. By varying the patch size presented to the model, we tested the influence of spatial context on the classification accuracy of both tasks. We show that discrimination between organic and conventional farming systems using multispectral remote sensing data is feasible. However, classification performance varies substantially across crop types. For several crops, such as winter rye, winter wheat, and winter oat, F1 scores of 0.8 or higher can be achieved. In contrast, other agricultural land use classes, such as permanent grassland, orchards, grapevines, and hops, cannot be reliably distinguished, with F1 scores for the organic management class of 0.4 or lower. Joint learning of farming system and crop type provides only limited additional benefits over single-task learning. In contrast, incorporating wider spatial context improves the performance of both farming system and crop type classification. Overall, we demonstrate that a classification of agricultural farming systems is possible in a diverse agricultural region using multispectral remote sensing data.

URL PDF HTML ☆

赞 0 踩 0

2603.24549 2026-03-26 cs.CL cs.AI cs.CV cs.SD

A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Dana Serditova, Kevin Tang

Comments 54 pages, 11 figures

2603.24546 2026-03-26 cs.IT math.IT

Optimal Multidimensional Convolutional Codes

Z. Abreu, J. Lieb, R. Pinto, R. Simoes

2603.24545 2026-03-26 math.ST cs.CC cs.DS math.PR stat.ML stat.TH

Detection of local geometry in random graphs: information-theoretic and computational limits

Jinho Bok, Shuangping Li, Sophie H. Yu

Comments 68 pages

2603.24543 2026-03-26 cs.CR cs.CL

Analysing the Safety Pitfalls of Steering Vectors

Yuxiao Li, Alina Fastowski, Efstratios Zaradoukas, Bardh Prenkaj, Gjergji Kasneci

2603.24542 2026-03-26 math.NA cs.NA

Two-level nonlinear Schwarz methods - a parallel implementation with application to nonlinear elasticity and incompressible flow problems

Kyrill Ho, Axel Klawonn, Martin Lanser

2603.24541 2026-03-26 cs.CV cs.AI

SEGAR: Selective Enhancement for Generative Augmented Reality

Fanjun Bu, Chenyang Yuan, Hiroshi Yasuda

2603.24540 2026-03-26 eess.SY cs.SY

A Modular Platooning and Vehicle Coordination Simulator for Research and Education

Kevin Jamsahar, Adrian Wiltz, Maria Charitidou, Dimos V. Dimarogonas

Comments 6 pages

2603.24539 2026-03-26 cs.CV cs.AI

CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition

Florian Stilz, Vinkle Srivastav, Nassir Navab, Nicolas Padoy

2603.24535 2026-03-26 cs.CL cs.CY

Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding

Conrad Borchers, Jiayi Zhang, Ashish Gurung

Comments Accepted as short paper to the 27th International Conference on Artificial Intelligence in Education (AIED 2026)

2603.24533 2026-03-26 cs.LG cs.AI cs.CV

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Zichuan Lin, Feiyu Liu, Yijun Yang, Jiafei Lyu, Yiming Gao, Yicheng Liu, Zhicong Lu, Yangbin Yu, Mingyu Yang, Junyou Li, Deheng Ye, Jie Jiang

Comments Code and models are available at https://github.com/ui-voyager/UI-Voyager