arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.26940 2026-03-31 stat.ML cs.LG math.PR

Static and Dynamic Approaches to Computing Barycenters of Probability Measures on Graphs

David Gentile, James M. Murphy

Comments 31 pages, 17 figures, 1 table

详情

英文摘要

The optimal transportation problem defines a geometry of probability measures which leads to a definition for weighted averages (barycenters) of measures, finding application in the machine learning and computer vision communities as a signal processing tool. Here, we implement a barycentric coding model for measures which are supported on a graph, a context in which the classical optimal transport geometry becomes degenerate, by leveraging a Riemannian structure on the simplex induced by a dynamic formulation of the optimal transport problem. We approximate the exponential mapping associated to the Riemannian structure, as well as its inverse, by utilizing past approaches which compute action minimizing curves in order to numerically approximate transport distances for measures supported on discrete spaces. Intrinsic gradient descent is then used to synthesize barycenters, wherein gradients of a variance functional are computed by approximating geodesic curves between the current iterate and the reference measures; iterates are then pushed forward via a discretization of the continuity equation. Analysis of measures with respect to given dictionary of references is performed by solving a quadratic program formed by computing geodesics between target and reference measures. We compare our novel approach to one based on entropic regularization of the static formulation of the optimal transport problem where the graph structure is encoded via graph distance functions, we present numerical experiments validating our approach, and we conclude that intrinsic gradient descent on the probability simplex provides a coherent framework for the synthesis and analysis of measures supported on graphs.

URL PDF HTML ☆

赞 0 踩 0

2603.13846 2026-03-31 cs.HC cs.AI

Is Seeing Believing? Evaluating Human Sensitivity to Synthetic Video

David Wegmann, Emil Stevnsborg, Søren Knudsen, Luca Rossi, Aske Mottelson

2603.13294 2026-03-31 cs.CY cs.AI

Real-World AI Evaluation: How FRAME Generates Systematic Evidence to Resolve the Decision-Maker's Dilemma

Reva Schwartz, Gabriella Waters

Comments 19 pages, 4 tables, 5 figures

2602.14477 2026-03-31 cs.HC cs.AI cs.CY cs.SI

When AI Agents Teach Each Other: Discourse Patterns Resembling Peer Learning in the Moltbook Community

Eason Chen, Ce Guan, A Elshafiey, Zhonghao Zhao, Joshua Zekeri, Afeez Edeifo Shaibu, Emmanuel Osadebe Prince

Comments 7 pages, 1 figure. Revised version addressing reviewer feedback: added statistical inference, human baselines, redefined design principles as hypotheses, clarified anti-anthropomorphization stance

2602.10655 2026-03-31 cs.SE cs.RO

Assessing Vision-Language Models for Perception in Autonomous Underwater Robotic Software

Muhammad Yousaf, Aitor Arrieta, Shaukat Ali, Paolo Arcaini, Shuai Wang

Comments 16 pages, 5 figures

2602.08482 2026-03-31 cs.DB cs.AI

CLEAR: A Knowledge-Centric Vessel Trajectory Analysis Platform

Hengyu Liu, Tianyi Li, Haoyu Wang, Kristian Torp, Yushuai Li, Tiancheng Zhang, Torben Bach Pedersen, Christian S. Jensen

Comments 4 pages, 5 figures. Accepted at SIGMOD 2026 Demo Track

2601.01331 2026-03-31 cs.CY cs.CL cs.LG

AppellateGen: A Benchmark for Appellate Legal Judgment Generation

Hongkun Yang, Lionel Z. Wang, Wei Fan, Yiran Hu, Lixu Wang, Chenyu Liu, Yu Zeng, Shenghong Fu, Lei Gong, Zhengxin Zhang, Haoyang Li, Jiexin Zheng, Xin Xu

Comments 15 pages, 4 figures, 3 tables

2512.04653 2026-03-31 cs.MA cs.AI cs.LG

A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control

Arash Rezaali, Pouria Yazdani, Monireh Abdoos

Comments Co-first authors: Arash Rezaali and Pouria Yazdani

2511.18151 2026-03-31 cs.DC cs.AR cs.CV cs.LG cs.NI

AVERY: Intent-Driven Adaptive VLM Split Computing via Embodied Self-Awareness for Efficient Disaster Response Systems

Rajat Bhattacharjya, Sing-Yao Wu, Hyunwoo Oh, Chaewon Nam, Suyeon Koo, Mohsen Imani, Elaheh Bozorgzadeh, Nikil Dutt

Comments Paper is currently under review. Authors' version posted for personal use and not for redistribution. Previous version of the preprint was titled: 'AVERY: Adaptive VLM Split Computing through Embodied Self-Awareness for Efficient Disaster Response Systems'

详情

英文摘要

Unmanned Aerial Vehicles (UAVs) in disaster response require complex, queryable intelligence that onboard CNNs cannot provide. While Vision-Language Models (VLMs) offer this semantic reasoning, their high resource demands make on-device deployment infeasible, and naive cloud offloading fails under the low-bandwidth, unstable networks endemic to disaster zones. We present AVERY, an intent-driven adaptive split computing framework for efficient VLM deployment on resource-constrained platforms. AVERY is motivated by the observation that operator intent must be treated as a first-class system objective, since missions such as broad situational monitoring and precise, spatially grounded investigation require different semantic products, latency targets, and resource allocations. To reflect this, AVERY advances split computing beyond traditional depth-wise partitioning through a functional, cognitive-inspired dual-stream split: a high-frequency, low-resolution Context stream for real-time awareness, and a low-frequency, high-fidelity Insight stream for deep analysis. This design enables a hierarchical split strategy: computation is first separated by function, then partitioned depth-wise across edge and cloud when the Insight stream is required. A lightweight, self-aware onboard controller monitors network conditions and operator intent to select from pre-trained compression models, navigating the accuracy-throughput trade-off at runtime. Evaluated using LISA-7B in an edge-cloud setting under fluctuating network conditions, AVERY achieves 11.2% higher accuracy than raw image compression, 93.98% lower energy consumption than full-edge execution, and average accuracy within 0.75% of the static High-Accuracy baseline during dynamic adaptation. Overall, AVERY enhances mission efficiency and enables real-time, queryable intelligence in dynamic disaster environments.

URL PDF HTML ☆

赞 0 踩 0

2511.07014 2026-03-31 cs.CE cs.AI econ.EM q-fin.PM

Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction

So-Yoon Cho, Jin-Young Kim, Kayoung Ban, Hyeng Keun Koo, Hyun-Gyoon Kim

Comments 41 pages, 11 figures. Replacement to match the version accepted for publication in Information Fusion (Vol. 133, 104286, 2026). Significant updates have been made from the initial draft to reflect the final accepted manuscript (AAM)

详情

DOI: 10.1016/j.inffus.2026.104286
Journal ref: Information Fusion, Vol. 133, 104286 (2026)

英文摘要

Probabilistic forecasting is crucial in multivariate financial time-series for constructing efficient portfolios that account for complex cross-sectional dependencies. In this paper, we propose Diffolio, a diffusion model designed for multivariate financial time-series forecasting and portfolio construction. Diffolio employs a denoising network with a hierarchical attention architecture, comprising both asset-level and market-level layers. Furthermore, to better reflect cross-sectional correlations, we introduce a correlation-guided regularizer informed by a stable estimate of the target correlation matrix. This structure effectively extracts salient features not only from historical returns but also from asset-specific and systematic covariates, significantly enhancing the performance of forecasts and portfolios. Experimental results on the daily excess returns of 12 industry portfolios show that Diffolio outperforms various probabilistic forecasting baselines in multivariate forecasting accuracy and portfolio performance. Moreover, in portfolio experiments, portfolios constructed from Diffolio's forecasts show consistently robust performance, thereby outperforming those from benchmarks by achieving higher Sharpe ratios for the mean-variance tangency portfolio and higher certainty equivalents for the growth-optimal portfolio. These results demonstrate the superiority of our proposed Diffolio in terms of not only statistical accuracy but also economic significance.

URL PDF HTML ☆

赞 0 踩 0

2510.19372 2026-03-31 stat.ML cs.LG

On the Hardness of Reinforcement Learning with Transition Look-Ahead

Corentin Pla, Hugo Richard, Marc Abeille, Nadav Merlis, Vianney Perchet

2510.15681 2026-03-31 cs.LO cs.AI

ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings

Prithwish Jana, Kaan Kale, Ahmet Ege Tanriverdi, Cruise Song, Sriram Vishwanath, Vijay Ganesh

Comments Published as a conference paper at the 14th International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, April 23-27, 2026

2510.05145 2026-03-31 cs.DC cs.AI cs.MA

Efficient Tree-Structured Deep Research with Adaptive Resource Allocation

Lunyiu Nie, Nedim Lipka, Ryan A. Rossi, Swarat Chaudhuri

Comments ICLR 2026 Workshop on Agents in the Wild (Spotlight)

2509.25311 2026-03-31 hep-th cs.LG physics.comp-ph

Aspects of holographic entanglement using physics-informed-neural-networks

Anirudh Deb, Yaman Sanghavi

Comments 19 pages, 14 figures. v2: minor corrections, references added, revised figure captions for clarity

2509.22059 2026-03-31 hep-ph cs.LG hep-ex

Stable and Interpretable Jet Physics with IRC-Safe Equivariant Feature Extraction

Partha Konar, Vishal S. Ngairangbam, Michael Spannowsky, Deepanshu Srivastava

Comments 30 pages, 3 tables, 7 figures

2509.13688 2026-03-31 cs.GR cs.AI

CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion

James Jincheng, Yuxiao Wu, Youcheng Cai, Ligang Liu

2508.01321 2026-03-31 stat.ML cs.LG

Flow IV: Counterfactual Inference In Nonseparable Outcome Models Using Instrumental Variables

Marc Braun, Jose M. Peña, Adel Daoud

2507.06876 2026-03-31 cs.CY cs.AI

Winning and losing with Artificial Intelligence: What public discourse about ChatGPT tells us about how societies make sense of technological change

Adrian Rauchfleisch, Joshua Philip Suarez, Nikka Marie Sales, Andreas Jungherr

详情

DOI: 10.1016/j.tele.2025.102344
Journal ref: Telematics and Informatics, 103, 102344 (2025)

英文摘要

Public product launches in Artificial Intelligence can serve as focusing events for collective attention, surfacing how societies react to technological change. Social media provide a window into the sensemaking around these events, surfacing hopes and fears and showing who chooses to engage in the discourse and when. We demonstrate that public sensemaking about AI is shaped by economic interests and cultural values of those involved. We analyze 3.8 million tweets posted by 1.6 million users across 117 countries in response to the public launch of ChatGPT in 2022. Our analysis shows how economic self-interest, proxied by occupational skill types in writing, programming, and mathematics, and national cultural orientations, as measured by Hofstede's individualism, uncertainty avoidance, and power distance dimensions, shape who speaks, when they speak, and their stance towards ChatGPT. Roles requiring more technical skills, such as programming and mathematics, tend to engage earlier and express more positive stances, whereas writing-centric occupations join later with greater skepticism. At the cultural level, individualism predicts both earlier engagement and a more negative stance, and uncertainty avoidance reduces the prevalence of positive stances but does not delay when users first engage with ChatGPT. Aggregate sentiment trends mask the dynamics observed in our study. The shift toward a more critical stance towards ChatGPT over time stems primarily from the entry of more skeptical voices rather than a change of heart among early adopters. Our findings underscore the importance of both the occupational background and cultural context in understanding public reactions to AI.

URL PDF HTML ☆

赞 0 踩 0

2506.23836 2026-03-31 math.OC cs.DC cs.LG

Proving the Limited Scalability of Centralized Distributed Optimization via a New Lower Bound Construction

Alexander Tyurin

详情

英文摘要

We consider centralized distributed optimization in the classical federated learning setup, where $n$ workers jointly find an $\varepsilon$-stationary point of an $L$-smooth, $d$-dimensional nonconvex function $f$, having access only to unbiased stochastic gradients with variance $σ^2$. Each worker requires at most $h$ seconds to compute a stochastic gradient, and the communication times from the server to the workers and from the workers to the server are $τ_{s}$ and $τ_{w}$ seconds per coordinate, respectively. One of the main motivations for distributed optimization is to achieve scalability with respect to $n$. For instance, it is well known that the distributed version of SGD has a variance-dependent runtime term $\frac{h σ^2 L Δ}{n \varepsilon^2},$ which improves with the number of workers $n,$ where $Δ= f(x^0) - f^*,$ and $x^0 \in R^d$ is the starting point. Similarly, using unbiased sparsification compressors, it is possible to reduce both the variance-dependent runtime term and the communication runtime term. However, once we account for the communication from the server to the workers $τ_{s}$, we prove that it becomes infeasible to design a method using unbiased random sparsification compressors that scales both the server-side communication runtime term $τ_{s} d \frac{L Δ}{\varepsilon}$ and the variance-dependent runtime term $\frac{h σ^2 L Δ}{\varepsilon^2},$ better than poly-logarithmically in $n$, even in the homogeneous (i.i.d.) case, where all workers access the same distribution. To establish this result, we construct a new "worst-case" function and develop a new lower bound framework that reduces the analysis to the concentration of a random sum, for which we prove a concentration bound. These results reveal fundamental limitations in scaling distributed optimization, even under the homogeneous assumption.

URL PDF HTML ☆

赞 0 踩 0

2506.21138 2026-03-31 cs.SE cs.AI

Multi-Sample Prompting and Actor-Critic Prompt Optimization for Diverse Synthetic Data Generation

Abdelkarim El-Hajjami, Camille Salinesi

2505.17288 2026-03-31 stat.ML cs.LG

Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation

Seamus Somerstep, Vinod Raman, Unique Subedi, Yuekai Sun

Comments AISTATS 2026 Camera Ready

2505.13213 2026-03-31 stat.ML cs.LG

Diffusion Models with Double Guidance: Generate with aggregated datasets

Yanfeng Yang, Kenji Fukumizu

2505.12412 2026-03-31 stat.ML cs.LG

Training Latent Diffusion Models with Interacting Particle Algorithms

Tim Y. J. Wang, Juan Kuntz, O. Deniz Akyildiz

Comments Camera Ready version for AISTATS 2026

2505.07372 2026-03-31 cs.SE cs.AI

Self-Bootstrapping Automated Program Repair: Using LLMs to Generate and Evaluate Synthetic Training Data for Bug Repair

David de-Fitero-Dominguez, Antonio Garcia-Cabot, Eva Garcia-Lopez

详情

DOI: 10.1016/j.eswa.2026.132154
Journal ref: Expert Systems with Applications 319 (2026)

英文摘要

This paper presents a novel methodology for enhancing Automated Program Repair (APR) through synthetic data generation utilizing Large Language Models (LLMs). Current APR systems are constrained by the limited availability of high-quality training data encompassing diverse bug types across multiple programming languages. The proposed approach addresses this limitation through a two-phase process: a synthetic sample generation followed by a rigorous quality assessment. Multiple state-of-the-art LLMs were employed to generate approximately 30,000 paired examples of buggy and fixed code across 12 programming languages and 13 bug categories. Subsequently, these samples underwent cross-model evaluation against five criteria: correctness, code quality, security, performance, and completeness. Experimental evaluation on the VulRepair test set dataset showed statistically significant improvements in Perfect Prediction rates, with the quality-filtered synthetic dataset achieving 17.18% (Top@1) and 23.00% (Top@5) compared to the baseline's 11.68% and 18.88% respectively, representing a 47% relative improvement in Top@1 and 22% in Top@5. The methodology was validated through rigorous statistical testing, including ANOVA and post-hoc Tukey's Honest Significant Difference analysis. Furthermore, the best-performing configurations surpassed existing systems despite using a less computationally intensive decoding strategy. This research establishes a self-bootstrapping paradigm in which LLMs generate and evaluate their own training data, suggesting promising directions for addressing data scarcity in similar software engineering tasks and advancing the development of robust, adaptable tools for automated code maintenance.

URL PDF HTML ☆

赞 0 踩 0

2505.04880 2026-03-31 quant-ph cs.AI cs.LG

Symbolic Analysis of Grover Search Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization

Min Chen, Jinglei Cheng, Pingzhi Li, Haoran Wang, Tianlong Chen, Junyu Liu

Comments 33 pages, 14 figures

详情

DOI: 10.1038/s41534-026-01195-1
Journal ref: npj Quantum Information 12, 48 (2026)

英文摘要

Understanding the high-level conceptual structure of quantum algorithms from their low-level circuit representations is a critical task for verification, debugging, and education. While traditional numerical simulators can calculate output probabilities, they do not explicitly surface the underlying algorithmic logic, such as the function of an oracle or embedded symmetries. In this work, we shift the focus from numerical simulation to symbolic analysis, investigating whether Large Language Models (LLMs) can automatically interpret quantum circuits and articulate their logic in a human-readable format. We introduce GroverGPT+, a model that leverages Chain-of-Thought reasoning and quantum-native tokenization to analyze Grover's search algorithm. We use Grover's algorithm as a controlled testbed, as its well-defined analytical properties allow for rigorous verification of the model's reasoning process. Our primary finding is that GroverGPT+ successfully identifies the oracle and its marked states directly from circuit representations. The model's key output is not a final probability, but a structured, interpretable reasoning trace that mirrors human expert analysis, effectively translating procedural circuit steps into conceptual insights. Furthermore, we establish a structured benchmark for this symbolic analysis task and explore its empirical extrapolation describing the model's performance as the number of qubits increases. These findings position LLMs as powerful tools for automated quantum algorithm analysis and verification. More fundamentally, this work offers a first step towards using such models as scientific probes, suggesting that an algorithm's ``learnability" by a classical model can provide a new, complementary perspective on its conceptual complexity, a topic of core interest to quantum information science.

URL PDF HTML ☆

赞 0 踩 0

2504.16474 2026-03-31 cs.CR cs.LG

Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation

Meixi Zheng, Kehan Wu, Yanbo Fan, Rui Huang, Baoyuan Wu

Comments 32 pages, 9 figures

2503.08915 2026-03-31 eess.IV cs.CV

Reconstruct Anything Model: a lightweight general model for computational imaging

Matthieu Terris, Samuel Hurault, Maxime Song, Julian Tachella

2502.18535 2026-03-31 cs.CR cs.AI cs.LG

A Survey of Zero-Knowledge Proof Based Verifiable Machine Learning

Zhizhi Peng, Chonghe Zhao, Taotao Wang, Guofu Liao, Zibin Lin, Yifeng Liu, Bin Cao, Long Shi, Qing Yang, Shengli Zhang

Comments This manuscript has been accepted for publication in Artificial Intelligence Review

2502.07754 2026-03-31 cs.GR cs.CV

MeshSplats: Mesh-Based Rendering with Gaussian Splatting Initialization

Rafał Tobiasz, Grzegorz Wilczyński, Marcin Mazur, Sławomir Tadeja, Weronika Smolak-Dyżewska, Przemysław Spurek

2502.03330 2026-03-31 cs.HC cs.AI cs.CV cs.GR

ControlGUI: Guiding Generative GUI Exploration through Perceptual Visual Flow

Aryan Garg, Yue Jiang, Antti Oulasvirta