arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00825 2026-05-04 cs.CV

Posterior Augmented Flow Matching

George Stoica, Sayak Paul, Matthew Wallingford, Vivek Ramanujan, Abhay Nori, Winson Han, Ali Farhadi, Ranjay Krishna, Judy Hoffman

详情

英文摘要

Flow matching (FM) trains a time-dependent vector field that transports samples from a simple prior to a complex data distribution. However, for high-dimensional images, each training sample supervises only a single trajectory and intermediate point, yielding an extremely sparse and high-variance training signal. This under-constrained supervision can cause flow collapse, where the learned dynamics memorize specific source-target pairings, mapping diverse inputs to overly similar outputs, failing to generalize. We introduce Posterior-Augmented Flow Matching (PAFM), a theoretically grounded generalization of FM that replaces single-target supervision with an expectation over an approximate posterior of valid target completions for a given intermediate state and condition. PAFM factorizes this intractable posterior into (i) the likelihood of the intermediate under a hypothesized endpoint and (ii) the prior probability of that endpoint under the condition, and uses an importance sampling scheme to construct a mixture over multiple candidate targets. We prove that PAFM yields an unbiased estimator of the original FM objective while substantially reducing gradient variance during training by aggregating information from many plausible continuation trajectories per intermediate. Finally, we show that PAFM improves over FM by up to 3.4 FID50K across different model scales (SiT-B/2 and SiT-XL/2), different architectures (SiT and MMDiT), and in both class and text conditioned benchmarks (ImageNet and CC12M), with a negligible increase in the compute overhead. Code: https://github.com/gstoica27/PAFM.git.

URL PDF HTML ☆

赞 0 踩 0

2605.00824 2026-05-04 cs.MM

CustomDancer: Customized Dance Recommendation by Text-Dance Retrieval

Yawen Qin, Ke Qiu, Qin Zhang

2605.00823 2026-05-04 math.NA cs.NA

Reliability, Robustness, and Resilience Modeling for Surveillance System in Advanced Air Mobility Operations

Esrat Farhana Dulia, Caleb Adams, Syed Arbab Mohd Shihab, Ruben Del Rosario

2605.00820 2026-05-04 cs.CE cs.LG cs.NA math.NA

HyCOP: Hybrid Composition Operators for Interpretable Learning of PDEs

Jinpai Zhao, Nishant Panda, Yen Ting Lin, Eirik Valseth, Diane Oyen, Clint Dawson

2605.00812 2026-05-04 cs.LO math.LO

Univalence without function extensionality

Evan Cavallo, Jonas Höfer

Comments 20 pages

2605.00804 2026-05-04 cs.HC

Prop-Chromeleon: Adaptive Haptic Props in Mixed Reality through Generative Artificial Intelligence

Haoyu Wang, Fengyuan Zhu, Bingjian Huang, Zhecheng Wang, Ludwig Sidenmark

Comments Accepted to ACM DIS 2026

2605.00803 2026-05-04 cs.SE cs.AI cs.CL

Can Coding Agents Reproduce Findings in Computational Materials Science?

Ziyang Huang, Yi Cao, Ali K. Shargh, Jing Luo, Ruidong Mei, Mohd Zaki, Zhan Liu, Wyatt Bunstine, William Jurayj, Somdatta Goswami, Tyrel McQueen, Michael Shields, Jaafar El-Awady, Paulette Clancy, Benjamin Van Durme, Nicholas Andrews, William Walden, Daniel Khashabi

2605.00800 2026-05-04 cs.LG

Generating Statistical Charts with Validation-Driven LLM Workflows

Pavlin G. Poličar, Andraž Pevcin, Blaž Zupan

2605.00799 2026-05-04 cs.CV

GMGaze: MoE-Based Context-Aware Gaze Estimation with CLIP and Multiscale Transformer

Xinyuan Zhao, Yihang Wu, Ahmad Chaddad, Sarah A. Alkhodair, Reem Kateb

Comments Accepted in KBS

2605.00798 2026-05-04 cs.LG cs.CL cs.MA

RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

Arunabh Srivastava, Mohammad A., Khojastepour, Srimat Chakradhar, Sennur Ulukus

2605.00797 2026-05-04 cs.DS

A Faster Deterministic Algorithm for Fully Dynamic Maximal Matching

Julia Chuzhoy, Sanjeev Khanna, Junkai Song

详情

英文摘要

In the fully dynamic maximal matching problem, the goal is to maintain a maximal matching in a graph undergoing an online sequence of edge insertions and deletions. The problem has been studied extensively in the oblivious-adversary setting, where randomized algorithms with polylogarithmic worst-case and constant amortized update time have been known for some time. A major challenge in this area has been designing an algorithm with non-trivial update time against an adaptive adversary. In a recent breakthrough, Bernstein, Bhattacharya, Kiss, and Saranurak (STOC 2025; hereafter, BBKS25) obtained the first algorithms with sublinear update time for this setting: namely, a randomized algorithm with $\tilde{O}(n^{3/4})$ amortized update time, and a deterministic algorithm with $\tilde{O}(n^{8/9})$ amortized update time. Our main result is a deterministic algorithm for fully dynamic maximal matching with amortized update time $n^{1/2+o(1)}$. A powerful tool in dynamic matching is the use of matching sparsifiers: sparse subgraphs that preserve enough information to recover matchings with desired properties. Sparsifiers, such as the EDCS data structure, have been successfully used for approximate maximum matching. For maximal matching, however, this paradigm is not as natural, since maximality must hold with respect to the entire graph. Nevertheless, BBKS25 showed that EDCS can be repurposed as a verification-and-repair mechanism for fully dynamic maximal matching against adaptive adversaries. We introduce a new deterministic framework, referred to as the subgraph system, which, in contrast to EDCS, is purpose-built for verification and maintenance of maximality. It is also designed to allow efficient recursive refinements leading to stronger and stronger parameters, that yield our deterministic algorithm with $n^{1/2+o(1)}$ amortized update time.

URL PDF HTML ☆

赞 0 踩 0

2605.00796 2026-05-04 cs.CR cs.AI cs.CL

When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI

Alfredo Madrid-García, Miguel Rujas

详情

英文摘要

Background: Patient-facing medical chatbots based on retrieval-augmented generation (RAG) are increasingly promoted to deliver accessible, grounded health information. AI-assisted development lowers the barrier to building them, but they still demand rigorous security, privacy, and governance controls. Objective: To report an anonymized, non-destructive security assessment of a publicly accessible patient-facing medical RAG chatbot and identify governance lessons for safe deployment of generative AI in health. Methods: We used a two-stage strategy. First, Claude Opus 4.6 supported exploratory prompt-based testing and structured vulnerability hypotheses. Second, candidate findings were manually verified using Chrome Developer Tools, inspecting browser-visible network traffic, payloads, API schemas, configuration objects, and stored interaction data. Results: The LLM-assisted phase identified a critical vulnerability: sensitive system and RAG configuration appeared exposed through client-server communication rather than restricted server-side. Manual verification confirmed that ordinary browser inspection allowed collection of the system prompt, model and embedding configuration, retrieval parameters, backend endpoints, API schema, document and chunk metadata, knowledge-base content, and the 1,000 most recent patient-chatbot conversations. The deployment also contradicted its privacy assurances: full conversation records, including health-related queries, were retrievable without authentication. Conclusions: Serious privacy and security failures in patient-facing RAG chatbots can be identified with standard browser tools, without specialist skills or authentication; independent review should be a prerequisite for deployment. Commercial LLMs accelerated this assessment, including under a false developer persona; assistance available to auditors is equally available to adversaries.

URL PDF HTML ☆

赞 0 踩 0

2605.00789 2026-05-04 cs.CV cs.AI cs.LG

Make Your LVLM KV Cache More Lightweight

Xihao Chen, Yangyang Guo, Roger Zimmermann

Comments Accepted to Transactions on Machine Learning Research (TMLR), 2026

2605.00788 2026-05-04 cs.CR

Repurposing Image Diffusion Models for Adversarial Synthetic Structured Data: A Case Study of Ground Truth Drift

Adam Arthur, Christopher Schwartz

Comments 2 figures

2605.00787 2026-05-04 cs.LG

SAVGO: Learning State-Action Value Geometry with Cosine Similarity for Continuous Control

Stavros Orfanoudakis, Pedro P. Vergara

Comments Reinforcement Learning

2605.00782 2026-05-04 cs.SE cs.AI

GeoContra: From Fluent GIS Code to Verifiable Spatial Analysis with Geography-Grounded Repair

Yinhao Xiao, Rongbo Xiao, Yihan Zhang

2605.00781 2026-05-04 cs.CV

Map2World: Segment Map Conditioned Text to 3D World Generation

Jaeyoung Chung, Suyoung Lee, Jianfeng Xiang, Jiaolong Yang, Kyoung Mu Lee

Comments project page: https://robot0321.github.io/Map2World/index.html

2605.00778 2026-05-04 cs.LG q-bio.NC

Observable Performance Does Not Fully Reflect System Organization: A Multi-Level Analysis of Gait Dynamics Under Occlusal Constraint

Jacques Raynal, Pierre Slangen, Jacques Margerit

Comments 1 table, 4 figures. Exploratory single-case study

2605.00777 2026-05-04 cs.SD cs.CL eess.AS

LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation

Venkata Pushpak Teja Menta

Comments 7 pages, 2 figures, 2 tables. Code, model, and datasets at https://github.com/praxelhq/lase

2605.00776 2026-05-04 cs.CL cs.AI

Directed Social Regard: Surfacing Targeted Advocacy, Opposition, Aid, Harms, and Victimization in Online Media

Scott Friedman, Ruta Wheelock, Sonja Schmer-Galunder, Drisana Iverson, Jake Vasilakes, Joan Zheng, Jeffrey Rye, Vasanth Sarathy, Christopher Miller

Comments 32 pages, 12 figures, 7 tables

2605.00773 2026-05-04 math.CT cs.LO

The Synthetic Sierpiński Cone

Fredrik Bakke, Jonathan Sterling, Mark Damuni Williams, Lingyuan Ye

2605.00769 2026-05-04 eess.SY cs.SY

Voltage Ride-Through in Large Loads- A Dual PQ Approach

Amir Norouzi, Michael Morel

Comments 10 pages

2605.00764 2026-05-04 cs.CV cs.HC

Modeling Subjective Urban Perception with Human Gaze

Lin Che, Xi Wang, Marc Pollefeys, Konrad Schindler, Martin Raubal, Peter Kiefer

2605.00762 2026-05-04 cs.LG cs.AI cs.MA

Meritocratic Fairness in Budgeted Combinatorial Multi-armed Bandits via Shapley Values

Shradha Sharma, Swapnil Dhamal, Shweta Jain

2605.00761 2026-05-04 cs.IT eess.SP math.IT

The Benefit of Decoder-Provided Pilots in Highly Dynamic Channels

Duschia Bodet, Muriel Médard, Muralidhar Rangaswamy, Ken Duffy

Comments This work has been submitted to the IEEE for possible publication

2605.00760 2026-05-04 cs.LG

Learning the Helmholtz equation operator with DeepONet for non-parametric 2D geometries

Rodolphe Barlogis, Ferhat Tamssaouet, Quentin Falcoz, Stéphane Grieu

Comments 24 pages, 16 figures

2605.00758 2026-05-04 physics.soc-ph cs.GT

Optimal network structure for collective performance with strategic information sharing

Ye Wang, Andrea Civilini, Anzhi Sheng, Xiaojie Chen, Long Wang, Vito Latora

2605.00755 2026-05-04 cs.NI

AdvNet: Revealing Performance Issues in Network Protocols by Generating Adversarial Environments

Shehab Sarar Ahmed, William Sentosa, Yinjie Zhang, Yoav Lebendiker, Michael Shnaiderman, Tomer Gilad, Nathan H. Jay, Brighten Godfrey, Michael Schapira

Comments 18 pages, 8 figures

2605.00752 2026-05-04 eess.SY cs.LO cs.SY

HyperCertificates: Verification of Discrete-time Dynamical Systems against HyperLTL Specifications

Vishnu Murali, Amin Falah, Ashutosh Trivedi, Majid Zamani

Comments 24 pages, 3 figures, 1 table

2605.00751 2026-05-04 cs.LG

NonZero: Interaction-Guided Exploration for Multi-Agent Monte Carlo Tree Search

Sizhe Tang, Zuyuan Zhang, Mahdi Imani, Tian Lan

Comments Accepted by ICML 2026 as Spotlight