arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.06421 2026-05-08 cs.CV cs.LG

FREPix: Frequency-Heterogeneous Flow Matching for Pixel-Space Image Generation

Mingfeng Lin, Jiakun Chen, Liang Han, Liqiang Nie

详情

英文摘要

Pixel-space diffusion has re-emerged as a promising alternative to latent-space generation because it avoids the representation bottleneck introduced by VAEs. Yet most existing methods still treat image generation as a frequency-homogeneous process, overlooking the distinct roles and learning dynamics of low- and high-frequency components. To address this, we propose FREPix, a FREquency-heterogeneous flow matching framework for Pixel-space image generation. FREPix explicitly decomposes generation into low- and high-frequency components, assigns them separate transport paths, predicts them with a factorized network, and trains them with a frequency-aware objective. In this way, coarse-to-fine generation becomes an explicit design principle rather than an implicit behavior. On ImageNet class-to-image generation, FREPix achieves competitive results among pixel-space generation models, reaching 1.91 FID at $256\times256$ and 2.38 FID at $512\times512$, with particularly strong behavior in the low-NFE regime.

URL PDF HTML ☆

赞 0 踩 0

2605.06416 2026-05-08 cs.CL

MiA-Signature: Approximating Global Activation for Long-Context Understanding

Yuqing Li, Jiangnan Li, Mo Yu, Zheng Lin, Weiping Wang, Jie Zhou

Comments This is a work in progress; we will continue to revise and improve the manuscript

2605.06404 2026-05-08 cs.LG

FRInGe: Distribution-Space Integrated Gradients with Fisher--Rao Geometry

Gabriele Martino, Sebastian Tschiatschek

2605.06403 2026-05-08 cs.CL cs.IR

GATHER: Convergence-Centric Hyper-Entity Retrieval for Zero-Shot Cell-Type Annotation

Zhonghui Zhang, Feng Jiang, Shaowei Qin, Jiahao Zhao, Min Yang

Comments Accepted to SIGIR 2026. 2 figures, 3 tables

详情

DOI: 10.1145/3805712.3809935

英文摘要

Zero-shot single-cell cell-type annotation aims to determine a cell's type from a given set of expressed genes without any training. Existing knowledge-graph-based RAG approaches retrieve evidence by expanding from source entities and relying on iterative LLM reasoning. However, in this setting each query contains tens to hundreds of genes, where no single gene is decisive and the label emerges only from their collective co-occurrence. Such hyper-entity queries fundamentally challenge local, entity-wise exploration strategies, which reason from individual genes, leading to poor scalability and substantial LLM cost. We propose GATHER (Graph-Aware Traversal with Hyper-Entity Retrieval), a convergence-centric retriever tailored to hyper-entity queries. It performs global multi-source graph traversal and identifies topological convergence points -- nodes jointly reachable from many input genes. These convergence nodes act as high-information hyper-entities that capture entity synergy. By incorporating node- and path-importance scoring, GATHER selects informative evidence entirely without LLM involvement during retrieval. Instantiated on a self-constructed cell-centric biological knowledge graph (VCKG), GATHER outperforms strong KG-RAG baselines (ToG, ToG-2, RoG, PoG) on two datasets (Immune and Lung), achieving the highest exact-match accuracy (27.45% and 59.64%) with only a single LLM call per sample, compared to 2--61 calls for KG-RAG baselines. Our results demonstrate that convergence nodes compress multi-entity signals into compact, high-information evidence that conveys more per item than multi-hop paths, providing an efficient global alternative to local entity-wise reasoning.

URL PDF HTML ☆

赞 0 踩 0

2605.06402 2026-05-08 cs.LG

SparseForge: Efficient Semi-Structured LLM Sparsification via Annealing of Hessian-Guided Soft-Mask

Liu Hanzuo, Chaofan Lin, Weixuan Sun, Yulong Wang, Key, Rayying, Mingyu Gao

2605.06388 2026-05-08 cs.CV cs.LG cs.RO

Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models

Nilaksh, Saurav Jha, Artem Zholus, Sarath Chandar

Comments 9 pages

2605.06385 2026-05-08 cs.LG

Data-Driven Covariate Selection for Nonparametric and Cycle-Agnostic Causal Effect Estimation

Ana Leticia Garcez Vicente, Gijs van Seeventer, Saber Salehkaleybar

2605.06382 2026-05-08 cs.AI

Rethinking Vacuity for OOD Detection in Evidential Deep Learning

Claire McNamara

2605.06380 2026-05-08 cs.CV cs.LG

Empirical Evidence for Simply Connected Decision Regions in Image Classifiers

Arjhun Swaminathan, Mete Akgün

2605.06376 2026-05-08 cs.CV cs.AI

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Tao Liu, Hao Yan, Mengting Chen, Taihang Hu, Zhengrong Yue, Zihao Pan, Jinsong Lan, Xiaoyong Zhu, Ming-Ming Cheng, Bo Zheng, Yaxing Wang

Comments 22pages, 9 figures

2605.06371 2026-05-08 cs.AI

Debiased Multimodal Personality Understanding through Dual Causal Intervention

Yangfu Zhu, Zitong Han, Nianwen Ning, Yuting Wei, Yuandong Wang, Hang Feng, Zhenzhou Shao

2605.06368 2026-05-08 cs.CV cs.AI cs.LG

eXplaining to Learn (eX2L): Regularization Using Contrastive Visual Explanation Pairs for Distribution Shifts

Paulo Mario P. Medina, Jose Marie Antonio Miñoza, Sebastian C. Ibañez

2605.06365 2026-05-08 cs.AI cs.MA cs.SE

From Agent Loops to Deterministic Graphs: Execution Lineage for Reproducible AI-Native Work

Josh Rosen, Seth Rosen

Comments 16 pages, 1 figure

详情

英文摘要

Large language model systems are increasingly deployed as agentic workflows that interleave reasoning, tool use, memory, and iterative refinement. These systems are effective at producing answers, but they often rely on implicit conversational state, making it difficult to preserve stable work products, isolate irrelevant updates, or propagate changes through intermediate artifacts. We introduce execution lineage: an execution model in which AI-native work is represented as a directed acyclic graph (DAG) of artifact-producing computations with explicit dependencies, stable intermediate boundaries, and identity-based replay. The goal is not to make the model a better one-shot writer, but to make evolving AI-generated work maintainable under change. We compare execution-lineage replay against loop-centric update baselines on two controlled policy-memo update tasks. In an unrelated-branch update, DAG replay preserved the final memo exactly in all runs, with zero churn and zero unrelated-branch contamination, while loop baselines regenerated the memo and frequently imported unrelated context. In an intermediate-artifact edit, all systems reflected the new constraint in the final memo, but only DAG replay achieved perfect upstream preservation, downstream propagation, unaffected-artifact preservation, and cross-artifact consistency. These results show that final answer quality and maintained-state quality are distinct. Strong loop baselines can remain competitive at producing polished final outputs when the task is a bounded synthesis/update problem and all current sources fit in context, but immediate task success can mask partial state inconsistency that may compound over future revisions. Execution lineage provides stronger guarantees about what should change, what should remain stable, and how work evolves across revisions.

URL PDF HTML ☆

赞 0 踩 0

2605.06364 2026-05-08 cs.LG cs.AI

Flow Matching with Arbitrary Auxiliary Paths

Xin Peng, Ang Gao

2605.06361 2026-05-08 cs.LG

Preliminary Insights in Chronos Frequency Data Understanding and Reconstruction

Alessandro Pagani, Marco Cominelli, Liying Han, Gaofeng Dong, Sergio Benini, Francesco Gringoli, Mattia Savardi, Mani B. Srivastava, Trevor Bihl, Erik P. Blasch, Daniel O. Brigham, Kara Combs, Lance M. Kaplan, Federico Cerutti

2605.06357 2026-05-08 cs.LG cs.AI cs.CV

Memory Efficient Full-gradient Attacks (MEFA) Framework for Adversarial Defense Evaluations

Yuan Du, Mitchel Hill, HanQin Cai

2605.06352 2026-05-08 cs.LG cs.AI stat.ML

Topological Signatures of Grokking

Yifan Tang, Qiquan Wang, Inés García-Redondo, Anthea Monod

Comments 19 pages, 14 figures, 2 tables

2605.06350 2026-05-08 cs.LG cs.AI cs.CL

Is Escalation Worth It? A Decision-Theoretic Characterization of LLM Cascades

Dylan Bouchard

2605.06346 2026-05-08 cs.AI

Prediction and Empowerment: A Theory of Agency through Bridge Interfaces

Richard Csaky

Comments This is a working draft: feedback and criticism is most welcome

2605.06345 2026-05-08 cs.AI

More Than Can Be Said: A Benchmark and Framework for Pre-Question Scientific Ideation

Jie Yu, Song Qiu

Comments 19 pages, 2 figures; Code is available at https://github.com/Paradoxtcal/InciteResearch.git

2605.06343 2026-05-08 cs.AI

Mind the Gap? A Distributional Comparison of Real and Synthetic Priors for Tabular Foundation Models

Alex O. Davies, Telmo de Menezes e Silva Filho, Nirav Ajmeri

2605.06342 2026-05-08 cs.CL

Don't Lose Focus: Activation Steering via Key-Orthogonal Projections

Haoyan Luo, Mateo Espinosa Zarlenga, Mateja Jamnik

2605.06339 2026-05-08 cs.AI

A Regime Theory of Controller Class Selection for LLM Action Decisions

Zhaoyang Jiang, Zhizhong Fu, Yunsoo Kim, Jiacong Mi, Zicheng Li, Xuanqi Peng, Honghan Wu

2605.06337 2026-05-08 cs.CV

Earth-o1: A Grid-free Observation-native Atmospheric World Model

Junchao Gong, Kaiyi Xu, Wangxu Wei, Siwei Tu, Jingyi Xu, Zili Liu, Hang Fan, Zhiwang Zhou, Tao Han, Yi Xiao, Xinyu Gu, Zhangrui Li, Wenlong Zhang, Hao Chen, Xiaokang Yang, Yaqiang Wang, Lijing Cheng, Pierre Gentine, Wanli Ouyang, Feng Zhang, Zhe-Min Tan, Bowen Zhou, Fenghua Ling, Ben Fei, Lei Bai

2605.06335 2026-05-08 cs.LG

Eliciting associations between clinical variables from LLMs via comparison questions across populations

Fabian Kabus, Kian Kordtomeikel, Thomas Brox, Heinz Wiendl, Daiana Stolz, Harald Binder

2605.06334 2026-05-08 cs.CL cs.LG cs.LO

MANTRA: Synthesizing SMT-Validated Compliance Benchmarks for Tool-Using LLM Agents

Ashwani Anand, Ivi Chatzi, Ritam Raha, Anne-Kathrin Schmuck

2605.06333 2026-05-08 cs.CV cs.AI cs.LG stat.AP stat.ML

TinyBayes: Closed-Form Bayesian Inference via Jacobi Prior for Real-Time Image Classification on Edge Devices

Shouvik Sardar, Sourish Das

Comments 14 Pages, 1 Figure, 4 Tables

2605.06332 2026-05-08 cs.LG

LINC: Decoupling Local Consequence Scoring from Hidden Matching in Constructive Neural Routing

Shaofeng Qin, Li Wang

Comments 21 pages, 10 figures, 10 tables. Code: https://github.com/Elaina10172004/LINC

2605.06327 2026-05-08 cs.CL cs.AI cs.LG

Measuring Evaluation-Context Divergence in Open-Weight LLMs: A Paired-Prompt Protocol with Pilot Evidence of Alignment-Pipeline-Specific Heterogeneity

Florian A. D. Burnat, Brittany I. Davidson

2605.06326 2026-05-08 cs.CL

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Qianjia Cheng, Yuchen Zhang, Zhilin Wang, Yuxin Zuo, Shunkai Zhang, Yuchen Fan, Yu Qiao, Bowen Zhou, Ning Ding, Yu Cheng, Yun Luo, Ganqu Cui