arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Xinrui Zhou, Yuhao Huang, Haoran Dou, Shijing Chen, Ao Chang, Jia Liu, Weiran Long, Jian Zheng, Erjiao Xu, Jie Ren, Alejandro F. Frangi, Ruobing Huang, Jun Cheng, Xiaomeng Li, Wufeng Xue, Dong Ni

Comments Accepted by International Journal of Computer Vision, 30 pages, 11 figures, 11 tables

2409.04332 2026-02-19 cs.LG stat.ML

Amortized Bayesian Workflow

Chengkun Li, Aki Vehtari, Paul-Christian Bürkner, Stefan T. Radev, Luigi Acerbi, Marvin Schmitt

Comments Accepted in Transactions on Machine Learning Research

2402.00468 2026-02-19 cs.AI

RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway

Biswajit Sadhu, Trijit Sadhu, S. Anand

Comments 12 pages, 7 main figures, code link (GitHub)

Journal ref IEEE Transactions on Neural Networks and Learning Systems ( Volume: 36, Issue: 9, September 2025), Page(s): 15951 - 15962

2308.00010 2026-02-19 cs.SD cs.LG eess.AS

Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model

S. Rijal, R. Neupane, S. P. Mainali, S. K. Regmi, S. Maharjan

Comments The paper doesn't qualify for replication, no clear instruction for data preparation to see the results being replicated. Multiple grammar mistakes, and need a through review prior to publish

2305.05418 2026-02-19 cs.AI cs.LO

Measuring Rule-based LTLf Process Specifications: A Probabilistic Data-driven Approach

Alessio Cecconi, Luca Barbaro, Claudio Di Ciccio, Arik Senderovich

2208.14153 2026-02-19 cs.LG stat.ML

Identifying Weight-Variant Latent Causal Models

Yuhang Liu, Zhen Zhang, Dong Gong, Mingming Gong, Biwei Huang, Anton van den Hengel, Kun Zhang, Javen Qinfeng Shi

1907.06386 2026-02-19 cs.AI

Comprehensive Process Drift Detection with Visual Analytics

Anton Yeshchenko, Claudio Di Ciccio, Jan Mendling, Artem Polyvyanyy

Comments Accepted for publication at the 38th International Conference on Conceptual Modeling (ER 2019), http://www.inf.ufrgs.br/er2019/

2602.16703 2026-02-19 cs.CY cs.AI

Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

Shen Zhou Hong, Alex Kleinman, Alyssa Mathiowetz, Adam Howes, Julian Cohen, Suveer Ganta, Alex Letizia, Dora Liao, Deepika Pahari, Xavier Roberts-Gaal, Luca Righetti, Joe Torres

2602.16696 2026-02-19 q-bio.GN cs.LG q-bio.QM

Parameter-free representations outperform single-cell foundation models on downstream benchmarks

Huan Souza, Pankaj Mehta

2602.16690 2026-02-19 stat.ME cs.LG stat.ML

Synthetic-Powered Multiple Testing with FDR Control

Yonghoon Lee, Meshi Bashari, Edgar Dobriban, Yaniv Romano

2602.16671 2026-02-19 cs.SE cs.AI

SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation

Jaid Monwar Chowdhury, Chi-An Fu, Reyhaneh Jabbarvand

Comments 9 pages, 6 figures, 4 tables

2602.16650 2026-02-19 cs.CE cs.AI

Retrieval Augmented Generation of Literature-derived Polymer Knowledge: The Example of a Biodegradable Polymer Expert System

Sonakshi Gupta, Akhlak Mahmood, Wei Xiong, Rampi Ramprasad

详情

英文摘要

Polymer literature contains a large and growing body of experimental knowledge, yet much of it is buried in unstructured text and inconsistent terminology, making systematic retrieval and reasoning difficult. Existing tools typically extract narrow, study-specific facts in isolation, failing to preserve the cross-study context required to answer broader scientific questions. Retrieval-augmented generation (RAG) offers a promising way to overcome this limitation by combining large language models (LLMs) with external retrieval, but its effectiveness depends strongly on how domain knowledge is represented. In this work, we develop two retrieval pipelines: a dense semantic vector-based approach (VectorRAG) and a graph-based approach (GraphRAG). Using over 1,000 polyhydroxyalkanoate (PHA) papers, we construct context-preserving paragraph embeddings and a canonicalized structured knowledge graph supporting entity disambiguation and multi-hop reasoning. We evaluate these pipelines through standard retrieval metrics, comparisons with general state-of-the-art systems such as GPT and Gemini, and qualitative validation by a domain chemist. The results show that GraphRAG achieves higher precision and interpretability, while VectorRAG provides broader recall, highlighting complementary trade-offs. Expert validation further confirms that the tailored pipelines, particularly GraphRAG, produce well-grounded, citation-reliable responses with strong domain relevance. By grounding every statement in evidence, these systems enable researchers to navigate the literature, compare findings across studies, and uncover patterns that are difficult to extract manually. More broadly, this work establishes a practical framework for building materials science assistants using curated corpora and retrieval design, reducing reliance on proprietary models while enabling trustworthy literature analysis at scale.

URL PDF HTML ☆

赞 0 踩 0

2602.16634 2026-02-19 stat.ML cs.AI cs.LG physics.bio-ph physics.chem-ph

Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

Yu Xie, Ludwig Winkler, Lixin Sun, Sarah Lewis, Adam E. Foster, José Jiménez Luna, Tim Hempel, Michael Gastegger, Yaoyi Chen, Iryna Zaporozhets, Cecilia Clementi, Christopher M. Bishop, Frank Noé

2602.16612 2026-02-19 cs.LO cs.AI math.CT quant-ph

Causal and Compositional Abstraction

Robin Lorenz, Sean Tull

详情

英文摘要

Abstracting from a low level to a more explanatory high level of description, and ideally while preserving causal structure, is fundamental to scientific practice, to causal inference problems, and to robust, efficient and interpretable AI. We present a general account of abstractions between low and high level models as natural transformations, focusing on the case of causal models. This provides a new formalisation of causal abstraction, unifying several notions in the literature, including constructive causal abstraction, Q-$τ$ consistency, abstractions based on interchange interventions, and `distributed' causal abstractions. Our approach is formalised in terms of category theory, and uses the general notion of a compositional model with a given set of queries and semantics in a monoidal, cd- or Markov category; causal models and their queries such as interventions being special cases. We identify two basic notions of abstraction: downward abstractions mapping queries from high to low level; and upward abstractions, mapping concrete queries such as Do-interventions from low to high. Although usually presented as the latter, we show how common causal abstractions may, more fundamentally, be understood in terms of the former. Our approach also leads us to consider a new stronger notion of `component-level' abstraction, applying to the individual components of a model. In particular, this yields a novel, strengthened form of constructive causal abstraction at the mechanism-level, for which we prove characterisation results. Finally, we show that abstraction can be generalised to further compositional models, including those with a quantum semantics implemented by quantum circuits, and we take first steps in exploring abstractions between quantum compositional circuit models and high-level classical causal models as a means to explainable quantum AI.

URL PDF HTML ☆

赞 0 踩 0

2602.16603 2026-02-19 cs.DC cs.AI

FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

Chia-chi Hsieh, Zan Zong, Xinyang Chen, Jianjiang Li, Jidong Zhai, Lijie Wen

Comments 13 pages

2602.16585 2026-02-19 cs.DB cs.AI

DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows

Dimitri Yatsenko, Thinh T. Nguyen

Comments 20 pages, 2 figures, 1 table

2602.16568 2026-02-19 math.ST cs.DS cs.LG math.OC stat.ML stat.TH

Separating Oblivious and Adaptive Models of Variable Selection

Ziyun Chen, Jerry Li, Kevin Tian, Yusong Zhu

Comments 40 pages

2602.16555 2026-02-19 math.OC cs.LG math.PR

Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$-Potential Approach

Philipp Plank, Yufei Zhang

2602.16554 2026-02-19 cs.LO cs.AI cs.ET quant-ph

MerLean: An Agentic Framework for Autoformalization in Quantum Computation

Yuanjie Ren, Jinzheng Li, Yidi Qi

2602.16520 2026-02-19 cs.CR cs.AI

Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents

Doron Shavit

Comments 5 pages and 1 figure. Appendix: an additional 5 pages

2602.16505 2026-02-19 stat.ML cs.LG

Functional Decomposition and Shapley Interactions for Interpreting Survival Models

Sophie Hanna Langbein, Hubert Baniecki, Fabian Fumagalli, Niklas Koenen, Marvin N. Wright, Julia Herbinger

2602.16476 2026-02-19 stat.ML cs.LG

Learning Preference from Observed Rankings

Yu-Chang Chen, Chen Chian Fuh, Shang En Tsai

2602.16422 2026-02-19 eess.IV cs.AI cs.CV

Automated Histopathology Report Generation via Pyramidal Feature Extraction and the UNI Foundation Model

Ahmet Halici, Ece Tugba Cebeci, Musa Balci, Mustafa Cini, Serkan Sokmen

Comments 9 pages. Equal contribution: Ahmet Halici, Ece Tugba Cebeci, Musa Balci

2602.16421 2026-02-19 eess.AS cs.SD

SELEBI: Percussion-aware Time Stretching via Selective Magnitude Spectrogram Compression by Nonstationary Gabor Transform

Natsuki Akaishi, Nicki Holighaus, Kohei Yatabe

Comments This work has been submitted to the IEEE for possible publication

2602.16375 2026-02-19 cs.IR cs.CL cs.LG

Variable-Length Semantic IDs for Recommender Systems

Kirill Khrylchenko

2602.12207 2026-02-19 cs.HC cs.AI cs.SI

VIRENA: Virtual Arena for Research, Education, and Democratic Innovation

Emma Hoes, K. Jonathan Klueser, Fabrizio Gilardi

Comments VIRENA is under active development and currently in use at the University of Zurich. This preprint will be updated as new features are released. For the latest version and to inquire about demos or pilot collaborations, contact the authors

2602.05298 2026-02-19 stat.ML cs.LG math.OC

Logarithmic-time Schedules for Scaling Language Models with Momentum

Damien Ferbach, Courtney Paquette, Gauthier Gidel, Katie Everett, Elliot Paquette

2512.17322 2026-02-19 eess.IV cs.CV

Rotterdam artery-vein segmentation (RAV) dataset

Jose Vargas Quiros, Bart Liefers, Karin van Garderen, Jeroen Vermeulen, Eyened Reading Center, Caroline Klaver

2512.13532 2026-02-19 physics.flu-dyn cs.LG

Adaptive Sampling for Hydrodynamic Stability

Anshima Singh, David J. Silvester

2512.09530 2026-02-19 stat.ML cs.LG

Transformers for Tabular Data: A Training Perspective of Self-Attention via Optimal Transport

Alessandro Quadrio, Antonio Candelieri

AI 大模型

视觉与机器人

科学与医疗

Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification