arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.08590 2026-04-13 cs.LG cs.AI

AlphaLab: Autonomous Multi-Agent Research Across Optimization Domains with Frontier LLMs

Brendan R. Hogan, Xiwen Chen, James T. Wilson, Kashif Rasul, Adel Boyarsky, Thomas Kamei, Anderson Schneider, Yuriy Nevmyvaka

Comments 43 pages, 12 figures

详情

英文摘要

We present AlphaLab, an autonomous research harness that leverages frontier LLM agentic capabilities to automate the full experimental cycle in quantitative, computation-intensive domains. Given only a dataset and a natural-language objective, AlphaLab proceeds through three phases without human intervention: (1) it adapts to the domain and explores the data, writing analysis code and producing a research report; (2) it constructs and adversarially validates its own evaluation framework; and (3) it runs large-scale GPU experiments via a Strategist/Worker loop, accumulating domain knowledge in a persistent playbook that functions as a form of online prompt optimization. All domain-specific behavior is factored into adapters generated by the model itself, so the same pipeline handles qualitatively different tasks without modification. We evaluate AlphaLab with two frontier LLMs (GPT-5.2 and Claude Opus 4.6) on three domains: CUDA kernel optimization, where it writes GPU kernels that run 4.4x faster than torch.compile on average (up to 91x); LLM pretraining, where the full system achieves 22% lower validation loss than a single-shot baseline using the same model; and traffic forecasting, where it beats standard baselines by 23-25% after researching and implementing published model families from the literature. The two models discover qualitatively different solutions in every domain (neither dominates uniformly), suggesting that multi-model campaigns provide complementary search coverage. We additionally report results on financial time series forecasting in the appendix, and release all code at https://brendanhogan.github.io/alphalab-paper/.

URL PDF HTML ☆

赞 0 踩 0

2604.08589 2026-04-13 cs.LG

EngageTriBoost: Predictive Modeling of User Engagement in Digital Mental Health Intervention Using Explainable Machine Learning

Ha Na Cho, Daniel Eisenberg, Cheryl King, Kai Zheng

2604.08588 2026-04-13 cs.LG cs.AI

Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models

Matthew DosSantos DiSorbo, Harang Ju

2604.08586 2026-04-13 cs.LG cs.AI physics.flu-dyn

FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes

David Ramos, Lucas Lacasa, Fermín Gutiérrez, Eusebio Valero, Gonzalo Rubio

Comments 17 pages, 6 figures

详情

英文摘要

Computational fluid dynamics (CFD) provides high-fidelity simulations of fluid flows but remains computationally expensive for many-query applications. In recent years deep learning (DL) has been used to construct data-driven fluid-dynamic surrogate models. In this work we consider a different learning paradigm and embrace generative modelling as a framework for constructing scalable fluid-dynamics surrogate models. We introduce FluidFlow, a generative model based on conditional flow-matching, a recent alternative to diffusion models that learns deterministic transport maps between noise and data distributions. FluidFlow is specifically designed to operate directly on CFD data defined on both structured and unstructured meshes alike, without the needs to perform any mesh interpolation pre-processing and preserving geometric fidelity. We assess the capabilities of FluidFlow using two different core neural network architectures, a U-Net and diffusion transformer (DiT), and condition their learning on physically meaningful parameters. The methodology is validated on two benchmark problems of increasing complexity: prediction of pressure coefficients along an airfoil boundary across different operating conditions, and prediction of pressure and friction coefficients over a full three-dimensional aircraft geometry discretized on a large unstructured mesh. In both cases, FluidFlow outperform strong multilayer perceptron baselines, achieving significantly lower error metrics and improved generalisation across operating conditions. Notably, the transformer-based architecture enables scalable learning on large unstructured datasets while maintaining high predictive accuracy. These results demonstrate that flow-matching generative models provide an effective and flexible framework for surrogate modelling in fluid dynamics, with potential for realistic engineering and scientific applications.

URL PDF HTML ☆

赞 0 踩 0

2604.08584 2026-04-13 cs.LG cs.AI

CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference

Chuxu Song, Zhencan Peng, Jiuqi Wei, Chuanhui Yang

2604.08582 2026-04-13 cs.LG cs.AI

Multivariate Time Series Anomaly Detection via Dual-Branch Reconstruction and Autoregressive Flow-based Residual Density Estimation

Jun Liu, Ying Chen, Ziqian Lu, Qinyue Tong, Jun Tang

Comments 12 pages, 3 figures,

2604.08579 2026-04-13 cs.LG cs.AI

On the Spectral Geometry of Cross-Modal Representations: A Functional Map Diagnostic for Multimodal Alignment

Krisanu Sarkar

Comments Under review at ACMMM Brave New Ideas Track

2604.08578 2026-04-13 cs.LG cs.AI

Structured Exploration and Exploitation of Label Functions for Automated Data Annotation

Phong Lam, Ha-Linh Nguyen, Thu-Trang Nguyen, Son Nguyen, Hieu Dinh Vo

Comments Accepted by KBS Journal

2604.08575 2026-04-13 cs.LG cs.AI

MolPaQ: Modular Quantum-Classical Patch Learning for Interpretable Molecular Generation

Syed Rameez Naqvi, Lu Peng

2604.08574 2026-04-13 cs.LG cs.AI

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

Rasched Haidari, Sam Martin, Maxime Allard

Comments Accepted at the Tiny Papers Track for the Machine Learning for Genomics Explorations Workshop at ICLR 2026 an the Gen2 Workshop at ICLR 2026

2604.08573 2026-04-13 cs.LG cs.AI cs.CV

Silhouette Loss: Differentiable Global Structure Learning for Deep Representations

Matheus Vinícius Todescato, Joel Luís Carbonera

详情

英文摘要

Learning discriminative representations is a central goal of supervised deep learning. While cross-entropy (CE) remains the dominant objective for classification, it does not explicitly enforce desirable geometric properties in the embedding space, such as intra-class compactness and inter-class separation. Existing metric learning approaches, including supervised contrastive learning (SupCon) and proxy-based methods, address this limitation by operating on pairwise or proxy-based relationships, but often increase computational cost and complexity. In this work, we introduce Soft Silhouette Loss, a novel differentiable objective inspired by the classical silhouette coefficient from clustering analysis. Unlike pairwise objectives, our formulation evaluates each sample against all classes in the batch, providing a batch-level notion of global structure. The proposed loss directly encourages samples to be closer to their own class than to competing classes, while remaining lightweight. Soft Silhouette Loss can be seamlessly combined with cross-entropy, and is also complementary to supervised contrastive learning. We propose a hybrid objective that integrates them, jointly optimizing local pairwise consistency and global cluster structure. Extensive experiments on seven diverse datasets demonstrate that: (i) augmenting CE with Soft Silhouette Loss consistently improves over CE and other metric learning baselines; (ii) the hybrid formulation outperforms SupCon alone; and (iii) the combined method achieves the best performance, improving average top-1 accuracy from 36.71% (CE) and 37.85% (SupCon2) to 39.08%, while incurring substantially lower computational overhead. These results suggest that classical clustering principles can be reinterpreted as differentiable objectives for deep learning, enabling efficient optimization of both local and global structure in representation spaces.

URL PDF HTML ☆

赞 0 踩 0

2604.08572 2026-04-13 cs.LG cs.CV

Ranked Activation Shift for Post-Hoc Out-of-Distribution Detection

Gianluca Guglielmo, Marc Masana

Comments Code is available at https://github.com/gigug/RAS

2604.08569 2026-04-13 cs.LG

Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for High Dimensions

Abhilasha Saroj, Shaked Regev, Guanhao Xu, Jinghui Yuan, Roy Luo, Ross Wang

2604.08566 2026-04-13 cs.CL cs.LG

Sentiment Classification of Gaza War Headlines: A Comparative Analysis of Large Language Models and Arabic Fine-Tuned BERT Models

Amr Eleraqi, Hager H. Mustafa, Abdul Hadi N. Ahmed

Comments 45 pages, 6 figures (including diagrams), 8 tables. Dataset available at this https URL . Previously posted at https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/FFENX3

详情

DOI: 10.2139/ssrn.6332158
Journal ref: SSRN (2026)

英文摘要

This study examines how different artificial intelligence architectures interpret sentiment in conflict-related media discourse, using the 2023 Gaza War as a case study. Drawing on a corpus of 10,990 Arabic news headlines (Eleraqi 2026), the research conducts a comparative analysis between three large language models and six fine-tuned Arabic BERT models. Rather than evaluating accuracy against a single human-annotated gold standard, the study adopts an epistemological approach that treats sentiment classification as an interpretive act produced by model architectures. To quantify systematic differences across models, the analysis employs information-theoretic and distributional metrics, including Shannon Entropy, Jensen-Shannon Distance, and a Variance Score measuring deviation from aggregate model behavior. The results reveal pronounced and non-random divergence in sentiment distributions. Fine-tuned BERT models, particularly MARBERT, exhibit a strong bias toward neutral classifications, while LLMs consistently amplify negative sentiment, with LLaMA-3.1-8B showing near-total collapse into negativity. Frame-conditioned analysis further demonstrates that GPT-4.1 adjusts sentiment judgments in line with narrative frames (e.g., humanitarian, legal, security), whereas other LLMs display limited contextual modulation. These findings suggest that the choice of model constitutes a choice of interpretive lens, shaping how conflict narratives are algorithmically framed and emotionally evaluated. The study contributes to media studies and computational social science by foregrounding algorithmic discrepancy as an object of analysis and by highlighting the risks of treating automated sentiment outputs as neutral or interchangeable measures of media tone in contexts of war and crisis.

URL PDF HTML ☆

赞 0 踩 0

2604.08565 2026-04-13 cs.CL cs.AI cs.LG

Dynamic sparsity in tree-structured feed-forward layers at scale

Reza Sedghi, Robin Schiewer, Anand Subramoney, David Kappel

2604.08563 2026-04-13 cs.CL cs.AI cs.LG

Temperature-Dependent Performance of Prompting Strategies in Extended Reasoning Large Language Models

Mousa Salah, Amgad Muneer

Comments 3 Figures, 2 Tables

2604.08562 2026-04-13 cs.CL cs.AI cs.SD eess.AS

Neural networks for Text-to-Speech evaluation

Ilya Trofimenko, David Kocharyan, Aleksandr Zaitsev, Pavel Repnikov, Mark Levin, Nikita Shevtsov

2604.08561 2026-04-13 cs.CL cs.LG

A Representation-Level Assessment of Bias Mitigation in Foundation Models

Svetoslav Nizhnichenkov, Rahul Nair, Elizabeth Daly, Brian Mac Namee

Comments Accepted at ECML-PKDD 2025 (5th Workshop on Bias and Fairness in AI)

2604.08560 2026-04-13 cs.CL cs.AI

Uncertainty Estimation for the Open-Set Text Classification systems

Leonid Erlygin, Alexey Zaytsev

2604.08559 2026-04-13 cs.CL cs.AI

Medical Reasoning with Large Language Models: A Survey and MR-Bench

Xiaohan Ren, Chenxiao Fan, Wenyin Ma, Hongliang He, Chongming Gao, Xiaoyan Zhao, Fuli Feng

2604.08558 2026-04-13 cs.CL cs.AI

WAND: Windowed Attention and Knowledge Distillation for Efficient Autoregressive Text-to-Speech Models

Hanna Lee, Tan Dat Nguyen, Jaehoon Kang, Kyuhong Shim

Comments Submitted to Interspeech 2026

2604.08556 2026-04-13 cs.CL cs.AI

EMA Is Not All You Need: Mapping the Boundary Between Structure and Content in Recurrent Context

Arth Singh

Comments 10 pages, 1 figure, 7 tables

2604.08555 2026-04-13 cs.CL

SynDocDis: A Metadata-Driven Framework for Generating Synthetic Physician Discussions Using Large Language Models

Beny Rubinstein, Sergio Matos

详情

DOI: 10.1007/978-3-032-05176-9_24
Journal ref: In: Valente de Oliveira, J., Leite, J., Rodrigues, J., Dias, J., Cardoso, P. (eds) Progress in Artificial Intelligence. EPIA 2025. Lecture Notes in Computer Science(), vol 16121. Springer, Cham

英文摘要

Physician-physician discussions of patient cases represent a rich source of clinical knowledge and reasoning that could feed AI agents to enrich and even participate in subsequent interactions. However, privacy regulations and ethical considerations severely restrict access to such data. While synthetic data generation using Large Language Models offers a promising alternative, existing approaches primarily focus on patient-physician interactions or structured medical records, leaving a significant gap in physician-to-physician communication synthesis. We present SynDocDis, a novel framework that combines structured prompting techniques with privacy-preserving de-identified case metadata to generate clinically accurate physician-to-physician dialogues. Evaluation by five practicing physicians in nine oncology and hepatology scenarios demonstrated exceptional communication effectiveness (mean 4.4/5) and strong medical content quality (mean 4.1/5), with substantial interrater reliability (kappa = 0.70, 95% CI: 0.67-0.73). The framework achieved 91% clinical relevance ratings while maintaining doctors' and patients' privacy. These results place SynDocDis as a promising framework for advancing medical AI research ethically and responsibly through privacy-compliant synthetic physician dialogue generation with direct applications in medical education and clinical decision support.

URL PDF HTML ☆

赞 0 踩 0

2604.08554 2026-04-13 cs.CL cs.AI

Drift and selection in LLM text ecosystems

Søren Riis

2604.08553 2026-04-13 cs.LG cs.AI cs.CL

GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback

Ruiyao Xu, Kaize Ding

Comments ICLR 2026

2604.08548 2026-04-13 cs.CV

ETCH-X: Robustify Expressive Body Fitting to Clothed Humans with Composable Datasets

Xiaoben Li, Jingyi Wu, Zeyu Cai, Siyuan Yu, Boqian Li, Yuliang Xiu

Comments Page: https://xiaobenli00.github.io/ETCH-X/, Code: https://github.com/XiaobenLi00/ETCH-X

2604.08544 2026-04-13 cs.RO cs.AI cs.CV

SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds

Yunsong Zhou, Hangxu Liu, Xuekun Jiang, Xing Shen, Yuanzhen Zhou, Hui Wang, Baole Fang, Yang Tian, Mulin Yu, Qiaojun Yu, Li Ma, Hengjie Li, Hanqing Wang, Jia Zeng, Jiangmiao Pang

Comments Website: https://internrobotics.github.io/sim1.github.io/

2604.08357 2026-04-13 cs.LG

Bias-Constrained Diffusion Schedules for PDE Emulations: Reconstruction Error Minimization and Efficient Unrolled Training

Constantin Le Cleï, Nils Thuerey, Xiaoxiang Zhu

2604.08355 2026-04-13 cs.AI

ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer

Ajsal Shereef Palattuparambil, Thommen George Karimpanal, Santu Rana

2604.08287 2026-04-13 cs.CV

CAMotion: A High-Quality Benchmark for Camouflaged Moving Object Detection in the Wild

Siyuan Yao, Hao Sun, Ruiqi Yu, Xiwei Jiang, Wenqi Ren, Xiaochun Cao

Comments Under review