arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.06594 2026-05-08 cs.CL

Automated Clinical Report Generation for Remote Cognitive Remediation: Comparing Knowledge-Engineered Templates and LLMs in Low-Resource Settings

Yongxin Zhou, Fabien Ringeval, François Portet

详情

英文摘要

The growing demand for cognitive remediation therapy, combined with limited speech therapist availability, has accelerated the adoption of remote rehabilitation tools. These systems generate large volumes of interaction data that are difficult for clinicians to review efficiently. This paper investigates automated clinical report generation for avatar-guided, home-based cognitive remediation sessions in a low-resource setting with no reference reports. We present and compare two approaches: (1) a rule-based template system encoding speech therapy domain knowledge as explicit decision rules and validated templates, ensuring clinical reliability and traceability; and (2) a zero-shot LLM-based approach (GPT-4) aimed at more fluent and concise output. Both systems use identical pre-extracted, expert-validated structured variables, enabling a controlled factual comparison. Outputs were evaluated by eight speech therapists and final-year students using a nine-criterion questionnaire. Results reveal a clear trade-off between clinical reliability and linguistic quality. The template-based system scored higher on fluidity, coherence, and results presentation, while GPT-4 produced more concise output. Directional differences are consistent across evaluation dimensions, though no comparison reached statistical significance after correction, reflecting the scale constraints of expert clinical evaluation. Based on evaluator feedback, we derive eight design recommendations for clinical reporting systems in remote rehabilitation settings. More broadly, this work contributes a replicable methodology combining expert elicitation, taxonomy-driven generation, and multi-dimensional human evaluation for clinical NLG in low-resource settings, and illustrates how controlled comparisons can inform the responsible adoption of generative AI in healthcare.

URL PDF HTML ☆

赞 0 踩 0

2605.06593 2026-05-08 cs.RO cs.GR cs.LG

ReActor: Reinforcement Learning for Physics-Aware Motion Retargeting

David Müller, Agon Serifi, Sammy Christen, Ruben Grandia, Espen Knoop, Moritz Bächer

Comments SIGGRAPH 2026

2605.06592 2026-05-08 cs.CV cs.AI cs.LG

DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency

Shuyang Jiang, Nan Yu, Yiming Zhang, Zenghui Ding, Zhenyu Wu

Comments 18 pages, 7 figures, 9 tables. Code will be made publicly available upon acceptance

2605.06591 2026-05-08 cs.LG hep-ph

BRICKS: Compositional Neural Markov Kernels for Zero-Shot Radiation-Matter Simulation

Richard Hildebrandt, Evangelos Kourlitis, Baran Hashemi, Manuel Bünstorf, Thierry Meyer, Nikola Boskov, Michael Kagan, Dan Rosenbaum, Sanmay Ganguly, Lukas Heinrich

Comments 10 pages, 5 figures

2605.06588 2026-05-08 cs.LG cs.AI

Towards Metric-Faithful Neural Graph Matching

Jyotirmaya Shivottam, Subhankar Mishra

2605.06585 2026-05-08 cs.LG math.OC

Distributionally-Robust Learning to Optimize

Vinit Ranjan, Jisun Park, Bartolomeo Stellato

2605.06584 2026-05-08 cs.AI

NeuroAgent: LLM Agents for Multimodal Neuroimaging Analysis and Research

Lujia Zhong, Yihao Xia, Jianwei Zhang, Shuo huang, Jiaxin Yue, Mingyang Xia, Yonggang Shi

详情

英文摘要

Multimodal neuroimaging analysis often involves complex, modality-specific preprocessing workflows that require careful configuration, quality control, and coordination across heterogeneous toolchains. Beyond preprocessing, downstream statistical analysis and disease classification commonly require task-specific code, evaluation protocols, and data-format conventions, creating additional barriers between raw acquisitions and reproducible scientific analysis. We present NeuroAgent, an LLM-driven agentic framework that automates key preprocessing and analysis steps for heterogeneous neuroimaging data, including sMRI, fMRI, dMRI, and PET, and supports interactive downstream analysis through natural-language queries. NeuroAgent employs a hierarchical multi-agent architecture with a feedback-driven Generate-Execute-Validate engine: agents autonomously generate executable preprocessing code, detect and recover from runtime errors, and validate output integrity. We evaluate the system on 1,470 subjects pooled across all ADNI phases (CN=1,000, AD=470), where all subjects have sMRI and tabular data, with subsets also having Tau-PET (n=469), fMRI (n=278), and DTI ($n=620$). Pipeline ablation studies across multiple LLM backends show that capable models reach up to 100% intent-parsing accuracy, with the strongest backend (Qwen3.5-27B) reaching 84.8% end-to-end preprocessing step correctness. Automated recovery limits manual intervention to edge cases where human review is required via the Human-In-The-Loop interface. For Alzheimer's Disease classification using automatically preprocessed multimodal data, our agent ensemble achieves an AUC of 0.9518 with four modalities, outperforming all single-modality baselines. These results show that NeuroAgent can reduce the manual effort required for neuroimaging preprocessing and enable end-to-end automated analysis pipelines for neuroimaging research.

URL PDF HTML ☆

赞 0 踩 0

2605.06583 2026-05-08 cs.AI

Improved techniques for fine-tuning flow models via adjoint matching: a deterministic control pipeline

Zhengyi Guo, Jiayuan Sheng, David D. Yao, Wenpin Tang

2605.06576 2026-05-08 cs.LG

On the Safety of Graph Representation Learning

Xiaoguang Guo, Zehong Wang, Ziming Li, Shawn Spitzel, Soonwoo Kwon, Tianyi Ma, Yanfang Ye, Chuxu Zhang

Comments Preprint. 10 pages main text, appendices included

2605.06575 2026-05-08 cs.LG cs.AI

Directional Consistency as a Complementary Optimization Signal: The GONO Framework

Victor Daniel Gera

2605.06572 2026-05-08 cs.CV cs.NA math.NA

Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation

Haidong Wu, Snehal Bhayani, Janne Heikkilä

Comments Accepted to CVPR 2026

2605.06571 2026-05-08 cs.LG cs.CR cs.DC cs.NI

CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification

Iason Ofeidis, Nikos Papadis, Randeep Bhatia, Leandros Tassiulas, TV Lakshman

Comments 12 pages, 7 figures, 5 tables

2605.06570 2026-05-08 cs.LG math.OC q-fin.CP q-fin.MF q-fin.RM

SNAPO: Smooth Neural Adjoint Policy Optimization for Optimal Control via Differentiable Simulation

Dmitri Goloubentsev, Natalija Karpichina

Comments 27 pages, 8 tables. Three domains: natural gas storage, pension fund ALM, pharmaceutical manufacturing. Benchmark code and trained policies available on request

2605.06562 2026-05-08 cs.LG q-bio.GN

Feature Dimensionality Outweighs Model Complexity in Breast Cancer Subtype Classification Using TCGA-BRCA Gene Expression Data

Meena Al Hasani

Comments 8 pages, 4 figures, 3 tables. Independent research study using TCGA-BRCA RNA-seq data

2605.06561 2026-05-08 cs.LG

Optimal Counterfactual Search in Tree Ensembles: A Study Across Modeling and Solution Paradigms

Awa Khouna, Youssouf Emine, Julien Ferry, Thibaut Vidal

详情

英文摘要

Trust in counterfactual explanations depends critically on whether their recommended changes are truly minimal: suboptimal explanations may vastly overshoot the actual changes needed to alter a decision, and heuristic errors can affect individuals unevenly, giving some users relevant recourse while assigning others unnecessarily costly recommendations. Consequently, we study the problem of computing optimal counterfactual explanations for tree ensembles under plausibility and actionability constraints. This is a combinatorial problem: for a fixed model, counterfactual search boils down to selecting consistent branching decisions and threshold-defined regions under a distance objective. We exploit this structure through CPCF, a constraint programming (CP) formulation in which numerical features are encoded as interval domains induced by split thresholds, while discrete features retain native finite-domain representations. This yields a compact finite-domain formulation that supports multiple distance objectives without continuous split-boundary search. We then place CPCF in a broader comparison across mathematical programming paradigms: we extend a maximum Boolean satisfiability (MaxSAT) formulation, originally designed for hard-voting random forests, to soft-voting ensembles, and compare against the current state-of-the-art mixed-integer linear programming (MILP) optimal approach. Across ten datasets and three types of tree ensembles, we analyze scalability, anytime performance, and sensitivity to distance metrics. We observe that CP achieves the best overall performance. More importantly, our results identify regimes in which the specific strengths of each paradigm make it best suited: CP is most versatile overall, MaxSAT handles hard-voting ensembles particularly well, and MILP remains competitive in amortized inference settings with a moderate number of split levels.

URL PDF HTML ☆

赞 0 踩 0

2605.06554 2026-05-08 cs.CL

Long Context Pre-Training with Lighthouse Attention

Bowen Peng, Subho Ghosh, Jeffrey Quesnelle

Comments 18 pages, 4 figures, 4 tables

2605.06553 2026-05-08 cs.LG

Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

Gal Vinograd, Idan Achituve, Ethan Fetaya

Comments 9 pages, 4 figures

2605.06552 2026-05-08 cs.LG

Sequential Design of Genetic Circuits Under Uncertainty With Reinforcement Learning

Michal Kobiela, Diego A. Oyarzún, Michael U. Gutmann

2605.06548 2026-05-08 cs.CL cs.AI cs.CV

Continuous Latent Diffusion Language Model

Hongcan Guo, Qinyu Zhao, Yian Zhao, Shen Nie, Rui Zhu, Qiushan Guo, Feng Wang, Tao Yang, Hengshuang Zhao, Guoqiang Wei, Yan Zeng

Comments 99 pages, 31 figures, 9 tables. Project page: https://hongcanguo.github.io/Cola-DLM/

2605.06541 2026-05-08 cs.LG stat.ML

Hedging Memory Horizons for Non-Stationary Prediction via Online Aggregation

Yutong Wang, Yannig Goude, Qiwei Yao

Comments Preprint

2605.06540 2026-05-08 cs.AI cs.GT

Ex Ante Evaluation of AI-Induced Idea Diversity Collapse

Nafis Saami Azad, Raiyan Abdul Baten

2605.06538 2026-05-08 cs.LG

Diffusion-Based Posterior Sampling: A Feynman-Kac Analysis of Bias and Stability

Matias G. Delgadino, Sebastien Motsch, Advait Parulekar, William Porteous, Sanjay Shakkottai

2605.06537 2026-05-08 cs.CV

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

Bodong Du, Bowen Liu, Yang Yu, Xinpeng Ding, Zhiheng Wu, Shuning Wang, Shuo Nie, Naiming Liu, Qifeng Chen, Yangqiu Song, Xiaomeng Li

2605.06535 2026-05-08 cs.CV cs.AI

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance

Ziyun Zeng, Yiqi Lin, Guoqiang Liang, Mike Zheng Shou

Comments Tech Report. Project Page: https://showlab.github.io/Sparkle/

2605.06530 2026-05-08 cs.AI

SpatialEpiBench: Benchmarking Spatial Information and Epidemic Priors in Forecasting

Ruiqi Lyu, Alistair Turcan, Bryan Wilder

2605.06529 2026-05-08 cs.AI cs.LG

Market-Alignment Risk in Pricing Agents: Trace Diagnostics and Trace-Prior RL under Hidden Competitor State

Peiying Zhu, Sidi Chang

Comments 7 pages

2605.06527 2026-05-08 cs.CL

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Hanxiang Chao, Yihan Bai, Rui Sheng, Tianle Li, Yushi Sun

2605.06523 2026-05-08 cs.LG cs.AI

On the Implicit Reward Overfitting and the Low-rank Dynamics in RLVR

Hao Ye, Jisheng Dang, Junfeng Fang, Bimei Wang, Yizhou Zhang, Ning Lv, Wencan Zhang, Hong Peng, Bin Hu, Tat-Seng Chua

2605.06522 2026-05-08 cs.LG cs.CV

Agentic AIs Are the Missing Paradigm for Out-of-Distribution Generalization in Foundation Models

Xin Wang, Haibo Chen, Wenxuan Liu, Wenwu Zhu

Comments 13 pages, 2 figures

2605.06519 2026-05-08 cs.LG

Efficient Techniques for Data Reconstruction, with Finite-Width Recovery Guarantees

Edward Tansley, Roy Makhlouf, Estelle Massart, Coralia Cartis