arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.09936 2026-04-15 cs.LG cs.NA math.NA

SciML Agents: Write the Solver, Not the Solution

Saarth Gaonkar, Xiang Zheng, Haocheng Xi, Rishabh Tiwari, Kurt Keutzer, Dmitriy Morozov, Michael W. Mahoney, Amir Gholami

详情

Journal ref: NeurIPS 2025 Math-AI Workshop

英文摘要

Recent work in scientific machine learning aims to tackle scientific tasks directly by predicting target values with neural networks (e.g., physics-informed neural networks, neural ODEs, neural operators, etc.), but attaining high accuracy and robustness has been challenging. We explore an alternative view: use LLMs to write code that leverages decades of numerical algorithms. This shifts the burden from learning a solution function to making domain-aware numerical choices. We ask whether LLMs can act as SciML agents that, given a natural-language ODE description, generate runnable code that is scientifically appropriate, selecting suitable solvers (stiff vs. non-stiff), and enforcing stability checks. There is currently no benchmark to measure this kind of capability for scientific computing tasks. As such, we first introduce two new datasets: a diagnostic dataset of adversarial "misleading" problems; and a large-scale benchmark of 1,000 diverse ODE tasks. The diagnostic set contains problems whose superficial appearance suggests stiffness, and that require algebraic simplification to demonstrate non-stiffness; and the large-scale benchmark spans stiff and non-stiff ODE regimes. We evaluate open- and closed-source LLM models along two axes: (i) unguided versus guided prompting with domain-specific knowledge; and (ii) off-the-shelf versus fine-tuned variants. Our evaluation measures both executability and numerical validity against reference solutions. We find that with sufficient context and guided prompts, newer instruction-following models achieve high accuracy on both criteria. In many cases, recent open-source systems perform strongly without fine-tuning, while older or smaller models still benefit from fine-tuning. Overall, our preliminary results indicate that careful prompting and fine-tuning can yield a specialized LLM agent capable of reliably solving simple ODE problems.

URL PDF HTML ☆

赞 0 踩 0

2509.05288 2026-04-15 cs.LG math.OC

Learning to accelerate distributed ADMM using graph neural networks

Henri Doerks, Paul Häusner, Daniel Hernández Escobar, Jens Sjölund

Comments Learning for Dynamics and Control Conference (L4DC), the first two authors contributed equally

2508.18187 2026-04-15 cs.CV cs.AI

BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding

Xuan-Bac Nguyen, Thanh-Dat Truong, Pawan Sinha, Khoa Luu

2508.17403 2026-04-15 cs.LG stat.AP

Mutual Information Surprise: Rethinking Unexpectedness in Autonomous Systems

Yinsong Wang, Quan Zeng, Xiao Liu, Yu Ding

Comments Pre-Submission Version

2508.12260 2026-04-15 cs.AI q-bio.QM

Mantis: A Foundation Model for Mechanistic Disease Forecasting

Carson Dudley, Reiden Magdaleno, Christopher Harding, Ananya Sharma, Emily Martin, Marisa Eisenberg

Comments 11 pages, 4 figures

2508.05461 2026-04-15 cs.CV

Time-reversed Flow Matching with Worst Transport in High-dimensional Latent Space for Image Anomaly Detection

Liangwei Li, Lin Liu, Hanzhe Liang, Juanxiu Liu, Jing Zhang, Ruqian Hao, Xiaohui Du, Yong Liu, Pan Li

2508.01620 2026-04-15 cs.LG cs.CR cs.CV

IMU: Influence-guided Machine Unlearning

Xindi Fan, Jing Wu, Mingyi Zhou, Pengwei Liang, Mehrtash Harandi, Dinh Phung

2507.22767 2026-04-15 cs.LG cs.AI

Teaching the Teacher: The Role of Teacher-Student Smoothness Alignment in Genetic Programming-based Symbolic Distillation

Soumyadeep Dhar, Kei Sen Fong, Mehul Motani

Comments camera-ready version, accepted at GECCO 2026

2507.13647 2026-04-15 cs.RO cs.AI

Improved particle swarm optimization algorithm: multi-target trajectory optimization for swarm drones

Minze Li, Wei Zhao, Ran Chen, Mingqiang Wei

Comments New experiments have revealed systematic errors in the original data

2507.11081 2026-04-15 cs.CV cs.AI

Automatic Road Subsurface Distress Recognition from Ground Penetrating Radar Images using Deep Learning-based Cross-verification

Chang Peng, Bao Yang, Meiqi Li, Ge Zhang, Hui Sun, Zhenyu Jiang

2507.01041 2026-04-15 cs.LG cs.AI

Fast AI Model Partition for Split Learning over Edge Networks

Zuguang Li, Wen Wu, Shaohua Wu, Xuemin, Shen

Comments This version lacks sufficient detail in key technical parts, including the equivalence proof for the s-t cut transformation and the computational complexity analysis (Sections VI-D). We are withdrawing it to prepare a revised, more complete manuscript

2506.23104 2026-04-15 cs.CV

DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation

Jihun Kim, Hoyong Kwon, Hyeokjun Kweon, Wooseong Jeong, Kuk-Jin Yoon

Comments accepted at ICCV 2025

2506.14512 2026-04-15 cs.CV

SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks

Zijian Song, Xiaoxin Lin, Qiuming Huang, Sihan Qin, Guangrun Wang, Liang Lin

Comments 20 pages, 11 figures

2506.14092 2026-04-15 cs.AI

Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models

Haonan Yin, Shai Vardi, Vidyanand Choudhary

2506.00239 2026-04-15 cs.AI

SmellNet: A Large-scale Dataset for Real-world Smell Recognition

Dewei Feng, Wei Dai, Carol Li, Alistair Pernigo, Yunge Wen, Paul Pu Liang

Comments Accepted to ICLR 2026; published as a conference paper at ICLR 2026. 32 pages; 21 figures

2505.19261 2026-04-15 cs.CV cs.AI

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Yu Zhang, Jialei Zhou, Xinchen Li, Qi Zhang, Zhongwei Wan, Tianyu Wang, Duoqian Miao, Changwei Wang, Longbing Cao

Comments NeurIPS 2025

2505.17086 2026-04-15 cs.CL

Advancing Multi-Agent RAG Systems with Minimalist Reinforcement Learning

Yihong Wu, Liheng Ma, Muzhi Li, Jiaming Zhou, Lei Ding, Jianye Hao, Ho-fung Leung, Irwin King, Yingxue Zhang, Jian-Yun Nie

Comments AAMAS 2026

2505.15467 2026-04-15 cs.CL cs.AI

Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning

Yukun Zhao, Lingyong Yan, Zhenyang Li, Shuaiqiang Wang, Zhumin Chen, Zhaochun Ren, Dawei Yin

Comments The experimental setting is wrong, i.e., not a real continual learning setting

2505.14264 2026-04-15 cs.LG cs.CL

AAPO: Enhancing the Reasoning Capabilities of LLMs with Advantage Margin

Jian Xiong, Jingbo Zhou, Jingyong Ye, Qiang Huang, Dejing Dou

Comments Accepted to ACL2026 Main Conference

2505.14129 2026-04-15 cs.RO

Unconventional Hexacopters via Evolution and Learning: Performance Gains and New Insights

Jed Muff, Keiichi Ito, Elijah H. W. Ang, Karine Miras, A. E. Eiben

Comments 16 pages, 14 figures, Published in evostar2026. Code: https://github.com/JedMuff/airevolve. Videos: https://www.youtube.com/watch?list=PL5oQiyJFx4qM9Hzs2asyoGbJo9TuO4sPS&v=playlist&feature=youtu.be

2504.02169 2026-04-15 cs.LG cs.AI math.ST stat.ML stat.TH

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

Reza Sameni

2503.24135 2026-04-15 cs.CV

PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization

Alexis Guichemerre, Soufiane Belharbi, Mohammadhadi Shateri, Luke McCaffrey, Eric Granger

Comments 43 pages, 24 figures, Medical Imaging with Deep Learning (MIDL 2025)

详情

英文摘要

Weakly supervised object localization (WSOL) methods allow training models to classify images and localize ROIs. WSOL only requires low-cost image-class annotations yet provides a visually interpretable classifier. Standard WSOL methods rely on class activation mapping (CAM) methods to produce spatial localization maps according to a single- or two-step strategy. While both strategies have made significant progress, they still face several limitations with histology images. Single-step methods can easily result in under- or over-activation due to the limited visual ROI saliency in histology images and scarce localization cues. They also face the well-known issue of asynchronous convergence between classification and localization tasks. The two-step approach is sub-optimal because it is constrained to a frozen classifier, limiting the capacity for localization. Moreover, these methods also struggle when applied to out-of-distribution (OOD) datasets. In this paper, a multi-task approach for WSOL is introduced for simultaneous training of both tasks to address the asynchronous convergence problem. In particular, localization is performed in the pixel-feature space of an image encoder that is shared with classification. This allows learning discriminant features and accurate delineation of foreground/background regions to support ROI localization and image classification. We propose PixelCAM, a cost-effective foreground/background pixel-wise classifier in the pixel-feature space that allows for spatial object localization. Using partial-cross entropy, PixelCAM is trained using pixel pseudo-labels collected from a pretrained WSOL model. Both image and pixel-wise classifiers are trained simultaneously using standard gradient descent. In addition, our pixel classifier can easily be integrated into CNN- and transformer-based architectures without any modifications.

URL PDF HTML ☆

赞 0 踩 0

2503.14333 2026-04-15 cs.LG cs.AI q-bio.NC

Characterizing higher-order representations through generative diffusion models explains human decoded neurofeedback performance

Hojjat Azimi Asrari, Megan A. K. Peters

Comments 25 pages, 7 figures

2503.10676 2026-04-15 cs.CL cs.AI cs.LG

Fine-Tuning LLMs for Report Summarization: Analysis on Supervised and Unsupervised Data

Swati Rallapalli, Shannon Gallagher, Andrew O. Mellinger, Jasmine Ratchford, Anusha Sinha, Tyler Brooks, William R. Nichols, Nick Winski, Bryan Brown

2502.18321 2026-04-15 cs.LG

Global-Decision-Focused Neural ODEs for Proactive Grid Resilience Management

Shuyi Chen, Ferdinando Fioretto, Feng Qiu, Shixiang Zhu

2502.17403 2026-04-15 cs.LG cs.AI cs.CL

Large Language Models are Powerful Electronic Health Record Encoders

Stefan Hegselmann, Georg von Arnim, Tillmann Rheude, Noel Kronenberg, David Sontag, Gerhard Hindricks, Roland Eils, Benjamin Wild

2502.11271 2026-04-15 cs.LG cs.CL cs.CV cs.MA

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

Pan Lu, Bowen Chen, Sheng Liu, Rahul Thapa, Joseph Boen, James Zou

Comments 88 pages, 18 figures. Accepted to ACL 2026

2501.17518 2026-04-15 cs.LG cs.AI

RegD: Hierarchical Embeddings via Dissimilarity between Arbitrary Euclidean Regions

Hui Yang, Jiaoyan Chen

2501.16154 2026-04-15 cs.CL cs.AI

AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought

Weihua Zheng, Xin Huang, Zhengyuan Liu, Tarun Kumar Vangani, Bowei Zou, Xiyan Tao, Yuhao Wu, Ai Ti Aw, Nancy F. Chen, Roy Ka-Wei Lee

Comments AAAI 2026

2501.06268 2026-04-15 cs.LG stat.ME stat.ML

Clustering with Uniformity- and Neighbor-Based Random Geometric Graphs

Rui Shi, Elvan Ceyhan, Nedret Billor