arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.14010 2026-04-16 cs.LG cs.CL

Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning

Zekai Lin, Chao Xue, Di Liang, Xingsheng Han, Peiyang Liu, Xianjie Wu, Lei Jiang, Yu Lu, Haibo Shi, Shuang Liang, Minlong Peng

详情

英文摘要

Supervised Fine-Tuning (SFT) of large language models often suffers from task interference and catastrophic forgetting. Recent approaches alleviate this issue by isolating task-critical parameters during training. However, these methods represent a static solution to a dynamic problem, assuming that parameter importance remains fixed once identified. In this work, we empirically demonstrate that parameter importance exhibits temporal drift over the course of training. To address this, we propose Evolving Parameter Isolation (EPI), a fine-tuning framework that adapts isolation decisions based on online estimates of parameter importance. Instead of freezing a fixed subset of parameters, EPI periodically updates isolation masks using gradient-based signals, enabling the model to protect emerging task-critical parameters while releasing outdated ones to recover plasticity. Experiments on diverse multi-task benchmarks demonstrate that EPI consistently reduces interference and forgetting compared to static isolation and standard fine-tuning, while improving overall generalization. Our analysis highlights the necessity of synchronizing isolation mechanisms with the evolving dynamics of learning diverse abilities.

URL PDF HTML ☆

赞 0 踩 0

2604.14004 2026-04-16 cs.AI cs.CL

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Kangsan Kim, Minki Kang, Taeil Kim, Yanlai Yang, Mengye Ren, Sung Ju Hwang

Comments Preprint

2604.13995 2026-04-16 cs.CV

Depth-Aware Image and Video Orientation Estimation

Muhammad Z. Alam, Larry Stetsiuk, M. Umair Mukati, Zeeshan Kaleem

Comments 13 pages, 8 figures

2604.13994 2026-04-16 cs.CV

Remote Sensing Image Super-Resolution for Imbalanced Textures: A Texture-Aware Diffusion Framework

Enzhuo Zhang, Sijie Zhao, Dilxat Muhtar, Zhenshi Li, Xueliang Zhang, Pengfeng Xiao

Comments 10 pages, 5 figures, 9 Tables

2604.13993 2026-04-16 cs.AI cs.CL cs.CV

Reward Design for Physical Reasoning in Vision-Language Models

Derek Lilienthal, Manisha Mukherjee, Sameera Horawalavithana

详情

英文摘要

Physical reasoning over visual inputs demands tight integration of visual perception, domain knowledge, and multi-step symbolic inference. Yet even state-of-the-art Vision Language Models (VLMs) fall far short of human performance on physics benchmarks. While post-training algorithms such as Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO) have demonstrated strong reasoning gains in language models, how reward design shapes VLM physical reasoning behavior remains poorly understood. We present a systematic reward ablation study for GRPO-based VLM training on physical reasoning. We compare four reward signals of increasing semantic richness: format compliance, answer accuracy, a composite rubric reward (answer correctness, physics principle identification, and unit consistency), and a novel internal reward derived from model attention weights over input image regions. We evaluate on PhyX, a 3,000-problem benchmark spanning six physics domains and six reasoning types across multiple-choice and open-ended formats, using IBM Granite Vision 3.3 (2B). Across both formats, GRPO with accuracy-based rewards outperforms SFT on most domains, though gains vary substantially by reward type and domain. Reward design does not uniformly improve performance. Instead, it induces domain-specific reasoning behaviors. Accuracy-based rewards provide the strongest overall gains. Rubric rewards improve structured reasoning quality without consistent accuracy improvements. Attention-based rewards enhance spatial reasoning while degrading performance in symbolic domains. Our internal attention-weight reward requires no spatial annotations and improves spatial relation accuracy from 0.27 to 0.50, suggesting that supervising where the model attends during generation is a promising direction for visually grounded physical reasoning.

URL PDF HTML ☆

赞 0 踩 0

2604.13992 2026-04-16 cs.LG

Physics-Informed Neural Networks for Methane Sorption: Cross-Gas Transfer Learning, Ensemble Collapse Under Physics Constraints, and Monte Carlo Dropout Uncertainty Quantification

Mohammad Nooraiepour, Zezhang Song, Wei Li, Sarah Perez

详情

英文摘要

Accurate methane sorption prediction across heterogeneous coal ranks requires models that combine thermodynamic consistency, efficient knowledge transfer across data-scarce geological systems, and calibrated uncertainty estimates, capabilities that are rarely addressed together in existing frameworks. We present a physics-informed transfer learning framework that adapts a hydrogen sorption PINN to methane sorption prediction via Elastic Weight Consolidation, coal-specific feature engineering, and a three-phase curriculum that progressively balances transfer preservation with thermodynamic fine-tuning. Trained on 993 equilibrium measurements from 114 independent coal experiments spanning lignite to anthracite, the framework achieves R2 = 0.932 on held-out coal samples, a 227% improvement over pressure-only classical isotherms, while hydrogen pre-training delivers 18.9% lower RMSE and 19.4% faster convergence than random initialization. Five Bayesian uncertainty quantification approaches reveal a systematic divergence in performance across physics-constrained architectures. Monte Carlo Dropout achieves well-calibrated uncertainty at minimal overhead, while deep ensembles, regardless of architectural diversity or initialization strategy, exhibit performance degradation because shared physics constraints narrow the admissible solution manifold. SHAP and ALE analyses confirm that learned representations remain physically interpretable and aligned with established coal sorption mechanisms: moisture-volatile interactions are most influential, pressure-temperature coupling captures thermodynamic co-dependence, and features exhibit non-monotonic effects. These results identify Monte Carlo Dropout as the best-performing UQ method in this physics-constrained transfer learning framework, and demonstrate cross-gas transfer learning as a data-efficient strategy for geological material modeling.

URL PDF HTML ☆

赞 0 踩 0

2604.13991 2026-04-16 cs.CL cs.AI cs.LG

Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models

Aleksandr Rubashevskii, Dzianis Piatrashyn, Preslav Nakov, Maxim Panov

2604.13988 2026-04-16 cs.LG cs.NA math.NA

Unsupervised domain transfer: Overcoming signal degradation in sleep monitoring by increasing scoring realism

Mohammad Ahangarkiasari, Andreas Tind Damgaard, Casper Haurum, Kaare B. Mikkelsen

2604.13981 2026-04-16 cs.CV

HiProto: Hierarchical Prototype Learning for Interpretable Object Detection Under Low-quality Conditions

Jianlin Xiang, Linhui Dai, Xue Yang, Chaolei Yang, Yanshan Li

Comments 9 pages, 9 figures

2604.13980 2026-04-16 cs.LG q-bio.QM stat.ML

BOAT: Navigating the Sea of In Silico Predictors for Antibody Design via Multi-Objective Bayesian Optimization

Jackie Rao, Ferran Gonzalez Hernandez, Leon Gerard, Alexandra Gessner

Comments Proceedings of the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026

2604.13979 2026-04-16 cs.CL cs.AI cs.DB

Leveraging LLM-GNN Integration for Open-World Question Answering over Knowledge Graphs

Hussein Abdallah, Ibrahim Abdelaziz, Panos Kalnis, Essam Mansour

Comments 18 pages,6 figures,10 tables. https://aclanthology.org/2026.eacl-long.26/

2604.13977 2026-04-16 cs.CL cs.AI cs.LG

How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data

Joel Niklaus, Atsuki Yamaguchi, Michal Štefánik, Guilherme Penedo, Hynek Kydlíček, Elie Bakouch, Lewis Tunstall, Edward Emanuel Beeching, Thibaud Frere, Colin Raffel, Leandro von Werra, Thomas Wolf

2604.13970 2026-04-16 cs.CV

MApLe: Multi-instance Alignment of Diagnostic Reports and Large Medical Images

Felicia Bader, Philipp Seeböck, Anastasia Bartashova, Ulrike Attenberger, Georg Langs

Comments Accepted for MIDL 2026; Reviews available at https://openreview.net/forum?id=M8OO3CRbL9#discussion

2604.13966 2026-04-16 cs.LG

Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation

Shangzhe Li, Weitong Zhang

Comments 44 pages, 2 tables

2604.13959 2026-04-16 cs.AI

[Emerging Ideas] Artificial Tripartite Intelligence: A Bio-Inspired, Sensor-First Architecture for Physical AI

You Rim Choi, Subeom Park, Hyung-Sin Kim

2604.13954 2026-04-16 cs.LG cs.AI

HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark

Jiacheng Wang, Jinchang Hou, Fabian Wang, Ping Jian, Chenfu Bao, Zhonghou Lv

2604.13951 2026-04-16 cs.LG quant-ph

Quantum Machine Learning for Colorectal Cancer Data: Anastomotic Leak Classification and Risk Factors

Vojtěch Novák, Ivan Zelinka, Lenka Přibylová, Lubomír Martínek, Vladimír Benčurík, Martin Beseda

2604.13950 2026-04-16 cs.CL

Causal Drawbridges: Characterizing Gradient Blocking of Syntactic Islands in Transformer LMs

Sasha Boguraev, Kyle Mahowald

Comments 19 pages, 7 figures, 3 tables

2604.13947 2026-04-16 cs.CV

Heuristic Style Transfer for Real-Time, Efficient Weather Attribute Detection

Hamed Ouattara, Pierre Duthon, Pascal Houssam Salmane, Frédéric Bernardin, Omar Ait Aider

Comments 32 pages, 18 figures

2604.13942 2026-04-16 cs.RO

Goal2Skill: Long-Horizon Manipulation with Adaptive Planning and Reflection

Zhen Liu, Xinyu Ning, Zhe Hu, Xinxin Xie, Weize Li, Zhipeng Tang, Chongyu Wang, Zejun Yang, Hanlin Wang, Yitong Liu, Zhongzhu Pu

2604.13941 2026-04-16 cs.CV

SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation

Songlin Du, Xiaoyong Lu, Yaping Yan, Guobao Xiao, Xiaobo Lu, Takeshi Ikenaga

2604.13940 2026-04-16 cs.AI

AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot

Joydeep Biswas, Sheila Schoepp, Gautham Vasan, Anthony Opipari, Arthur Zhang, Zichao Hu, Sebastian Joseph, Matthew Lease, Junyi Jessy Li, Peter Stone, Kiri L. Wagstaff, Matthew E. Taylor, Odest Chadwicke Jenkins

2604.13939 2026-04-16 cs.CV

A Multi-Stage Optimization Pipeline for Bethesda Cell Detection in Pap Smear Cytology

Martin Amster, Camila María Polotto

Comments ISBI 2026 Accepted Paper & Second Place Solution for the RIVA Cervical Cytology Challenge Track B

2604.13938 2026-04-16 cs.CV

ASTRA: Enhancing Multi-Subject Generation with Retrieval-Augmented Pose Guidance and Disentangled Position Embedding

Tianze Xia, Zijian Ning, Zonglin Zhao, Mingjia Wang

2604.13928 2026-04-16 cs.LG

Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study

Sergej Krasnikov, Lukas Meitz, Samineh Bagheri, Michael Heider, Thorsten Schöler, Jörg Hähner

2604.13918 2026-04-16 cs.CV

PartNerFace: Part-based Neural Radiance Fields for Animatable Facial Avatar Reconstruction

Xianggang Yu, Lingteng Qiu, Xiaohang Ren, Guanying Chen, Shuguang Cui, Xiaoguang Han, Baoyuan Wang

2604.13906 2026-04-16 cs.CV

Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model

Shuyun Wang, Hu Zhang, Xin Shen, Dadong Wang, Xin Yu

Comments CVPR 2025

2604.13905 2026-04-16 cs.CV

Rethinking Image-to-3D Generation with Sparse Queries: Efficiency, Capacity, and Input-View Bias

Zhiyuan Xu, Jiuming Liu, Yuxin Chen, Masayoshi Tomizuka, Chenfeng Xu, Chensheng Peng

Comments Code is available at https://github.com/Pixtella/SparseGen

2604.13902 2026-04-16 cs.LG

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Xiaofan Li, Ming Yang, Zhiyuan Ma, Shichao Ma, Jintao Du, Yu Cheng, Weiqiang Wang, Zhizhong Zhang, Xin Tan, Yanyun Qu, Lizhuang Ma, Yuan Xie

Comments LLM Reinforce Learning

2604.13897 2026-04-16 cs.LG physics.comp-ph

MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals

Adam Lahouari, Shen Ai, Jihye Han, Jillian Hoffstadt, Philipp Hoellmer, Charlotte Infante, Pulkita Jain, Sangram Kadam, Maya M. Martirossyan, Amara McCune, Hypatia Newton, Shlok J. Paul, Willmor Pena, Jonathan Raghoonanan, Sumon Sahu, Oliver Tan, Andrea Vergara, Jutta Rogal, Mark E. Tuckerman