arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.17191 2026-03-19 cs.CL cs.LG q-bio.QM

Tabular LLMs for Interpretable Few-Shot Alzheimer's Disease Prediction with Multimodal Biomedical Data

Sophie Kearney, Shu Yang, Zixuan Wen, Weimin Lyu, Bojian Hou, Duy Duong-Tran, Tianlong Chen, Jason H. Moore, Marylyn D. Ritchie, Chao Chen, Li Shen

详情

英文摘要

Accurate diagnosis of Alzheimer's disease (AD) requires handling tabular biomarker data, yet such data are often small and incomplete, where deep learning models frequently fail to outperform classical methods. Pretrained large language models (LLMs) offer few-shot generalization, structured reasoning, and interpretable outputs, providing a powerful paradigm shift for clinical prediction. We propose TAP-GPT Tabular Alzheimer's Prediction GPT, a domain-adapted tabular LLM framework built on TableGPT2 and fine-tuned for few-shot AD classification using tabular prompts rather than plain texts. We evaluate TAP-GPT across four ADNI-derived datasets, including QT-PAD biomarkers and region-level structural MRI, amyloid PET, and tau PET for binary AD classification. Across multimodal and unimodal settings, TAP-GPT improves upon its backbone models and outperforms traditional machine learning baselines in the few-shot setting while remaining competitive with state-of-the-art general-purpose LLMs. We show that feature selection mitigates degradation in high-dimensional inputs and that TAP-GPT maintains stable performance under simulated and real-world missingness without imputation. Additionally, TAP-GPT produces structured, modality-aware reasoning aligned with established AD biology and shows greater stability under self-reflection, supporting its use in iterative multi-agent systems. To our knowledge, this is the first systematic application of a tabular-specialized LLM to multimodal biomarker-based AD prediction, demonstrating that such pretrained models can effectively address structured clinical prediction tasks and laying the foundation for tabular LLM-driven multi-agent clinical decision-support systems. The source code is publicly available on GitHub: https://github.com/sophie-kearney/TAP-GPT.

URL PDF HTML ☆

赞 0 踩 0

2603.17189 2026-03-19 cs.RO

Influence of Gripper Design on Human Demonstration Quality for Robot Learning

Gina L. Georgadarellis, Natalija Beslic, Seonhun Lee, Frank C. Sup, Meghan E. Huber

Comments To be published in proceedings of 2026 IEEE International Conference on Robotics & Automation

2603.17187 2026-03-19 cs.LG

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Peng Xia, Jianwen Chen, Xinyu Yang, Haoqin Tu, Jiaqi Liu, Kaiwen Xiong, Siwei Han, Shi Qiu, Haonian Ji, Yuyin Zhou, Zeyu Zheng, Cihang Xie, Huaxiu Yao

详情

英文摘要

Large language model (LLM) agents are increasingly used for complex tasks, yet deployed agents often remain static, failing to adapt as user needs evolve. This creates a tension between the need for continuous service and the necessity of updating capabilities to match shifting task distributions. On platforms like OpenClaw, which handle diverse workloads across 20+ channels, existing methods either store raw trajectories without distilling knowledge, maintain static skill libraries, or require disruptive downtime for retraining. We present MetaClaw, a continual meta-learning framework that jointly evolves a base LLM policy and a library of reusable behavioral skills. MetaClaw employs two complementary mechanisms. Skill-driven fast adaptation analyzes failure trajectories via an LLM evolver to synthesize new skills, enabling immediate improvement with zero downtime. Opportunistic policy optimization performs gradient-based updates via cloud LoRA fine-tuning and Reinforcement Learning with a Process Reward Model (RL-PRM). This is triggered during user-inactive windows by the Opportunistic Meta-Learning Scheduler (OMLS), which monitors system inactivity and calendar data. These mechanisms are mutually reinforcing: a refined policy generates better trajectories for skill synthesis, while richer skills provide higher-quality data for policy optimization. To prevent data contamination, a versioning mechanism separates support and query data. Built on a proxy-based architecture, MetaClaw scales to production-size LLMs without local GPUs. Experiments on MetaClaw-Bench and AutoResearchClaw show that skill-driven adaptation improves accuracy by up to 32% relative. The full pipeline advances Kimi-K2.5 accuracy from 21.4% to 40.6% and increases composite robustness by 18.3%. Code is available at https://github.com/aiming-lab/MetaClaw.

URL PDF HTML ☆

赞 0 踩 0

2603.17186 2026-03-19 cs.CV cs.IR

Visual Product Search Benchmark

Karthik Sulthanpete Govindappa

Comments 21 pages

2603.17178 2026-03-19 cs.CV

Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video

Mingxiao Tu, Hoijoon Jung, Alireza Moghadam, Andre Kyme, Jinman Kim

2603.17175 2026-03-19 cs.LG physics.geo-ph

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Cheng-Hsi Hsiao, Krishna Kumar, Ellen M. Rathje

Comments 33 pages, 16 figures

2603.17173 2026-03-19 cs.CV cs.AI

Generalist Multimodal LLMs Gain Biometric Expertise via Human Salience

Jacob Piland, Byron Dowling, Christopher Sweet, Adam Czajka

2603.17172 2026-03-19 cs.LG

Noise-Response Calibration: A Causal Intervention Protocol for LLM-Judges

Maxim Khomiakov, Jes Frellsen

Comments Published as a conference paper at CAO Workshop at ICLR 2026

2603.17171 2026-03-19 cs.CL

Exploiting the English Grammar Profile for L2 grammatical analysis with LLMs

Stefano Bannò, Penny Karanasou, Kate Knill, Mark Gales

详情

英文摘要

Evaluating the grammatical competence of second language (L2) learners is essential both for providing targeted feedback and for assessing proficiency. To achieve this, we propose a novel framework leveraging the English Grammar Profile (EGP), a taxonomy of grammatical constructs mapped to the proficiency levels of the Common European Framework of Reference (CEFR), to detect learners' attempts at grammatical constructs and classify them as successful or unsuccessful. This detection can then be used to provide fine-grained feedback. Moreover, the grammatical constructs are used as predictors of proficiency assessment by using automatically detected attempts as predictors of holistic CEFR proficiency. For the selection of grammatical constructs derived from the EGP, rule-based and LLM-based classifiers are compared. We show that LLMs outperform rule-based methods on semantically and pragmatically nuanced constructs, while rule-based approaches remain competitive for constructs that rely purely on morphological or syntactic features and do not require semantic interpretation. For proficiency assessment, we evaluate both rule-based and hybrid pipelines and show that a hybrid approach combining a rule-based pre-filter with an LLM consistently yields the strongest performance. Since our framework operates on pairs of original learner sentences and their corrected counterparts, we also evaluate a fully automated pipeline using automatic grammatical error correction. This pipeline closely approaches the performance of semi-automated systems based on manual corrections, particularly for the detection of successful attempts at grammatical constructs. Overall, our framework emphasises learners' successful attempts in addition to unsuccessful ones, enabling positive, formative feedback and providing actionable insights into grammatical development.

URL PDF HTML ☆

赞 0 踩 0

2603.17169 2026-03-19 cs.AI cs.CL

How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment

Rebecca Ansell, Autumn Toney-Wails

2603.17165 2026-03-19 cs.RO cs.CV

SLAM Adversarial Lab: An Extensible Framework for Visual SLAM Robustness Evaluation under Adverse Conditions

Mohamed Hefny, Karthik Dantu, Steven Y. Ko

Comments 8 pages, 4 figures

2603.17161 2026-03-19 cs.CV

GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global-Local Feature Fusion

Zhuojiang Cai, Zhenghui Sun, Feng Lu

Comments Accepted to CVPR 2026

2603.17159 2026-03-19 cs.CV cs.RO

BEV-SLD: Self-Supervised Scene Landmark Detection for Global Localization with LiDAR Bird's-Eye View Images

David Skuddis, Vincent Ress, Wei Zhang, Vincent Ofosu Nyako, Norbert Haala

Comments Accepted to CVPR 2026

2603.17152 2026-03-19 cs.RO cs.LG

Shielded Reinforcement Learning Under Dynamic Temporal Logic Constraints

Sadık Bera Yüksel, Ali Tevfik Buyukkocak, Derya Aksaray

Comments 7 pages, 3 figures, 2026 IEEE American Control Conference (ACC)

2603.17148 2026-03-19 cs.LG

Personalized Fall Detection by Balancing Data with Selective Feedback Using Contrastive Learning

Awatif Yasmin, Tarek Mahmud, Sana Alamgeer, Anne H. H. Ngu

2603.17139 2026-03-19 cs.LG stat.ML

Contextual Preference Distribution Learning

Benjamin Hudson, Laurent Charlin, Emma Frejinger

Comments In CPAIOR 2026 (23rd International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research)

2603.17131 2026-03-19 cs.CV

SMAL-pets: SMAL Based Avatars of Pets from Single Image

Piotr Borycki, Joanna Waczyńska, Yizhe Zhu, Yongqiang Gao, Przemysław Spurek

2603.17126 2026-03-19 cs.LG cs.IT eess.IV math.IT

Topology-Preserving Deep Joint Source-Channel Coding for Semantic Communication

Omar Erak, Omar Alhussein, Fang Fang, Sami Muhaidat

Comments Submitted to IEEE Journals for possible publication

2603.17117 2026-03-19 cs.CV

MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Wei Yu, Runjia Qian, Yumeng Li, Liquan Wang, Songheng Yin, Sri Siddarth Chakaravarthy P, Dennis Anthony, Yang Ye, Yidi Li, Weiwei Wan, Animesh Garg

Comments Project Page: https://mosaicmem.github.io/mosaicmem/

2603.17111 2026-03-19 cs.CV cs.AI

Hidden Clones: Exposing and Fixing Family Bias in Vision-Language Model Ensembles

Zacharie Bugaud

Comments 15 pages, 6 figures, 11 tables

2603.17110 2026-03-19 cs.CV cs.LG

Pixel-level Counterfactual Contrastive Learning for Medical Image Segmentation

Marceau Lafargue-Hauret, Raghav Mehta, Fabio De Sousa Ribeiro, Mélanie Roschewitz, Ben Glocker

Comments Accepted at ISBI-2026 (oral presentation)

2603.17109 2026-03-19 cs.LG

SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval

Akshaj Murhekar, Christina Liu, Abhijit Mishra, Shounak Roychowdhury, Jacek Gwizdka

2603.17102 2026-03-19 cs.CL cs.AI cs.LG

Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency

Lucas Bandarkar, Alan Ansell, Trevor Cohn

2603.17098 2026-03-19 cs.CV

Accurate Shift Invariant Convolutional Neural Networks Using Gaussian-Hermite Moments

Jaspreet Singh, Petra Bosilj, Grzegorz Cielniak

2603.17094 2026-03-19 cs.CL

Evaluating LLM-Simulated Conversations in Modeling Inconsistent and Uncollaborative Behaviors in Human Social Interaction

Ryo Kamoi, Ameya Godbole, Longqi Yang, Rui Zhang, Mengting Wan, Pei Zhou

2603.17092 2026-03-19 cs.RO

SLowRL: Safe Low-Rank Adaptation Reinforcement Learning for Locomotion

Elham Daneshmand, Shafeef Omar, Glen Berseth, Majid Khadiv, Hsiu-Chin Lin

2603.17087 2026-03-19 cs.CL cs.LG

Ensemble Self-Training for Unsupervised Machine Translation

Ido Aharon, Jonathan Shaki, Sarit Kraus

2603.17079 2026-03-19 cs.CV

ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language Models

M. Arda Aydın, Melih B. Yilmaz, Aykut Koç, Tolga Çukur

2603.17075 2026-03-19 cs.LG cs.AI cs.CC

CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning

Weikun K. Zhang, Rohan Pandey, Bhaumik Mehta, Kaijie Jin, Naomi Morato, Archit Ganapule, Michael Ruofan Zeng, Jarod Alper

Comments ICLR 2026 Workshop on AI with Recursive Self-Improvement

2603.17070 2026-03-19 cs.CL cs.AI

Large Reasoning Models Struggle to Transfer Parametric Knowledge Across Scripts

Lucas Bandarkar, Alan Ansell, Trevor Cohn