arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.29914 2026-04-01 cs.LG

Task Scarcity and Label Leakage in Relational Transfer Learning

Francisco Galuppo Azevedo, Clarissa Lima Loures, Denis Oliveira Correa

Comments Accepted at the 3rd DATA-FM Workshop at ICLR 2026, Rio de Janeiro, Brazil. OpenReview: https://openreview.net/forum?id=nI2nsMMHXp

2603.29902 2026-04-01 cs.AI

ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation

Yinuo Liu, Zi Qian, Heng Zhou, Jiahao Zhang, Yajie Zhang, Zhihang Li, Mengyu Zhou, Erchao Zhao, Xiaoxi Jiang, Guanjun Jiang

2603.29901 2026-04-01 cs.CV cs.CL

Less Is More? Selective Visual Attention to High-Importance Regions for Multimodal Radiology Summarization

Mst. Fahmida Sultana Naznin, Adnan Ibney Faruq, Mushfiqur Rahman, Niloy Kumar Mondal, Md. Mehedi Hasan Shawon, Md Rakibul Hasan

2603.29895 2026-04-01 cs.AI cs.IT cs.LG math.IT

A Rational Account of Categorization Based on Information Theory

Christophe J. MacLellan, Karthik Singaravadivelan, Xin Lian, Zekun Wang, Pat Langley

Comments 6 pages, 5 figures, 2 tables

2603.29892 2026-04-01 cs.CL

FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish

Daban Q. Jaff, Mohammad Mohammadamini

2603.29882 2026-04-01 cs.RO cs.SY eess.SY

Passive iFIR filters for data-driven velocity control in robotics

Yi Zhang, Zixing Wang, Fulvio Forni

2603.29871 2026-04-01 cs.AI

ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training

Rui Ai, Yu Pan, David Simchi-Levi, Chonghuan Wang

2603.29861 2026-04-01 cs.CL cs.AI

Towards Empowering Consumers through Sentence-level Readability Scoring in German ESG Reports

Benjamin Josef Schüßler, Jakob Prange

Comments accepted to NLP4Ecology workshop at LREC 2026

2603.29848 2026-04-01 cs.AI cs.MA

AgentFixer: From Failure Detection to Fix Recommendations in LLM Agentic Systems

Hadar Mulian, Sergey Zeltyn, Ido Levy, Liane Galanti, Avi Yaeli, Segev Shlomov

2603.29846 2026-04-01 cs.CL

SNEAK: Evaluating Strategic Communication and Information Leakage in Large Language Models

Adar Avsian, Larry Heck

2603.29842 2026-04-01 cs.CV cs.LG

Toward Generalizable Whole Brain Representations with High-Resolution Light-Sheet Data

Minyoung E. Kim, Dae Hee Yun, Aditi V. Patel, Madeline Hon, Webster Guan, Taegeon Lee, Brian Nguyen

Comments 21 pages, 12 figures. Accepted at CVPR 2026

2603.29837 2026-04-01 cs.LG

DiSGMM: A Method for Time-varying Microscopic Weight Completion on Road Networks

Yan Lin, Jilin Hu, Shengnan Guo, Christian S. Jensen, Youfang Lin, Huaiyu Wan

2603.29832 2026-04-01 cs.CV

AutoFormBench: Benchmark Dataset for Automating Form Understanding

Gaurab Baral, Junxiu Zhou

Comments 9 pages, 3 figures, 2 tables

2603.29828 2026-04-01 cs.AI cs.CL

Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis

Han Deng, Anqi Zou, Hanling Zhang, Ben Fei, Chengyu Zhang, Haobo Wang, Xinru Guo, Zhenyu Li, Xuzhu Wang, Peng Yang, Fujian Zhang, Weiyu Guo, Xiaohong Shao, Zhaoyang Liu, Shixiang Tang, Zhihui Wang, Wanli Ouyang

Comments 17 pages

2603.29820 2026-04-01 cs.SD

SIREN: Spatially-Informed Reconstruction of Binaural Audio with Vision

Mingyeong Song, Seoyeon Ko, Junhyug Noh

Comments 5 pages, 1 figure, to appear in ICASSP 2026

2603.29818 2026-04-01 cs.LG

Loss Gap Parity for Fairness in Heterogeneous Federated Learning

Brahim Erraji, Michaël Perrot, Aurélien Bellet

Comments 9 Pages, Published to AISTATS 2026

2603.29812 2026-04-01 cs.LG cond-mat.mtrl-sci

AMShortcut: An Inference- and Training-Efficient Inverse Design Model for Amorphous Materials

Yan Lin, Jonas A. Finkler, Tao Du, Jilin Hu, Morten M. Smedskjaer

2603.29808 2026-04-01 cs.RO

Reconfiguration of supernumerary robotic limbs for human augmentation

Mustafa Mete, Anastasia Bolotnikova, Alexander Schuessler, Jamie Paik

2603.29801 2026-04-01 cs.CL

ENEIDE: A High Quality Silver Standard Dataset for Named Entity Recognition and Linking in Historical Italian

Cristian Santini, Sebastian Barzaghi, Paolo Sernani, Emanuele Frontoni, Laura Melosi, Mehwish Alam

2603.29798 2026-04-01 cs.CV

SceneTeract: Agentic Functional Affordances and VLM Grounding in 3D Scenes

Léopold Maillard, Francis Engelmann, Tom Durand, Boxiao Pan, Yang You, Or Litany, Leonidas Guibas, Maks Ovsjanikov

Comments Project page: https://sceneteract.github.io/

2603.29793 2026-04-01 cs.LG q-bio.QM

Multimodal Machine Learning for Early Prediction of Metastasis in a Swedish Multi-Cancer Cohort

Franco Rugolon, Korbinian Randl, Braslav Jovanovic, Ioanna Miliou, Panagiotis Papapetrou

详情

英文摘要

Multimodal Machine Learning offers a holistic view of a patient's status, integrating structured and unstructured data from electronic health records (EHR). We propose a framework to predict metastasis risk one month prior to diagnosis, using six months of clinical history from EHR data. Data from four cancer cohorts collected at Karolinska University Hospital (Stockholm, Sweden) were analyzed: breast (n = 743), colon (n = 387), lung (n = 870), and prostate (n = 1890). The dataset included demographics, comorbidities, laboratory results, medications, and clinical text. We compared traditional and deep learning classifiers across single modalities and multimodal combinations, using various fusion strategies and a Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) 2a design, with an 80-20 development-validation split to ensure a rigorous, repeatable evaluation. Performance was evaluated using AUROC, AUPRC, F1 score, sensitivity, and specificity. We then employed a multimodal adaptation of SHAP to analyze the classifiers' reasoning. Intermediate fusion achieved the highest F1 scores on breast (0.845), colon (0.786), and prostate cancer (0.845), demonstrating strong predictive performance. For lung cancer, the intermediate fusion achieved an F1 score of 0.819, while the text-only model achieved the highest, with an F1 score of 0.829. Deep learning classifiers consistently outperformed traditional models. Colon cancer, the smallest cohort, had the lowest performance, highlighting the importance of sufficient training data. SHAP analysis showed that the relative importance of modalities varied across cancer types. Fusion strategies offer distinct strengths and weaknesses. Intermediate fusion consistently delivered the best results, but strategy choices should align with data characteristics and organizational needs.

URL PDF HTML ☆

赞 0 踩 0

2603.29791 2026-04-01 cs.AI cs.CL cs.LG

Reasoning-Driven Synthetic Data Generation and Evaluation

Tim R. Davidson, Benoit Seguin, Enrico Bacis, Cesar Ilharco, Hamza Harkous

Comments Accepted to TMLR 2026, J2C Certification

2603.29788 2026-04-01 cs.CV

Multi-Feature Fusion Approach for Generative AI Images Detection

Abderrezzaq Sendjasni, Mohamed-Chaker Larabi

Comments This work has been submitted to IEEE Transactions for possible publication

2603.29784 2026-04-01 cs.CV

MAPLE: Multi-Path Adaptive Propagation with Level-Aware Embeddings for Hierarchical Multi-Label Image Classification

Boshko Koloski, Marjan Stoimchev, Jurica Levatić, Dragi Kocev, Sašo Džeroski

Comments REO: Advances in Representation Learning for Earth Observation, accepted workshow paper at EurIPS

2603.29777 2026-04-01 cs.CV cs.AI

From Skeletons to Semantics: Design and Deployment of a Hybrid Edge-Based Action Detection System for Public Safety

Ganen Sethupathy, Lalit Dumka, Jan Schagen

Comments Preprint version of a manuscript currently under review at IEEE Access

2603.29768 2026-04-01 cs.LG

Big2Small: A Unifying Neural Network Framework for Model Compression

Jing-Xiao Liao, Haoran Wang, Tao Li, Daoming Lyu, Yi Zhang, Chengjun Cai, Feng-Lei Fan

2603.29765 2026-04-01 cs.LG cs.CL

Training-Free Dynamic Upcycling of Expert Language Models

Eros Fanì, Oğuzhan Ersoy

Comments Accepted at the ICLR 2026 Workshop on Scaling Post-training for LLMs

详情

英文摘要

Large Language Models (LLMs) have achieved remarkable performance on a wide range of specialized tasks, exhibiting strong problem-solving capabilities. However, training these models is prohibitively expensive, and they often lack domain-specific expertise because they rely on general knowledge datasets. Expertise finetuning can address this issue; however, it often leads to overspecialization, and developing a single multi-domain expert remains difficult due to diverging objectives. Furthermore, multitask training is challenging due to interference and catastrophic forgetting. Existing work proposes combining the expertise of dense models within a Mixture of Experts (MoE) architecture, although this approach still requires multitask finetuning. To address these issues, we introduce Dynamic Upcycling MoE (DUME), a novel approach that reuses dense experts trained on different domains to construct a unified MoE model. Our method builds a single multitask model that preserves the capabilities of the original dense experts without requiring additional training. DUME is both cost-efficient and scalable: by leveraging the closed-form solution of ridge regression, it eliminates the need for further optimization and enables experts to be added dynamically while maintaining the model's original performance. We demonstrate that DUME consistently outperforms baseline approaches in both causal language modeling and reasoning settings. Finally, we also show that the DUME model can be fine-tuned to further improve performance. We show that, in the causal language modeling setting, DUME can retain up to 97.6% of a dense expert model specialized in one particular domain, and that it can also surpass it in the reasoning setting, where it can achieve 102.1% of the dense expert performance. Our code is available at: github.com/gensyn-ai/dume.

URL PDF HTML ☆

赞 0 踩 0

2603.29761 2026-04-01 cs.AI

Tracking vs. Deciding: The Dual-Capability Bottleneck in Searchless Chess Transformers

Quanhao Li, Wei Jiang

2603.29759 2026-04-01 cs.CV cs.AI

TSHA: A Benchmark for Visual Language Models in Trustworthy Safety Hazard Assessment Scenarios

Qiucheng Yu, Ruijie Xu, Mingang Chen, Xuequan Lu, Jianfeng Dong, Chaochao Lu, Xin Tan

2603.29756 2026-04-01 cs.LG

One-for-All: A Lightweight Stabilized and Parameter-Efficient Pre-trained LLM for Time Series Forecasting

Prasanjit Dey, Soumyabrata Dev, Bianca Schoen-Phelan

Comments This manuscript is currently under review at IEEE Transactions on Knowledge and Data Engineering (TKDE)