arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2512.01510 2026-04-29 cs.CV cs.LG

Semantic-aware Random Convolution and Source Matching for Domain Generalization in Medical Image Segmentation

Franz Thaler, Martin Urschler, Mateusz Kozinski, Matthias AF Gsell, Gernot Plank, Darko Stern

Comments Accepted for publication in IEEE Access

详情

DOI: 10.1109/ACCESS.2026.3687116

英文摘要

We tackle the challenging problem of single-source domain generalization (DG) for medical image segmentation, where we train a network on one domain (e.g., CT) and directly apply it to a different domain (e.g., MR) without adapting the model and without requiring images or annotations from the new domain during training. Our method diversifies the source domain through semantic-aware random convolution, where different regions of a source image are augmented differently at training-time, based on their annotation labels. At test-time, we complement the randomization of the training domain via mapping the intensity of target domain images, making them similar to source domain data. We perform a comprehensive evaluation on a variety of cross-modality and cross-center generalization settings for abdominal, whole-heart and prostate segmentation, where we outperform previous DG techniques in a vast majority of experiments. Additionally, we also investigate our method when training on whole-heart CT or MR data and testing on the diastolic and systolic phase of cine MR data captured with different scanner hardware. Overall, our evaluation shows that our method achieves new state-of-the-art performance in DG for medical image segmentation, even matching the performance of the in-domain baseline in several settings.

URL PDF HTML ☆

赞 0 踩 0

2511.20211 2026-04-29 cs.CV cs.AI

OmniAlpha: Aligning Transparency-Aware Generation via Multi-Task Unified Reinforcement Learning

Hao Yu, Jinglin Wang, Jiabo Zhan, Rui Chen, Zile Wang, Huaisong Zhang, Hongyu Li, Xinrui Chen, Yongxian Wei, Chun Yuan

2511.18871 2026-04-29 cs.LG cs.AI

Periodic Asynchrony: An On-Policy Approach for Accelerating LLM Reinforcement Learning

Jian Lu

2511.07743 2026-04-29 cs.CV cs.AI

UltraGS: Real-Time Physically-Decoupled Gaussian Splatting for Ultrasound Novel View Synthesis

Yuezhe Yang, Qingqing Ruan, Wenjie Cai, Yudang Dong, Dexin Yang, Xingbo Dong, Zhe Jin, Yong Dai

Comments Accepted by ICME 2026

2511.01490 2026-04-29 cs.CL

Synthetic Eggs in Many Baskets: The Impact of Synthetic Data Diversity on LLM Fine-Tuning

Max Schaffelder, Albert Gatt

Comments Accepted to Findings of the Association for Computational Linguistics: ACL 2026

2510.16340 2026-04-29 cs.CL cs.AI

Thinking About Thinking: Evaluating Reasoning in Post-Trained Language Models

Pratham Singla, Shivank Garg, Ayush Singh, Ishan Garg, Ketan Suhaas Saichandran

2510.09382 2026-04-29 cs.LG

CHUCKLE -- When Humans Teach AI To Learn Emotions The Easy Way

Ankush Pratap Singh, Houwei Cao, Yong Liu

2510.05432 2026-04-29 cs.AI

AInstein: Can LLMs Solve Research Problems From Parametric Memory Alone?

Shambhavi Mishra, Gaurav Sahu, Marco Pedersoli, Laurent Charlin, Jose Dolz, Christopher Pal

2510.02765 2026-04-29 cs.LG

Curl Descent: Non-Gradient Learning Dynamics with Sign-Diverse Plasticity

Hugo Ninou, Jonathan Kadmon, N. Alex Cayco-Gajic

2509.17387 2026-04-29 cs.RO

High-Precision and High-Efficiency Trajectory Tracking for Excavators Based on Closed-Loop Dynamics

Ziqing Zou, Cong Wang, Yue Hu, Xiao Liu, Bowen Xu, Rong Xiong, Changjie Fan, Yingfeng Chen, Yue Wang

2508.18473 2026-04-29 cs.CL cs.AI cs.LG

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

Jiawei Li, Akshayaa Magesh, Venugopal V. Veeravalli

Comments 14 pages, 2 figures

2506.23773 2026-04-29 cs.AI cs.LO

BayesL: a Logical Framework for the Verification of Bayesian Networks

Stefano M. Nicoletti, E. Moritz Hahn, Mariëlle Stoelinga

2504.17364 2026-04-29 cs.CV

I-INR: Iterative Implicit Neural Representations

Ali Haider, Muhammad Salman Ali, Maryam Qamar, Tahir Khalil, Soo Ye Kim, Jihyong Oh, Enzo Tartaglione, Sung-Ho Bae

Comments Accepted at AAAI 2026

2504.11349 2026-04-29 cs.CV cs.AI cs.GR

Representation Paradigms in AI-based 3D Radiological Image Reconstruction: A Systematic Review

Yuezhe Yang, Lei Bi, Boyu Yang, Yaqian Wang, Yang He, Yige Peng, Zhe Jin, Xingbo Dong, Jinman Kim

Comments 58 pages, Under Reivew

2504.01919 2026-04-29 cs.CL cs.AI

Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation

Baban Gain, Dibyanayan Bandyopadhyay, Asif Ekbal, Trilok Nath Singh

详情

DOI: 10.1007/s10579-026-09919-7

英文摘要

Large Language Models (LLMs) are rapidly reshaping machine translation (MT), particularly by introducing instruction-following, in-context learning, and preference-based alignment into what has traditionally been a supervised encoder-decoder paradigm. This survey provides a comprehensive and up-to-date overview of how LLMs are being leveraged for MT across data regimes, languages, and application settings. We systematically analyze prompting-based methods, parameter-efficient and full fine-tuning strategies, synthetic data generation, preference-based optimization, and reinforcement learning with human and weakly supervised feedback. Special attention is given to low-resource translation, where we examine the roles of synthetic data quality, diversity, and preference signals, as well as the limitations of current RLHF pipelines. We further review recent advances in Mixture-of-Experts models, MT-focused LLMs, and multilingual alignment, highlighting trade-offs between scalability, specialization, and accessibility. Beyond sentence-level translation, we survey emerging document-level and discourse-aware MT methods with LLMs, showing that most approaches extend sentence-level pipelines through structured context selection, post-editing, or reranking rather than requiring fundamentally new data regimes or architectures. Finally, we discuss LLM-based evaluation, its strengths and biases, and its role alongside learned metrics. Overall, this survey positions LLM-based MT as an evolution of traditional MT systems, where gains increasingly depend on data quality, preference alignment, and context utilization rather than scale alone, and outlines open challenges for building robust, inclusive, and controllable translation systems.

URL PDF HTML ☆

赞 0 踩 0

2503.10210 2026-04-29 cs.CV

TARS: Traffic-Aware Radar Scene Flow Estimation

Jialong Wu, Marco Braun, Dominic Spata, Matthias Rottmann

2503.07768 2026-04-29 cs.CV

NimbleReg: A light-weight deep-learning framework for diffeomorphic image registration

Antoine Legouhy, Ross Callaghan, Nolah Mazet, Vivien Julienne, Hojjat Azadbakht, Hui Zhang

2502.02452 2026-04-29 cs.CV

Personalization Toolkit: Training Free Personalization of Large Vision Language Models

Soroush Seifi, Vaggelis Dorovatas, Matteo Cassinelli, Fabien Despinoy, Daniel Olmeda Reino, Rahaf Aljundi

Comments Accepted at Transactions on Machine Learning Research (TMLR) 2026

2501.17653 2026-04-29 cs.LG cs.CE eess.SP

Drivetrain simulation using variational autoencoders

Pallavi Sharma, Jorge-Humberto Urrea-Quintero, Bogdan Bogdan, Adrian-Dumitru Ciotec, Laura Vasilie, Henning Wessels, Matteo Skull

Comments 27 pages

2412.07584 2026-04-29 cs.CV cs.AI

Multimodal Contextualized Support for Enhancing Video Retrieval System

Quoc-Bao Nguyen-Le, Thanh-Huy Le-Nguyen

Comments This paper has been withdrawn by the author. After further review, the author believes that the current version does not meet the desired standards and plans to revise the work before any potential resubmission

2412.00167 2026-04-29 cs.LG cs.AI

Origin-Destination Demand Prediction: An Urban Radiation and Attraction Perspective

Xuan Ma, Zepeng Bao, Ming Zhong, Yuanyuan Zhu, Chenliang Li, Jiawei Jiang, Qing Li, Tieyun Qian

Comments Upon further internal review, we identified several issues that were not fully addressed in the current version. To ensure scientific rigor and avoid potential misinterpretation, we have decided to withdraw the paper for further refinement

2411.14721 2026-04-29 cs.CL cs.LG q-bio.QM

MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

Jiatong Li, Yunqing Liu, Wei Liu, Jingdi Le, Di Zhang, Wenqi Fan, Dongzhan Zhou, Yuqiang Li, Qing Li

Comments Accepted by TKDE, To appear. Codes are available at: https://github.com/phenixace/MolReFlect

2410.09635 2026-04-29 cs.LG cs.AI

Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal Health

Abdullah Mamun, Lawrence D. Devoe, Mark I. Evans, David W. Britt, Judith Klein-Seetharaman, Hassan Ghasemzadeh

Comments Accepted for publication in ACM Transactions on Computing for Healthcare (ACM HEALTH), April 2026. 26 pages, 9 figures

2410.04182 2026-04-29 cs.CV

PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation

Yiqi Liang, Ying Liu, Dandan Long, Ruihui Li

Comments 6 pages, 9 figures

2211.12080 2026-04-29 cs.SD eess.AS

Robust Training for Speaker Verification against Noisy Labels

Zhihua Fang, Liang He, Hanhan Ma, Xiaochen Guo, Lin Li

Comments Accepted by INTERSPEECH 2023

2604.25534 2026-04-29 cs.AI

Sample-efficient Neuro-symbolic Proximal Policy Optimization

Simone Murari, Celeste Veronese, Daniele Meli

2604.25533 2026-04-29 cs.CV

DualGeo: A Dual-View Framework for Worldwide Image Geo-localization

Junchao Cui, Wenqi Shi, Shaoyong Du, Hang He, Xuanzi Ma, Hao Tang, Xiangyang Luo

Comments ICME2026 Accept

2604.25521 2026-04-29 cs.AI

Automated Adversarial Collaboration for Advancing Theory Building in the Cognitive Sciences

Suyog Chandramouli, George Kachergis, Akshay Jagadish

Comments 2 pages

2604.25512 2026-04-29 cs.AI

PHISHREV: A Hybrid Machine Learning and Post-Hoc Non-monotonic Reasoning Framework for Context-Aware Phishing Website Classification

Mainak Sen, Kumar Sankar Ray, Amlan Chakrabarti

2604.25508 2026-04-29 cs.LG

Dyna-Style Safety Augmented Reinforcement Learning: Staying Safe in the Face of Uncertainty

Artur Eisele, Bernd Frauenknecht, Friedrich Solowjow, Sebastian Trimpe