arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.17194 2026-04-03 cs.CL

What Makes a Good Doctor Response? A Study on Text-Based Telemedicine

Adrian Cosma, Cosmin Dumitrache, Emilian Radoi

Comments Accepted at CL4Health Workshop @ LREC 2026

详情

英文摘要

Text-based telemedicine has become an increasingly used mode of care, requiring clinicians to deliver medical advice clearly and effectively in writing. As platforms increasingly rely on patient ratings and feedback, clinicians face growing pressure to maintain satisfaction scores, even though these evaluations often reflect communication quality more than clinical accuracy. We analyse patient satisfaction signals in Romanian text-based telemedicine. Using a sample of anonymised text-based telemedicine consultations, we model feedback as a binary outcome, treating thumbs-up responses as positive and grouping negative or absent feedback into the other class. We extract from doctor responses interpretable, predominantly language-agnostic features (e.g., length, structural characteristics, readability proxies), along with Romanian LIWC psycholinguistic features and politeness/hedging markers where available. We train a classifier with a time-based split and perform SHAP-based analyses, which indicate that metadata dominates prediction, functioning as a strong prior, while characteristics of the response text provide a smaller but actionable signal. In subgroup correlation analyses, politeness and hedging are consistently associated with positive patient feedback, whereas lexical diversity shows a negative association.

URL PDF HTML ☆

赞 0 踩 0

2602.15997 2026-04-03 cs.LG cs.AI cs.CL

The Geometric Anatomy of Capability Acquisition in Transformers

Jayadev Billa

Comments 19 pages (13 pages main, 6 pages appendix), 13 tables, 8 figures. v4: significant rewrite with additional experiments

2602.11812 2026-04-03 cs.AI

Predicting LLM Output Length via Entropy-Guided Representations

Huanyi Xie, Yubin Chen, Liangyu Wang, Lijie Hu, Di Wang

2602.10259 2026-04-03 cs.CV

PMMA: The Polytechnique Montreal Mobility Aids Dataset

Qingwu Liu, Nicolas Saunier, Guillaume-Alexandre Bilodeau

Comments Submitted to the journal IEEE Open Journal Intelligent Transportation Systems, under review

2602.08821 2026-04-03 cs.RO

Multi-Staged Framework for Safety Analysis of Offloaded Services in Distributed Intelligent Transportation Systems

Robin Dehler, Oliver Schumann, Jona Ruof, Michael Buchholz

Comments 2025 IEEE International Conference on Intelligent Transportation Systems (ITSC)

2602.04160 2026-04-03 cs.SD

PFluxTTS: Hybrid Flow-Matching TTS with Robust Cross-Lingual Voice Cloning and Inference-Time Model Fusion

Vikentii Pankov, Artem Gribul, Oktai Tatanov, Vladislav Proskurov, Yuliya Korotkova, Darima Mylzenova, Dmitrii Vypirailenko

Comments Accepted at ICASSP 2026

2602.03396 2026-04-03 cs.CL

Towards Distillation-Resistant Large Language Models: An Information-Theoretic Perspective

Hao Fang, Tianyi Zhang, Tianqu Zhuang, Jiawei Kong, Kuofeng Gao, Bin Chen, Leqi Liang, Shu-Tao Xia, Ke Xu

2602.03380 2026-04-03 cs.CV

Seeing Through the Chain: Mitigate Hallucination in Multimodal Reasoning Models via CoT Compression and Contrastive Preference Optimization

Hao Fang, Jinyu Li, Jiawei Kong, Tianqu Zhuang, Kuofeng Gao, Bin Chen, Shu-Tao Xia

2601.21462 2026-04-03 cs.LG stat.ML

Partial Feedback Online Learning

Shihao Shao, Cong Fang, Zhouchen Lin, Dacheng Tao

Comments 40 pages. Fixed some typos in the proof and improved readability

2601.20666 2026-04-03 cs.LG cs.AI cs.SY eess.SY

Learning Contextual Runtime Monitors for Safe AI-Based Autonomy

Alejandro Luque-Cerpa, Mengyuan Wang, Emil Carlsson, Sanjit A. Seshia, Devdatt Dubhashi, Hazem Torfah

2601.20331 2026-04-03 cs.CV

GVGS: Gaussian Visibility-Aware Multi-View Geometry for Accurate Surface Reconstruction

Mai Su, Qihan Yu, Zhongtao Wang, Yilong Li, Chengwei Pan, Yisong Chen, Guoping Wang, Fei Zhu

2601.17641 2026-04-03 cs.LG eess.SP

RPNT: Robust Pre-trained Neural Transformer -- A Pathway for Generalized Motor Decoding

Hao Fang, Ryan A. Canfield, Tomohiro Ouchi, Beatrice Macagno, Eli Shlizerman, Amy L. Orsborn

详情

英文摘要

Brain motor decoding aims to interpret and translate neural activity into behaviors. Decoding models should generalize across variations, such as recordings from different brain sites, experimental sessions, behavior types, and subjects, will be critical for real-world applications. Current decoding models only partially address these challenges. In this work, we develop a pretrained neural transformer model, RPNT - Robust Pretrained Neural Transformer, designed to achieve robust generalization through pretraining, which in turn enables effective finetuning for downstream motor decoding tasks. We achieved the proposed RPNT architecture by systematically investigating which transformer building blocks could be suitable for neural spike activity modeling, since components from models developed for other modalities, such as text and images, do not transfer directly to neural data. The final RPNT architecture incorporates three unique enabling components: 1) Multidimensional rotary positional embedding to aggregate experimental metadata such as site coordinates, session ids and behavior types; 2) Context-based attention mechanism via convolution kernels operating on global attention to learn local temporal structures for handling non-stationarity of neural population activity; 3) Robust self-supervised learning objective with stochastic causal masking strategies and contrastive representations. We pretrained two versions of RPNT on distinct datasets that present significant generalization challenges: a) Multi-session, multi-task, and multi-subject microelectrode benchmark; b) Multi-site recordings using high-density Neuropixel 1.0 probes from many cortical locations. After pretraining, we evaluated RPNT generalization on cross-session, cross-type, cross-subject, and cross-site downstream behavior decoding tasks. Our RPNT consistently outperforms the existing decoding models on these tasks.

URL PDF HTML ☆

赞 0 踩 0

2601.17387 2026-04-03 cs.CL

Generation-Step-Aware Framework for Cross-Modal Representation and Control in Multilingual Speech-Text Models

Toshiki Nakai, Varsha Suresh, Vera Demberg

Comments 10 pages for the main text, 6 Figures, 5 Tables

2601.17192 2026-04-03 cs.LG

PUNCH: Physics-informed Uncertainty-aware Network for Coronary Hemodynamics

Sukirt Thakur, Marcus Roper, Yang Zhou, Dmitry Yu. Isaev, Reza Akbarian Bafghi, Brahmajee K. Nallamothu, C. Alberto Figueroa, Srinivas Paruchuri, Scott Burger, Carlos Collet, Maziar Raissi

2601.16885 2026-04-03 cs.CV cs.RO

GPA-VGGT:Adapting VGGT to Large Scale Localization by Self-Supervised Learning with Geometry and Physics Aware Loss

Yangfan Xu, Lilian Zhang, Xiaofeng He, Pengdong Wu, Wenqi Wu, Jun Mao

2601.16515 2026-04-03 cs.CV

SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

Tongcheng Fang, Hanling Zhang, Ruiqi Xie, Zhuo Han, Xin Tao, Tianchen Zhao, Pengfei Wan, Wenbo Ding, Wanli Ouyang, Xuefei Ning, Yu Wang

2601.16514 2026-04-03 cs.LG cs.AI math.OC

Finite-Time Analysis of Gradient Descent for Shallow Transformers

Enes Arda, Semih Cayci, Atilla Eryilmaz

Comments AISTATS 2026 camera-ready version

2601.15475 2026-04-03 cs.CV

Seeing through Light and Darkness: Sensor-Physics Grounded Deblurring HDR NeRF from Single-Exposure Images and Events

Yunshan Qi, Lin Zhu, Nan Bao, Yifan Zhao, Jia Li

Comments Accepted by the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026. Project Page: https://icvteam.github.io/See-NeRF.html. Our code and datasets are publicly available at https://github.com/iCVTEAM/See-NeRF

2601.11508 2026-04-03 cs.CV

ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes

Emily Steiner, Jianhao Zheng, Henry Howard-Jenkins, Chris Xie, Iro Armeni

Comments CVPR 2026

2601.10779 2026-04-03 cs.LG cs.AI

Unified Optimization of Source Weights and Transfer Quantities in Multi-Source Transfer Learning: An Asymptotic Framework

Qingyue Zhang, Chang Chu, Haohao Fu, Tianren Peng, Yanru Wu, Guanbo Huang, Yang Li, Shao-Lun Huang

2601.09724 2026-04-03 cs.CL cs.AI

Syntactic Framing Fragility: An Audit of Robustness in LLM Ethical Decisions

Katherine Elkins, Jon Chun

Comments 23 pages, 14 figures

2601.08462 2026-04-03 cs.AI

M3-BENCH: Process-Aware Evaluation of LLM Agents' Social Behaviors in Mixed-Motive Games

Sixiong Xie, Zhuofan Shi, Haiyang Shen, Yun Ma, Xiang Jing

2601.06810 2026-04-03 cs.LG cs.AI math-ph math.MP

WFR-FM: Simulation-Free Dynamic Unbalanced Optimal Transport

Qiangwei Peng, Zihan Wang, Junda Ying, Yuhao Sun, Qing Nie, Lei Zhang, Tiejun Li, Peijie Zhou

2601.05500 2026-04-03 cs.AI

The Illusion of AI Expertise Under Uncertainty: Navigating Elusive Ground Truth via a Probabilistic Paradigm

Aparna Elangovan, Lei Xu, Mahsa Elyasi, Ismail Akdulum, Mehmet Aksakal, Enes Gurun, Brian Hur, Saab Mansour, Ravid Shwartz Ziv, Karin Verspoor, Dan Roth

2601.05352 2026-04-03 cs.LG cs.CR cs.IR cs.SI

When the Server Steps In: Calibrated Updates for Fair Federated Learning

Tianrun Yu, Kaixiang Zhao, Cheng Zhang, Anjun Gao, Yueyang Quan, Zhuqing Liu, Minghong Fang

Comments To appear in WiOpt 2026

2601.04823 2026-04-03 cs.AI cs.CL

DR-LoRA: Dynamic Rank LoRA for Fine-Tuning Mixture-of-Experts Models

Guanzhi Deng, Bo Li, Ronghao Chen, Xiujin Liu, Zhuo Han, Huacan Wang, Lijie Wen, Linqi Song

2601.02991 2026-04-03 cs.CV cs.AI

Towards Faithful Reasoning in Comics for Small MLLMs

Chengcheng Feng, Haojie Yin, Yucheng Jin, Kaizhu Huang

2601.02031 2026-04-03 cs.LG cs.AI cs.CL

Output Embedding Centering for Stable LLM Pretraining

Felix Stollenwerk, Anna Lokrantz, Niclas Hertzberg

Comments Additional experiments using logit soft-capping & weight tying

2601.00609 2026-04-03 cs.RO cs.SY eess.SY

NMPC-Augmented Visual Navigation and Safe Learning Control for Large-Scale Mobile Robots

Mehdi Heydari Shahna, Pauli Mustalahti, Jouni Mattila

详情

DOI: 10.1109/LRA.2026.3669802
Journal ref: M. H. Shahna, P. Mustalahti and J. Mattila, "NMPC-Augmented Visual Navigation and Safe Learning Control for Large-Scale Mobile Robots," in IEEE Robotics and Automation Letters, vol. 11, no. 4, pp. 5182-5189, April 2026

英文摘要

A large-scale mobile robot (LSMR) is a high-order multibody system that often operates on loose, unconsolidated terrain, which reduces traction. This paper presents a comprehensive navigation and control framework for an LSMR that ensures stability and safety-defined performance, delivering robust operation on slip-prone terrain by jointly leveraging high-performance techniques. The proposed architecture comprises four main modules: (1) a visual pose-estimation module that fuses onboard sensors and stereo cameras to provide an accurate, low-latency robot pose, (2) a high-level nonlinear model predictive control that updates the wheel motion commands to correct robot drift from the robot reference pose on slip-prone terrain, (3) a low-level deep neural network control policy that approximates the complex behavior of the wheel-driven actuation mechanism in LSMRs, augmented with robust adaptive control to handle out-of-distribution disturbances, ensuring that the wheels accurately track the updated commands issued by high-level control module, and (4) a logarithmic safety module to monitor the entire robot stack and guarantees safe operation. The proposed low-level control framework guarantees uniform exponential stability of the actuation subsystem, while the safety module ensures the whole system-level safety during operation. Comparative experiments on a 6,000 kg LSMR actuated by two complex electro-hydrostatic drives, while synchronizing modules operating at different frequencies.

URL PDF HTML ☆

赞 0 踩 0

2512.21106 2026-04-03 cs.CL cs.AI cs.LG

Semantic Refinement with LLMs for Graph Representations

Safal Thapaliya, Zehong Wang, Jiazheng Li, Ziming Li, Yanfang Ye, Chuxu Zhang