arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.07102 2026-04-09 cs.CL cs.AI

The Impact of Steering Large Language Models with Persona Vectors in Educational Applications

Yongchao Wu, Aron Henriksson

详情

英文摘要

Activation-based steering can personalize large language models at inference time, but its effects in educational settings remain unclear. We study persona vectors for seven character traits in short-answer generation and automated scoring on the ASAP-SAS benchmark across three models spanning two architectures. Persona steering lowers answer quality overall, with much larger effects on open-ended English Language Arts (ELA) prompts than on factual science prompts; interpretive and argumentative tasks are up to 11x more sensitive. On the scoring side, we observe predictable valence-aligned calibration shifts: evil and impolite scorers grade more harshly, while good and optimistic scorers grade more leniently. ELA tasks are 2.5-3x more susceptible to scorer personalization than science tasks, and the Mixture-of-Experts model shows roughly 6x larger calibration shifts than the dense models. To our knowledge, this is the first study to systematically examine the effects of activation-steered persona traits in educational generation and scoring, and the results highlight the need for task-aware and architecture-aware calibration when deploying steered models in educational settings.

URL PDF HTML ☆

赞 0 踩 0

2604.07101 2026-04-09 cs.CV cs.AI cs.MM eess.IV

SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation

Qizhou Wang, Guansong Pang, Christopher Leckie

2604.07097 2026-04-09 cs.CV

Novel Anomaly Detection Scenarios and Evaluation Metrics to Address the Ambiguity in the Definition of Normal Samples

Reiji Saito, Satoshi Kamiya, Kazuhiro Hotta

Comments Accepted by CVPR 2026 Workshop

2604.07095 2026-04-09 cs.CL

Multilingual Embedding Probes Fail to Generalize Across Learner Corpora

Laurits Lyngbaek, Ross Deans Kristensen-McLachlan

2604.07084 2026-04-09 cs.RO cs.AI

Flow Motion Policy: Manipulator Motion Planning with Flow Matching Models

Davood Soleymanzadeh, Xiao Liang, Minghui Zheng

2604.07082 2026-04-09 physics.comp-ph cs.NA math.NA

Granular mixing and flow dynamics in horizontal stirred bed reactors

Sahar Pourandi, Igor Ostanin, Thomas Weinhart

2604.07081 2026-04-09 eess.SY cs.SY

Small-gain analysis of exponential incremental input/output-to-state stability for large-scale distributed systems

Christian Gatke, Julian D. Schiller, Matthias A. Müller

Comments This work has been submitted to the IEEE for possible publication

2604.07079 2026-04-09 cs.IR

MARVEL: Multimodal Adaptive Reasoning-intensiVe Expand-rerank and retrievaL

Mahmoud SalahEldin Kasem, Mohamed Mahmoud, Mostafa Farouk Senussi, Mahmoud Abdalla, Abdelrahman Abdallah, Hyun-Soo Kang

2604.07072 2026-04-09 cs.LG

Epistemic Robust Offline Reinforcement Learning

Abhilash Reddy Chenreddy, Erick Delage

2604.07071 2026-04-09 cs.HC cs.CR

BioMoTouch: Touch-Based Behavioral Authentication via Biometric-Motion Interaction Modeling

Zijian Ling, Jianbang Chen, Hongwei Li, Hongda Zhai, Man Zhou, Jun Feng, Zhengxiong Li, Qi Li, Qian Wang

Comments 13 pages

2604.07069 2026-04-09 eess.SY cs.LG cs.SY math.DS

Controller Design for Structured State-space Models via Contraction Theory

Muhammad Zakwan, Vaibhav Gupta, Alireza Karimi, Efe C. Balta, Giancarlo Ferrari-Trecate

Comments The first and second authors contributed equally. The paper has been accepted in 24th European Control Conference (ECC) in Reykjavik, Iceland, 2026

2604.07067 2026-04-09 cs.CL

Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

Iza Škrjanec, Irene Elisabeth Winther, Vera Demberg, Stefan L. Frank

2604.07066 2026-04-09 cs.CL

SemEval-2026 Task 3: Dimensional Aspect-Based Sentiment Analysis (DimABSA)

Liang-Chih Yu, Jonas Becker, Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Lung-Hao Lee, Ying-Lung Lin, Jin Wang, Jan Philip Wahle, Terry Ruas, Natalia Loukachevitch, Alexander Panchenko, Ilseyar Alimova, Lilian Wanzare, Nelson Odhiambo, Bela Gipp, Kai-Wei Chang, Saif M. Mohammad

2604.07065 2026-04-09 eess.SY cs.SY

Trust-as-a-Service: Task-Specific Orchestration for Effective Task Completion via Model Context Protocol-Aided Agentic AI

Botao Zhu, Xianbin Wang

2604.07064 2026-04-09 eess.SY cs.SY

TSO-DSO Coordinated Reactive Power Dispatch for Smart Inverters with Multiple Control Modes Real-Time Implementation

Mohammad Almomani, Ahmed Alkhonain, Venkataramana Ajjarapu

2604.07059 2026-04-09 cs.LG

Production-Ready Automated ECU Calibration using Residual Reinforcement Learning

Andreas Kampmeier, Kevin Badalian, Lucas Koch, Sung-Yong Lee, Jakob Andert

Comments This manuscript has been submitted to SAE as a conference paper for the 2026 Stuttgart International Symposium on Automotive and Powertrain Technology

2604.07058 2026-04-09 cs.FL

The Quadratic State Cost of Classical Simulation of One-Way Quantum Finite Automata

Zeyu Chen, Junde Wu

2604.07057 2026-04-09 cs.CL

IndoBERT-Sentiment: Context-Conditioned Sentiment Classification for Indonesian Text

Muhammad Apriandito Arya Saputra, Andry Alamsyah, Dian Puteri Ramadhani, Thomhert Suprapto Siadari, Hanif Fakhrurroja

Comments 8 pages, 5 tables, and 2 figures

2604.07051 2026-04-09 eess.SY cs.SY

Trajectory-Based Nonlinear Indices for Real-Time Monitoring and Quantification of Short-Term Voltage Stability

Mohammad Almomani, Muhammad Sarwar, Venkataramana Ajjarapu

2604.07041 2026-04-09 cs.DB cs.AI cs.ET cs.HC cs.IR

AV-SQL: Decomposing Complex Text-to-SQL Queries with Agentic Views

Minh Tam Pham, Trinh Pham, Tong Chen, Hongzhi Yin, Quoc Viet Hung Nguyen, Thanh Tam Nguyen

2604.07038 2026-04-09 cs.RO q-bio.NC

Exploring the proprioceptive potential of joint receptors using a biomimetic robotic joint

Akihiro Miki, Shun Hasegawa, Sota Yuzaki, Yuta Sahara, Yoshimoto Ribayashi, Kento Kawaharazuka, Kei Okada

Comments 26 pages including supplementary materials (17 pages main text), 6 main figures and 7 supplementary figures. Published in Scientific Reports

2604.07037 2026-04-09 hep-ex cs.CV

Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

Saúl Alonso-Monsalve, Fabio Cufino, Umut Kose, Anna Mascellani, André Rubbia

Comments 18 pages, 6 figures

详情

英文摘要

Accelerator-based neutrino physics is entering an energy-frontier regime in which interactions reach the TeV scale and produce exceptionally dense, overlapping detector signatures. In this regime, event interpretation becomes impractical for conventional reconstruction approaches, particularly when labelled data are scarce and the analysis spans diverse downstream objectives. We present a sparse ViT framework for learning reusable representations from heterogeneous detector data. Self-supervised pre-training combines masked autoencoder reconstruction with relational voxel-level objectives for hierarchy, ghost and particle identification, and the resulting shared encoder is then jointly fine-tuned across classification and regression tasks. Evaluated on simulated events from the proposed FASERCal concept at the LHC, we find that pre-training consistently improves neutrino flavour and charm-quark identification, momentum regression, and vertex reconstruction over training from scratch, with the addition of relational objectives yielding further gains in the most topologically complex channels. Interpretability analyses further show that pre-training yields a more structured latent space, while detector-subsystem ablations recover physically plausible channel-dependent roles for the heterogeneous inputs. A data-efficiency study shows that, with roughly $10^3$ labelled events, the pre-trained encoder already matches the flavour-classification performance of a randomly initialised model trained on an order of magnitude more data. The learned representations also transfer effectively to publicly available benchmarks spanning different detector technologies and energy scales, matching or exceeding published baselines. These results support self-supervised pre-training on multimodal detector data as a scalable route towards reusable representations for neutrino and particle-detector analysis.

URL PDF HTML ☆

赞 0 踩 0

2604.07036 2026-04-09 cs.CL cs.LG cs.MA

ReDAct: Uncertainty-Aware Deferral for LLM Agents

Dzianis Piatrashyn, Nikita Kotelevskii, Kirill Grishchenkov, Nikita Glazkov, Ivan Nasonov, Ilya Makarov, Timothy Baldwin, Preslav Nakov, Roman Vashurin, Maxim Panov

2604.07034 2026-04-09 cs.RO cs.AI cs.CV

KITE: Keyframe-Indexed Tokenized Evidence for VLM-Based Robot Failure Analysis

Mehdi Hosseinzadeh, King Hang Wong, Feras Dayoub

Comments ICRA 2026; Project page: https://m80hz.github.io/kite/

2604.07030 2026-04-09 cs.LG

MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale

Tobias Falke, Nicolas Anastassacos, Samson Tan, Chankrisna Richy Meas, Chandana Satya Prakash, Nitesh Sekhar, M Saiful Bari, Krishna Kompella, Gamaleldin F. Elsayed

2604.07029 2026-04-09 physics.soc-ph cs.CY

Quality assessment of a country-wide bicycle node network with loop census analysis

Michael Szell, Anastassia Vybornova, Ane Rahbek Vierø

Comments Main text: 12 pages, 6 figures. SI: 10 pages, 8 figures

2604.07027 2026-04-09 cs.LG

Learning to Query History: Nonstationary Classification via Learned Retrieval

Jimmy Gammell, Bishal Thapaliya, Yoon Jung, Riyasat Ohib, Bilel Fehri, Deepayan Chakrabarti

Comments Accepted to ICLR 2026 Workshop on Time Series in the Age of Large Models (TSALM). 12 pages, 6 figures

2604.07026 2026-04-09 cs.CV

Not all tokens contribute equally to diffusion learning

Guoqing Zhang, Lu Shi, Wanru Xu, Linna Zhang, Sen Wang, Fangfang Wang, Yigang Cen

详情

英文摘要

With the rapid development of conditional diffusion models, significant progress has been made in text-to-video generation. However, we observe that these models often neglect semantically important tokens during inference, leading to biased or incomplete generations under classifier-free guidance. We attribute this issue to two key factors: distributional bias caused by the long-tailed token frequency in training data, and spatial misalignment in cross-attention where semantically important tokens are overshadowed by less informative ones. To address these issues, we propose Distribution-Aware Rectification and Spatial Ensemble (DARE), a unified framework that improves semantic guidance in diffusion models from the perspectives of distributional debiasing and spatial consistency. First, we introduce Distribution-Rectified Classifier-Free Guidance (DR-CFG), which regularizes the training process by dynamically suppressing dominant tokens with low semantic density, encouraging the model to better capture underrepresented semantic cues and learn a more balanced conditional distribution. This design mitigates the risk of the model distribution overfitting to tokens with low semantic density. Second, we propose Spatial Representation Alignment (SRA), which adaptively reweights cross-attention maps according to token importance and enforces representation consistency, enabling semantically important tokens to exert stronger spatial guidance during generation. This mechanism effectively prevents low semantic-density tokens from dominating the attention allocation, thereby avoiding the dilution of the spatial and distributional guidance provided by high semantic-density tokens. Extensive experiments on multiple benchmark datasets demonstrate that DARE consistently improves generation fidelity and semantic alignment, achieving significant gains over existing approaches.

URL PDF HTML ☆

赞 0 踩 0

2604.07025 2026-04-09 math.DS cs.LG cs.NA math.NA

Physics-Informed Functional Link Constrained Framework with Domain Mapping for Solving Bending Analysis of an Exponentially Loaded Perforated Beam

Iswari Sahu, Ramanath Garai, S. Chakraverty

2604.07023 2026-04-09 cs.CL

MARS: Enabling Autoregressive Models Multi-Token Generation

Ziqi Jin, Lei Wang, Ziwei Luo, Aixin Sun

Comments 15 pages, 4 fugures