arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.23650 2026-03-26 cs.CV

Foundation Model Embeddings Meet Blended Emotions: A Multimodal Fusion Approach for the BLEMORE Challenge

Masoumeh Chapariniya, Aref Farhadipour, Sarah Ebling, Volker Dellwo, Teodora Vukovic

详情

英文摘要

We present our system for the BLEMORE Challenge at FG 2026 on blended emotion recognition with relative salience prediction. Our approach combines six encoder families through late probability fusion: an S4D-ViTMoE face encoder adapted with soft-label KL training, frozen layer-selective Wav2Vec2 audio features, finetuned body-language encoders (TimeSformer, VideoMAE), and -- for the first time in emotion recognition -- Gemini Embedding 2.0, a large multimodal model whose video embeddings produce competitive presence accuracy (ACCP = 0.320) from only 2 seconds of input. Three key findings emerge from our experiments: selecting prosody-encoding layers (6--12) from frozen Wav2Vec2 outperforms end-to-end finetuning (Score 0.207 vs. 0.161), as the non-verbal nature of BLEMORE audio makes phonetic layers irrelevant; the post-processing salience threshold $β$ varies from 0.05 to 0.43 across folds, revealing that personalized expression styles are the primary bottleneck; and task-adapted encoders collectively receive 62\% of ensemble weight over general-purpose baselines. Our 12-encoder system achieves Score = 0.279 (ACCP = 0.391, ACCS = 0.168) on the test set, placing 6th.

URL PDF HTML ☆

赞 0 踩 0

2603.23646 2026-03-26 cs.CL cs.AI

Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks

Fatih Uenal

Comments 21 pages, 5 figures, 7 tables. Code and data: https://github.com/FUenal/swiss-bench

2603.23627 2026-03-26 cs.CV cs.AI

Ukrainian Visual Word Sense Disambiguation Benchmark

Yurii Laba, Yaryna Mohytych, Ivanna Rohulia, Halyna Kyryleyza, Hanna Dydyk-Meush, Oles Dobosevych, Rostyslav Hryniv

2603.23626 2026-03-26 cs.LG cond-mat.stat-mech cs.AI cs.CL nlin.AO

A Theory of LLM Information Susceptibility

Zhuo-Yang Song, Hua Xing Zhu

Comments 16 pages, 9 figures

2603.23625 2026-03-26 cs.AI cs.CL

Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework

Zeinab Dehghani, Rameez Raja Kureshi, Koorosh Aslansefat, Faezeh Alsadat Abedi, Dhavalkumar Thakker, Lisa Greaves, Bhupesh Kumar Mishra, Baseer Ahmad, Tanaya Maslekar

详情

英文摘要

Artificial intelligence (AI) is increasingly being explored in health and social care to reduce administrative workload and allow staff to spend more time on patient care. This paper evaluates a voice-enabled Care Home Smart Speaker designed to support everyday activities in residential care homes, including spoken access to resident records, reminders, and scheduling tasks. A safety-focused evaluation framework is presented that examines the system end-to-end, combining Whisper-based speech recognition with retrieval-augmented generation (RAG) approaches (hybrid, sparse, and dense). Using supervised care-home trials and controlled testing, we evaluated 330 spoken transcripts across 11 care categories, including 184 reminder-containing interactions. These evaluations focus on (i) correct identification of residents and care categories, (ii) reminder recognition and extraction, and (iii) end-to-end scheduling correctness under uncertainty (including safe deferral/clarification). Given the safety-critical nature of care homes, particular attention is also paid to reliability in noisy environments and across diverse accents, supported by confidence scoring, clarification prompts, and human-in-the-loop oversight. In the best-performing configuration (GPT-5.2), resident ID and care category matching reached 100% (95% CI: 98.86-100), while reminder recognition reached 89.09\% (95% CI: 83.81-92.80) with zero missed reminders (100% recall) but some false positives. End-to-end scheduling via calendar integration achieved 84.65% exact reminder-count agreement (95% CI: 78.00-89.56), indicating remaining edge cases in converting informal spoken instructions into actionable events. The findings suggest that voice-enabled systems, when carefully evaluated and appropriately safeguarded, can support accurate documentation, effective task management, and trustworthy use of AI in care home settings.

URL PDF HTML ☆

赞 0 踩 0

2603.23624 2026-03-26 cs.CL

Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths

Amani Maina-Kilaas, Roger Levy

Comments 8 pages, 5 figures

2603.23617 2026-03-26 cs.CV

M3T: Discrete Multi-Modal Motion Tokens for Sign Language Production

Alexandre Symeonidis-Herzig, Jianhe Low, Ozge Mercanoglu Sincan, Richard Bowden

2603.23584 2026-03-26 cs.LG cs.AI q-fin.CP

LineMVGNN: Anti-Money Laundering with Line-Graph-Assisted Multi-View Graph Neural Networks

Chung-Hoo Poon, James Kwok, Calvin Chow, Jang-Hyeon Choi

Comments Published as a journal paper in AI 2025

2603.23580 2026-03-26 cs.LG

MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

Wei Sun, Ting Wang, Xinran Tian, Wanshun Lan, Xuhan Feng, Haoyue Li, Fangxin Wang

2603.23578 2026-03-26 cs.LG physics.comp-ph

Residual Attention Physics-Informed Neural Networks for Robust Multiphysics Simulation of Steady-State Electrothermal Energy Systems

Yuqing Zhou, Ze Tao, Fujun Liu

2603.23577 2026-03-26 cs.LG cs.CL cs.CY

The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

Long Zhang, Dai-jun Lin, Wei-neng Chen

2603.23575 2026-03-26 cs.LG cs.AI

APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

Meriem Bouzouad, Yuan-Hao Chang, Jalil Boukhobza

2603.23574 2026-03-26 cs.LG cs.AI

PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

Tao Liu, Jiguang Lv, Dapeng Man, Weiye Xi, Yaole Li, Feiyu Zhao, Kuiming Wang, Yingchao Bian, Chen Xu, Wu Yang

2603.23573 2026-03-26 cs.LG cs.AI

Dual-Criterion Curriculum Learning: Application to Temporal Data

Gaspard Abel, Eloi Campagne, Mohamed Benloughmari, Argyris Kalogeratos

2603.23571 2026-03-26 cs.LG cs.AI

StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Zhiyuan Chen, Yuxuan Zhong, Fan Wang, Bo Yu, Pengtao Shao, Shaoshan Liu, Ning Ding

Comments 9 pages, 4 figures

2603.23568 2026-03-26 cs.LG stat.ML

Causal Reconstruction of Sentiment Signals from Sparse News Data

Stefania Stan, Marzio Lunghi, Vito Vargetto, Claudio Ricci, Rolands Repetto, Brayden Leo, Shao-Hong Gan

Comments 28 pages, 2 figures, 14 tables

2603.23558 2026-03-26 cs.LG cs.AI

Upper Entropy for 2-Monotone Lower Probabilities

Tuan-Anh Vu, Sébastien Destercke, Frédéric Pichon

Comments 14 pages, 3 figures

2603.23550 2026-03-26 cs.LG

Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

Haoyu Wang, Yuxin Chen, Liang Luo, Buyun Zhang, Ellie Dingqiao Wen, Pan Li

2603.23539 2026-03-26 cs.AI cs.CL cs.LG nlin.AO

PLDR-LLMs Reason At Self-Organized Criticality

Burc Gokden

2603.23534 2026-03-26 cs.CL cs.LG

Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Polarization Tasks in Low-Resource Settings

Abass Oguntade

2603.23532 2026-03-26 cs.CL cs.AI

Generating Hierarchical JSON Representations of Scientific Sentences Using LLMs

Satya Sri Rajiteswari Nimmagadda, Ethan Young, Niladri Sengupta, Ananya Jana, Aniruddha Maiti

Comments accepted to 21th International Conference on Semantic Computing (IEEE ICSC 2026)

2603.23529 2026-03-26 cs.CL cs.AI

Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language

Reuben Chagas Fernandes, Gaurang S. Patkar

2603.23528 2026-03-26 cs.CL

The Compression Paradox in LLM Inference: Provider-Dependent Energy Effects of Prompt Compression

Warren Johnson

Comments 16 pages, 5 figures, 5 tables. Includes data/code availability, ethics statement, and competing interests

2603.23527 2026-03-26 cs.CL

Compression Method Matters: Benchmark-Dependent Output Dynamics in LLM Prompt Compression

Warren Johnson

Comments 19 pages. Includes figures and tables. Companion code/data repository and direct NVML calibration dataset are cited in manuscript

2603.23526 2026-03-26 cs.CL cs.HC cs.MA

Plato's Cave: A Human-Centered Research Verification System

Matheus Kunzler Maldaner, Raul Valle, Junsung Kim, Tonuka Sultan, Pranav Bhargava, Matthew Maloni, John Courtney, Hoang Nguyen, Aamogh Sawant, Kristian O'Connor, Stephen Wormald, Damon L. Woodard

Comments 15 pages, 4 figures

2603.23525 2026-03-26 cs.CL

Prompt Compression in Production Task Orchestration: A Pre-Registered Randomized Trial

Warren Johnson, Charles Lee

Comments 28 pages, 9 tables, 1 CONSORT figure; pre-registered randomized controlled trial on production orchestration prompts

2603.23524 2026-03-26 cs.CL cs.AI

Navigating the Concept Space of Language Models

Wilson E. Marcílio-Jr, Danilo M. Eler

2603.23523 2026-03-26 cs.CL cs.RO

Do 3D Large Language Models Really Understand 3D Spatial Relationships?

Xianzheng Ma, Tao Sun, Shuai Chen, Yash Bhalgat, Jindong Gu, Angel X Chang, Iro Armeni, Iro Laina, Songyou Peng, Victor Adrian Prisacariu

Comments ICLR 2026

2603.23522 2026-03-26 cs.CL cs.AI

Qworld: Question-Specific Evaluation Criteria for LLMs

Shanghua Gao, Yuchang Su, Pengwei Sui, Curtis Ginder, Marinka Zitnik

2603.23521 2026-03-26 cs.CL cs.AI cs.CV

Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages

Shaharukh Khan, Ali Faraz, Abhinav Ravi, Mohd Nauman, Mohd Sarfraz, Akshat Patidar, Raja Kolla, Chandra Khatri, Shubham Agarwal

Comments Accepted at "CVPR 2025: Workshop Vision Language Models For All"