arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.18082 2026-03-20 cs.MM cs.CV cs.SD

EgoAdapt: Enhancing Robustness in Egocentric Interactive Speaker Detection Under Missing Modalities

Xinyuan Qian, Xinjia Zhu, Alessio Brutti, Dong Liang

详情

DOI: 10.1145/3797029

英文摘要

TTM (Talking to Me) task is a pivotal component in understanding human social interactions, aiming to determine who is engaged in conversation with the camera-wearer. Traditional models often face challenges in real-world scenarios due to missing visual data, neglecting the role of head orientation, and background noise. This study addresses these limitations by introducing EgoAdapt, an adaptive framework designed for robust egocentric "Talking to Me" speaker detection under missing modalities. Specifically, EgoAdapt incorporates three key modules: (1) a Visual Speaker Target Recognition (VSTR) module that captures head orientation as a non-verbal cue and lip movement as a verbal cue, allowing a comprehensive interpretation of both verbal and non-verbal signals to address TTM, setting it apart from tasks focused solely on detecting speaking status; (2) a Parallel Shared-weight Audio (PSA) encoder for enhanced audio feature extraction in noisy environments; and (3) a Visual Modality Missing Awareness (VMMA) module that estimates the presence or absence of each modality at each frame to adjust the system response dynamically.Comprehensive evaluations on the TTM benchmark of the Ego4D dataset demonstrate that EgoAdapt achieves a mean Average Precision (mAP) of 67.39% and an Accuracy (Acc) of 62.01%, significantly outperforming the state-of-the-art method by 4.96% in Accuracy and 1.56% in mAP.

URL PDF HTML ☆

赞 0 踩 0

2603.18076 2026-03-20 q-bio.BM cs.LG physics.comp-ph

Generative Replica-Exchange: A Flow-based Framework for Accelerating Replica Exchange Simulations

Shengjie Huang, Sijie Yang, Jianqiao Yi, Rui Zheng, Haocong Liao, Muzammal Hussain, Yaoquan Tu, Xiaoyun Lu, Yang Zhou

2603.18063 2026-03-20 cs.CR cs.AI

MCP-38: A Comprehensive Threat Taxonomy for Model Context Protocol Systems (v1.0)

Yi Ting Shen, Kentaroh Toyoda, Alex Leung

Comments v1.0

2603.18054 2026-03-20 cs.AR cs.LG

An FPGA-Based SoC Architecture with a RISC-V Controller for Energy-Efficient Temporal-Coding Spiking Neural Networks

Mohammad Javad Sekonji, Ali Mahani, Maryam Mirsadeghi, Mahdi Taheri

2603.18043 2026-03-20 cs.MA cs.AI

The Provenance Paradox in Multi-Agent LLM Routing: Delegation Contracts and Attested Identity in LDP

Sunil Prakash

Comments 9 pages, 6 figures. Open-source: https://github.com/sunilp/ldp-protocol

2603.18042 2026-03-20 eess.IV cs.LG

A Novel Framework using Intuitionistic Fuzzy Logic with U-Net and U-Net++ Architecture: A case Study of MRI Bain Image Segmentation

Hanuman Verma, Kiho Im, Akshansh Gupta, M. Tanveer

Comments 13 pages, 8 figures

2603.18039 2026-03-20 cs.NE cs.LG

Sharpness Aware Surrogate Training for Spiking Neural Networks

Maximilian Nicholson

2603.18034 2026-03-20 cs.CR cs.AI cs.LG

Semantic Chameleon: Corpus-Dependent Poisoning Attacks and Defenses in RAG Systems

Scott Thornton

Comments 10 pages, 5 figures

2603.18028 2026-03-20 cs.CY cs.AI q-bio.NC

Clinically Meaningful Explainability for NeuroAI: An ethical, technical, and clinical perspective

Laura Schopp, Ambra DImperio, Jalal Etesami, Marcello Ienca

Comments 20 pages, 2 figures

2603.18027 2026-03-20 eess.SP cs.AI cs.LG

KD-EKF: Knowledge-Distilled Adaptive Covariance EKF for Robust UWB/PDR Indoor Localization

Kyeonghyun Yoo, Wooyong Jung, Namkyung Yoon, Sangmin Lee, Sanghong Kim, Hwangnam Kim

Comments 16 pages, 7 figures

详情

英文摘要

Ultra-wideband (UWB) indoor localization provides centimeter-level accuracy and low latency, but its measurement reliability degrades severely under Non-Line-of-Sight (NLOS) conditions, leading to meter-scale ranging errors and inconsistent uncertainty characteristics. Inertial Measurement Unit (IMU)-based Pedestrian Dead Reckoning (PDR) complements UWB by providing infrastructure-free motion estimation; however, its error accumulates nonlinearly over time due to bias and noise propagation. Fusion methods based on Extended Kalman Filters (EKF) and Particle Filters (PF) can improve average localization accuracy through probabilistic state estimation. However, these approaches typically rely on manually tuned measurement covariances. Such fixed or heuristically tuned parameters are hard to sustain across varying indoor layouts, NLOS ratios, and motion patterns, leading to limited robustness and poor generalization of measurement uncertainty modeling in heterogeneous environments. To address this limitation, this work proposes an adaptive measurement covariance scaling framework in which reliability cues are learned from historical UWB/PDR trajectories. A large teacher model is employed offline to generate temporally consistent next-position predictions from structured UWB/PDR sequences, and this behavior is distilled into a lightweight student model suitable for real-time deployment. The student model continuously regulates EKF measurement covariances based on prediction residuals, enabling environment-aware fusion without manual re-tuning. Experimental results demonstrate that the proposed KD-EKF framework significantly reduces localization error, suppresses error spikes during Line-of-Sight (LOS)/NLOS transitions, and mitigates long-term drift compared to fixed-parameter EKF, thereby improving measurement robustness across diverse indoor environments.

URL PDF HTML ☆

赞 0 踩 0

2603.18026 2026-03-20 eess.SP cs.GR cs.LG

Physically Accurate Differentiable Inverse Rendering for Radio Frequency Digital Twin

Xingyu Chen, Xinyu Zhang, Kai Zheng, Xinmin Fang, Tzu-Mao Li, Chris Xiaoxuan Lu, Zhengxiong Li

2603.18025 2026-03-20 cs.CY cs.AI stat.AP

Understanding the Relationship Between Firms' AI Technology Innovation and Consumer Complaints

Yongchao Martin Ma, Zhongzhun Deng

2603.18024 2026-03-20 eess.AS cs.AI cs.CL cs.SD

ProKWS: Personalized Keyword Spotting via Collaborative Learning of Phonemes and Prosody

Jianan Pan, Yuanming Zhang, Kejie Huang

2603.18023 2026-03-20 eess.AS cs.AI cs.CL cs.SD

PCOV-KWS: Multi-task Learning for Personalized Customizable Open Vocabulary Keyword Spotting

Jianan Pan, Kejie Huang

2603.18022 2026-03-20 math.OC cs.AI cs.SY eess.SY

Using Laplace Transform To Optimize the Hallucination of Generation Models

Cheng Kang, Xinye Chen, Daniel Novak, Xujing Yao

Comments Corresponding author: Xujing Yao (xjyao@njtech.edu.cn)

2603.15571 2026-03-20 cs.AR cs.LG cs.SY eess.SY physics.app-ph

Co-Design of Memory-Storage Systems for Workload Awareness with Interpretable Models

Jay Sarkar, Vamsi Pavan Rayaprolu, Abhijeet Bhalerao

Comments 9 pages, 10 figures

2603.14601 2026-03-20 math.ST cs.LG math.PR stat.TH

$K-$means with learned metrics

Pablo Groisman, Matthieu Jonckheere, Jordan Serres, Mariela Sued

2603.12214 2026-03-20 cs.DC cs.AI

WORKSWORLD: A Domain for Integrated Numeric Planning and Scheduling of Distributed Pipelined Workflows

Taylor Paul, William Regli

Comments To be published in Proceedings of the International Conference on Automated Planning and Scheduling Volume 36 (2026)

2603.00601 2026-03-20 cs.SE cs.AI

Theory of Code Space: Do Code Agents Understand Software Architecture?

Grigory Sapunov

Comments updated figures and numbers

2602.13647 2026-03-20 cs.IR cs.AI

SF-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Question Answering

Rui Yu, Tianyi Wang, Ruixia Liu, Yinglong Wang

2602.06175 2026-03-20 math.ST cs.AI cs.LG stat.ML stat.TH

Optimal rates for density and mode estimation with expand-and-sparsify representations

Kaushik Sinha, Christopher Tosh

Comments Accepted at AISTATS 2026

2602.02469 2026-03-20 cs.IT cs.LG eess.SP math.IT

Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation

Ahmed M. Elshazly, Ahmed Arafa

Comments To appear in IEEE ICC 2026

2602.01671 2026-03-20 cs.HC cs.AI cs.CR

AI-Assisted Adaptive Rendering for High-Frequency Security Telemetry in Web Interfaces

Mona Rajhans

Comments To appear in IEEE ICCA 2025 proceedings

2601.10971 2026-03-20 cs.CR cs.CL

AJAR: Adaptive Jailbreak Architecture for Red-teaming

Yipu Dou, Wang Yang

Comments 7 pages, 3 figures. Code and data available at https://github.com/douyipu/ajar

2512.20305 2026-03-20 stat.ML cs.LG

A Structured Nonparametric Framework for Nonlinear Accelerated Failure Time Models (KAN-AFT)

Mebin Jose, Jisha Francis, Sudheesh Kumar Kattumannil

Comments A new development in Survival Analysis based on the celebrated Kolmogorov-Arnold Networks (KANs)

2512.18561 2026-03-20 cs.MA cs.AI

Adaptive Accountability in Networked MAS: Tracing and Mitigating Emergent Norms at Scale

Saad Alqithami

2511.18415 2026-03-20 cs.MM cs.CV

DuoTeach: Dual Role Self-Teaching for Coarse-to-Fine Decision Coordination in Vision--Language Models

Wei Yang, Yiran Zhu, Zilin Li, Xunjia Zhang, Jun Xia, Hongtao Wang

2511.14763 2026-03-20 cs.IR cs.AI

Membership Inference Attack against Large Language Model-based Recommendation Systems: A New Distillation-based Paradigm

Li Cuihong, Huang Xiaowen, Yin Chuanhuan, Sang Jitao

2510.20012 2026-03-20 stat.AP cs.CV

AI Pose Analysis and Kinematic Profiling of Range-of-Motion Variations in Resistance Training

Adam Diamant

2509.25722 2026-03-20 eess.SP cs.IT cs.LG math.IT

Transformer-Based Rate Prediction for Multi-Band Cellular Handsets

Ruibin Chen, Haozhe Lei, Hao Guo, Marco Mezzavilla, Hitesh Poddar, Tomoki Yoshimura, Sundeep Rangan

Comments Accepted to IEEE ICC 2026 Workshop on Intelligent Movable and Reconfigurable Antennas for Future Wireless Communication and Sensing (WS02)