arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.00419 2026-04-02 cs.LG cs.AI

G-Drift MIA: Membership Inference via Gradient-Induced Feature Drift in LLMs

Ravi Ranjan, Utkarsh Grover, Xiaomin Lin, Agoritsa Polyzou

Comments 14 pages, 3 figures and tables. Accepted in ICPR-2026 conference, to appear in the Springer LNCS proceedings

详情

英文摘要

Large language models (LLMs) are trained on massive web-scale corpora, raising growing concerns about privacy and copyright. Membership inference attacks (MIAs) aim to determine whether a given example was used during training. Existing LLM MIAs largely rely on output probabilities or loss values and often perform only marginally better than random guessing when members and non-members are drawn from the same distribution. We introduce G-Drift MIA, a white-box membership inference method based on gradient-induced feature drift. Given a candidate (x,y), we apply a single targeted gradient-ascent step that increases its loss and measure the resulting changes in internal representations, including logits, hidden-layer activations, and projections onto fixed feature directions, before and after the update. These drift signals are used to train a lightweight logistic classifier that effectively separates members from non-members. Across multiple transformer-based LLMs and datasets derived from realistic MIA benchmarks, G-Drift substantially outperforms confidence-based, perplexity-based, and reference-based attacks. We further show that memorized training samples systematically exhibit smaller and more structured feature drift than non-members, providing a mechanistic link between gradient geometry, representation stability, and memorization. In general, our results demonstrate that small, controlled gradient interventions offer a practical tool for auditing the membership of training-data and assessing privacy risks in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2604.00416 2026-04-02 cs.RO cs.AI cs.CV cs.LG

Learning Humanoid Navigation from Human Data

Weizhuo Wang, Yanjie Ze, C. Karen Liu, Monroe Kennedy

Comments 8 pages 8 figures

2604.00414 2026-04-02 cs.AI cs.LG

Decision-Centric Design for LLM Systems

Wei Sun

2604.00404 2026-04-02 cs.CV

The 1st Winner for 5th PVUW MeViS-Text Challenge: Strong MLLMs Meet SAM3 for Referring Video Object Segmentation

Xusheng He, Canyang Wu, Jinrong Zhang, Weili Guan, Jianlong Wu, Liqiang Nie

Comments 1st Place Solution for the 5th PVUW MeViS-Text Challenge (CVPR 2026 Workshop)

2604.00401 2026-04-02 cs.RO

Sampling-based Task and Kinodynamic Motion Planning under Semantic Uncertainty

Qi Heng Ho, Zachary N. Sunberg, Morteza Lahijanian

2604.00399 2026-04-02 cs.LG

A Cross-graph Tuning-free GNN Prompting Framework

Yaqi Chen, Shixun Huang, Ryan Twemlow, Lei Wang, John Le, Sheng Wang, Willy Susilo, Jun Yan, Jun Shen

2604.00397 2026-04-02 cs.CV cs.AI

Improving Generalization of Deep Learning for Brain Metastases Segmentation Across Institutions

Yuchen Yang, Shuangyang Zhong, Haijun Yu, Langcuomu Suo, Hongbin Han, Florian Putz, Yixing Huang

Comments 5 figures and 1 table

2604.00396 2026-04-02 cs.CV

VLM-in-the-Loop: A Plug-In Quality Assurance Module for ECG Digitization Pipelines

Jiachen Li, Shihao Li, Soovadeep Bakshi, Wei Li, Dongmei Chen

2604.00395 2026-04-02 cs.CV

Advancing Complex Video Object Segmentation via Tracking-Enhanced Prompt: The 1st Winner for 5th PVUW MOSE Challenge

Jinrong Zhang, Canyang Wu, Xusheng He, Weili Guan, Jianlong Wu, Liqiang Nie

Comments 1st Place Solution for the 5th PVUW MOSE Challenge (CVPR 2026 Workshop)

2604.00391 2026-04-02 cs.RO cs.SY eess.SY

Behavioral Score Diffusion: Model-Free Trajectory Planning via Kernel-Based Score Estimation from Data

Shihao Li, Jiachen Li, Jiamin Xu, Dongmei Chen

2604.00388 2026-04-02 cs.LG cs.SY eess.SY

Gradient-Based Data Valuation Improves Curriculum Learning for Game-Theoretic Motion Planning

Shihao Li, Jiachen Li, Dongmei Chen

2604.00385 2026-04-02 cs.LG

GUIDE: Reinforcement Learning for Behavioral Action Support in Type 1 Diabetes

Saman Khamesian, Sri Harini Balaji, Di Yang Shi, Stephanie M. Carpenter, Daniel E. Rivera, W. Bradley Knox, Peter Stone, Hassan Ghasemzadeh

2604.00383 2026-04-02 cs.CV

Mine-JEPA: In-Domain Self-Supervised Learning for Mine-Like Object Classification in Side-Scan Sonar

Taeyoun Kwon, Youngwon Choi, Hyeonyu Kim, Myeongkyun Cho, Junhyeok Choi, Moon Hwan Kim

Comments 9 pages, 3 figures, 6 tables. Accepted at CVPR 2026 MACVi Workshop

2604.00382 2026-04-02 cs.CV eess.SP

mmAnomaly: Leveraging Visual Context for Robust Anomaly Detection in the Non-Visual World with mmWave Radar

Tarik Reza Toha, Shao-Jung, Lu, Mahathir Monjur, Shahriar Nirjon

Comments Accepted at the 24th ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems (SenSys 2026)

2604.00381 2026-04-02 cs.CV

UCMNet: Uncertainty-Aware Context Memory Network for Under-Display Camera Image Restoration

Daehyun Kim, Youngmin Kim, Yoon Ju Oh, Tae Hyun Kim

Comments We propose UCMNet, an uncertainty-aware adaptive framework that restores high-frequency details in regions with varying levels of degradation in under-display camera images

2604.00375 2026-04-02 cs.CL

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

Liancheng Fang, Aiwei Liu, Henry Peng Zou, Yankai Chen, Enze Ma, Leyi Pan, Chunyu Miao, Wei-Chieh Huang, Xue Liu, Philip S. Yu

2604.00372 2026-04-02 cs.CV

Dynamic Graph Neural Network with Adaptive Features Selection for RGB-D Based Indoor Scene Recognition

Qiong Liu, Ruofei Xiong, Xingzhen Chen, Muyao Peng, You Yang

2604.00371 2026-04-02 cs.CV

Neural Reconstruction of LiDAR Point Clouds under Jamming Attacks via Full-Waveform Representation and Simultaneous Laser Sensing

Ryo Yoshida, Takami Sato, Wenlun Zhang, Yuki Hayakawa, Shota Nagai, Takahiro Kado, Taro Beppu, Ibuki Fujioka, Yunshan Zhong, Kentaro Yoshioka

2604.00363 2026-04-02 cs.RO cs.CV

A Dual-Stream Transformer Architecture for Illumination-Invariant TIR-LiDAR Person Tracking

Yuki Minase, Kanji Tanaka

Comments 6 pages, 4 figures, technical report

2604.00362 2026-04-02 cs.AI cs.LG

In harmony with gpt-oss

Borislav Mavrin

2604.00360 2026-04-02 cs.CV

VADMamba++: Efficient Video Anomaly Detection via Hybrid Modeling in Grayscale Space

Jihao Lyu, Minghua Zhao, Jing Hu, Yifei Chen, Shuangli Du, Cheng Shi

2604.00356 2026-04-02 cs.AI cs.CL

Signals: Trajectory Sampling and Triage for Agentic Interactions

Shuguang Chen, Adil Hafeez, Salman Paracha

2604.00352 2026-04-02 cs.LG

Deep Learning-Accelerated Surrogate Optimization for High-Dimensional Well Control in Stress-Sensitive Reservoirs

Mahammad Valiyev, Jodel Cornelio, Behnam Jafarpour

2604.00350 2026-04-02 cs.RO cs.AI

Go Big or Go Home: Simulating Mobbing Behavior with Braitenbergian Robots

Elaheh Sanoubari

Comments This work was completed in 2019 as a final project for a graduate course at the University of Waterloo, titled: ECE 750 - Artificial Life: Embodied Intelligence

2604.00344 2026-04-02 cs.CL stat.AP

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Eric Hanchen Jiang, Levina Li, Rui Sun, Xiao Liang, Yubei Li, Yuchen Wu, Haozheng Luo, Hengli Li, Zhi Zhang, Zhaolu Kang, Kai-Wei Chang, Ying Nian Wu

2604.00343 2026-04-02 cs.RO

Real Time Local Wind Inference for Robust Autonomous Navigation

Spencer Folk

Comments PhD Thesis, University of Pennsylvania, 2026. 152 pages

2604.00342 2026-04-02 cs.LG

Is One Token All It Takes? Graph Pooling Tokens for LLM-based GraphQA

Ankit Grover, Lodovico Giaretta, Rémi Bourgerie, Sarunas Girdzijauskas

Comments Accepted at LREC, KG-LLM Workshop 2026

详情

英文摘要

The integration of Graph Neural Networks (GNNs) with Large Language Models (LLMs) has emerged as a promising paradigm for Graph Question Answering (GraphQA). However, effective methods for encoding complex structural information into the LLM's latent space remain an open challenge. Current state-of-the-art architectures, such as G-Retriever, typically rely on standard GNNs and aggressive mean pooling to compress entire graph substructures into a single token, creating a severe information bottleneck. This work mitigates this bottleneck by investigating two orthogonal strategies: (1) increasing the bandwidth of the graph-to-LLM interface via multi-token pooling, and (2) enhancing the semantic quality of the graph encoder via global attention mechanisms. We evaluate a suite of hierarchical pruning and clustering-based pooling operators including Top-k, SAGPool, DiffPool, MinCutPool, and Virtual Node Pooling (VNPool) to project graph data into multiple learnable tokens. Empirically, we demonstrate that while pooling introduces significant instability during soft prompt tuning, the application of Low-Rank Adaptation (LoRA) effectively stabilizes specific hierarchical projections (notably VNPool and pruning methods), though dense clustering operators remain challenging. This stabilization allows compressed representations to rival full-graph baselines (achieving ~73% Hit@1 on WebQSP). Conceptually, we demonstrate that a Graph Transformer with VNPool implementation functions structurally as a single-layer Perceiver IO encoder. Finally, we adapt the FandE (Features and Edges) Score to the generative GraphQA domain. Our analysis reveals that the GraphQA benchmark suffers from representational saturation, where target answers are often highly correlated with isolated node features. The implementation is available at https://github.com/Agrover112/G-Retriever/tree/all_good/

URL PDF HTML ☆

赞 0 踩 0

2604.00339 2026-04-02 cs.LG

When Career Data Runs Out: Structured Feature Engineering and Signal Limits for Founder Success Prediction

Yagiz Ihlamur

Comments 4 pages, 4 tables. Accepted at SecureFinAI Contest @ IEEE IDS 2026. Code: https://github.com/ihlamury/vcbench

2603.30008 2026-04-02 cs.CV

Conditional Polarization Guidance for Camouflaged Object Detection

QIfan Zhang, Hao Wang, Xiangrong Qin, Ruijie Li

Comments 11 pages, 10 figures, 4 tables

2603.29962 2026-04-02 cs.CV

SurgTEMP: Temporal-Aware Surgical Video Question Answering with Text-guided Visual Memory for Laparoscopic Cholecystectomy

Shi Li, Vinkle Srivastav, Nicolas Chanel, Saurav Sharma, Nabani Banik, Lorenzo Arboit, Kun Yuan, Pietro Mascagni, Nicolas Padoy

Comments 29 pages, 14 figures, 9 tables