arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2505.11790 2026-03-11 cs.LG cs.CR

JULI: Jailbreak Large Language Models by Self-Introspection

Jesson Wang, Zhanhao Hu, David Wagner

Comments Accepted to ICLR 2026

详情

英文摘要

Large Language Models (LLMs) are trained with safety alignment to prevent generating malicious content. Although some attacks have highlighted vulnerabilities in these safety-aligned LLMs, they typically have limitations, such as necessitating access to the model weights or the generation process. Since proprietary models through API-calling do not grant users such permissions, these attacks find it challenging to compromise them. In this paper, we propose Jailbreaking Using LLM Introspection (JULI), which jailbreaks LLMs by manipulating the token log probabilities, using a tiny plug-in block, BiasNet. JULI relies solely on the knowledge of the target LLM's predicted token log probabilities. It can effectively jailbreak API-calling LLMs under a black-box setting and knowing only top-$5$ token log probabilities. Our approach demonstrates superior effectiveness, outperforming existing state-of-the-art (SOTA) approaches across multiple metrics.

URL PDF HTML ☆

赞 0 踩 0

2505.11595 2026-03-11 cs.LG cs.AI cs.CL

Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO

Peter Chen, Xiaopeng Li, Ziniu Li, Xi Chen, Tianyi Lin

Comments Accepted by TMLR; 47 pages

2505.10931 2026-03-11 cs.CV

M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for optical-SAR Object Detection

Chao Wang, Wei Lu, Xiang Li, Jian Yang, Lei Luo

2505.01399 2026-03-11 cs.RO

Physics-Conditioned Grasping for Stable Tool Use

Noah Trupin, Zixing Wang, Ahmed H. Qureshi

Comments In submission and under review

2504.11922 2026-03-11 cs.CV

Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach

Lvpan Cai, Haowei Wang, Jiayi Ji, Yanshu Zhoumen, Shen Chen, Taiping Yao, Xiaoshuai Sun

Comments Accepted at AAAI2026

2504.01547 2026-03-11 cs.CV

Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training

Luca Ciampi, Gabriele Lagani, Giuseppe Amato, Fabrizio Falchi

2503.21622 2026-03-11 cs.CV

The MVTec AD 2 Dataset: Advanced Scenarios for Unsupervised Anomaly Detection

Lars Heckler-Kram, Jan-Hendrik Neudeck, Ulla Scheler, Rebecca König, Carsten Steger

Comments paper under review; dataset first released for the VAND3.0 challenge @ CVPR 2025 https://sites.google.com/view/vand30cvpr2025/challenge

2503.16203 2026-03-11 cs.AI

Logic Explanation of AI Classifiers by Categorical Explaining Functors

Stefano Fioravanti, Francesco Giannini, Paolo Frazzetto, Fabio Zanasi, Pietro Barbiero

2503.12902 2026-03-11 cs.LG

Experiments with Optimal Model Trees

Sabino Francesco Roselli, Eibe Frank

2503.12525 2026-03-11 cs.LG cs.AI

HyConEx: Hypernetwork classifier with counterfactual explanations for tabular data

Patryk Marszałek, Kamil Książek, Oleksii Furman, Ulvi Movsum-zada, Przemysław Spurek, Marek Śmieja

Comments Published in Neurocomputing (2026)

2503.08387 2026-03-11 cs.CV

Recognition-Synergistic Scene Text Editing

Zhengyao Fang, Pengyuan Lyu, Jingjing Wu, Chengquan Zhang, Jun Yu, Guangming Lu, Wenjie Pei

Comments accepted by CVPR2025

2503.08008 2026-03-11 cs.CV

A Survey on Wi-Fi Sensing Generalizability: Taxonomy, Techniques, Datasets, and Future Research Prospects

Fei Wang, Tingting Zhang, Wei Xi, Han Ding, Ge Wang, Di Zhang, Yuanhao Cui, Fan Liu, Jinsong Han, Jie Xu, Tony Xiao Han

Comments Accepted for publication in IEEE Communications Surveys & Tutorials 2026

详情

DOI: 10.1109/COMST.2026.3670854

英文摘要

Wi-Fi sensing has emerged as a powerful non-intrusive technology for recognizing human activities, monitoring vital signs, and enabling context-aware applications using commercial wireless devices. However, the performance of Wi-Fi sensing often degrades when applied to new users, devices, or environments due to significant domain shifts. To address this challenge, researchers have proposed a wide range of generalization techniques aimed at enhancing the robustness and adaptability of Wi-Fi sensing systems. In this survey, we provide a comprehensive and structured review of over 200 papers published since 2015, categorizing them according to the Wi-Fi sensing pipeline: experimental setup, signal preprocessing, feature learning, and model deployment. We analyze key techniques, including signal preprocessing, domain adaptation, meta-learning, metric learning, data augmentation, cross-modal alignment, federated learning, and continual learning. Furthermore, we summarize publicly available datasets across various tasks, such as activity recognition, user identification, indoor localization, and pose estimation, and provide insights into their domain diversity. We also discuss emerging trends and future directions, including large-scale pretraining, integration with multimodal foundation models, and continual deployment. To foster community collaboration, we introduce the Sensing Dataset Platform (SDP) for sharing datasets and models. This survey aims to serve as a valuable reference and practical guide for researchers and practitioners dedicated to improving the generalizability of Wi-Fi sensing systems. Survey papge: https://github.com/aiotgroup/awesome-wireless-sensing-generalization.

URL PDF HTML ☆

赞 0 踩 0

2503.01236 2026-03-11 cs.RO cs.AI

LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains

Ling Xiao, Toshihiko Yamasaki

2503.00133 2026-03-11 cs.RO

A Magnetic-Actuated Vision-Based Whisker Array for Contact Perception and Grasping

Zhixian Hu, Juan Wachs, Yu She

Comments Accepted by IEEE International Conference on Robotics and Automation (ICRA) 2025

2502.18615 2026-03-11 cs.RO cs.LG

A Distributional Treatment of Real2Sim2Real for Object-Centric Agent Adaptation in Vision-Driven Deformable Linear Object Manipulation

Georgios Kamaras, Subramanian Ramamoorthy

2502.18215 2026-03-11 cs.CL

Connecting Voices: LoReSpeech as a Low-Resource Speech Parallel Corpus

Samy Ouzerrout

Comments This paper is withdrawn because the LoReSpeech dataset described in Section 2 is not currently available, which affects the reproducibility of the work and the validity of the experimental results

2502.14916 2026-03-11 cs.CL cs.AI

MKE-Coder: Multi-Axial Knowledge with Evidence Verification in ICD Coding for Chinese EMRs

Xinxin You, Xien Liu, Xue Yang, Ziyi Wang, Ji Wu

Comments We identified an error in the data preprocessing script that led to inconsistent results in the tables. As the current version contains inaccurate data, we are withdrawing it for further correction and verification

详情

英文摘要

The task of automatically coding the International Classification of Diseases (ICD) in the medical field has been well-established and has received much attention. Automatic coding of the ICD in the medical field has been successful in English but faces challenges when dealing with Chinese electronic medical records (EMRs). The first issue lies in the difficulty of extracting disease code-related information from Chinese EMRs, primarily due to the concise writing style and specific internal structure of the EMRs. The second problem is that previous methods have failed to leverage the disease-based multi-axial knowledge and lack of association with the corresponding clinical evidence. This paper introduces a novel framework called MKE-Coder: Multi-axial Knowledge with Evidence verification in ICD coding for Chinese EMRs. Initially, we identify candidate codes for the diagnosis and categorize each of them into knowledge under four coding axes.Subsequently, we retrieve corresponding clinical evidence from the comprehensive content of EMRs and filter credible evidence through a scoring model. Finally, to ensure the validity of the candidate code, we propose an inference module based on the masked language modeling strategy. This module verifies that all the axis knowledge associated with the candidate code is supported by evidence and provides recommendations accordingly. To evaluate the performance of our framework, we conduct experiments using a large-scale Chinese EMR dataset collected from various hospitals. The experimental results demonstrate that MKE-Coder exhibits significant superiority in the task of automatic ICD coding based on Chinese EMRs. In the practical evaluation of our method within simulated real coding scenarios, it has been demonstrated that our approach significantly aids coders in enhancing both their coding accuracy and speed.

URL PDF HTML ☆

赞 0 踩 0

2502.06574 2026-03-11 cs.AI cs.GT cs.LG

On the Impact of the Utility in Semivalue-based Data Valuation

Mélissa Tamine, Benjamin Heymann, Maxime Vono, Patrick Loiseau

Comments 44 pages, 19 figures. Accepted at ICLR 2026

2412.18380 2026-03-11 cs.CV cs.GR

ARSGaussian: 3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis

Yiling Yao, Bing Zhang, Wenjuan Zhang, Lianru Gao, Dailiang Peng, Bocheng Li, Yaning Wang, Bowen Wang

Comments This is the author's version of a work that was accepted for publication in [ISPRS]. Changes resulting from the publishing process... may not be reflected in this document

详情

DOI: 10.1016/j.isprsjprs.2025.10.022
Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing,Volume 231,2026,Pages 288-306,ISSN 0924-2716,

英文摘要

Novel View Synthesis (NVS) can reconstruct scenes from multi-view images and synthesize novel images from new viewpoints, which provides technical support for tasks such as target recognition and environmental perception. Aerial remote sensing can conveniently capture a wealth of multi-view images with just a few flights. However, the challenges brought by large distances and sparse viewing angles during collection can cause the model to easily produce floaters and overgrowth issues due to geometric estimation errors. This results in low visual quality and a lack of precise geometric estimation capabilities. Therefore, this study presents ARSGaussian, an innovative novel view synthesis (NVS) method for aerial remote sensing. The method incorporates LiDAR point cloud as constraints into the 3D Gaussian Splatting approach, adaptively guiding the Gaussians to grow and split along geometric benchmarks, thereby addressing the overgrowth and floaters issues. Additionally, considering the geometric distortions arising from data acquisition, coordinate transformations with distortion parameters are integrated to replace the simple pinhole camera model parameters to achieve pixel-level alignment between LiDAR point cloud and multi-view optical images, facilitating the accurate fusion of heterogeneous data and achieving the high-precision geo-alignment. Moreover, depth, normal and scale consistency losses are introduced into the regularization process to guide Gaussians toward real depth and plane representations, significantly improving geometric estimation accuracy. To address the current lack of dense airborne hybrid datasets, we have established and released AIR-LONGYAN, an open-source dataset containing a dense LiDAR point cloud (8 pts/m) and multi-view optical images captured by airborne scanners and cameras in diverse scenes....

URL PDF HTML ☆

赞 0 踩 0

2412.01297 2026-03-11 cs.RO cs.LG

Morphological-Symmetry-Equivariant Heterogeneous Graph Neural Network for Robotic Dynamics Learning

Fengze Xie, Sizhe Wei, Yue Song, Yisong Yue, Lu Gan

2411.16722 2026-03-11 cs.CV

Active Prompt Learning with Vision-Language Model Priors

Hoyoung Kim, Seokhee Jin, Changhwan Sung, Jaechang Kim, Jungseul Ok

2410.05564 2026-03-11 cs.LG cs.CV

Unsupervised Representation Learning from Sparse Transformation Analysis

Yue Song, Thomas Anderson Keller, Yisong Yue, Pietro Perona, Max Welling

Comments T-PAMI journal paper

2410.02840 2026-03-11 cs.LG cs.CY math.ST stat.TH

Overcoming Representation Bias in Fairness-Aware data Repair using Optimal Transport

Abigail Langbridge, Anthony Quinn, Robert Shorten

2410.01611 2026-03-11 cs.CV cs.AI cs.LG

DRUPI: Dataset Reduction Using Privileged Information

Shaobo Wang, Youxin Jiang, Tianle Niu, Yantai Yang, Ruiji Zhang, Shuhao Hu, Shuaiyu Zhang, Chenghao Sun, Weiya Li, Conghui He, Xuming Hu, Linfeng Zhang

Comments 21 pages, 5 figures, 11 tables

2409.18827 2026-03-11 cs.LG

ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning

Jannis Becktepe, Julian Dierkes, Carolin Benjamins, Aditya Mohan, David Salinas, Raghu Rajan, Frank Hutter, Holger Hoos, Marius Lindauer, Theresa Eimer

2408.17135 2026-03-11 cs.CV

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation

Yabiao Wang, Shuo Wang, Jiangning Zhang, Ke Fan, Jiafu Wu, Zhucun Xue, Yong Liu

Comments Accepted to CVPR 2025. Project page: https://aigc-explorer.github.io/TIMotion-page/

2408.06699 2026-03-11 cs.LG cs.AI

Sparse Variational Student-t Processes for Heavy-tailed Modeling

Jian Xu, Delu Zeng, John Paisley

2407.21460 2026-03-11 cs.AI cs.LG

Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network

Jeffrey Redondo, Nauman Aslam, Juan Zhang, Zhenhui Yuan

2407.04359 2026-03-11 cs.AI cs.NE cs.SE

Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing

Tong Wang, Taotao Gu, Huan Deng, Hu Li, Xiaohui Kuang, Gang Zhao

Comments This paper was accepted by 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2024)

2406.07871 2026-03-11 cs.CV cs.MM cs.SD eess.AS

Controllable Dance Generation with Style-Guided Motion Diffusion

Hongsong Wang, Ying Zhu, Xin Geng, Liang Wang