arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.16192 2026-03-18 cs.CL

Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models

Xiaobing Sun, Perry Lam, Shaohua Li, Zizhou Wang, Rick Siow Mong Goh, Yong Liu, Liangli Zhen

Comments 15 pages

详情

英文摘要

Modern LLMs employ safety mechanisms that extend beyond surface-level input filtering to latent semantic representations and generation-time reasoning, enabling them to recover obfuscated malicious intent during inference and refuse accordingly, and rendering many surface-level obfuscation jailbreak attacks ineffective. We propose Structured Semantic Cloaking (S2C), a novel multi-dimensional jailbreak attack framework that manipulates how malicious semantic intent is reconstructed during model inference. S2C strategically distributes and reshapes semantic cues such that full intent consolidation requires multi-step inference and long-range co-reference resolution within deeper latent representations. The framework comprises three complementary mechanisms: (1) Contextual Reframing, which embeds the request within a plausible high-stakes scenario to bias the model toward compliance; (2) Content Fragmentation, which disperses the semantic signature of the request across disjoint prompt segments; and (3) Clue-Guided Camouflage, which disguises residual semantic cues while embedding recoverable markers that guide output generation. By delaying and restructuring semantic consolidation, S2C degrades safety triggers that depend on coherent or explicitly reconstructed malicious intent at decoding time, while preserving sufficient instruction recoverability for functional output generation. We evaluate S2C across multiple open-source and proprietary LLMs using HarmBench and JBB-Behaviors, where it improves Attack Success Rate (ASR) by 12.4% and 9.7%, respectively, over the current SOTA. Notably, S2C achieves substantial gains on GPT-5-mini, outperforming the strongest baseline by 26% on JBB-Behaviors. We also analyse which combinations perform best against broad families of models, and characterise the trade-off between the extent of obfuscation versus input recoverability on jailbreak success.

URL PDF HTML ☆

赞 0 踩 0

2603.16189 2026-03-18 cs.CV

Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning

Haomin Wang, Qi Wei, Qianli Ma, Shengyuan Ding, Jinhui Yin, Kai Chen, Hongjie Zhang

2603.16188 2026-03-18 cs.CV

ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control

Haozhe Jia, Jianfei Song, Yuan Zhang, Honglei Jin, Youcheng Fan, Wenshuo Chen, Wei Zhang, Yutao Yue

2603.16185 2026-03-18 cs.LG cs.AI q-bio.QM

Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift

Camille Jimenez Cortes, Philippe Lalanda, German Vega

2603.16184 2026-03-18 cs.CL

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

Quy-Anh Dang, Chris Ngo

2603.16181 2026-03-18 cs.CV cs.CR

KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety

Viraj Panchal, Tanmay Talsaniya, Parag Patel, Meet Patel

Comments 12 pages, 2 figures, 6 tables

2603.16166 2026-03-18 cs.RO cs.CV

SignNav: Leveraging Signage for Semantic Visual Navigation in Large-Scale Indoor Environments

Jian Sun, Yuming Huang, He Li, Shuqi Xiao, Shenyan Guo, Maani Ghaffari, Qingbiao Li, Chengzhong Xu, Hui Kong

2603.16165 2026-03-18 cs.CV cs.AI

Homogeneous and Heterogeneous Consistency progressive Re-ranking for Visible-Infrared Person Re-identification

Yiming Wang

2603.16163 2026-03-18 cs.CV cs.CL

STARK: Spatio-Temporal Attention for Representation of Keypoints for Continuous Sign Language Recognition

Suvajit Patra, Soumitra Samanta

2603.16161 2026-03-18 cs.AI

SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation

Long Li, Zhijian Zhou, Jiangxuan Long, Peiyang Liu, Weidi Xu, Zhe Wang, Shirui Pan, Chao Qu

Comments 17 pages

2603.16160 2026-03-18 cs.CV

Segmentation-before-Staining Improves Structural Fidelity in Virtual IHC-to-Multiplex IF Translation

Junhyeok Lee, Han Jang, Heeseong Eum, Joon Jang, Kyu Sung Choi

Comments 11 pages, 2 figures, 2 tables. Submitted to MICCAI 2026

2603.16159 2026-03-18 cs.CV cs.CY

AI-Generated Figures in Academic Publishing: Policies, Tools, and Practical Guidelines

Davie Chen

2603.16158 2026-03-18 cs.LG

Execution-Grounded Credit Assignment for GRPO in Code Generation

Abhijit Kumar, Natalya Kumar, Shikhar Gupta

Comments Accepted at SPOT ICLR 2026 (https://openreview.net/forum?id=nqkVB5EVXJ)

2603.16157 2026-03-18 cs.LG cs.AI

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay

Long Li, Zhijian Zhou, Tianyi Wang, Weidi Xu, Zuming Huang, Wei Chu, Zhe Wang, Shirui Pan, Chao Qu, Yuan Qi

Comments 14 pages, 3 figures

2603.16154 2026-03-18 cs.CV cs.AI

GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation

Jiayi Tian, Jiaze Wang

2603.16152 2026-03-18 cs.LG cs.AI cs.CL

HIPO: Instruction Hierarchy via Constrained Reinforcement Learning

Keru Chen, Jun Luo, Sen Lin, Yingbin Liang, Alvaro Velasquez, Nathaniel Bastian, Shaofeng Zou

Comments 9 pages + appendix. Under review

2603.16151 2026-03-18 cs.CV

EFF-Grasp: Energy-Field Flow Matching for Physics-Aware Dexterous Grasp Generation

Yukun Zhao, Zichen Zhong, Yongshun Gong, Yilong Yin, Haoliang Sun

2603.16148 2026-03-18 cs.AI

NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics

Zhengzheng Tang

Comments 10 pages, 6 figures, 6 tables. Preprint

2603.16140 2026-03-18 cs.LG

Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards

Yuxuan Zhu, Daniel Kang

Comments 16 pages, 17 figures

2603.16139 2026-03-18 cs.CV

Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training

Peng Sun, Jun Xie, Tao Lin

Comments https://github.com/LINs-lab/IOMM

2603.16134 2026-03-18 cs.CV cs.AI cs.LG

When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems

Shesh Narayan Gupta, Nik Bear Brown

2603.16133 2026-03-18 cs.CV

DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives

Xiaoxu Meng, Zhongmin Chen, Bo Yang, Weikai Chen, Weixiao Liu, Lin Gao

2603.16131 2026-03-18 cs.CL

SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era

Han Jang, Junhyeok Lee, Kyu Sung Choi

Comments 12 pages, 7 figures, Submitted to KDD 2026

2603.16129 2026-03-18 cs.CV

Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting

Da Zhang, Bingyu Li, Feiyu Wang, Zhiyuan Zhao, Junyu Gao

Comments Accepted to CVPR 2026

2603.16127 2026-03-18 cs.CL cs.LG

Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning

Kazuki Yano, Shun Kiyono, Sosuke Kobayashi, Sho Takase, Jun Suzuki

Comments 25 pages, accepted by ICLR 2026 as a conference paper

2603.16122 2026-03-18 cs.CV

Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning

Sadia Ilyas, Annika Mütze, Klaus Friedrichs, Thomas Kurbiel, Matthias Rottmann

2603.16118 2026-03-18 cs.RO

SE(3)-LIO: Smooth IMU Propagation With Jointly Distributed Poses on SE(3) Manifold for Accurate and Robust LiDAR-Inertial Odometry

Gunhee Shin, Seungjae Lee, Jei Kong, Youngwoo Seo, Hyun Myung

2603.16113 2026-03-18 cs.CV cs.AI

PathGLS: Evaluating Pathology Vision-Language Models without Ground Truth through Multi-Dimensional Consistency

Minbing Chen, Zhu Meng, Fei Su

2603.16112 2026-03-18 cs.CL cs.AI cs.CE

ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning

Tik Yu Yim, Wenting Tan, Sum Yee Chan, Tak-Wah Lam, Siu Ming Yiu

2603.16110 2026-03-18 cs.AI

VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support

Sarthak Ahuja, Neda Kordjazi, Evren Yortucboylu, Vishaal Kapoor, Mariam Dundua, Yiming Li, Derek Ho, Vaibhavi Padala, Jennifer Whitted, Rebecca Steinert