arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.05738 2026-03-23 cs.CV

FeatureSLAM: Feature-enriched 3D gaussian splatting SLAM in real time

Christopher Thirgood, Oscar Mendez, Erin Ling, Jon Storey, Simon Hadfield

详情

英文摘要

We present a real-time tracking SLAM system that unifies efficient camera tracking with photorealistic feature-enriched mapping using 3D Gaussian Splatting (3DGS). Our main contribution is integrating dense feature rasterization into the novel-view synthesis, aligned with a visual foundation model. This yields strong semantics, going beyond basic RGB-D input, aiding both tracking and mapping accuracy. Unlike previous semantic SLAM approaches (which embed pre-defined class labels) FeatureSLAM enables entirely new downstream tasks via free-viewpoint, open-set segmentation. Across standard benchmarks, our method achieves real-time tracking, on par with state-of-the-art systems while improving tracking stability and map fidelity without prohibitive compute. Quantitatively, we obtain 9\% lower pose error and 8\% higher mapping accuracy compared to recent fixed-set SLAM baselines. Our results confirm that real-time feature-embedded SLAM, is not only valuable for enabling new downstream applications. It also improves the performance of the underlying tracking and mapping subsystems, providing semantic and language masking results that are on-par with offline 3DGS models, alongside state-of-the-art tracking, depth and RGB rendering.

URL PDF HTML ☆

赞 0 踩 0

2601.03302 2026-03-23 cs.CV cs.AI cs.RO

CageDroneRF: A Large-Scale RF Benchmark and Toolkit for Drone Perception

Mohammad Rostami, Atik Faysal, Hongtao Xia, Hadi Kasasbeh, Ziang Gao, Huaxia Wang

2601.03273 2026-03-23 cs.CL cs.AI cs.HC cs.LG

A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

Naseem Machlovi, Maryam Saleki, Ruhul Amin, Mohamed Rahouti, Shawqi Al-Maliki, Junaid Qadir, Mohamed M. Abdallah, Ala Al-Fuqaha

2512.06332 2026-03-23 cs.CV

CryoHype: Reconstructing a thousand cryo-EM structures with transformer-based hypernetworks

Jeffrey Gu, Minkyu Jeon, Ambri Ma, Serena Yeung-Levy, Ellen D. Zhong

Comments CVPR 2026

2512.02435 2026-03-23 cs.LG

Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

Zhongjian Qiao, Rui Yang, Jiafei Lyu, Chenjia Bai, Xiu Li, Siyang Gao, Shuang Qiu

2511.09792 2026-03-23 cs.LG cs.MA

Beyond Monotonicity: Revisiting Factorization Principles in Multi-Agent Q-Learning

Tianmeng Hu, Yongzheng Cui, Rui Tang, Biao Luo, Ke Li

Comments Accepted at AAAI 2026

2511.08015 2026-03-23 cs.CV cs.AI

Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous Driving

Jian Wang, Lijun He, Yixing Yong, Haixia Bi, Fan Li

Comments Accepted by the AAAI 2026 (Main Track)

详情

DOI: 10.1609/aaai.v40i12.37955
Journal ref: AAAI Conference on Artificial Intelligence, 40(12), 9903-9911. (2026)

英文摘要

Modern autonomous driving (AD) systems leverage 3D object detection to perceive foreground objects in 3D environments for subsequent prediction and planning. Visual 3D detection based on RGB cameras provides a cost-effective solution compared to the LiDAR paradigm. While achieving promising detection accuracy, current deep neural network-based models remain highly susceptible to adversarial examples. The underlying safety concerns motivate us to investigate realistic adversarial attacks in AD scenarios. Previous work has demonstrated the feasibility of placing adversarial posters on the road surface to induce hallucinations in the detector. However, the unnatural appearance of the posters makes them easily noticeable by humans, and their fixed content can be readily targeted and defended. To address these limitations, we propose the AdvRoad to generate diverse road-style adversarial posters. The adversaries have naturalistic appearances resembling the road surface while compromising the detector to perceive non-existent objects at the attack locations. We employ a two-stage approach, termed Road-Style Adversary Generation and Scenario-Associated Adaptation, to maximize the attack effectiveness on the input scene while ensuring the natural appearance of the poster, allowing the attack to be carried out stealthily without drawing human attention. Extensive experiments show that AdvRoad generalizes well to different detectors, scenes, and spoofing locations. Moreover, physical attacks further demonstrate the practical threats in real-world environments.

URL PDF HTML ☆

赞 0 踩 0

2511.07889 2026-03-23 cs.CV cs.AI

Generating Sketches in a Hierarchical Auto-Regressive Process for Flexible Sketch Drawing Manipulation at Stroke-Level

Sicong Zang, Shuhui Gao, Zhijun Fang

Comments Accepted by AAAI 2026

2511.03992 2026-03-23 cs.CV

Camera-Aware Cross-View Alignment for Referring 3D Gaussian Splatting Segmentation

Yuwen Tao, Kanglei Zhou, Xin Tan, Yuan Xie

Comments Accepted to ICME 2026

2510.23571 2026-03-23 cs.RO cs.AI cs.CV cs.LG

RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation

Yash Jangir, Yidi Zhang, Pang-Chi Lo, Kashu Yamazaki, Chenyu Zhang, Kuan-Hsun Tu, Tsung-Wei Ke, Lei Ke, Yonatan Bisk, Katerina Fragkiadaki

Comments Website: https://robotarenainf.github.io

2510.21599 2026-03-23 cs.LG cs.CC cs.FL quant-ph

SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism

Reda Marzouk, Shahaf Bassan, Guy Katz

Comments To appear in NeurIPS 2025

2510.19350 2026-03-23 cs.CL

Modeling Turn-Taking with Semantically Informed Gestures

Varsha Suresh, M. Hamza Mughal, Christian Theobalt, Vera Demberg

Comments EACL 2026

2510.18041 2026-03-23 cs.LG cs.AI

Sensing Without Colocation: Operator-Based Virtual Instrumentation for Domains Beyond Physical Reach

Jay Phil Yoo, Kazuma Kobayashi, Souvik Chakraborty, Syed Bahauddin Alam

2510.13219 2026-03-23 cs.CV

Prompt-based Adaptation in Large-scale Vision Models: A Survey

Xi Xiao, Yunbei Zhang, Lin Zhao, Yiyang Liu, Xiaoying Liao, Zheda Mai, Xingjian Li, Xiao Wang, Hao Xu, Jihun Hamm, Xue Lin, Min Xu, Qifan Wang, Tianyang Wang, Cheng Han

Comments Accepted by TMLR 2026

2510.12752 2026-03-23 cs.LG

KoALA: KL-L0 Adversarial Detector via Label Agreement

Siqi Li, Yasser Shoukry

2510.11283 2026-03-23 cs.LG

Gym-TORAX: Open-source software for integrating reinforcement learning with plasma control simulators in tokamak research

Antoine Mouchamps, Arthur Malherbe, Adrien Bolland, Damien Ernst

2510.10518 2026-03-23 cs.CV

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning

Qunzhong Wang, Jie Liu, Jiajun Liang, Yilei Jiang, Yuanxing Zhang, Yaozhi Zheng, Xintao Wang, Pengfei Wan, Xiangyu Yue, Jiaheng Liu

2510.05154 2026-03-23 cs.CL

Can AI Truly Represent Your Voice in Deliberations? A Comprehensive Study of Large-Scale Opinion Aggregation with LLMs

Shenzhe Zhu, Shu Yang, Michiel A. Bakker, Alex Pentland, Jiaxin Pei

2509.24129 2026-03-23 cs.RO cs.CV

Mash, Spread, Slice! Learning to Manipulate Object States via Visual Spatial Progress

Priyanka Mandikal, Jiaheng Hu, Shivin Dass, Sagnik Majumder, Roberto Martín-Martín, Kristen Grauman

Comments Accepted at ICRA 2026

2509.24005 2026-03-23 cs.LG stat.ML

Does Weak-to-strong Generalization Happen under Spurious Correlations?

Chenruo Liu, Yijun Dong, Qi Lei

2509.22469 2026-03-23 cs.RO

Uncertainty-Aware Multi-Robot Task Allocation With Strongly Coupled Inter-Robot Rewards

Ben Rossano, Jaein Lim, Jonathan P. How

Comments 9 pages

2509.20057 2026-03-23 cs.CL cs.AI

Responsible AI Technical Report

KT, :, Yunjin Park, Jungwon Yoon, Junhyung Moon, Myunggyo Oh, Wonhyuk Lee, Sujin Kim, Youngchol Kim, Eunmi Kim, Hyoungjun Park, Eunyoung Shin, Wonyoung Lee, Somin Lee, Minwook Ju, Minsung Noh, Dongyoung Jeong, Jeongyeop Kim, Wanjin Park, Soonmin Bae

Comments 23 pages, 8 figures

2509.19080 2026-03-23 cs.RO cs.AI

World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation

Zhennan Jiang, Kai Liu, Yuxin Qin, Shuai Tian, Yupeng Zheng, Mingcai Zhou, Chao Yu, Haoran Li, Dongbin Zhao

2509.16835 2026-03-23 cs.CL cs.AI

Semantic-Driven Topic Modeling for Analyzing Creativity in Virtual Brainstorming

Melkamu Abay Mersha, Jugal Kalita

2509.00183 2026-03-23 cs.LG

FNODE: Flow-Matching for data-driven simulation of constrained multibody systems

Hongyu Wang, Jingquan Wang, Dan Negrut

Comments 36 pages, 19 figures

2508.18839 2026-03-23 cs.LG cs.CR

DRMD: Deep Reinforcement Learning for Malware Detection under Concept Drift

Shae McFadden, Myles Foley, Mario D'Onghia, Chris Hicks, Vasilios Mavroudis, Nicola Paoletti, Fabio Pierazzi

Comments The Fortieth AAAI Conference on Artificial Intelligence (AAAI-26)

详情

DOI: 10.1609/aaai.v40i2.37053
Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 40, No. 2, pp. 854-862, 2026

英文摘要

Malware detection in real-world settings must deal with evolving threats, limited labeling budgets, and uncertain predictions. Traditional classifiers, without additional mechanisms, struggle to maintain performance under concept drift in malware domains, as their supervised learning formulation cannot optimize when to defer decisions to manual labeling and adaptation. Modern malware detection pipelines combine classifiers with monthly active learning (AL) and rejection mechanisms to mitigate the impact of concept drift. In this work, we develop a novel formulation of malware detection as a one-step Markov Decision Process and train a deep reinforcement learning (DRL) agent, simultaneously optimizing sample classification performance and rejecting high-risk samples for manual labeling. We evaluated the joint detection and drift mitigation policy learned by the DRL-based Malware Detection (DRMD) agent through time-aware evaluations on Android malware datasets subject to realistic drift requiring multi-year performance stability. The policies learned under these conditions achieve a higher Area Under Time (AUT) performance compared to standard classification approaches used in the domain, showing improved resilience to concept drift. Specifically, the DRMD agent achieved an average AUT improvement of 8.66 and 10.90 for the classification-only and classification-rejection policies, respectively. Our results demonstrate for the first time that DRL can facilitate effective malware detection and improved resiliency to concept drift in the dynamic setting of Android malware detection.

URL PDF HTML ☆

赞 0 踩 0

2508.08570 2026-03-23 cs.CV cs.AI cs.LG

Superclass-Guided Representation Disentanglement for Spurious Correlation Mitigation

Chenruo Liu, Hongjun Liu, Zeyu Lai, Yiqiu Shen, Chen Zhao, Qi Lei

2507.23295 2026-03-23 cs.CV

LED Benchmark: Diagnosing Structural Layout Errors for Document Layout Analysis

Inbum Heo, Taewook Hwang, Jeesu Jung, Sangkeun Jung

Comments This work has been substantially revised, including updates to the title and content. The revised version is available as arXiv:2603.17265

2507.18014 2026-03-23 cs.LG

Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

Datta Nimmaturi, Vaishnavi Bhargava, Rajat Ghosh, Johnu George, Debojyoti Dutta

2507.16214 2026-03-23 cs.RO cs.AI

Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

Batu Candan, Murat Berke Oktay, Simone Servadio