arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Lianqing Zheng, Long Yang, Qunshu Lin, Wenjin Ai, Minghao Liu, Shouyi Lu, Jianan Liu, Hongze Ren, Jingyue Mo, Xiaokai Bai, Jie Bai, Zhixiong Ma, Xichan Zhu

Comments Accepted by IEEE TPAMI

2411.17257 2026-02-10 cs.LG cs.AI

Disentangled Parameter-Efficient Linear Model for Long-Term Time Series Forecasting

Yuang Zhao, Tianyu Li, Jiadong Chen, Shenrong Ye, Fuxin Jiang, Xiaofeng Gao

Comments Accepted by DASFAA 2026. (Submitted Manuscript Version)

2410.03613 2026-02-10 cs.LG

Understanding Large Language Models in Your Pockets: Performance Study on COTS Mobile Devices

Jie Xiao, Qianyi Huang, Xu Chen, Chen Tian

Comments Corrected a typographical error on page 12: "4604%" has been corrected to "60%."

2409.09777 2026-02-10 cs.CV cs.RO

EgoFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Self-Driving

Haisheng Su, Wei Wu, Zhenjie Yang, Isabel Guan

Comments Accepted to ICRA2026

2409.01392 2026-02-10 cs.CL cs.AI cs.CV

ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems

Xiangyuan Xue, Zeyu Lu, Di Huang, Zidong Wang, Wanli Ouyang, Lei Bai

2407.03035 2026-02-10 cs.RO cs.AI cs.LG

NLP Sampling: Combining MCMC and NLP Methods for Diverse Constrained Sampling

Marc Toussaint, Cornelius V. Braun, Joaquim Ortiz-Haro

2407.02502 2026-02-10 cs.RO cs.SY eess.SY

Impact of an Autonomous Shuttle Service on Urban Road Capacity: Experiments by Microscopic Traffic Simulation

Sudipta Roy, Bat-hen Nahmias-Biran, Samiul Hasan

Comments 16 Pages, 5 Figures, 6 Tables. Accepted in Transportation Research Board Annual Meeting 2024

详情

DOI: 10.3390/smartcities9020029

英文摘要

Autonomous vehicles are expected to transform transportation systems with rapid technological advancement. Human mobility would become more accessible and safer with the emergence of driverless vehicles. To this end, autonomous shuttle services are currently introduced in different urban conditions throughout the world. As a result, studies are needed to assess the safety and mobility performance of such autonomous shuttle services. However, calibrating the movement of autonomous shuttles in a simulation environment has been a difficult task due to the absence of any real-world data. This study aims to calibrate autonomous shuttles in a microscopic traffic simulation model and consequently assess the impact of the shuttle service on urban road capacity through simulation experiments. For this analysis, a prototype of an operational shuttle system at Lake Nona, Orlando, Florida is emulated in a microscopic traffic simulator during different times of the day. The movements of autonomous vehicles are calibrated using real-world trajectory data which help replicate the driving behavior of the shuttle in the simulation. The analysis reveals that with increasing frequency of the shuttle service the delay time percentage of the shared road sections increases and traveling speed decreases. It is also found that increasing the speed of shuttles up to 5 mph during off-peak hours and 10 mph during peak hours will improve traffic conditions. The findings from this study will assist policymakers and transportation agencies to revise policies for deploying autonomous shuttles and for planning road infrastructures for shared road-use of autonomous shuttles and human driven vehicles.

URL PDF HTML ☆

赞 0 踩 0

2407.02136 2026-02-10 cs.CL

Black Big Boxes: Tracing Adjective Order Preferences in Large Language Models

Jaap Jumelet, Lisa Bylinina, Willem Zuidema, Jakub Szymanik

2405.04605 2026-02-10 cs.CV cs.AI cs.LG eess.IV

Reproducible Benchmarking for Lung Nodule Detection and Malignancy Classification Across Multiple Low-Dose CT Datasets

Fakrul Islam Tushar, Avivah Wang, Lavsen Dahal, Ehsan Samei, Michael R. Harowicz, Jayashree Kalpathy-Cramer, Kyle J. Lafata, Tina D. Tailor, Cynthia Rudin, Joseph Y. Lo

Comments 3 tables, 2 supplement tables, 5 figures

2401.10777 2026-02-10 cs.CV

Determination of efficiency indicators of the stand for intelligent control of manual operations in industrial production

Anton Sergeev, Victor Minchenkov, Aleksei Soldatov

2306.09322 2026-02-10 cs.CV

Fast Image-based Neural Relighting with Translucency-Reflection Modeling

Shizhan Zhu, Shunsuke Saito, Aljaz Bozic, Carlos Aliaga, Trevor Darrell, Christoph Lassner

Comments v2: Major revision and bug fix: New method with significantly improved results. Corrects an error in v1 (arXiv:2306.09322v1) in the evaluation of baseline NRTF due to an implementation bug. Results in v2 supersede those in v1

2305.04195 2026-02-10 cs.CV cs.CL

Cross-Modal Retrieval for Motion and Text via DropTriple Loss

Sheng Yan, Yang Liu, Haoqiang Wang, Xin Du, Mengyuan Liu, Hong Liu

Comments This paper has been accepted by ACM MM Asia 2023 (Best Paper Candidate)

2106.15332 2026-02-10 cs.CV

Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model

Yixuan Qiao, Hao Chen, Jun Wang, Shanshan Zhao, Yihao Chen, Xianbin Ye, Ziliang Li, Xianbiao Qi, Peng Gao, Guotong Xie

Comments Winner of TextVQA 2021

2602.08112 2026-02-10 cs.CV cs.LG

MMLSv2: A Multimodal Dataset for Martian Landslide Detection in Remote Sensing Imagery

Sidike Paheding, Abel Reyes-Angulo, Leo Thomas Ramos, Angel D. Sappa, Rajaneesh A., Hiral P. B., Sajin Kumar K. S., Thomas Oommen

2602.08105 2026-02-10 cs.LG physics.data-an stat.ML

Mutual information and task-relevant latent dimensionality

Paarth Gulati, Eslam Abdelaleem, Audrey Sederberg, Ilya Nemenman

2602.08100 2026-02-10 cs.CL cs.AI

Emergent Search and Backtracking in Latent Reasoning Models

Jasmine Cui, Charles Ye

2602.08099 2026-02-10 cs.CV cs.AI

VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval

Issar Tzachor, Dvir Samuel, Rami Ben-Ari

Comments Project page: https://iyttor.github.io/VidVec/

2602.08092 2026-02-10 cs.AI cs.ET

Objective Decoupling in Social Reinforcement Learning: Recovering Ground Truth from Sycophantic Majorities

Majid Ghasemi, Mark Crowley

2602.08086 2026-02-10 cs.LG

Probability Hacking and the Design of Trustworthy ML for Signal Processing in C-UAS: A Scenario Based Method

Liisa Janssens, Laura Middeldorp

Comments 6 pages, Pre-publication. Copyright 2026 IEEE. Peer Reviewed. Accepted at ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), scheduled for 3-8 May 2026 in Barcelona, Spain

2602.08082 2026-02-10 cs.LG cs.AI eess.SP

Spectral Guardrails for Agents in the Wild: Detecting Tool Use Hallucinations via Attention Topology

Valentin Noël

Comments 32 pages, 2 fgures, 18 tables

2602.08077 2026-02-10 cs.LG cs.AI

Multimodal normative modeling in Alzheimers Disease with introspective variational autoencoders

Sayantan Kumar, Peijie Qiu, Aristeidis Sotiras

Comments Conference on Health, Inference, and Learning (CHIL)

2602.08071 2026-02-10 cs.CV

ViT-5: Vision Transformers for The Mid-2020s

Feng Wang, Sucheng Ren, Tiezheng Zhang, Predrag Neskovic, Anand Bhattad, Cihang Xie, Alan Yuille

Comments Code is available at https://github.com/wangf3014/ViT-5

2602.08068 2026-02-10 cs.CV

ReRoPE: Repurposing RoPE for Relative Camera Control

Chunyang Li, Yuanbo Yang, Jiahao Shao, Hongyu Zhou, Katja Schwarz, Yiyi Liao

2602.08067 2026-02-10 cs.LG

Enhancing Bandit Algorithms with LLMs for Time-varying User Preferences in Streaming Recommendations

Chenglei Shen, Yi Zhan, Weijie Yu, Xiao Zhang, Jun Xu

2602.08063 2026-02-10 cs.LG

Efficient Distribution Learning with Error Bounds in Wasserstein Distance

Eduardo Figueiredo, Steven Adams, Luca Laurenti

2602.08062 2026-02-10 cs.LG cs.CR

Efficient and Adaptable Detection of Malicious LLM Prompts via Bootstrap Aggregation

Shayan Ali Hassan, Tao Ni, Zafar Ayyub Qazi, Marco Canini

详情

英文摘要

Large Language Models (LLMs) have demonstrated remarkable capabilities in natural language understanding, reasoning, and generation. However, these systems remain susceptible to malicious prompts that induce unsafe or policy-violating behavior through harmful requests, jailbreak techniques, and prompt injection attacks. Existing defenses face fundamental limitations: black-box moderation APIs offer limited transparency and adapt poorly to evolving threats, while white-box approaches using large LLM judges impose prohibitive computational costs and require expensive retraining for new attacks. Current systems force designers to choose between performance, efficiency, and adaptability. To address these challenges, we present BAGEL (Bootstrap AGgregated Ensemble Layer), a modular, lightweight, and incrementally updatable framework for malicious prompt detection. BAGEL employs a bootstrap aggregation and mixture of expert inspired ensemble of fine-tuned models, each specialized on a different attack dataset. At inference, BAGEL uses a random forest router to identify the most suitable ensemble member, then applies stochastic selection to sample additional members for prediction aggregation. When new attacks emerge, BAGEL updates incrementally by fine-tuning a small prompt-safety classifier (86M parameters) and adding the resulting model to the ensemble. BAGEL achieves an F1 score of 0.92 by selecting just 5 ensemble members (430M parameters), outperforming OpenAI Moderation API and ShieldGemma which require billions of parameters. Performance remains robust after nine incremental updates, and BAGEL provides interpretability through its router's structural features. Our results show ensembles of small finetuned classifiers can match or exceed billion-parameter guardrails while offering the adaptability and efficiency required for production systems.

URL PDF HTML ☆

赞 0 踩 0

2602.08061 2026-02-10 cs.AI q-bio.OT

Securing Dual-Use Pathogen Data of Concern

Doni Bloomfield, Allison Berke, Moritz S. Hanke, Aaron Maiwald, James R. M. Black, Toby Webster, Tina Hernandez-Boussard, Oliver M. Crook, Jassi Pannu

Comments 39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: Biosecurity Safeguards for Generative AI

2602.08059 2026-02-10 cs.CV cs.AI

DICE: Disentangling Artist Style from Content via Contrastive Subspace Decomposition in Diffusion Models

Tong Zhang, Ru Zhang, Jianyi Liu

2602.08057 2026-02-10 cs.CV cs.AI

Weak to Strong: VLM-Based Pseudo-Labeling as a Weakly Supervised Training Strategy in Multimodal Video-based Hidden Emotion Understanding Tasks

Yufei Wang, Haixu Liu, Tianxiang Xu, Chuancheng Shi, Hongsheng Xing

2602.08054 2026-02-10 cs.LG cs.AI

Epigraph-Guided Flow Matching for Safe and Performant Offline Reinforcement Learning

Manan Tayal, Mumuksh Tayal

Comments 23 pages, 8 figures

AI 大模型

视觉与机器人

科学与医疗

OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving