arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2503.13587 2026-02-27 cs.CV

UniFuture: A 4D Driving World Model for Future Generation and Perception

Dingkang Liang, Dingyuan Zhang, Xin Zhou, Sifan Tu, Tianrui Feng, Xiaofan Li, Yumeng Zhang, Mingyang Du, Xiao Tan, Xiang Bai

Comments Accepted by ICRA 2026

详情

英文摘要

We present UniFuture, a unified 4D Driving World Model designed to simulate the dynamic evolution of the 3D physical world. Unlike existing driving world models that focus solely on 2D pixel-level video generation (lacking geometry) or static perception (lacking temporal dynamics), our approach bridges appearance and geometry to construct a holistic 4D representation. Specifically, we treat future RGB images and depth maps as coupled projections of the same 4D reality and model them jointly within a single framework. To achieve this, we introduce a Dual-Latent Sharing (DLS) scheme, which maps visual and geometric modalities into a shared spatio-temporal latent space, implicitly entangling texture with structure. Furthermore, we propose a Multi-scale Latent Interaction (MLI) mechanism, which enforces bidirectional consistency: geometry constrains visual synthesis to prevent structural hallucinations, while visual semantics refine geometric estimation. During inference, UniFuture can forecast high-fidelity, geometrically consistent 4D scene sequences (image-depth pairs) from a single current frame. Extensive experiments on the nuScenes and Waymo datasets demonstrate that our method outperforms specialized models in both future generation and geometry perception, highlighting the efficacy of unified 4D modeling for autonomous driving. The code is available at https://github.com/dk-liang/UniFuture.

URL PDF HTML ☆

赞 0 踩 0

2503.10981 2026-02-27 cs.CV

CLIP-Free, Label Free, Unsupervised Concept Bottleneck Models

Fawaz Sammani, Jonas Fischer, Nikos Deligiannis

Comments CVPR 2026 (Findings)

2503.10503 2026-02-27 cs.LG

Sample Compression for Self Certified Continual Learning

Jacob Comeau, Mathieu Bazinet, Pascal Germain, Cem Subakan

2503.05560 2026-02-27 cs.LG cond-mat.soft physics.bio-ph q-bio.QM

Global graph features unveiled by unsupervised geometric deep learning

Mirja Granfors, Jesús Pineda, Blanca Zufiria Gerbolés, Joana B. Pereira, Carlo Manzo, Giovanni Volpe

Comments 28 pages, 6 figures

2502.14377 2026-02-27 cs.CV

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Ke Cao, Jing Wang, Ao Ma, Jiasong Feng, Xuanhua He, Run Ling, Haowei Liu, Jian Lu, Wei Feng, Haozhe Wang, Hongjuan Pei, Yihua Shao, Zhanjie Zhang, Jie Zhang

Comments AAAI 2026

2502.12108 2026-02-27 cs.LG cs.AI stat.ML

Using the Path of Least Resistance to Explain Deep Networks

Sina Salek, Joseph Enguehard

2502.11816 2026-02-27 cs.LG

Mixing It Up: Exploring Mixer Networks for Irregular Multivariate Time Series Forecasting

Christian Klötergens, Tim Dernedde, Lars Schmidt-Thieme, Vijaya Krishna Yalavarthi

2502.02088 2026-02-27 cs.CV cs.AI

Dual-IPO: Dual-Iterative Preference Optimization for Text-to-Video Generation

Xiaomeng Yang, Mengping Yang, Jia Gong, Luozheng Qin, Zhiyu Tan, Hao Li

Comments To appear in ICLR 2026, GitHub Code: https://github.com/SAIS-FUXI/IPO

2502.01932 2026-02-27 cs.RO cs.AI cs.LG

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Zelai Xu, Ruize Zhang, Chao Yu, Huining Yuan, Xiangmin Yi, Shilong Ji, Chuqi Wang, Wenhao Tang, Feng Gao, Wenbo Ding, Xinlei Chen, Yu Wang

Comments Accepted by NeurIPS 2025

2501.16904 2026-02-27 cs.CV

Diffusion or Non-Diffusion Adversarial Defenses: Rethinking the Relation between Classifier and Adversarial Purifier

Yuan-Chih Chen, Chun-Shien Lu

2501.02158 2026-02-27 cs.CV

Joint Optimization for 4D Human-Scene Reconstruction in the Wild

Zhizheng Liu, Joe Lin, Wayne Wu, Bolei Zhou

Comments Project Page: https://vail-ucla.github.io/JOSH/

2412.17287 2026-02-27 cs.AI

LLM4AD: A Platform for Algorithm Design with Large Language Model

Fei Liu, Rui Zhang, Zhuoliang Xie, Rui Sun, Kai Li, Qinglong Hu, Ping Guo, Xi Lin, Xialiang Tong, Mingxuan Yuan, Zhenkun Wang, Zhichao Lu, Qingfu Zhang

2412.06491 2026-02-27 cs.CV cs.RO

PPT: Pretraining with Pseudo-Labeled Trajectories for Motion Forecasting

Yihong Xu, Yuan Yin, Éloi Zablocki, Tuan-Hung Vu, Alexandre Boulch, Matthieu Cord

Comments 8 pages, 6 figures, accepted to ICRA 2026

2410.15047 2026-02-27 cs.LG

Testing the Efficacy of Hyperparameter Optimization Algorithms in Short-Term Load Forecasting

Tugrul Cabir Hakyemez, Omer Adar

Comments This is a conference paper submitted to 2nd IEEE INTERNATIONAL CONFERENCE ON IoT, COMMUNICATION AND AUTOMATION TECHNOLOGY (ICICAT 2024). It is currently under review

2410.12439 2026-02-27 cs.LG

Beyond Attribution: Unified Concept-Level Explanations

Junhao Liu, Haonan Yu, Xin Zhang

2408.17251 2026-02-27 cs.CV cs.AI

Abstracted Gaussian Prototypes for True One-Shot Concept Learning

Chelsea Zou, Kenneth J. Kurtz

2408.12791 2026-02-27 cs.CV

Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture

Chenqi Kong, Anwei Luo, Peijun Bao, Haoliang Li, Renjie Wan, Zengwei Zheng, Anderson Rocha, Alex C. Kot

2408.10517 2026-02-27 cs.LG cs.AI

Decision MetaMamba: Enhancing Selective SSM in Offline RL with Heterogeneous Sequence Mixing

Wall Kim, Chaeyoung Song, Hanul Kim

Comments 17 pages; Previously this version appeared as arXiv:2602.19805 which was submitted as a new work by accident. This is a revised version of the previously withdrawn manuscript, updated with new experiments and results

2408.08781 2026-02-27 cs.AI cs.CL

Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

Bhuvanashree Murugadoss, Christian Poelitz, Ian Drosos, Vu Le, Nick McKenna, Carina Suzana Negreanu, Chris Parnin, Advait Sarkar

2408.01503 2026-02-27 cs.LG

Efficient Graph Coloring with Neural Networks: A Physics-Inspired Approach for Large Graphs

Lorenzo Colantonio, Andrea Cacioppo, Federico Scarpati, Maria Chiara Angelini, Federico Ricci-Tersenghi, Stefano Giagu

Comments 15 pages, 9 figures

2407.17120 2026-02-27 cs.LG cs.AI

Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective

Jingren Liu, Zhong Ji, YunLong Yu, Jiale Cao, Yanwei Pang, Jungong Han, Xuelong Li

2406.09293 2026-02-27 cs.CV cs.GR

StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning

Giuseppe Vecchio

2402.16639 2026-02-27 cs.LG stat.CO

Differentiable Particle Filtering using Optimal Placement Resampling

Domonkos Csuzdi, Olivér Törő, Tamás Bécsi

2309.15604 2026-02-27 cs.LG q-bio.MN q-bio.QM stat.ML

Entropic Matching for Expectation Propagation of Markov Jump Processes

Yannick Eich, Bastian Alt, Heinz Koeppl

Comments AISTATS 2025

2305.01898 2026-02-27 cs.AI cs.RO cs.SE

VSRQ: Quantitative Assessment Method for Safety Risk of Vehicle Intelligent Connected System

Tian Zhang, Wenshan Guan, Hao Miao, Xiujie Huang, Zhiquan Liu, Chaonan Wang, Quanlong Guan, Liangda Fang, Zhifei Duan

详情

DOI: 10.1109/TVT.2024.3469389
Journal ref: IEEE Transactions on Vehicular Technology, vol. 74, no. 2, pp. 2635-2651, 2025

英文摘要

The field of intelligent connected in modern vehicles continues to expand, and the functions of vehicles become more and more complex with the development of the times. This has also led to an increasing number of vehicle vulnerabilities and many safety issues. Therefore, it is particularly important to identify high-risk vehicle intelligent connected systems, because it can inform security personnel which systems are most vulnerable to attacks, allowing them to conduct more thorough inspections and tests. In this paper, we develop a new model for vehicle risk assessment by combining I-FAHP with FCA clustering: VSRQ model. We extract important indicators related to vehicle safety, use fuzzy cluster analys (FCA) combined with fuzzy analytic hierarchy process (FAHP) to mine the vulnerable components of the vehicle intelligent connected system, and conduct priority testing on vulnerable components to reduce risks and ensure vehicle safety. We evaluate the model on OpenPilot and experimentally demonstrate the effectiveness of the VSRQ model in identifying the safety of vehicle intelligent connected systems. The experiment fully complies with ISO 26262 and ISO/SAE 21434 standards, and our model has a higher accuracy rate than other models. These results provide a promising new research direction for predicting the security risks of vehicle intelligent connected systems and provide typical application tasks for VSRQ. The experimental results show that the accuracy rate is 94.36%, and the recall rate is 73.43%, which is at least 14.63% higher than all other known indicators.

URL PDF HTML ☆

赞 0 踩 0

2202.03045 2026-02-27 cs.LG stat.ML

Metric-valued regression

Dan Tsir Cohen, Aryeh Kontorovich

2602.22660 2026-02-27 cs.LG

LEDA: Latent Semantic Distribution Alignment for Multi-domain Graph Pre-training

Lianze Shan, Jitao Zhao, Dongxiao He, Siqi Liu, Jiaxu Cui, Weixiong Zhang

Comments Accepted by WWW-26, 12 pages, 2 figures

2602.22659 2026-02-27 cs.CV cs.MM

Scaling Audio-Visual Quality Assessment Dataset via Crowdsourcing

Renyu Yang, Jian Jin, Lili Meng, Meiqin Liu, Yilin Wang, Balu Adsumilli, Weisi Lin

Comments Accepted to ICASSP 2026. 5 pages (main paper) + 8 pages (supplementary material)

2602.22650 2026-02-27 cs.AI

AHBid: An Adaptable Hierarchical Bidding Framework for Cross-Channel Advertising

Xinxin Yang, Yangyang Tang, Yikun Zhou, Yaolei Liu, Yun Li, Bo Yang

Comments 11 pages, 6 figures, accepted by WWW'2026

详情

DOI: 10.1145/3774904.3792322

英文摘要

In online advertising, the inherent complexity and dynamic nature of advertising environments necessitate the use of auto-bidding services to assist advertisers in bid optimization. This complexity is further compounded in multi-channel scenarios, where effective allocation of budgets and constraints across channels with distinct behavioral patterns becomes critical for optimizing return on investment. Current approaches predominantly rely on either optimization-based strategies or reinforcement learning techniques. However, optimization-based methods lack flexibility in adapting to dynamic market conditions, while reinforcement learning approaches often struggle to capture essential historical dependencies and observational patterns within the constraints of Markov Decision Process frameworks. To address these limitations, we propose AHBid, an Adaptable Hierarchical Bidding framework that integrates generative planning with real-time control. The framework employs a high-level generative planner based on diffusion models to dynamically allocate budgets and constraints by effectively capturing historical context and temporal patterns. We introduce a constraint enforcement mechanism to ensure compliance with specified constraints, along with a trajectory refinement mechanism that enhances adaptability to environmental changes through the utilization of historical data. The system further incorporates a control-based bidding algorithm that synergistically combines historical knowledge with real-time information, significantly improving both adaptability and operational efficacy. Extensive experiments conducted on large-scale offline datasets and through online A/B tests demonstrate the effectiveness of AHBid, yielding a 13.57% increase in overall return compared to existing baselines.

URL PDF HTML ☆

赞 0 踩 0

2602.22649 2026-02-27 cs.CV

Interactive Medical-SAM2 GUI: A Napari-based semi-automatic annotation tool for medical images

Woojae Hong, Jong Ha Hwang, Jiyong Chung, Joongyeon Choi, Hyunngun Kim, Yong Hwy Kim

Comments 6 pages, 2 figures, Planning to submit JOSS (Journal of Open Source Software)