arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Haitao Li, Yifan Chen, Shuo Miao, Qian Dong, Jia Chen, Yiran Hu, Junjie Chen, Minghao Qin, Yueyue Wu, Yujia Zhou, Qingyao Ai, Yiqun Liu, Cheng Luo, Quan Zhou, Ya Zhang, Jikun Hu

Comments 25 pages, v1

2602.00450 2026-02-04 cs.CV

Model Optimization for Multi-Camera 3D Detection and Tracking

Ethan Anderson, Justin Silva, Kyle Zheng, Sameer Pusegaonkar, Yizhou Wang, Zheng Tang, Sujit Biswas

2601.21494 2026-02-04 cs.AI cs.CL

The Path of Least Resistance: Guiding LLM Reasoning Trajectories with Prefix Consensus

Ishan Jindal, Sai Prashanth Akuthota, Jayant Taneja, Sachin Dev Sharma

Comments Accepted at ICLR 2026. https://openreview.net/forum?id=hrnSqERgPn

2601.18930 2026-02-04 cs.LG cs.AI cs.RO

Toward Learning POMDPs Beyond Full-Rank Actions and State Observability

Seiji Shaw, Travis Manderson, Chad Kessens, Nicholas Roy

Comments Update abstract

2601.18795 2026-02-04 cs.LG cs.AI cs.CL

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Amrith Setlur, Zijian Wang, Andrew Cohen, Paria Rashidinejad, Sang Michael Xie

2601.11265 2026-02-04 cs.LG

Sample-Near-Optimal Agnostic Boosting with Improved Running Time

Arthur da Cunha, Mikael Møller Høgsgaard, Andrea Paudice

Comments 28 pages, 0 figures. Accepted at the 37th International Conference on Algorithmic Learning Theory (ALT 2026)

2601.07891 2026-02-04 cs.LG cs.AI cs.CL

KVzap: Fast, Adaptive, and Faithful KV Cache Pruning

Simon Jegou, Maximilian Jeblick

2601.07020 2026-02-04 cs.CL cs.AI

TurkBench: A Benchmark for Evaluating Turkish Large Language Models

Çağrı Toraman, Ahmet Kaan Sever, Ayse Aysu Cengiz, Elif Ecem Arslan, Görkem Sevinç, Mete Mert Birdal, Yusuf Faruk Güldemir, Ali Buğra Kanburoğlu, Sezen Felekoğlu, Osman Gürlek, Sarp Kantar, Birsen Şahin Kütük, Büşra Tufan, Elif Genç, Serkan Coşkun, Gupse Ekin Demir, Muhammed Emin Arayıcı, Olgun Dursun, Onur Gungor, Susan Üsküdarlı, Abdullah Topraksoy, Esra Darıcı

Comments Accepted by EACL 2026 SIGTURK

2601.05083 2026-02-04 cs.CV cs.AI cs.RO

Driving on Registers

Ellington Kirby, Alexandre Boulch, Yihong Xu, Yuan Yin, Gilles Puy, Éloi Zablocki, Andrei Bursuc, Spyros Gidaris, Renaud Marlet, Florent Bartoccioni, Anh-Quan Cao, Nermin Samet, Tuan-Hung VU, Matthieu Cord

2512.08042 2026-02-04 cs.CV

Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking

Chandler Timm C. Doloriel, Habib Ullah, Kristian Hovde Liland, Fadi Al Machot, Ngai-Man Cheung

Comments Accepted to ACM TOMM

2511.02570 2026-02-04 cs.LG

Dynamic Priors in Bayesian Optimization for Hyperparameter Optimization

Lukas Fehring, Marcel Wever, Maximilian Spliethöver, Leona Hennig, Henning Wachsmuth, Marius Lindauer

Comments 8 pages plus references and appendix

2510.24473 2026-02-04 cs.LG

Methodology for Comparing Machine Learning Algorithms for Survival Analysis

Lucas Buk Cardoso, Simone Aldrey Angelo, Yasmin Pacheco Gil Bonilha, Fernando Maia, Adeylson Guimarães Ribeiro, Maria Paula Curado, Gisele Aparecida Fernandes, Vanderlei Cunha Parro, Flávio Almeida de Magalhães Cipparrone, Alexandre Dias Porto Chiavegatto Filho, Victor Wünsch Filho, Tatiana Natasha Toporcov

2510.22926 2026-02-04 cs.LG

Simple Denoising Diffusion Language Models

Huaisheng Zhu, Zhengyu Chen, Shijie Zhou, Zhihui Xie, Yige Yuan, Shiqi Chen, Zhimeng Guo, Siyuan Xu, Hangfan Zhang, Vasant Honavar, Teng Xiao

2510.19326 2026-02-04 cs.CL

Slot Filling as a Reasoning Task for SpeechLLMs

Kadri Hacioglu, Manjunath K E, Andreas Stolcke

Journal ref Proc. IEEE ICASSP, 2026

2510.16004 2026-02-04 cs.AI physics.flu-dyn

PAINT: Parallel-in-time Neural Twins for Dynamical System Reconstruction

Andreas Radler, Vincent Seyfried, Johannes Brandstetter, Thomas Lichtenegger

Comments 28 pages, 23 figures

2510.07743 2026-02-04 cs.CL

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Tianci Liu, Ran Xu, Tony Yu, Ilgee Hong, Carl Yang, Tuo Zhao, Haoyu Wang

Comments The first two authors contributed equally. Updated OpenRubrics dataset, RMs, and results

2509.26468 2026-02-04 cs.LG

fev-bench: A Realistic Benchmark for Time Series Forecasting

Oleksandr Shchur, Abdul Fatir Ansari, Caner Turkmen, Lorenzo Stella, Nick Erickson, Pablo Guerron, Michael Bohlke-Schneider, Yuyang Wang

2509.23155 2026-02-04 cs.RO

LAGEA: Language Guided Embodied Agents for Robotic Manipulation

Abdul Monaf Chowdhury, Akm Moshiur Rahman Mazumder, Rabeya Akter, Safaeid Hossain Arib

2509.11442 2026-02-04 cs.CV

MultiMAE for Brain MRIs: Robustness to Missing Inputs Using Multi-Modal Masked Autoencoder

Ayhan Can Erdur, Christian Beischl, Daniel Scholz, Jiazhen Pan, Benedikt Wiestler, Daniel Rueckert, Jan C Peeken

Comments Official implementation: https://github.com/chris-beischl/multimae-for-brain-mri

2509.05356 2026-02-04 cs.RO cs.AI cs.LG

Spiking Neural Networks for Continuous Control via End-to-End Model-Based Learning

Justus Huebotter, Pablo Lanillos, Marcel van Gerven, Serge Thill

2508.02831 2026-02-04 cs.CV

Affine-Equivariant Kernel Space Encoding for NeRF Editing

Mikołaj Zieliński, Krzysztof Byrski, Tomasz Szczepanik, Dominik Belter, Przemysław Spurek

2507.23440 2026-02-04 cs.AI

Self-Foveate: Enhancing Diversity and Difficulty of Synthesized Instructions from Unsupervised Text via Multi-Level Foveation

Mingzhe Li, Xin Lu, Yanyan Zhao

Comments Accepted to ACL 2025 (Findings). 23 pages, 4 figures

2507.04075 2026-02-04 cs.LG cs.AI cs.CV

Accurate and Efficient World Modeling with Masked Latent Transformers

Maxime Burchi, Radu Timofte

2506.21551 2026-02-04 cs.LG

Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Ziyue Li, Chenrui Fan, Tianyi Zhou

Comments Accepted at ICLR 2026

2506.18939 2026-02-04 cs.LG cs.AI

Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction

Rui An, Yifeng Zhang, Ziran Liang, Wenqi Fan, Yuxuan Liang, Xuequn Shang, Qing Li

Comments Accepted by ICDE 2026

详情

英文摘要

Training urban spatio-temporal foundation models that generalize well across diverse regions and cities is critical for deploying urban services in unseen or data-scarce regions. Recent studies have typically focused on fusing cross-domain spatio-temporal data to train unified Transformer-based models. However, these models suffer from quadratic computational complexity and high memory overhead, limiting their scalability and practical deployment. Inspired by the efficiency of Mamba, a state space model with linear time complexity, we explore its potential for efficient urban spatio-temporal prediction. However, directly applying Mamba as a spatio-temporal backbone leads to negative transfer and severe performance degradation. This is primarily due to spatio-temporal heterogeneity and the recursive mechanism of Mamba's hidden state updates, which limit cross-domain generalization. To overcome these challenges, we propose Damba-ST, a novel domain-adaptive Mamba-based model for efficient urban spatio-temporal prediction. Damba-ST retains Mamba's linear complexity advantage while significantly enhancing its adaptability to heterogeneous domains. Specifically, we introduce two core innovations: (1) a domain-adaptive state space model that partitions the latent representation space into a shared subspace for learning cross-domain commonalities and independent, domain-specific subspaces for capturing intra-domain discriminative features; (2) three distinct Domain Adapters, which serve as domain-aware proxies to bridge disparate domain distributions and facilitate the alignment of cross-domain commonalities. Extensive experiments demonstrate the generalization and efficiency of Damba-ST. It achieves state-of-the-art performance on prediction tasks and demonstrates strong zero-shot generalization, enabling seamless deployment in new urban environments without extensive retraining or fine-tuning.

URL PDF HTML ☆

赞 0 踩 0

2506.04536 2026-02-04 cs.LG cs.AI q-bio.NC

NOBLE -- Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models

Luca Ghafourpour, Valentin Duruisseaux, Bahareh Tolooshams, Philip H. Wong, Costas A. Anastassiou, Anima Anandkumar

详情

英文摘要

Characterizing the cellular properties of neurons is fundamental to understanding their function in the brain. In this quest, the generation of bio-realistic models is central towards integrating multimodal cellular data sets and establishing causal relationships. However, current modeling approaches remain constrained by the limited availability and intrinsic variability of experimental neuronal data. The deterministic formalism of bio-realistic models currently precludes accounting for the natural variability observed experimentally. While deep learning is becoming increasingly relevant in this space, it fails to capture the full biophysical complexity of neurons, their nonlinear voltage dynamics, and variability. To address these shortcomings, we introduce NOBLE, a neural operator framework that learns a mapping from a continuous frequency-modulated embedding of interpretable neuron features to the somatic voltage response induced by current injection. Trained on synthetic data generated from bio-realistic neuron models, NOBLE predicts distributions of neural dynamics accounting for the intrinsic experimental variability. Unlike conventional bio-realistic neuron models, interpolating within the embedding space offers models whose dynamics are consistent with experimentally observed responses. NOBLE enables the efficient generation of synthetic neurons that closely resemble experimental data and exhibit trial-to-trial variability, offering a $4200\times$ speedup over the numerical solver. NOBLE is the first scaled-up deep learning framework that validates its generalization with real experimental data. To this end, NOBLE captures fundamental neural properties in a unique and emergent manner that opens the door to a better understanding of cellular composition and computations, neuromorphic architectures, large-scale brain circuits, and general neuroAI applications.

URL PDF HTML ☆

赞 0 踩 0

2505.20272 2026-02-04 cs.CV

Ground-R1: Incentivizing Grounded Visual Reasoning via Reinforcement Learning

Meng Cao, Haoze Zhao, Can Zhang, Xiaojun Chang, Ian Reid, Xiaodan Liang

2505.17730 2026-02-04 cs.LG

Redirection for Erasing Memory (REM): Towards a universal unlearning method for corrupted data

Stefan Schoepf, Michael Curtis Mozer, Nicole Elyse Mitchell, Alexandra Brintrup, Georgios Kaissis, Peter Kairouz, Eleni Triantafillou

Comments Accepted as a main track paper at ICLR 2026 https://openreview.net/forum?id=xG0mQ4Xsfm

2505.17001 2026-02-04 cs.CV

Seeing through Satellite Images at Street Views

Ming Qian, Bin Tan, Qiuyu Wang, Xianwei Zheng, Hanjiang Xiong, Gui-Song Xia, Yujun Shen, Nan Xue

Comments Accepted to IEEE TPAMI. Initially submitted in July 2024. Code is available on https://qianmingduowan.github.io/sat2density-pp/

2505.16552 2026-02-04 cs.CL

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Wenhui Tan, Jiaze Li, Jianzhong Ju, Zhenbo Luo, Ruihua Song, Jian Luan

Comments 15 pages, 8 figures

AI 大模型

视觉与机器人

科学与医疗

LegalOne: A Family of Foundation Models for Reliable Legal Reasoning