arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.16711 2026-02-19 cs.CV

TeCoNeRV: Leveraging Temporal Coherence for Compressible Neural Representations for Videos

Namitha Padmanabhan, Matthew Gwilliam, Abhinav Shrivastava

详情

英文摘要

Implicit Neural Representations (INRs) have recently demonstrated impressive performance for video compression. However, since a separate INR must be overfit for each video, scaling to high-resolution videos while maintaining encoding efficiency remains a significant challenge. Hypernetwork-based approaches predict INR weights (hyponetworks) for unseen videos at high speeds, but with low quality, large compressed size, and prohibitive memory needs at higher resolutions. We address these fundamental limitations through three key contributions: (1) an approach that decomposes the weight prediction task spatially and temporally, by breaking short video segments into patch tubelets, to reduce the pretraining memory overhead by 20$\times$; (2) a residual-based storage scheme that captures only differences between consecutive segment representations, significantly reducing bitstream size; and (3) a temporal coherence regularization framework that encourages changes in the weight space to be correlated with video content. Our proposed method, TeCoNeRV, achieves substantial improvements of 2.47dB and 5.35dB PSNR over the baseline at 480p and 720p on UVG, with 36% lower bitrates and 1.5-3$\times$ faster encoding speeds. With our low memory usage, we are the first hypernetwork approach to demonstrate results at 480p, 720p and 1080p on UVG, HEVC and MCL-JCV. Our project page is available at https://namithap10.github.io/teconerv/ .

URL PDF HTML ☆

赞 0 踩 0

2602.16710 2026-02-19 cs.RO

EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data

Ruijie Zheng, Dantong Niu, Yuqi Xie, Jing Wang, Mengda Xu, Yunfan Jiang, Fernando Castañeda, Fengyuan Hu, You Liang Tan, Letian Fu, Trevor Darrell, Furong Huang, Yuke Zhu, Danfei Xu, Linxi Fan

2602.16709 2026-02-19 cs.LG math.ST stat.ME stat.TH

Knowledge-Embedded Latent Projection for Robust Representation Learning

Weijing Tang, Ming Yuan, Zongqi Xia, Tianxi Cai

2602.16704 2026-02-19 cs.CL

Reinforced Fast Weights with Next-Sequence Prediction

Hee Seung Hwang, Xindi Wu, Sanghyuk Chun, Olga Russakovsky

2602.16702 2026-02-19 cs.CV

Saliency-Aware Multi-Route Thinking: Revisiting Vision-Language Reasoning

Mingjia Shi, Yinhan He, Yaochen Zhu, Jundong Li

Comments preprint 10 pages, 4 figures

2602.16697 2026-02-19 cs.LG cs.DS

Protecting the Undeleted in Machine Unlearning

Aloni Cohen, Refael Kohen, Kobbi Nissim, Uri Stemmer

2602.16689 2026-02-19 cs.CV cs.LG

Are Object-Centric Representations Better At Compositional Generalization?

Ferdinand Kapl, Amir Mohammad Karimi Mamaghan, Maximilian Seitzer, Karl Henrik Johansson, Carsten Marr, Stefan Bauer, Andrea Dittadi

2602.16687 2026-02-19 cs.SD cs.CL eess.AS

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

Potsawee Manakul, Woody Haosheng Gan, Martijn Bartelds, Guangzhi Sun, William Held, Diyi Yang

2602.16684 2026-02-19 cs.LG

Retrieval-Augmented Foundation Models for Matched Molecular Pair Transformations to Recapitulate Medicinal Chemistry Intuition

Bo Pan, Peter Zhiping Zhang, Hao-Wei Pang, Alex Zhu, Xiang Yu, Liying Zhang, Liang Zhao

2602.16681 2026-02-19 cs.CV

VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection

Yingyuan Yang, Tian Lan, Yifei Gao, Yimeng Lu, Wenjun He, Meng Wang, Chenghao Liu, Chen Zhang

2602.16675 2026-02-19 cs.RO

Learning to unfold cloth: Scaling up world models to deformable object manipulation

Jack Rome, Stephen James, Subramanian Ramamoorthy

Comments 8 pages, 5 figures, 3 tables

2602.16673 2026-02-19 cs.LG cs.IR

Neighborhood Stability as a Measure of Nearest Neighbor Searchability

Thomas Vecchiato, Sebastian Bruch

2602.16669 2026-02-19 cs.CV

PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction

Bo Lang, Nirav Savaliya, Zhihao Zheng, Jinglun Feng, Zheng-Hang Yeh, Mooi Choo Chuah

Comments WACV 2026

2602.16664 2026-02-19 cs.CV

Unpaired Image-to-Image Translation via a Self-Supervised Semantic Bridge

Jiaming Liu, Felix Petersen, Yunhe Gao, Yabin Zhang, Hyojin Kim, Akshay S. Chaudhari, Yu Sun, Stefano Ermon, Sergios Gatidis

Comments 36 pages

2602.16660 2026-02-19 cs.CL cs.AI cs.LG

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Yuyan Bu, Xiaohao Liu, ZhaoXing Ren, Yaodong Yang, Juntao Dai

Comments Accepted by ICLR 2026

2602.16643 2026-02-19 cs.LG cond-mat.stat-mech

Factorization Machine with Quadratic-Optimization Annealing for RNA Inverse Folding and Evaluation of Binary-Integer Encoding and Nucleotide Assignment

Shuta Kikuchi, Shu Tanaka

Comments 17 pages, 10 figures

2602.16641 2026-02-19 cs.RO

Towards Autonomous Robotic Kidney Ultrasound: Spatial-Efficient Volumetric Imaging via Template Guided Optimal Pivoting

Xihan Ma, Haichong Zhang

2602.16640 2026-02-19 cs.CL

Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval

Subrit Dikshit

Comments 5 pages, 2 tables

2602.16639 2026-02-19 cs.CL

AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models

Adib Sakhawat, Fardeen Sadab

Comments 15 pages, 5 figures, 11 tables. Includes appendix with detailed experimental results and prompts

2602.16629 2026-02-19 cs.LG cs.AI

Almost Sure Convergence of Differential Temporal Difference Learning for Average Reward Markov Decision Processes

Ethan Blaser, Jiuqi Wang, Shangtong Zhang

2602.16626 2026-02-19 cs.LG cs.AI q-bio.NC

A Systematic Evaluation of Sample-Level Tokenization Strategies for MEG Foundation Models

SungJun Cho, Chetan Gohil, Rukuang Huang, Oiwi Parker Jones, Mark W. Woolrich

Comments 15 pages, 10 figures, 1 table

2602.16609 2026-02-19 cs.CL cs.IR

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models

Antoine Chaffin, Luca Arnaboldi, Amélie Chatelain, Florent Krzakala

Comments 9 pages, 5 tables, 2 figures

2602.16607 2026-02-19 cs.CL

CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes

Miguel Marques, Ana Luísa Fernandes, Ana Filipa Pacheco, Rute Rebouças, Inês Cantante, José Isidro, Luís Filipe Cunha, Alípio Jorge, Nuno Guimarães, Sérgio Nunes, António Leal, Purificação Silvano, Ricardo Campos

2602.16600 2026-02-19 cs.LG

Predicting The Cop Number Using Machine Learning

Meagan Mann, Christian Muise, Erin Meger

Comments 8 pages

2602.16590 2026-02-19 cs.CV cs.AI cs.LG

A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification

Qi You, Yitai Cheng, Zichao Zeng, James Haworth

2602.16579 2026-02-19 cs.LG cs.AI physics.app-ph

AIFL: A Global Daily Streamflow Forecasting Model Using Deterministic LSTM Pre-trained on ERA5-Land and Fine-tuned on IFS

Maria Luisa Taccari, Kenza Tazi, Oisín M. Morrison, Andreas Grafberger, Juan Colonese, Corentin Carton de Wiart, Christel Prudhomme, Cinzia Mazzetti, Matthew Chantry, Florian Pappenberger

2602.16578 2026-02-19 cs.AI cs.CL

Creating a digital poet

Vered Tohar, Tsahi Hayat, Amir Leshem

Comments 24 pages, 3 figures

2602.16573 2026-02-19 cs.LG

MoDE-Boost: Boosting Shared Mobility Demand with Edge-Ready Prediction Models

Antonios Tziorvas, George S. Theodoropoulos, Yannis Theodoridis

Comments 25 pages

2602.16570 2026-02-19 cs.LG cs.DS

Steering diffusion models with quadratic rewards: a fine-grained analysis

Ankur Moitra, Andrej Risteski, Dhruv Rohatgi

2602.16569 2026-02-19 cs.CV cs.CR

Arc2Morph: Identity-Preserving Facial Morphing with Arc2Face

Nicolò Di Domenico, Annalisa Franco, Matteo Ferrara, Davide Maltoni