arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.12026 2026-03-13 cs.LG

Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization

Haotong Duan, Zhongming Chen, Ngai Wong

详情

英文摘要

Tensor networks, which are originally developed for characterizing complex quantum many-body systems, have recently emerged as a powerful framework for capturing high-dimensional probability distributions with strong physical interpretability. This paper systematically studies matrix product states (MPS) for generative modeling and shows that unitary MPS, which is a tensor-network architecture that is both simple and expressive, offers clear benefits for unsupervised learning by reducing ambiguity in parameter updates and improving efficiency. To overcome the inefficiency of standard gradient-based MPS training, we develop a Riemannian optimization approach that casts probabilistic modeling as an optimization problem with manifold constraints, and further derive an efficient space-decoupling algorithm. Experiments on Bars-and-Stripes and EMNIST datasets demonstrate fast adaptation to data structure, stable updates, and strong performance while maintaining the efficiency and expressive power of MPS.

URL PDF HTML ☆

赞 0 踩 0

2603.12020 2026-03-13 cs.RO cs.AI

Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application

Alaaeddine Chaarani, Narcis Palomeras, Pere Ridao

Comments Currently under review by IROS 2026

2603.12016 2026-03-13 cs.CV q-bio.QM

Nyxus: A Next Generation Image Feature Extraction Library for the Big Data and AI Era

Nicholas Schaub, Andriy Kharchenko, Hamdah Abbasi, Sameeul Samee, Hythem Sidky, Nathan Hotaling

Comments 29 pages, 9 figures, 6 supplemental tables

2603.12015 2026-03-13 cs.LG cs.AI

Flowcean - Model Learning for Cyber-Physical Systems

Maximilian Schmidt, Swantje Plambeck, Markus Knitt, Hendrik Rose, Goerschwin Fey, Jan Christian Wieck, Stephan Balduin

2603.12013 2026-03-13 cs.CV

Pano360: Perspective to Panoramic Vision with Geometric Consistency

Zhengdong Zhu, Weiyi Xue, Zuyuan Yang, Wenlve Zhou, Zhiheng Zhou

Comments Accepted by CVPR2026

2603.12012 2026-03-13 cs.LG

Deep Learning-Based Metamodeling of Nonlinear Stochastic Dynamic Systems under Parametric and Predictive Uncertainty

Haimiti Atila, Seymour M. J. Spence

2603.12011 2026-03-13 cs.AI

Can RL Improve Generalization of LLM Agents? An Empirical Study

Zhiheng Xi, Xin Guo, Jiaqi Liu, Jiazheng Zhang, Yutao Fan, Zhihao Zhang, Shichun Liu, Mingxu Chai, Xiaowei Shi, Yitao Zhai, Xunliang Cai, Tao Gui, Qi Zhang, Xuanjing Huang

Comments Preprint, under review

2603.12008 2026-03-13 cs.CV

CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

Ziqi Ye, Ziyang Gong, Ning Liao, Xiaoxing Hu, Di Wang, Hongruixuan Chen, Chen Huang, Yiguo He, Yuru Jia, Xiaoxing Wang, Haipeng Wang, Xue Yang, Junchi Yan

Comments 26 pages, 15 figures

2603.11992 2026-03-13 cs.AI cs.LG

Few-for-Many Personalized Federated Learning

Ping Guo, Tiantian Zhang, Xi Lin, Xiang Li, Zhi-Ri Tang, Qingfu Zhang

2603.11991 2026-03-13 cs.CL cs.AI cs.LG stat.ML

BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

Ilias Aarab

Comments Accepted at ICLR 2026. 31 pages, 5 figures, 9 tables. Code: https://github.com/IliasAarab/btzsc ; Dataset: https://huggingface.co/datasets/btzsc/btzsc ; Leaderboard: https://huggingface.co/spaces/btzsc/btzsc-leaderboard . Proceedings of the Fourteenth International Conference on Learning Representations (ICLR 2026), 2026

详情

英文摘要

Zero-shot text classification (ZSC) offers the promise of eliminating costly task-specific annotation by matching texts directly to human-readable label descriptions. While early approaches have predominantly relied on cross-encoder models fine-tuned for natural language inference (NLI), recent advances in text-embedding models, rerankers, and instruction-tuned large language models (LLMs) have challenged the dominance of NLI-based architectures. Yet, systematically comparing these diverse approaches remains difficult. Existing evaluations, such as MTEB, often incorporate labeled examples through supervised probes or fine-tuning, leaving genuine zero-shot capabilities underexplored. To address this, we introduce BTZSC, a comprehensive benchmark of 22 public datasets spanning sentiment, topic, intent, and emotion classification, capturing diverse domains, class cardinalities, and document lengths. Leveraging BTZSC, we conduct a systematic comparison across four major model families, NLI cross-encoders, embedding models, rerankers and instruction-tuned LLMs, encompassing 38 public and custom checkpoints. Our results show that: (i) modern rerankers, exemplified by Qwen3-Reranker-8B, set a new state-of-the-art with macro F1 = 0.72; (ii) strong embedding models such as GTE-large-en-v1.5 substantially close the accuracy gap while offering the best trade-off between accuracy and latency; (iii) instruction-tuned LLMs at 4--12B parameters achieve competitive performance (macro F1 up to 0.67), excelling particularly on topic classification but trailing specialized rerankers; (iv) NLI cross-encoders plateau even as backbone size increases; and (v) scaling primarily benefits rerankers and LLMs over embedding models. BTZSC and accompanying evaluation code are publicly released to support fair and reproducible progress in zero-shot text understanding.

URL PDF HTML ☆

赞 0 踩 0

2603.11989 2026-03-13 cs.LG math.OC stat.ML

On-Average Stability of Multipass Preconditioned SGD and Effective Dimension

Simon Vary, Tyler Farghly, Ilja Kuzborskij, Patrick Rebeschini

Comments 35 pages, 1 figure

2603.11987 2026-03-13 cs.AI

LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories

Qianpu Sun, Xiaowei Chi, Yuhan Rui, Ying Li, Kuangzhi Ge, Jiajun Li, Sirui Han, Shanghang Zhang

2603.11984 2026-03-13 cs.CV

Ada3Drift: Adaptive Training-Time Drifting for One-Step 3D Visuomotor Robotic Manipulation

Chongyang Xu, Yixian Zou, Ziliang Feng, Fanman Meng, Shuaicheng Liu

2603.11980 2026-03-13 cs.RO

Learning Visuomotor Policy for Multi-Robot Laser Tag Game

Kai Li, Shiyu Zhao

2603.11972 2026-03-13 cs.LG cs.NE math.FA

Topological DeepONets and a generalization of the Chen-Chen operator approximation theorem

Vugar Ismailov

Comments 22 pages, 1 figure, 23 references

2603.11970 2026-03-13 cs.LG

Statistical and structural identifiability in representation learning

Walter Nelson, Marco Fumero, Theofanis Karaletsos, Francesco Locatello

Comments International Conference on Learning Representations (ICLR) 2026

2603.11963 2026-03-13 cs.RO

Energy Prediction on Sloping Ground for Quadruped Robots

Mohamed Ounally, Cyrille Pierre, Johann Laconte

Comments Presented at 3D-Advice (Advanced 3D Vision for Complex Environments) Workshop, ECMR 2025

2603.11957 2026-03-13 cs.CL

CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading

Pranav Raikote, Korbinian Randl, Ioanna Miliou, Athanasios Lakes, Panagiotis Papapetrou

2603.11955 2026-03-13 cs.CL

PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents

Minjia Wang, Yunfeng Wang, Xiao Ma, Dexin Lv, Qifan Guo, Lynn Zheng, Benliang Wang, Lei Wang, Jiannan Li, Yongwei Xing, David Xu, Zheng Sun

Comments EACL 2026 Industry Track

2603.11952 2026-03-13 cs.CV

Preliminary analysis of RGB-NIR Image Registration techniques for off-road forestry environments

Pankaj Deoli, Karthik Ranganath, Karsten Berns

Comments Preliminary results

2603.11950 2026-03-13 cs.AI cs.LG

Learning Transferable Sensor Models via Language-Informed Pretraining

Yuliang Chen, Arvind Pillai, Yu Yvonne Wu, Tess Z. Griffin, Lisa Marsch, Michael V. Heinz, Nicholas C. Jacobson, Andrew Campbell

2603.11947 2026-03-13 cs.SD cs.CL cs.MM eess.AS

Resurfacing Paralinguistic Awareness in Large Audio Language Models

Hao Yang, Minghan Wang, Tongtong Wu, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari

Comments Submitted to Interspeech 2026

2603.11944 2026-03-13 cs.LG cs.AI

Effective Resistance Rewiring: A Simple Topological Correction for Over-Squashing

Bertran Miquel-Oliver, Manel Gil-Sorribes, Victor Guallar, Alexis Molina

详情

英文摘要

Graph Neural Networks struggle to capture long-range dependencies due to over-squashing, where information from exponentially growing neighborhoods must pass through a small number of structural bottlenecks. While recent rewiring methods attempt to alleviate this limitation, many rely on local criteria such as curvature, which can overlook global connectivity constraints that restrict information flow. We introduce Effective Resistance Rewiring (ERR), a simple topology correction strategy that uses effective resistance as a global signal to detect structural bottlenecks. ERR iteratively adds edges between node pairs with the largest resistance while removing edges with minimal resistance, strengthening weak communication pathways while controlling graph densification under a fixed edge budget. The procedure is parameter-free beyond the rewiring budget and relies on a single global measure aggregating all paths between node pairs. Beyond predictive performance with GCN models, we analyze how rewiring affects message propagation. By tracking cosine similarity between node embeddings across layers, we examine how the relationship between initial node features and learned representations evolves during message passing, comparing graphs with and without rewiring. This analysis helps determine whether improvements arise from better long-range communication rather than changes in embedding geometry. Experiments on homophilic and heterophilic graphs, including directed settings with DirGCN, reveal a trade-off between over-squashing and oversmoothing, where oversmoothing corresponds to the loss of representation diversity across layers. Resistance-guided rewiring improves connectivity and signal propagation but can accelerate representation mixing in deep models. Combining ERR with normalization techniques such as PairNorm stabilizes this trade-off and improves performance.

URL PDF HTML ☆

赞 0 踩 0

2603.11942 2026-03-13 cs.LG

Causal Matrix Completion under Multiple Treatments via Mixed Synthetic Nearest Neighbors

Minrui Luo, Zhiheng Zhang

2603.10929 2026-03-13 cs.CV cs.RO

Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment

Fanqi Yu, Matteo Tiezzi, Tommaso Apicella, Cigdem Beyan, Vittorio Murino

Comments Accepted to CVPR 2026

2603.10725 2026-03-13 cs.SD cs.AI

Towards Robust Speech Deepfake Detection via Human-Inspired Reasoning

Artem Dvirniak, Evgeny Kushnir, Dmitrii Tarasov, Artem Iudin, Oleg Kiriukhin, Mikhail Pautov, Dmitrii Korzh, Oleg Y. Rogov

2603.10713 2026-03-13 cs.SD cs.AI

Probabilistic Verification of Voice Anti-Spoofing Models

Evgeny Kushnir, Alexandr Kozodaev, Dmitrii Korzh, Mikhail Pautov, Oleg Kiriukhin, Oleg Y. Rogov

Comments The paper was submitted for review to Interspeech 2026

2603.10573 2026-03-13 cs.LG cs.AI

Implicit Statistical Inference in Transformers: Approximating Likelihood-Ratio Tests In-Context

Faris Chaudhry, Siddhant Gadkari

Comments Accepted at the Latent and Implicit Thinking Workshop (ICLR 2026)

2603.09427 2026-03-13 cs.LG

Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning

Tatjana Krau, Jorge Mandlmaier, Tobias Damm, Frieder Heieck

Comments This work has been submitted to the IEEE for possible publication

2603.07504 2026-03-13 cs.CV

High-Fidelity Medical Shape Generation via Skeletal Latent Diffusion

Guoqing Zhang, Jingyun Yang, Siqi Chen, Anping Zhang, Yang Li

Comments 11 pages, 5 figures, journal