arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2408.01180 2026-03-17 cs.SD cs.IR cs.LG eess.AS

Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation

HaeJun Yoo, Hao-Wen Dong, Jongmin Jung, Dasaem Jeong

Comments Accepted at 25th International Society for Music Information Retrieval Conference (ISMIR 2024)

2407.11477 2026-03-17 cs.LG cs.AI

TraffiDent: A Dataset for Understanding the Interplay Between Traffic Dynamics and Incidents

Xiaochuan Gou, Ziyue Li, Tian Lan, Junpeng Lin, Zhishuai Li, Bingyu Zhao, Chen Zhang, Di Wang, Xiangliang Zhang

2407.00104 2026-03-17 cs.LG cs.AI cs.CV cs.IR eess.IV

MultiTask Learning AI system to assist BCC diagnosis with dual explanation

Iván Matas, Carmen Serrano, Francisca Silva, Amalia Serrano, Tomás Toledo-Pastrana, Begoña Acha

Comments 23 pages, 4 figures, 5 tables, under review in Scientific Reports

2405.20791 2026-03-17 cs.CV cs.LG

MetaGS: A Meta-Learned Gaussian-Phong Model for Out-of-Distribution 3D Scene Relighting

Yumeng He, Yunbo Wang, Xiaokang Yang

Comments Accepted by NeurIPS 2025 (Spotlight). Code: https://github.com/raynehe/MetaGS

2405.13791 2026-03-17 cs.LG

Multi-Type Point Cloud Autoencoder: A Complete Equivariant Embedding for Molecule Conformation and Pose

Michael Kilgour, Mark Tuckerman, Jutta Rogal

2405.07615 2026-03-17 cs.CL

ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source

Hung Tuan Le, Long Truong To, Manh Trong Nguyen, Kiet Van Nguyen

2404.08522 2026-03-17 cs.LG physics.ao-ph

Fuxi-DA: A Generalized Deep Learning Data Assimilation Framework for Assimilating Satellite Observations

Xiaoze Xu, Xiuyu Sun, Wei Han, Xiaohui Zhong, Lei Chen, Hao Li

详情

DOI: 10.1038/s41612-025-01039-3

英文摘要

Data assimilation (DA), as an indispensable component within contemporary Numerical Weather Prediction (NWP) systems, plays a crucial role in generating the analysis that significantly impacts forecast performance. Nevertheless, the development of an efficient DA system poses significant challenges, particularly in establishing intricate relationships between the background data and the vast amount of multi-source observation data within limited time windows in operational settings. To address these challenges, researchers design complex pre-processing methods for each observation type, leveraging approximate modeling and the power of super-computing clusters to expedite solutions. The emergence of deep learning (DL) models has been a game-changer, offering unified multi-modal modeling, enhanced nonlinear representation capabilities, and superior parallelization. These advantages have spurred efforts to integrate DL models into various domains of weather modeling. Remarkably, DL models have shown promise in matching, even surpassing, the forecast accuracy of leading operational NWP models worldwide. This success motivates the exploration of DL-based DA frameworks tailored for weather forecasting models. In this study, we introduces FuxiDA, a generalized DL-based DA framework for assimilating satellite observations. By assimilating data from Advanced Geosynchronous Radiation Imager (AGRI) aboard Fengyun-4B, FuXi-DA consistently mitigates analysis errors and significantly improves forecast performance. Furthermore, through a series of single-observation experiments, Fuxi-DA has been validated against established atmospheric physics, demonstrating its consistency and reliability.

URL PDF HTML ☆

赞 0 踩 0

2403.13681 2026-03-17 cs.CL cs.AI cs.LG

Ayn: A Tiny yet Competitive Indian Legal Language Model Pretrained from Scratch

Mitodru Niyogi, Eric Gaussier, Arnab Bhattacharya

Comments LREC 2026

2212.11738 2026-03-17 cs.AI

Towards Sustainable Artificial Intelligence: An Overview of Environmental Protection Uses and Issues

Arnault Pachot, Céline Patissier

2603.14811 2026-03-17 cs.RO cs.CV

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

Heng Zhou, Li Kang, Yiran Qin, Xiufeng Song, Ao Yu, Zilu Zhang, Haoming Song, Kaixin Xu, Yuchen Fan, Dongzhan Zhou, Xiaohong Liu, Ruimao Zhang, Philip Torr, Lei Bai, Zhenfei Yin

2603.14809 2026-03-17 cs.RO

A Unified Calibration Framework for Coordinate and Kinematic Parameters in Dual-Arm Robots

Tianyu Huang, Bohan Yang, Bin Li, Wenpan Li, Haoang Li, Wenlong Li, Yun-Hui Liu

Comments 21 pages, 12 figures

详情

英文摘要

Precise collaboration in vision-based dual-arm robot systems requires accurate system calibration. Recent dual-robot calibration methods have achieved strong performance by simultaneously solving multiple coordinate transformations. However, these methods either treat kinematic errors as implicit noise or handle them through separated error modeling, resulting in non-negligible accumulated errors. In this paper, we present a novel framework for unified calibration of the coordinate transformations and kinematic parameters in both robot arms. Our key idea is to unify all the tightly coupled parameters within a single Lie-algebraic formulation. To this end, we construct a consolidated error model grounded in the product-of-exponentials formula, which naturally integrates the coordinate and kinematic parameters in twist forms. Our model introduces no artificial error separation and thus greatly mitigates the error propagation. In addition, we derive a closed-form analytical Jacobian from this model using Lie derivatives. By exploring the Jacobian rank property, we analyze the identifiability of all calibration parameters and show that our joint optimization is well-posed under mild conditions. This enables off-the-shelf iterative solvers to stably optimize these parameters on the manifold space. Besides, to ensure robust convergence of our joint optimization, we develop a certifiably correct algorithm for initializing the unknown coordinates. Relying on semidefinite relaxation, our algorithm can yield a reliable estimate whose near-global optimality can be verified a posteriori. Extensive experiments validate the superior accuracy of our approach over previous baselines under identical visual measurements. Meanwhile, our certifiable initialization consistently outperforms several coordinate-only baselines, proving its reliability as a starting point for joint optimization.

URL PDF HTML ☆

赞 0 踩 0

2603.14807 2026-03-17 cs.CV cs.RO

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System

Kailin Lyu, Kangyi Wu, Pengna Li, Xiuyu Hu, Qingyi Si, Cui Miao, Ning Yang, Zihang Wang, Long Xiao, Lianyu Hu, Jingyuan Sun, Ce Hao

Comments 9 pages, 7 figures

2603.14802 2026-03-17 cs.LG

OpenReservoirComputing: GPU-Accelerated Reservoir Computing in JAX

Jan Williams, Dima Tretiak, Steven L. Brunton, J. Nathan Kutz, Krithika Manohar

2603.14799 2026-03-17 cs.LG cs.AI cs.CL

Universe Routing: Why Self-Evolving Agents Need Epistemic Control

Zhaohui Geoffrey Wang

Comments 10 pages. Accepted at the LLA Workshop at ICLR 2026 (camera-ready version)

2603.14797 2026-03-17 cs.LG cs.AI

Multi-Task Genetic Algorithm with Multi-Granularity Encoding for Protein-Nucleotide Binding Site Prediction

Yiming Gao, Liuyi Xu, Pengshan Cui, Yining Qian, An-Yang Lu, Xianpeng Wang

2603.14796 2026-03-17 cs.CV cs.RO

Global Truncated Loss Minimization for Robust and Threshold-Resilient Geometric Estimation

Tianyu Huang, Liangzu Peng, Xinyue Zhang, Tongfan Guan, Jinhu Dong, Haoang Li, Laurent Kneip, Yun-Hui Liu

Comments 19 pages, 10 figures

详情

英文摘要

To achieve outlier-robust geometric estimation, robust objective functions are generally employed to mitigate the influence of outliers. The widely used consensus maximization(CM) is highly robust when paired with global branch-and-bound(BnB) search. However, CM relies solely on inlier counts and is sensitive to the inlier threshold. Besides, the discrete nature of CM leads to loose bounds, necessitating extensive BnB iterations and computation cost. Truncated losses(TL), another continuous alternative, leverage residual information more effectively and could potentially overcome these issues. But to our knowledge, no prior work has systematically explored globally minimizing TL with BnB and its potential for enhanced threshold resilience or search efficiency. In this work, we propose GTM, the first unified BnB-based framework for globally-optimal TL loss minimization across diverse geometric problems. GTM involves a hybrid solving design: given an n-dimensional problem, it performs BnB search over an (n-1)-dimensional subspace while the remaining 1D variable is solved by bounding the objective function. Our hybrid design not only reduces the search space, but also enables us to derive Lipschitz-continuous bounding functions that are general, tight, and can be efficiently solved by a classic global Lipschitz solver named DIRECT, which brings further acceleration. We conduct a systematic evaluation on various BnB-based methods for CM and TL on the robust linear regression problem, showing that GTM enjoys remarkable threshold resilience and the highest efficiency compared to baseline methods. Furthermore, we apply GTM on different geometric estimation problems with diverse residual forms. Extensive experiments demonstrate that GTM achieves state-of-the-art outlier-robustness and threshold-resilience while maintaining high efficiency across these estimation tasks.

URL PDF HTML ☆

赞 0 踩 0

2603.14793 2026-03-17 cs.LG

GARCH-FIS: A Hybrid Forecasting Model with Dynamic Volatility-Driven Parameter Adaptation

Wen-Jing Li, Da-Qing Zhang

2603.14792 2026-03-17 cs.LG cs.AI

LaPro-DTA: Latent Dual-View Drug Representations and Salient Protein Feature Extraction for Generalizable Drug--Target Affinity Prediction

Zihan Dun, Liuyi Xu, An-Yang Lu, Shuang Li, Yining Qian

2603.14789 2026-03-17 cs.RO

GraspALL: Adaptive Structural Compensation from Illumination Variation for Robotic Garment Grasping in Any Low-Light Conditions

Haifeng Zhong, Wenshuo Han, Zhouyu Wang, Runyang Feng, Fan Tang, Tong-Yee Lee, Zipei Fan, Ruihai Wu, Yuran Wang, Hao Dong, Hechang Chen, Hyung Jin Chang, Yixing Gao

2603.14786 2026-03-17 cs.RO

CORAL: COntextual Reasoning And Local Planning in A Hierarchical VLM Framework for Underwater Monitoring

Zhenqi Wu, Yuanjie Lu, Xuesu Xiao, Xiaomin Lin

Comments Submitted to IROS 2026

2603.14783 2026-03-17 cs.LG

Orthogonal Subspace Clustering: Enhancing High-Dimensional Data Analysis through Adaptive Dimensionality Reduction and Efficient Clustering

Qing-Yuan Wen, Da-Qing Zhang

2603.14782 2026-03-17 cs.CL

Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA

Renhao Pei, Siyao Peng, Verena Blaschke, Robert Litschko, Barbara Plank

Comments 23 pages, accepted at LREC 2026 as an oral presentation

2603.14781 2026-03-17 cs.CV

High-Fidelity 3D Facial Avatar Synthesis with Controllable Fine-Grained Expressions

Yikang He, Jichao Zhang, Wei Wang, Nicu Sebe, Yao Zhao

2603.14779 2026-03-17 cs.CL

Vietnamese Automatic Speech Recognition: A Revisit

Thi Vu, Linh The Nguyen, Dat Quoc Nguyen

Comments Accepted to EACL 2026 Findings

2603.14772 2026-03-17 cs.CV

Zero-Shot Reconstruction of Animatable 3D Avatars with Cloth Dynamics from a Single Image

Joohyun Kwon, Geonhee Sim, Gyeongsik Moon

Comments Accepted to CVPR 2026

2603.14770 2026-03-17 cs.CV

AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas

Longhui Yuan

2603.14769 2026-03-17 cs.LG cs.AI

POLCA: Stochastic Generative Optimization with LLM

Xuanfei Ren, Allen Nie, Tengyang Xie, Ching-An Cheng

2603.14768 2026-03-17 cs.LG stat.ML

Understanding the geometry of deep learning with decision boundary volume

Matthew Burfitt, Jacek Brodzki, Pawel Dłotko

2603.14767 2026-03-17 cs.SD cs.LG

Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments

Anacin, Angela, Shruti Kshirsagar, Anderson R. Avila

2603.14765 2026-03-17 cs.CV

SSR: A Training-Free Approach for Streaming 3D Reconstruction

Hui Deng, Yuxin Mao, Yuxin He, Yuchao Dai

Comments 8 pages