arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Chang Yang, Chuang Zhou, Yilin Xiao, Su Dong, Luyao Zhuang, Yujing Zhang, Zhu Wang, Zijin Hong, Zheng Yuan, Zhishang Xiang, Shengyuan Chen, Huachi Zhou, Qinggang Zhang, Ninghao Liu, Jinsong Su, Xinrun Wang, Yi Chang, Xiao Huang

2602.05660 2026-02-06 cs.LG cs.AI

Probabilistic Multi-Regional Solar Power Forecasting with Any-Quantile Recurrent Neural Networks

Slawek Smyl, Paweł Pełka, Grzegorz Dudek

2602.05650 2026-02-06 cs.CV cs.AI cs.LG

Enhancing Personality Recognition by Comparing the Predictive Power of Traits, Facets, and Nuances

Amir Ansari, Jana Subirana, Bruna Silva, Sergio Escalera, David Gallardo-Pujol, Cristina Palmero

Comments Accepted to the 2025 13th International Conference on Affective Computing and Intelligent Interaction (Late Breaking Results)

2602.05648 2026-02-06 cs.CL

Modelling the Morphology of Verbal Paradigms: A Case Study in the Tokenization of Turkish and Hebrew

Giuseppe Samo, Paola Merlo

Comments 13 pages, 7 figures, to appear as proceedings of the SIGTURK 2026 Workshop

2602.05646 2026-02-06 cs.LG

Empowering Time Series Analysis with Large-Scale Multimodal Pretraining

Peng Chen, Siyuan Wang, Shiyan Hu, Xingjian Wu, Yang Shu, Zhongwen Rao, Meng Wang, Yijie Li, Bin Yang, Chenjuan Guo

2602.05635 2026-02-06 cs.LG

Structural Disentanglement in Bilinear MLPs via Architectural Inductive Bias

Ojasva Nema, Kaustubh Sharma, Aditya Chauhan, Parikshit Pareek

2602.05633 2026-02-06 cs.CL

CASTLE: A Comprehensive Benchmark for Evaluating Student-Tailored Personalized Safety in Large Language Models

Rui Jia, Ruiyi Lan, Fengrui Liu, Zhongxiang Dai, Bo Jiang, Jing Shao, Jingyuan Chen, Guandong Xu, Fei Wu, Min Zhang

2602.05619 2026-02-06 cs.LG cs.AI

Mode-Dependent Rectification for Stable PPO Training

Mohamad Mohamad, Francesco Ponzio, Xavier Descombes

2602.05617 2026-02-06 cs.CV cs.GR

Unified Sensor Simulation for Autonomous Driving

Nikolay Patakin, Arsenii Shirokov, Anton Konushin, Dmitry Senushkin

2602.05616 2026-02-06 cs.LG cs.AI

Path-Guided Flow Matching for Dataset Distillation

Xuhui Li, Zhengquan Luo, Xiwei Liu, Yongqiang Yu, Zhiqiang Xu

2602.05605 2026-02-06 cs.LG cs.AI cs.CV

Shiva-DiT: Residual-Based Differentiable Top-$k$ Selection for Efficient Diffusion Transformers

Jiaji Zhang, Hailiang Zhao, Guoxuan Zhu, Ruichao Sun, Jiaju Wu, Xinkui Zhao, Hanlin Tang, Weiyi Lu, Kan Liu, Tao Lan, Lin Qu, Shuiguang Deng

2602.05602 2026-02-06 cs.CV

Multi-instance robust fitting for non-classical geometric models

Zongliang Zhang, Shuxiang Li, Xingwang Huang, Zongyue Wang

2602.05599 2026-02-06 cs.AI cs.CL cs.LG

BhashaSetu: Cross-Lingual Knowledge Transfer from High-Resource to Extreme Low-Resource Languages

Subhadip Maji, Arnab Bhattacharya

Comments Accepted as a long paper at IJCNLP-AACL Main Conference

2602.05598 2026-02-06 cs.CV cs.AI

CAViT -- Channel-Aware Vision Transformer for Dynamic Feature Fusion

Aon Safdar, Mohamed Saadeldin

Comments Presented at the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 (CVPR 25) in the 4th Workshop on Transformers for Visions - T4V (https://sites.google.com/view/t4v-cvpr25/) Accepted for Publication at 33rd International Conference on Artificial Intelligence and Cognitive Science (AICS 2025), where it was shortlisted for Best Paper Award. (https://aicsconf.org/?page_id=278)

2602.05590 2026-02-06 cs.CV cs.ET cs.GR

EgoPoseVR: Spatiotemporal Multi-Modal Reasoning for Egocentric Full-Body Pose in Virtual Reality

Haojie Cheng, Shaun Jing Heng Ong, Shaoyu Cai, Aiden Tat Yang Koh, Fuxi Ouyang, Eng Tat Khoo

2602.05588 2026-02-06 cs.CV cs.ET cs.GR

A Mixed Reality System for Robust Manikin Localization in Childbirth Training

Haojie Cheng, Chang Liu, Abhiram Kanneganti, Mahesh Arjandas Choolani, Arundhati Tushar Gosavi, Eng Tat Khoo

2602.05582 2026-02-06 cs.CV

Geometric Observability Index: An Operator-Theoretic Framework for Per-Feature Sensitivity, Weak Observability, and Dynamic Effects in SE(3) Pose Estimation

Joe-Mei Feng, Sheng-Wei Yu

详情

英文摘要

We present a unified operator-theoretic framework for analyzing per-feature sensitivity in camera pose estimation on the Lie group SE(3). Classical sensitivity tools - conditioning analyses, Euclidean perturbation arguments, and Fisher information bounds - do not explain how individual image features influence the pose estimate, nor why dynamic or inconsistent observations can disproportionately distort modern SLAM and structure-from-motion systems. To address this gap, we extend influence function theory to matrix Lie groups and derive an intrinsic perturbation operator for left-trivialized M-estimators on SE(3). The resulting Geometric Observability Index (GOI) quantifies the contribution of a single measurement through the curvature operator and the Lie algebraic structure of the observable subspace. GOI admits a spectral decomposition along the principal directions of the observable curvature, revealing a direct correspondence between weak observability and amplified sensitivity. In the population regime, GOI coincides with the Fisher information geometry on SE(3), yielding a single-measurement analogue of the Cramer-Rao bound. The same spectral mechanism explains classical degeneracies such as pure rotation and vanishing parallax, as well as dynamic feature amplification along weak curvature directions. Overall, GOI provides a geometrically consistent description of measurement influence that unifies conditioning analysis, Fisher information geometry, influence function theory, and dynamic scene detectability through the spectral geometry of the curvature operator. Because these quantities arise directly within Gauss-Newton pipelines, the curvature spectrum and GOI also yield lightweight, training-free diagnostic signals for identifying dynamic features and detecting weak observability configurations without modifying existing SLAM architectures.

URL PDF HTML ☆

赞 0 踩 0

2602.05577 2026-02-06 cs.CV

LocateEdit-Bench: A Benchmark for Instruction-Based Editing Localization

Shiyu Wu, Shuyan Li, Jing Li, Jing Liu, Yequan Wang

Comments 11 pages, 7 figures

2602.05576 2026-02-06 cs.LG

OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

Chenxi Wan, Xunkai Li, Yilong Zuo, Haokun Deng, Sihan Li, Bowen Fan, Hongchao Qin, Ronghua Li, Guoren Wang

2602.05573 2026-02-06 cs.CV

Visual Implicit Geometry Transformer for Autonomous Driving

Arsenii Shirokov, Mikhail Kuznetsov, Danila Stepochkin, Egor Evdokimov, Daniil Glazkov, Nikolay Patakin, Anton Konushin, Dmitry Senushkin

2602.05572 2026-02-06 cs.CV

ShapeGaussian: High-Fidelity 4D Human Reconstruction in Monocular Videos via Vision Priors

Zhenxiao Liang, Ning Zhang, Youbao Tang, Ruei-Sung Lin, Qixing Huang, Peng Chang, Jing Xiao

2602.05571 2026-02-06 cs.LG

EdgeMask-DG*: Learning Domain-Invariant Graph Structures via Adversarial Edge Masking

Rishabh Bhattacharya, Naresh Manwani

2602.05570 2026-02-06 cs.AI

TangramSR: Can Vision-Language Models Reason in Continuous Geometric Space?

Yikun Zong, Cheston Tan

Comments 13 pages, 4 figures

2602.05567 2026-02-06 cs.LG

MAGPrompt: Message-Adaptive Graph Prompt Tuning for Graph Neural Networks

Long D. Nguyen, Binh P. Nguyen

2602.05557 2026-02-06 cs.CV cs.RO

PIRATR: Parametric Object Inference for Robotic Applications with Transformers in 3D Point Clouds

Michael Schwingshackl, Fabio F. Oberweger, Mario Niedermeyer, Huemer Johannes, Markus Murschitz

Comments 8 Pages, 11 Figures, Accepted at 2026 IEEE International Conference on Robotics & Automation (ICRA) Vienna

2602.05555 2026-02-06 cs.CV cs.RO

IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools

Panagiotis Sapoutzoglou, Orestis Vaggelis, Athina Zacharia, Evangelos Sartinas, Maria Pateraki

Comments To appear in ICRA 2026

2602.05552 2026-02-06 cs.RO cs.CV

VLN-Pilot: Large Vision-Language Model as an Autonomous Indoor Drone Operator

Bessie Dominguez-Dager, Sergio Suescun-Ferrandiz, Felix Escalona, Francisco Gomez-Donoso, Miguel Cazorla

2602.05547 2026-02-06 cs.CL cs.AI cs.LG

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Shyam Sundhar Ramesh, Xiaotong Ji, Matthieu Zimmer, Sangwoong Yoon, Zhiyong Wang, Haitham Bou Ammar, Aurelien Lucchi, Ilija Bogunovic

Comments Preprint

2602.05544 2026-02-06 cs.AI

Reasoning-guided Collaborative Filtering with Language Models for Explainable Recommendation

Fahad Anwaar, Adil Mehmood Khan, Muhammad Khalid, Usman Zia, Kezhi Wang

2602.05539 2026-02-06 cs.LG cs.AI cs.CL

Steering Large Reasoning Models towards Concise Reasoning via Flow Matching

Yawei Li, Benjamin Bergner, Yinghan Zhao, Vihang Prakash Patil, Bei Chen, Cheng Wang

Comments This paper has been accepted to Transactions on Machine Learning Research (TMLR)

AI 大模型

视觉与机器人

科学与医疗

Graph-based Agent Memory: Taxonomy, Techniques, and Applications