arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.21709 2026-02-26 cs.CV

Assessing airborne laser scanning and aerial photogrammetry for deep learning-based stand delineation

Håkon Næss Sandum, Hans Ole Ørka, Oliver Tomic, Terje Gobakken

Comments 20 pages, 4 figures, 4 tables

详情

英文摘要

Accurate forest stand delineation is essential for forest inventory and management but remains a largely manual and subjective process. A recent study has shown that deep learning can produce stand delineations comparable to expert interpreters when combining aerial imagery and airborne laser scanning (ALS) data. However, temporal misalignment between data sources limits operational scalability. Canopy height models (CHMs) derived from digital photogrammetry (DAP) offer better temporal alignment but may smoothen canopy surface and canopy gaps, raising the question of whether they can reliably replace ALS-derived CHMs. Similarly, the inclusion of a digital terrain model (DTM) has been suggested to improve delineation performance, but has remained untested in published literature. Using expert-delineated forest stands as reference data, we assessed a U-Net-based semantic segmentation framework with municipality-level cross-validation across six municipalities in southeastern Norway. We compared multispectral aerial imagery combined with (i) an ALS-derived CHM, (ii) a DAP-derived CHM, and (iii) a DAP-derived CHM in combination with a DTM. Results showed comparable performance across all data combinations, reaching overall accuracy values between 0.90-0.91. Agreement between model predictions was substantially larger than agreement with the reference data, highlighting both model consistency and the inherent subjectivity of stand delineation. The similar performance of DAP-CHMs, despite the reduced structural detail, and the lack of improvements of the DTM indicate that the framework is resilient to variations in input data. These findings indicate that large datasets for deep learning-based stand delineations can be assembled using projects including temporally aligned ALS data and DAP point clouds.

URL PDF HTML ☆

赞 0 踩 0

2602.21706 2026-02-26 cs.CV cs.AI

SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video

Guanyi Qin, Xiaozhen Wang, Zhu Zhuo, Chang Han Low, Yuancan Xiao, Yibing Fu, Haofeng Liu, Kai Wang, Chunjiang Li, Yueming Jin

2602.21704 2026-02-26 cs.CV cs.AI

Dynamic Multimodal Activation Steering for Hallucination Mitigation in Large Vision-Language Models

Jianghao Yin, Qin Chen, Kedi Chen, Jie Zhou, Xingjiao Wu, Liang He

Comments Accepted by ICLR 2026

2602.21703 2026-02-26 cs.CV cs.LG

Brain Tumor Segmentation with Special Emphasis on the Non-Enhancing Brain Tumor Compartment

T. Schaffer, A. Brawanski, S. Wein, A. M. Tomé, E. W. Lang

2602.21701 2026-02-26 cs.LG physics.data-an stat.ML

Learning Complex Physical Regimes via Coverage-oriented Uncertainty Quantification: An application to the Critical Heat Flux

Michele Cazzola, Alberto Ghione, Lucia Sargentini, Julien Nespoulous, Riccardo Finotello

Comments 34 pages, 14 figures

详情

英文摘要

A central challenge in scientific machine learning (ML) is the correct representation of physical systems governed by multi-regime behaviours. In these scenarios, standard data analysis techniques often fail to capture the nature of the data, as the system's response varies significantly across the state space due to its stochasticity and the different physical regimes. Uncertainty quantification (UQ) should thus not be viewed merely as a safety assessment, but as a support to the learning task itself, guiding the model to internalise the behaviour of the data. We address this by focusing on the Critical Heat Flux (CHF) benchmark and dataset presented by the OECD/NEA Expert Group on Reactor Systems Multi-Physics. This case study represents a test for scientific ML due to the non-linear dependence of CHF on the inputs and the existence of distinct microscopic physical regimes. These regimes exhibit diverse statistical profiles, a complexity that requires UQ techniques to internalise the data behaviour and ensure reliable predictions. In this work, we conduct a comparative analysis of UQ methodologies to determine their impact on physical representation. We contrast post-hoc methods, specifically conformal prediction, against end-to-end coverage-oriented pipelines, including (Bayesian) heteroscedastic regression and quality-driven losses. These approaches treat uncertainty not as a final metric, but as an active component of the optimisation process, modelling the prediction and its behaviour simultaneously. We show that while post-hoc methods ensure statistical calibration, coverage-oriented learning effectively reshapes the model's representation to match the complex physical regimes. The result is a model that delivers not only high predictive accuracy but also a physically consistent uncertainty estimation that adapts dynamically to the intrinsic variability of the CHF.

URL PDF HTML ☆

赞 0 踩 0

2602.21699 2026-02-26 cs.CV

SF3D-RGB: Scene Flow Estimation from Monocular Camera and Sparse LiDAR

Rajai Alhimdiat, Ramy Battrawy, René Schuster, Didier Stricker, Wesam Ashour

Comments Accepted in Computer Vision Conference (CVC) 2026

2602.21698 2026-02-26 cs.CV

E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought

Meiqi Sun, Mingyu Li, Junxiong Zhu

Comments 21pages, 19figures, accepted by CVPR 2026

2602.21696 2026-02-26 cs.RO

Dual-Regime Hybrid Aerodynamic Modeling of Winged Blimps With Neural Mixing

Xiaorui Wang, Hongwu Wang, Yue Fan, Hao Cheng, Feitian Zhang

2602.21693 2026-02-26 cs.LG

TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts

Jiafeng Lin, Yuxuan Wang, Huakun Luo, Zhongyi Pei, Jianmin Wang

2602.21691 2026-02-26 cs.RO

Trajectory Generation with Endpoint Regulation and Momentum-Aware Dynamics for Visually Impaired Scenarios

Yuting Zeng, Manping Fan, You Zhou, Yongbin Yu, Zhiwen Zheng, Jingtao Zhang, Liyong Ren, Zhenglin Yang

Comments 9 pages, 7 figures

2602.21684 2026-02-26 cs.RO cs.LG

Primary-Fine Decoupling for Action Generation in Robotic Imitation

Xiaohan Lei, Min Wang, Wengang Zhou, Xingyu Lu, Houqiang Li

Comments The Fourteenth International Conference on Learning Representations (ICLR), 2026

2602.21682 2026-02-26 cs.RO

SunnyParking: Multi-Shot Trajectory Generation and Motion State Awareness for Human-like Parking

Jishu Miao, Han Chen, Jiankun Zhai, Qi Liu, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi

2602.21680 2026-02-26 cs.LG cs.MA

Hierarchical Lead Critic based Multi-Agent Reinforcement Learning

David Eckel, Henri Meeß

Comments 16 pages, 10 Figures, Preprint

2602.21674 2026-02-26 cs.LG cs.LO

Error-awareness Accelerates Active Automata Learning

Loes Kruger, Sebastian Junges, Jurriaan Rot

2602.21669 2026-02-26 cs.CL

DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation

Duc Trung Vu, Pham Khanh Chi, Dat Phi Van, Linh Ngo Van, Sang Dinh, Trung Le

Comments EACL Findings

2602.21668 2026-02-26 cs.CV cs.GR

Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping

Junmyeong Lee, Hoseung Choi, Minsu Cho

Comments 20 pages, 13 figures

2602.21666 2026-02-26 cs.RO

Biomechanical Comparisons Reveal Divergence of Human and Humanoid Gaits

Luying Feng, Yaochu Jin, Hanze Hu, Wei Chen

2602.21662 2026-02-26 cs.CV

HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation

Wenjie Huang, Qi Yang, Shuting Xia, He Huang, Zhu Li, Yiling Xu

Comments 8 pages, 10 figures

2602.21657 2026-02-26 cs.CV cs.AI

Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

Shaoxuan Wu, Jingkun Chen, Chong Ma, Cong Shen, Xiao Zhang, Jun Feng

2602.21652 2026-02-26 cs.CL cs.AI

Sparsity Induction for Accurate Post-Training Pruning of Large Language Models

Minhao Jiang, Zhikai Li, Xuewen Liu, Jing Zhang, Mengjuan Chen, Qingyi Gu

Comments 5 pages, 1 figure, 4 tables

2602.21648 2026-02-26 cs.LG q-bio.QM

Multimodal Survival Modeling and Fairness-Aware Clinical Machine Learning for 5-Year Breast Cancer Risk Prediction

Toktam Khatibi

2602.21645 2026-02-26 cs.CV

Lie Flow: Video Dynamic Fields Modeling and Predicting with Lie Algebra as Geometric Physics Principle

Weidong Qiao, Wangmeng Zuo, Hui Li

Comments 10pages,5 figures

2602.21638 2026-02-26 cs.CL

Multi-dimensional Assessment and Explainable Feedback for Counselor Responses to Client Resistance in Text-based Counseling with LLMs

Anqi Li, Ruihan Wang, Zhaoming Chen, Yuqian Chen, Yu Lu, Yi Zhu, Yuan Xie, Zhenzhong Lan

Comments 8 pages

2602.21634 2026-02-26 cs.LG cs.MA

AgentLTV: An Agent-Based Unified Search-and-Evolution Framework for Automated Lifetime Value Prediction

Chaowei Wu, Huazhu Chen, Congde Yuan, Qirui Yang, Guoqing Song, Yue Gao, Li Luo, Frank Youhua Chen, Mengzhuo Guo

Comments 12 pages, 4 figures, submitted to KDD 2026: 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, ADS Track

2602.21633 2026-02-26 cs.RO cs.AI cs.CV

Self-Correcting VLA: Online Action Refinement via Sparse World Imagination

Chenyv Liu, Wentao Tan, Lei Zhu, Fengling Li, Jingjing Li, Guoli Yang, Heng Tao Shen

2602.21631 2026-02-26 cs.CV

UniHand: A Unified Model for Diverse Controlled 4D Hand Motion Modeling

Zhihao Sun, Tong Wu, Ruirui Tu, Daoguo Dong, Zuxuan Wu

2602.21622 2026-02-26 cs.RO

ADM-DP: Adaptive Dynamic Modality Diffusion Policy through Vision-Tactile-Graph Fusion for Multi-Agent Manipulation

Enyi Wang, Wen Fan, Dandan Zhang

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA 2026)

2602.21619 2026-02-26 cs.CL

When More Is Less: A Systematic Analysis of Spatial and Commonsense Information for Visual Spatial Reasoning

Muku Akasaka, Soyeon Caren Han

Comments 5 pages, 6 figures, Under review

2602.21613 2026-02-26 cs.CV cs.AI

Virtual Biopsy for Intracranial Tumors Diagnosis on MRI

Xinzhe Luo, Shuai Shao, Yan Wang, Jiangtao Wang, Yutong Bai, Jianguo Zhang

2602.21612 2026-02-26 cs.RO

Jumping Control for a Quadrupedal Wheeled-Legged Robot via NMPC and DE Optimization

Xuanqi Zeng, Lingwei Zhang, Linzhu Yue, Zhitao Song, Hongbo Zhang, Tianlin Zhang, Yun-Hui Liu

Comments 8 pages, 12 figures