arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.12912 2026-03-16 cs.CV cs.AI

FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts

Xin Xu, Weilong Li, Wei Liu, Wenke Huang, Zhixi Yu, Bin Yang, Xiaoying Liao, Kui Jiang

详情

英文摘要

Federated Domain Generalization for Person Re-Identification (FedDG-ReID) learns domain-invariant representations from decentralized data. While Vision Transformer (ViT) is widely adopted, its global attention often fails to distinguish pedestrians from high similarity backgrounds or diverse viewpoints -- a challenge amplified by cross-client distribution shifts in FedDG-ReID. To address this, we propose Federated Body Distribution Aware Visual Prompt (FedBPrompt), introducing learnable visual prompts to guide Transformer attention toward pedestrian-centric regions. FedBPrompt employs a Body Distribution Aware Visual Prompts Mechanism (BAPM) comprising: Holistic Full Body Prompts to suppress cross-client background noise, and Body Part Alignment Prompts to capture fine-grained details robust to pose and viewpoint variations. To mitigate high communication costs, we design a Prompt-based Fine-Tuning Strategy (PFTS) that freezes the ViT backbone and updates only lightweight prompts, significantly reducing communication overhead while maintaining adaptability. Extensive experiments demonstrate that BAPM effectively enhances feature discrimination and cross-domain generalization, while PFTS achieves notable performance gains within only a few aggregation rounds. Moreover, both BAPM and PFTS can be easily integrated into existing ViT-based FedDG-ReID frameworks, making FedBPrompt a flexible and effective solution for federated person re-identification. The code is available at https://github.com/leavlong/FedBPrompt.

URL PDF HTML ☆

赞 0 踩 0

2603.12906 2026-03-16 cs.CL cs.AI

Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study

Liel Binyamin, Elior Sulem

Comments Accepted to Findings of EACL 2026

2603.12905 2026-03-16 cs.LG cs.CV

DirPA: Addressing Prior Shift in Imbalanced Few-shot Crop-type Classification

Joana Reuss, Ekaterina Gikalo, Marco Körner

Comments 20 pages, 9 Figures, 28 Tables

2603.12904 2026-03-16 cs.RO

Consistent and Efficient MSCKF-based LiDAR-Inertial Odometry with Inferred Cluster-to-Plane Constraints for UAVs

Jinwen Zhu, Xudong Zhao, Fangcheng Zhu, Jun Hu, Shi Jin, Yinian Mao, Guoquan Huang

2603.12903 2026-03-16 cs.CV

Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis

Yinuo Jiang, Jun Cheng, Yiran Wang, Cheng Cheng

Comments Accepted by CVPR 2026

2603.12893 2026-03-16 cs.CV cs.AI cs.LG cs.NE stat.ML

Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models

David McAllister, Miika Aittala, Tero Karras, Janne Hellsten, Angjoo Kanazawa, Timo Aila, Samuli Laine

Comments Code available at https://github.com/NVlabs/finite-difference-flow-optimization

2603.12078 2026-03-16 cs.CV

Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs

Hiran Sarkar, Liming Kuang, Yordanka Velikova, Benjamin Busam

Comments Accepted to CVPR 2026. 13 pages, 9 figures

2603.12067 2026-03-16 cs.CV cs.AI

Beyond Convolution: A Taxonomy of Structured Operators for Learning-Based Image Processing

Simone Cammarasana

2603.11975 2026-03-16 cs.CV cs.AI cs.CR

HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios

Jiayue Pu, Zhongxiang Sun, Zilu Zhang, Xiao Zhang, Jun Xu

2603.11279 2026-03-16 cs.AI

AI Psychometrics: Evaluating the Psychological Reasoning of Large Language Models with Psychometric Validities

Yibai Li, Xiaolin Lin, Zhenghui Sha, Zhiye Jin, Xiaobing Li

Comments Accepted for publication in the Proceedings of the 58th Hawaii International Conference on System Sciences (HICSS), 2025

2603.11139 2026-03-16 cs.LG

H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code

Amit Singh, Vedant Nipane, Pulkit Agrawal, Jatin Kishnani, Sairanjan Mishra

2603.08207 2026-03-16 cs.CL

The Conundrum of Trustworthy Research on Attacking Personally Identifiable Information Removal Techniques

Sebastian Ochs, Ivan Habernal

Comments Accepted to Computational Linguistics

2603.05330 2026-03-16 cs.CV

Dark3R: Learning Structure from Motion in the Dark

Andrew Y Guo, Anagh Malik, SaiKiran Tedla, Yutong Dai, Yiqian Qin, Zach Salehe, Benjamin Attal, Sotiris Nousias, Kiriakos N. Kutulakos, David B. Lindell

Comments CVPR 2026, Project Page: https://andrewguo.com/pub/dark3r

2603.01000 2026-03-16 cs.CV

Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer

Yuze Li, Dong Gong, Xiao Cao, Junchao Yuan, Dongsheng Li, Lei Zhou, Yun Sing Koh, Cheng Yan, Xinyu Zhang

Comments 15 pages, 11 figures, cvpr 2026, see https://ethan-li123.github.io/FlexiMMT_page/

2602.24084 2026-03-16 cs.CV

FoV-Net: Rotation-Invariant CAD B-rep Learning via Field-of-View Ray Casting

Matteo Ballegeer, Dries F. Benoit

Comments Manuscript accepted at CVPR 2026

2602.21905 2026-03-16 cs.CV

TIRAuxCloud: A Thermal Infrared Dataset for Day and Night Cloud Detection

Alexis Apostolakis, Vasileios Botsos, Niklas Wölki, Andrea Spichtinger, Nikolaos Ioannis Bountos, Ioannis Papoutsis, Panayiotis Tsanakas

2602.14027 2026-03-16 cs.CV

Train Short, Inference Long: Training-free Horizon Extension for Autoregressive Video Generation

Jia Li, Xiaomeng Fu, Xurui Peng, Weifeng Chen, Youwei Zheng, Tianyu Zhao, Jiexi Wang, Fangmin Chen, Xing Wang, Hayden Kwok-Hay So

Comments 19 pages, 15 figures

2602.13172 2026-03-16 cs.CV

LongStream: Long-Sequence Streaming Autoregressive Visual Geometry

Chong Cheng, Xianda Chen, Tao Xie, Wei Yin, Weiqiang Ren, Qian Zhang, Xiaoyang Guo, Hao Wang

Comments CVPR2026 accepted

2602.12078 2026-03-16 cs.AI cs.CL

Tiny Recursive Reasoning with Mamba-2 Attention Hybrid

Wenlong Wang, Fergal Reid

Comments Published at ICLR 2026 Latent & Implicit Thinking Workshop

2602.11506 2026-03-16 cs.LG cs.AI cs.AR cs.PF

RooflineBench: A Benchmarking Framework for On-Device LLMs via Roofline Analysis

Zhen Bi, Xueshu Chen, Luoyang Sun, Yuhang Yao, Qing Shen, Jungang Lou, Cheng Deng

2602.04096 2026-03-16 cs.LG

CORE: Context-Robust Remasking for Diffusion Language Models

Kevin Zhai, Sabbir Mollah, Zhenyi Wang, Mubarak Shah

Comments Project Page: https://ucf-crcv.github.io/core/

2601.21866 2026-03-16 cs.LG cs.AI

MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts

Evandro S. Ortigossa, Guy Lutsker, Eran Segal

Comments Under review

2601.20518 2026-03-16 cs.LG cs.AI

CCMamba: Topologically-Informed Selective State-Space Networks on Combinatorial Complexes for Higher-Order Graph Learning

Jiawen Chen, Qi Shao, Mingtong Zhou, Duxin Chen, Wenwu Yu

2601.19577 2026-03-16 cs.CV

MaDiS: Taming Masked Diffusion Language Models for Sign Language Generation

Ronglai Zuo, Rolandos Alexandros Potamias, Qi Sun, Evangelos Ververas, Jiankang Deng, Stefanos Zafeiriou

2512.22587 2026-03-16 cs.LG stat.ML

Structural Incompatibility of Differentiable Sorting and Within-Vector Rank Normalization

Taeyun Kim

Comments 6 pages

2512.11946 2026-03-16 cs.LG cs.AI stat.ML

Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

Pramudita Satria Palar, Paul Saves, Rommel G. Regis, Koji Shimoyama, Shigeru Obayashi, Nicolas Verstaevel, Joseph Morlier

Comments Published in Aerospace Science and Technology, 2026

详情

DOI: 10.1016/j.ast.2026.112091

英文摘要

Explainable machine learning techniques have gained increasing attention in engineering applications, especially in aerospace design and analysis, where understanding how input variables influence data-driven models is essential. Partial Dependence Plots (PDPs) are widely used for interpreting black-box models by showing the average effect of an input variable on the prediction. However, their global sensitivity metric can be misleading when strong interactions are present, as averaging tends to obscure interaction effects. To address this limitation, we propose a global sensitivity metric based on Individual Conditional Expectation (ICE) curves. The method computes the expected feature importance across ICE curves, along with their standard deviation, to more effectively capture the influence of interactions. We provide a mathematical proof demonstrating that the PDP-based sensitivity is a lower bound of the proposed ICE-based metric under truncated orthogonal polynomial expansion. In addition, we introduce an ICE-based correlation value to quantify how interactions modify the relationship between inputs and the output. Comparative evaluations were performed on three cases: a 5-variable analytical function, a 5-variable wind-turbine fatigue problem, and a 9-variable airfoil aerodynamics case, where ICE-based sensitivity was benchmarked against PDP, SHapley Additive exPlanations (SHAP), and Sobol' indices. The results show that ICE-based feature importance provides richer insights than the traditional PDP-based approach, while visual interpretations from PDP, ICE, and SHAP complement one another by offering multiple perspectives.

URL PDF HTML ☆

赞 0 踩 0

2512.05343 2026-03-16 cs.CV cs.AI

SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling

Elisabetta Fedele, Francis Engelmann, Ian Huang, Or Litany, Marc Pollefeys, Leonidas Guibas

Comments Project page: https://spacecontrol3d.github.io/

2511.17361 2026-03-16 cs.CV

SuperQuadricOcc: Real-Time Self-Supervised Semantic Occupancy Estimation with Superquadric Volume Rendering

Seamie Hayes, Alexandre Boulch, Andrei Bursuc, Reenu Mohandas, Ganesh Sistu, Tim Brophy, Ciaran Eising

2511.13421 2026-03-16 cs.LG stat.ML

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Tingkai Yan, Haodong Wen, Binghui Li, Kairong Luo, Wenguang Chen, Kaifeng Lyu

详情

英文摘要

While data scaling laws of large language models (LLMs) have been widely examined in the one-pass regime with massive corpora, their form under limited data and repeated epochs remains largely unexplored. This paper presents a theoretical analysis of how a common workaround, training for multiple epochs on the same dataset, reshapes the data scaling laws in linear regression. Concretely, we ask: to match the performance of training on a dataset of size $N$ for $K$ epochs, how much larger must a dataset be if the model is trained for only one pass? We quantify this using the \textit{effective reuse rate} of the data, $E(K, N)$, which we define as the multiplicative factor by which the dataset must grow under one-pass training to achieve the same test loss as $K$-epoch training. Our analysis precisely characterizes the scaling behavior of $E(K, N)$ for SGD in linear regression under either strong convexity or Zipf-distributed data: (1) When $K$ is small, we prove that $E(K, N) \approx K$, indicating that every new epoch yields a linear gain; (2) As $K$ increases, $E(K, N)$ plateaus at a problem-dependent value that grows with $N$ ($Θ(\log N)$ for the strongly-convex case), implying that larger datasets can be repeated more times before the marginal benefit vanishes. These theoretical findings point out a neglected factor in a recent empirical study (Muennighoff et al. (2023)), which claimed that training LLMs for up to $4$ epochs results in negligible loss differences compared to using fresh data at each step, \textit{i.e.}, $E(K, N) \approx K$ for $K \le 4$ in our notation. Supported by further empirical validation with LLMs, our results reveal that the maximum $K$ value for which $E(K, N) \approx K$ in fact depends on the data size and distribution, and underscore the need to explicitly model both factors in future studies of scaling laws with data reuse.

URL PDF HTML ☆

赞 0 踩 0

2511.11266 2026-03-16 cs.CV

GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous Driving

Fabian Schmidt, Markus Enzweiler, Abhinav Valada