arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Weiguang Zhao, Haoran Xu, Xingyu Miao, Qin Zhao, Rui Zhang, Kaizhu Huang, Ning Gao, Peizhou Cao, Mingze Sun, Mulin Yu, Tao Lu, Linning Xu, Junting Dong, Jiangmiao Pang

2602.04436 2026-02-05 cs.LG

Hand Gesture Recognition from Doppler Radar Signals Using Echo State Networks

Towa Sano, Gouhei Tanaka

Comments Submitted to IJCNN 2026. 21 pages, 10figures

2602.04428 2026-02-05 cs.CL

Fine-Grained Activation Steering: Steering Less, Achieving More

Zijian Feng, Tianjiao Li, Zixiao Zhu, Hanzhang Zhou, Junlang Qian, Li Zhang, Jia Jim Deryl Chua, Lee Onn Mak, Gee Wah Ng, Kezhi Mao

Comments ICLR 2026

2602.04417 2026-02-05 cs.LG cs.AI

EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL

Lunjun Zhang, Jimmy Ba

2602.04416 2026-02-05 cs.CV cs.AI

Med-MMFL: A Multimodal Federated Learning Benchmark in Healthcare

Aavash Chhetri, Bibek Niroula, Pratik Shrestha, Yash Raj Shrestha, Lesley A Anderson, Prashnna K Gyawali, Loris Bazzani, Binod Bhattarai

2602.04413 2026-02-05 cs.CL cs.AI cs.MM

History-Guided Iterative Visual Reasoning with Self-Correction

Xinglong Yang, Zhilin Peng, Zhanzhan Liu, Haochen Shi, Sheng-Jun Huang

2602.04406 2026-02-05 cs.CV

LCUDiff: Latent Capacity Upgrade Diffusion for Faithful Human Body Restoration

Jue Gong, Zihan Zhou, Jingkai Wang, Shu Li, Libo Liu, Jianliang Lan, Yulun Zhang

Comments 8 pages, 7 figures. The code and model will be at https://github.com/gobunu/LCUDiff

2602.04405 2026-02-05 cs.CV cs.MM

Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion

Yixin Zhu, Long Lv, Pingping Zhang, Xuehu Liu, Tongdan Tang, Feng Tian, Weibing Sun, Huchuan Lu

Comments This work is accepted by IEEE Transactions on Image Processing. More modifications may be performed

2602.04404 2026-02-05 cs.LG cond-mat.dis-nn

Theory of Speciation Transitions in Diffusion Models with General Class Structure

Beatrice Achilli, Marco Benedetti, Giulio Biroli, Marc Mézard

Comments 17 pages, 6 figures

2602.04399 2026-02-05 cs.CL

Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models

Yu Zhang, Xinchen Li, Jialei Zhou, Hongnan Ma, Zhongwei Wan, Yiwei Shi, Duoqian Miao, Qi Zhang, Longbing Cao

2602.04398 2026-02-05 cs.CL cs.AI

Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts

Yujie Lin, Kunquan Li, Yixuan Liao, Xiaoxin Chen, Jinsong Su

2602.04392 2026-02-05 cs.CL

Evaluating the Presence of Sex Bias in Clinical Reasoning by Large Language Models

Isabel Tsintsiper, Sheng Wong, Beth Albert, Shaun P Brennecke, Gabriel Davis Jones

2602.04391 2026-02-05 cs.CL

Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning

Jie Deng, Hanshuang Tong, Jun Li, Shining Liang, Ning Wu, Hongzhi Li, Yutao Xie

2602.04388 2026-02-05 cs.LG

On the use of LLMs to generate a dataset of Neural Networks

Nadia Daoudi, Jordi Cabot

2602.04385 2026-02-05 cs.AI cs.SE

Digital Twins & ZeroConf AI: Structuring Automated Intelligent Pipelines for Industrial Applications

Marco Picone, Fabio Turazza, Matteo Martinelli, Marco Mamei

Comments Author-accepted manuscript of a paper published in the 2025 IEEE International Conference on Systems, Man and Cybernetics (IEEE SMC), October 2025, doi: 10.1109/SMC58881.2025.11343418

Journal ref 2025 IEEE International Conference on Systems, Man and Cybernetics (SMC)

2602.04384 2026-02-05 cs.LG cs.AI cs.CR

Blockchain Federated Learning for Sustainable Retail: Reducing Waste through Collaborative Demand Forecasting

Fabio Turazza, Alessandro Neri, Marcello Pietri, Maria Angela Butturi, Marco Picone, Marco Mamei

Comments Author-accepted manuscript of a paper published in the IEEE International Symposium on Computers and Communications (ISCC), 2025, pp. 1-6. doi: https://doi.org/10.1109/ISCC65549.2025.11326299

Journal ref IEEE International Symposium on Computers and Communications (ISCC), 2025, pp. 1-6

2602.04380 2026-02-05 cs.LG cs.AI

Beyond KL Divergence: Policy Optimization with Flexible Bregman Divergences for LLM Reasoning

Rui Yuan, Mykola Khandoga, Vinay Kumar Sankarapu

2602.04373 2026-02-05 cs.LG

Reducing the labeling burden in time-series mapping using Common Ground: a semi-automated approach to tracking changes in land cover and species over time

Geethen Singh, Jasper A Slingsby, Tamara B Robinson, Glenn Moncrieff

详情

英文摘要

Reliable classification of Earth Observation data depends on consistent, up-to-date reference labels. However, collecting new labelled data at each time step remains expensive and logistically difficult, especially in dynamic or remote ecological systems. As a response to this challenge, we demonstrate that a model with access to reference data solely from time step t0 can perform competitively on both t0 and a future time step t1, outperforming models trained separately on time-specific reference data (the gold standard). This finding suggests that effective temporal generalization can be achieved without requiring manual updates to reference labels beyond the initial time step t0. Drawing on concepts from change detection and semi-supervised learning (SSL), the most performant approach, "Common Ground", uses a semi-supervised framework that leverages temporally stable regions-areas with little to no change in spectral or semantic characteristics between time steps-as a source of implicit supervision for dynamic regions. We evaluate this strategy across multiple classifiers, sensors (Landsat-8, Sentinel-2 satellite multispectral and airborne imaging spectroscopy), and ecological use cases. For invasive tree species mapping, we observed a 21-40% improvement in classification accuracy using Common Ground compared to naive temporal transfer, where models trained at a single time step are directly applied to a future time step. We also observe a 10 -16% higher accuracy for the introduced approach compared to a gold-standard approach. In contrast, when broad land cover categories were mapped across Europe, we observed a more modest 2% increase in accuracy compared to both the naive and gold-standard approaches. These results underscore the effectiveness of combining stable reference screening with SSL for scalable and label-efficient multi-temporal remote sensing classification.

URL PDF HTML ☆

赞 0 踩 0

2602.04365 2026-02-05 cs.LG

EXaMCaP: Subset Selection with Entropy Gain Maximization for Probing Capability Gains of Large Chart Understanding Training Sets

Jiapeng Liu, Liang Li, Bing Li, Peng Fu, Xiyan Gao, Chengyang Fang, Xiaoshuai Hao, Can Ma

2602.04356 2026-02-05 cs.CV

When and Where to Attack? Stage-wise Attention-Guided Adversarial Attack on Large Vision Language Models

Jaehyun Kwak, Nam Cao, Boryeong Cho, Segyu Lee, Sumyeong Ahn, Se-Young Yun

Comments Pre-print

2602.04355 2026-02-05 cs.CL

Can Vision Replace Text in Working Memory? Evidence from Spatial n-Back in Vision-Language Models

Sichu Liang, Hongyu Zhu, Wenwen Wang, Deyu Zhou

2602.04352 2026-02-05 cs.LG

Mosaic Learning: A Framework for Decentralized Learning with Model Fragmentation

Sayan Biswas, Davide Frey, Romaric Gaudel, Nirupam Gupta, Anne-Marie Kermarrec, Dimitri Lerévérend, Rafael Pires, Rishi Sharma, François Taïani, Martijn de Vos

2602.03351 2026-02-05 cs.AI cs.CY cs.LG

Building Interpretable Models for Moral Decision-Making

Mayank Goel, Aritra Das, Paras Chopra

Comments 8 pages, 4 figures, accepted to AAAI'26 Machine Ethics Workshop

2602.03307 2026-02-05 cs.SD

GRAM: Spatial general-purpose audio representations for real-world environments

Goksenin Yuksel, Marcel van Gerven, Kiki van der Heijden

Comments I have accidentally uploaded a revised version of my old paper. I meant to revise arXiv:2506.00934 rather than upload a new version

2602.03305 2026-02-05 cs.LG

medR: Reward Engineering for Clinical Offline Reinforcement Learning via Tri-Drive Potential Functions

Qianyi Xu, Gousia Habib, Feng Wu, Yanrui Du, Zhihui Chen, Swapnil Mishra, Dilruk Perera, Mengling Feng

2602.02776 2026-02-05 cs.LG

Verification and Identification in ECG biometric on large-scale

Scagnetto Arjuna

2602.02515 2026-02-05 cs.AI cs.CL cs.LG

CreditAudit: 2$^\text{nd}$ Dimension for LLM Evaluation and Selection

Yiliang Song, Hongjun An, Jiangong Xiao, Haofei Zhao, Jiawei Shao, Xuelong Li

Comments Second update

2602.02499 2026-02-05 cs.CL

ROSA-Tuning: Enhancing Long-Context Modeling via Suffix Matching

Yunao Zheng, Xiaojie Wang, Lei Ren, Wei Chen

2602.02388 2026-02-05 cs.CV cs.LG

Personalized Image Generation via Human-in-the-loop Bayesian Optimization

Rajalaxmi Rajagopalan, Debottam Dutta, Yu-Lin Wei, Romit Roy Choudhury

2601.23174 2026-02-05 cs.LG cs.AI cs.SD

Beyond Fixed Frames: Dynamic Character-Aligned Speech Tokenization

Luca Della Libera, Cem Subakan, Mirco Ravanelli

Comments 18 pages, 3 figures

AI 大模型

视觉与机器人

科学与医疗

SynthVerse: A Large-Scale Diverse Synthetic Dataset for Point Tracking