arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.26389 2026-03-30 cs.LG

Maintaining Difficulty: A Margin Scheduler for Triplet Loss in Siamese Networks Training

Roberto Sprengel Minozzo Tomchak, Oge Marques, Lucas Garcia Pedroso, Luiz Eduardo Oliveira, Paulo Lisboa de Almeida

详情

英文摘要

The Triplet Margin Ranking Loss is one of the most widely used loss functions in Siamese Networks for solving Distance Metric Learning (DML) problems. This loss function depends on a margin parameter μ, which defines the minimum distance that should separate positive and negative pairs during training. In this work, we show that, during training, the effective margin of many triplets often exceeds the predefined value of μ, provided that a sufficient number of triplets violating this margin is observed. This behavior indicates that fixing the margin throughout training may limit the learning process. Based on this observation, we propose a margin scheduler that adjusts the value of μ according to the proportion of easy triplets observed at each epoch, with the goal of maintaining training difficulty over time. We show that the proposed strategy leads to improved performance when compared to both a constant margin and a monotonically increasing margin scheme. Experimental results on four different datasets show consistent gains in verification performance.

URL PDF HTML ☆

赞 0 踩 0

2603.26385 2026-03-30 cs.CV

Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration

I-Hsiang Chen, Isma Hadji, Enrique Sanchez, Adrian Bulat, Sy-Yen Kuo, Radu Timofte, Georgios Tzimiropoulos, Brais Martinez

Comments Accepted by CVPR2026; Project Page: https://restore-assess-repeat.github.io

2603.26378 2026-03-30 cs.LG cs.AI

Generative Modeling in Protein Design: Neural Representations, Conditional Generation, and Evaluation Standards

Senura Hansaja Wanasekara, Minh-Duong Nguyen, Xiaochen Liu, Nguyen H. Tran, Ken-Tye Yong

Comments 20 pages, 7 tables, 4 figures

2603.26365 2026-03-30 cs.CV

Dynamic Token Compression for Efficient Video Understanding through Reinforcement Learning

Shida Wang, YongXiang Hua, Zhou Tao, Haoyu Cao, Linli Xu

2603.26364 2026-03-30 cs.SD

LLaDA-TTS: Unifying Speech Synthesis and Zero-Shot Editing via Masked Diffusion Modeling

Xiaoyu Fan, Huizhi Xie, Wei Zou, Yunzhang Chen

Comments 11 pages, 6 figures, 2 tables

2603.26363 2026-03-30 cs.LG cs.CL

A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models

Steffen Herbold, Florian Lemmerich

2603.26362 2026-03-30 cs.CV

HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models

MD Khalequzzaman Chowdhury Sayem, Mubarrat Tajoar Chowdhury, Yihalem Yimolal Tiruneh, Muneeb A. Khan, Muhammad Salman Ali, Binod Bhattarai, Seungryul Baek

Comments Accepted in CVPR 2026; Project page, code, and dataset: https://kcsayem.github.io/handvqa/

2603.26360 2026-03-30 cs.RO

Realtime-VLA V2: Learning to Run VLAs Fast, Smooth, and Accurate

Chen Yang, Yucheng Hu, Yunchao Ma, Yunhuan Yang, Jing Tan, Haoqiang Fan

2603.26356 2026-03-30 cs.CV

From Pen to Pixel: Translating Hand-Drawn Plots into Graphical APIs via a Novel Benchmark and Efficient Adapter

Zhenghao Xu, Mengning Yang

2603.26354 2026-03-30 cs.CV

Only Whats Necessary: Pareto Optimal Data Minimization for Privacy Preserving Video Anomaly Detection

Nazia Aslam, Abhisek Ray, Thomas B. Moeslund, Kamal Nasrollahi

Comments 10 pages, CVPR conference

2603.26351 2026-03-30 cs.CV cs.LG

DuSCN-FusionNet: An Interpretable Dual-Channel Structural Covariance Fusion Framework for ADHD Classification Using Structural MRI

Qurat Ul Ain, Alptekin Temizel, Soyiba Jawed

Comments 5 pages, 5 figures

2603.26348 2026-03-30 cs.CV cs.AI

Reflect to Inform: Boosting Multimodal Reasoning via Information-Gain-Driven Verification

Shuai Lv, Chang Liu, Feng Tang, Yujie Yuan, Aojun Zhou, Kui Zhang, Xi Yang, Yangqiu Song

2603.26347 2026-03-30 cs.RO cs.SY eess.SY

Optimal Prioritized Dissipation and Closed-Form Damping Limitation under Actuator Constraints for Haptic Interfaces

Camilla Celli, Andrea Bini, Valerio Novelli, Alessandro Filippeschi, Francesco Porcini, Antonio Frisoli

2603.26341 2026-03-30 cs.CV

HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network

Mingyu Zhang, Zixu Li, Zhiwei Chen, Zhiheng Fu, Xiaowei Zhu, Jiajia Nie, Yinwei Wei, Yupeng Hu

Comments Accepted by ICASSP 2026

2603.26339 2026-03-30 cs.LG cs.RO cs.SY eess.SY

Curvature-aware Expected Free Energy as an Acquisition Function for Bayesian Optimization

Ajith Anil Meera, Wouter Kouw

Comments under review

2603.26336 2026-03-30 cs.CV

From Pixels to Privacy: Temporally Consistent Video Anonymization via Token Pruning for Privacy Preserving Action Recognition

Nazia Aslam, Abhisek Ray, Joakim Bruslund Haurum, Lukas Esterle, Kamal Nasrollahi

Comments 10 pages, CVPR paper

2603.26332 2026-03-30 cs.CL cs.AI

CALRK-Bench: Evaluating Context-Aware Legal Reasoning in Korean Law

JiHyeok Jung, TaeYoung Yoon, HyunSouk Cho

Comments 15 pages

2603.26330 2026-03-30 cs.CV cs.AI

Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation

Yiming Ren, Yujiu Yang, Junjie Wang

2603.26328 2026-03-30 cs.CV

Verify Claimed Text-to-Image Models via Boundary-Aware Prompt Optimization

Zidong Zhao, Yihao Huang, Qing Guo, Tianlin Li, Anran Li, Kailong Wang, Jin Song Dong, Geguang Pu

Comments Accepted to CVPR 2026 (Findings)

2603.26323 2026-03-30 cs.CL cs.AI

From Human Cognition to Neural Activations: Probing the Computational Primitives of Spatial Reasoning in LLMs

Jiyuan An, Liner Yang, Mengyan Wang, Luming Lu, Weihua An, Erhong Yang

2603.26322 2026-03-30 cs.RO

DiffusionAnything: End-to-End In-context Diffusion Learning for Unified Navigation and Pre-Grasp Motion

Iana Zhura, Yara Mahmoud, Jeffrin Sam, Hung Khang Nguyen, Didar Seyidov, Miguel Altamirano Cabrera, Dzmitry Tsetserukou

2603.26317 2026-03-30 cs.CV cs.AI cs.LG

Label-Free Cross-Task LoRA Merging with Null-Space Compression

Wonyoung Lee, Wooseong Jeong, Kuk-Jin Yoon

Comments Accepted at CVPR 2026

2603.26316 2026-03-30 cs.CV cs.LG

SALMUBench: A Benchmark for Sensitive Association-Level Multimodal Unlearning

Cai Selvas-Sala, Lei Kang, Lluis Gomez

Comments Accepted to CVPR 2026. Project page: http://cvc-mmu.github.io/salmubench

2603.26314 2026-03-30 cs.RO

Line-of-Sight-Constrained Multi-Robot Mapless Navigation via Polygonal Visible Regions

Ruofei Bai, Shenghai Yuan, Xinhang Xu, Xingyu Ji, Xiaowei Li, Hongliang Guo, Wei-Yun Yau, Lihua Xie

Comments 10 pages, 7 figures. See videos and code: https://github.com/bairuofei/LoS_constrained_navigation

2603.26308 2026-03-30 cs.LG

D-GATNet: Interpretable Temporal Graph Attention Learning for ADHD Identification Using Dynamic Functional Connectivity

Qurat Ul Ain, Alptekin Temizel, Soyiba Jawed

Comments 5 pages , 4 figures

2603.26299 2026-03-30 cs.CV cs.AI cs.LG

Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy

Wooseong Jeong, Wonyoung Lee, Kuk-Jin Yoon

Comments Accepted at CVPR 2026

2603.26264 2026-03-30 cs.LG cs.SY eess.SY

Topology-Aware Graph Reinforcement Learning for Energy Storage Systems Optimal Dispatch in Distribution Networks

Shuyi Gao, Stavros Orfanoudakis, Shengren Hou, Peter Palensky, Pedro P. Vergara

Comments 15 pages, 10 figures

2603.26263 2026-03-30 cs.CV cs.RO

DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation

Tomoya Miyawaki, Kazuto Nakashima, Yumi Iwashita, Ryo Kurazume

Comments ICRA 2026

2603.26262 2026-03-30 cs.CV cs.LG

GLASS: Geometry-aware Local Alignment and Structure Synchronization Network for 2D-3D Registration

Zhixin Cheng, Jiacheng Deng, Xinjun Li, Bohao Liao, Li Liu, Xiaotian Yin, Baoqun Yin, Tianzhu Zhang

Comments Accepted by IEEE Transactions on Circuits and Systems for Video Technology

2603.26261 2026-03-30 cs.LG stat.ML

Contrastive Conformal Sets

Yahya Alkhatib, Wee Peng Tay