arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.09123 2026-04-13 cs.CL

Prototype-Regularized Federated Learning for Cross-Domain Aspect Sentiment Triplet Extraction

Zongming Cai, Jianhang Tang, Zhenyong Zhang, Jinghui Qin, Kebing Jin, Hankz Hankui Zhuo

详情

英文摘要

Aspect Sentiment Triplet Extraction (ASTE) aims to extract all sentiment triplets of aspect terms, opinion terms, and sentiment polarities from a sentence. Existing methods are typically trained on individual datasets in isolation, failing to jointly capture the common feature representations shared across domains. Moreover, data privacy constraints prevent centralized data aggregation. To address these challenges, we propose Prototype-based Cross-Domain Span Prototype extraction (PCD-SpanProto), a prototype-regularized federated learning framework to enable distributed clients to exchange class-level prototypes instead of full model parameters. Specifically, we design a weighted performance-aware aggregation strategy and a contrastive regularization module to improve the global prototype under domain heterogeneity and the promotion between intra-class compactness and inter-class separability across clients. Extensive experiments on four ASTE datasets demonstrate that our method outperforms baselines and reduces communication costs, validating the effectiveness of prototype-based cross-domain knowledge transfer.

URL PDF HTML ☆

赞 0 踩 0

2604.09114 2026-04-13 cs.CV cs.LG

FIRE-CIR: Fine-grained Reasoning for Composed Fashion Image Retrieval

François Gardères, Camille-Sovanneary Gauthier, Jean Ponce, Shizhe Chen

2604.09100 2026-04-13 cs.CV cs.RO

Physically Grounded 3D Generative Reconstruction under Hand Occlusion using Proprioception and Multi-Contact Touch

Gabriele Mario Caddeo, Pasquale Marra, Lorenzo Natale

Comments 27 pages, 10 figures, under review

2604.09096 2026-04-13 cs.CV cs.MM eess.IV

Off-the-shelf Vision Models Benefit Image Manipulation Localization

Zhengxuan Zhang, Keji Song, Junmin Hu, Ao Luo, Yuezun Li

2604.09094 2026-04-13 cs.SD cs.CL

Few-Shot Contrastive Adaptation for Audio Abuse Detection in Low-Resource Indic Languages

Aditya Narayan Sankaran, Reza Farahbakhsh, Noel Crespi

Comments 14 pages, preprint under review

2604.09091 2026-04-13 cs.LG

Synthesizing real-world distributions from high-dimensional Gaussian Noise with Fully Connected Neural Network

Joanna Komorniczak

2604.09088 2026-04-13 cs.CV

Memory-Efficient Transfer Learning with Fading Side Networks via Masked Dual Path Distillation

Yutong Zhang, Jiaxin Chen, Honglin Chen, Kaiqi Zheng, Shengcai Liao, Hanwen Zhong, Weixin Li, Yunhong Wang

Comments CVPR2026 Accepted

2604.09085 2026-04-13 cs.LG cs.AI

Beyond Isolated Clients: Integrating Graph-Based Embeddings into Event Sequence Models

Harry Proshian, Nikita Severin, Sergey Nikolenko, Kireev Ivan, Andrey Savchenko, Ivan Sergeev, Maria Postnova, Ilya Makarov

Comments Short paper accepted at ACM Web Conference 2026 (WWW '26)

2604.09076 2026-04-13 cs.CV

Cross-Modal Knowledge Distillation from Spatial Transcriptomics to Histology

Arbel Hizmi, Artemii Bakulin, Shai Bagon, Nir Yosef

Comments Accepted to the CVMI Workshop at CVPR 2026. Project page: https://cross-modal-distillation.github.io/

2604.09075 2026-04-13 cs.CL

Hierarchical Alignment: Enforcing Hierarchical Instruction-Following in LLMs through Logical Consistency

Shu Yang, Zihao Zhou, Di Wang, Wenda Li

2604.09072 2026-04-13 cs.AI

Overhang Tower: Resource-Rational Adaptation in Sequential Physical Planning

Ruihong Shen, Shiqian Li, Yixin Zhu

Comments 8 pages, 4 figures, CogSci 2026

2604.09069 2026-04-13 cs.CL cs.AI cs.LG

NyayaMind- A Framework for Transparent Legal Reasoning and Judgment Prediction in the Indian Legal System

Parjanya Aditya Shukla, Shubham Kumar Nigam, Debtanu Datta, Balaramamahanthi Deepak Patnaik, Noel Shallum, Pradeep Reddy Vanga, Saptarshi Ghosh, Arnab Bhattacharya

2604.09067 2026-04-13 cs.LG

Temporal Patch Shuffle (TPS): Leveraging Patch-Level Shuffling to Boost Generalization and Robustness in Time Series Forecasting

Jafar Bakhshaliyev, Johannes Burchert, Niels Landwehr, Lars Schmidt-Thieme

Comments 25 pages, 7 figures, 17 tables

2604.09064 2026-04-13 cs.LG

Feature-Label Modal Alignment for Robust Partial Multi-Label Learning

Yu Chen, Weijun Lv, Yue Huang, Xiaozhao Fang, Jie Wen, Yong Xu, Guanbin Li

2604.09062 2026-04-13 cs.CV

Nested Radially Monotone Polar Occupancy Estimation: Clinically-Grounded Optic Disc and Cup Segmentation for Glaucoma Screening

Rimsa Goperma, Rojan Basnet, Liang Zhao

2604.09059 2026-04-13 cs.CV cs.AI

Learning Vision-Language-Action World Models for Autonomous Driving

Guoqing Wang, Pin Tang, Xiangxuan Ren, Guodongfang Zhao, Bailan Feng, Chao Ma

Comments Accepted by CVPR2026 findings

2604.09058 2026-04-13 cs.LG cs.AI

PDE-regularized Dynamics-informed Diffusion with Uncertainty-aware Filtering for Long-Horizon Dynamics

Min Young Baeg, Yoon-Yeong Kim

2604.09051 2026-04-13 cs.CV cs.RO

Fine-Grained Action Segmentation for Renorrhaphy in Robot-Assisted Partial Nephrectomy

Jiaheng Dai, Huanrong Liu, Tailai Zhou, Tongyu Jia, Qin Liu, Yutong Ban, Zeju Li, Yu Gao, Xin Ma, Qingbiao Li

2604.09047 2026-04-13 cs.CV

Text-Conditioned Multi-Expert Regression Framework for Fully Automated Multi-Abutment Design

Mianjie Zheng, Xinquan Yang, Xuefen Liu, Xuguang Li, Kun Tang, He Meng, Linlin Shen

2604.09045 2026-04-13 cs.CV

Scene-Agnostic Object-Centric Representation Learning for 3D Gaussian Splatting

Tsuheng Hsu, Guiyu Liu, Juho Kannala, Janne Heikkilä

2604.09038 2026-04-13 cs.RO cs.CV cs.LG

Towards Lifelong Aerial Autonomy: Geometric Memory Management for Continual Visual Place Recognition in Dynamic Environments

Xingyu Shao, Zhiqiang Yan, Liangzheng Sun, Mengfan He, Chao Chen, Jinhui Zhang, Chunyu Li, Ziyang Meng

2604.09037 2026-04-13 cs.CV cs.CL cs.HC

SiMing-Bench: Evaluating Procedural Correctness from Continuous Interactions in Clinical Skill Videos

Xiyang Huang, Jiawei Lin, Keying Wu, Jiaxin Huang, Kailai Yang, Renxiong Wei, Cheng zeng, Jiayi Xiang, Ziyan Kuang, Min Peng, Qianqian Xie, Sophia Ananiadou

2604.09036 2026-04-13 cs.RO

V-CAGE: Vision-Closed-Loop Agentic Generation Engine for Robotic Manipulation

Yaru Liu, Ao-bo Wang, Nanyang Ye

2604.09035 2026-04-13 cs.AI cs.LG

Advantage-Guided Diffusion for Model-Based Reinforcement Learning

Daniele Foffano, Arvid Eriksson, David Broman, Karl H. Johansson, Alexandre Proutiere

2604.09034 2026-04-13 cs.LG

The nextAI Solution to the NeurIPS 2023 LLM Efficiency Challenge

Gyuwon Park, DongIl Shin, SolGil Oh, SangGi Ryu, Byung-Hak Kim

2604.09030 2026-04-13 cs.CV

NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Multi-Exposure Image Fusion in Dynamic Scenes (Track 2)

Lishen Qu, Yao Liu, Jie Liang, Hui Zeng, Wen Dai, Guanyi Qin, Ya-nan Guan, Shihao Zhou, Jufeng Yang, Lei Zhang, Radu Timofte, Xiyuan Yuan, Wanjie Sun, Shihang Li, Bo Zhang, Bin Chen, Jiannan Lin, Yuxu Chen, Qinquan Gao, Tong Tong, Song Gao, Jiacong Tang, Tao Hu, Xiaowen Ma, Qingsen Yan, Sunhan Xu, Juan Wang, Xinyu Sun, Lei Qi, He Xu, Jiachen Tu, Guoyi Xu, Yaoxin Jiang, Jiajia Liu, Yaokun Shi

Comments Accepted by CVPRW 2026

2604.09029 2026-04-13 cs.CL cs.AI

CONDESION-BENCH: Conditional Decision-Making of Large Language Models in Compositional Action Space

Yeonjun Hwang, Sungyong Park, Minju Kim, Dongha Lee, Jinyoung Yeo

Comments preprint

2604.09024 2026-04-13 cs.CV cs.AI cs.CR cs.LG

Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection

Zedian Shao, Hongbin Liu, Yuepeng Hu, Neil Zhenqiang Gong

Comments Appeared in ACL 2026 main conference

2604.09023 2026-04-13 cs.CV

CAD 100K: A Comprehensive Multi-Task Dataset for Car Related Visual Anomaly Detection

Jiahua Pang, Ying Li, Dongpu Cao, Jingcai Luo, Yanuo Zheng, Bao Yunfan, Yujie Lei, Rui Yuan, Yuxi Tian, Guojin Yuan, Hongchang Chen, Zhi Zheng, Yongchun Liu

2604.09022 2026-04-13 cs.CV

BlendFusion -- Scalable Synthetic Data Generation for Diffusion Model Training

Thejas Venkatesh, Suguna Varshini Velury