arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.26138 2026-03-30 cs.LG cs.CV

PruneFuse: Efficient Data Selection via Weight Pruning and Network Fusion

Humaira Kousar, Hasnain Irshad Bhatti, Jaekyun Moon

Comments Published in TMLR (Featured Certification). arXiv admin note: substantial text overlap with arXiv:2501.01118

详情

Journal ref: Transactions on Machine Learning Research (TMLR), March 2026

英文摘要

Efficient data selection is crucial for enhancing the training efficiency of deep neural networks and minimizing annotation requirements. Traditional methods often face high computational costs, limiting their scalability and practical use. We introduce PruneFuse, a novel strategy that leverages pruned networks for data selection and later fuses them with the original network to optimize training. PruneFuse operates in two stages: First, it applies structured pruning to create a smaller pruned network that, due to its structural coherence with the original network, is well-suited for the data selection task. This small network is then trained and selects the most informative samples from the dataset. Second, the trained pruned network is seamlessly fused with the original network. This integration leverages the insights gained during the training of the pruned network to facilitate the learning process of the fused network while leaving room for the network to discover more robust solutions. Extensive experimentation on various datasets demonstrates that PruneFuse significantly reduces computational costs for data selection, achieves better performance than baselines, and accelerates the overall training process.

URL PDF HTML ☆

赞 0 踩 0

2603.26135 2026-03-30 cs.LG

TinyML for Acoustic Anomaly Detection in IoT Sensor Networks

Amar Almaini, Jakob Folz, Ghadeer Ashour

2603.26134 2026-03-30 cs.CV

InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution

Jintong Hu, Bin Chen, Zhenyu Hu, Jiayue Liu, Guo Wang, Lu Qi

Comments 12 pages, 7 figures

2603.26128 2026-03-30 cs.CV

TaxaAdapter: Vision Taxonomy Models are Key to Fine-grained Image Generation over the Tree of Life

Mridul Khurana, Amin Karimi Monsefi, Justin Lee, Medha Sawhney, David Carlyn, Julia Chae, Jianyang Gu, Rajiv Ramnath, Sara Beery, Wei-Lun Chao, Anuj Karpatne, Cheng Zhang

2603.26127 2026-03-30 cs.CV cs.AI cs.CL cs.LG cs.MM

Finding Distributed Object-Centric Properties in Self-Supervised Transformers

Samyak Rawlekar, Amitabh Swain, Yujun Cai, Yiwei Wang, Ming-Hsuan Yang, Narendra Ahuja

Comments Computer Vision and Pattern Recognition (CVPR) 2026

2603.26126 2026-03-30 cs.CV

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Jinda Lu, Junkang Wu, Jinghan Li, Kexin Huang, Shuo Yang, Mingzhu Chen, Jiancan Wu, Kuien Liu, Xiang Wang

2603.26122 2026-03-30 cs.CV cs.AI

SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis

Zhangtianyi Chen, Yuhao Shen, Florensia Widjaja, Yan Xu, Liyuan Sun, Zijian Wang, Hongyi Chen, Wufei Dai, Juexiao Zhou

详情

英文摘要

While recent advancements in Large Language Models have significantly advanced dermatological diagnosis, monolithic LLMs frequently struggle with fine-grained, large-scale multi-class diagnostic tasks and rare skin disease diagnosis owing to training data sparsity, while also lacking the interpretability and traceability essential for clinical reasoning. Although multi-agent systems can offer more transparent and explainable diagnostics, existing frameworks are primarily concentrated on Visual Question Answering and conversational tasks, and their heavy reliance on static knowledge bases restricts adaptability in complex real-world clinical settings. Here, we present SkinGPT-X, a multimodal collaborative multi-agent system for dermatological diagnosis integrated with a self-evolving dermatological memory mechanism. By simulating the diagnostic workflow of dermatologists and enabling continuous memory evolution, SkinGPT-X delivers transparent and trustworthy diagnostics for the management of complex and rare dermatological cases. To validate the robustness of SkinGPT-X, we design a three-tier comparative experiment. First, we benchmark SkinGPT-X against four state-of-the-art LLMs across four public datasets, demonstrating its state-of-the-art performance with a +9.6% accuracy improvement on DDI31 and +13% weighted F1 gain on Dermnet over the state-of-the-art model. Second, we construct a large-scale multi-class dataset covering 498 distinct dermatological categories to evaluate its fine-grained classification capabilities. Finally, we curate the rare skin disease dataset, the first benchmark to address the scarcity of clinical rare skin diseases which contains 564 clinical samples with eight rare dermatological diseases. On this dataset, SkinGPT-X achieves a +9.8% accuracy improvement, a +7.1% weighted F1 improvement, a +10% Cohen's Kappa improvement.

URL PDF HTML ☆

赞 0 踩 0

2603.26114 2026-03-30 cs.LG cs.AI

DPD-Cancer: Explainable Graph-based Deep Learning for Small Molecule Anti-Cancer Activity Prediction

Magnus H. Strømme, Alex G. C. de Sá, David B. Ascher

2603.26109 2026-03-30 cs.CV

SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection

Jiaming Liang, Yifeng Zhan, Chunlin Liu, Weihua Zheng, Bingye Peng, Qiwei Liang, Boyang Cai, Xiaochun Mai, Qiang Nie

Comments Accepted by CVPR2026

2603.26108 2026-03-30 cs.LG cs.CV

Accurate Precipitation Forecast by Efficiently Learning from Massive Atmospheric Variables and Unbalanced Distribution

Shuangliang Li, Siwei Li, Li Li, Weijie Zou, Jie Yang, Maolin Zhang

2603.26106 2026-03-30 cs.CL

LLM Benchmark-User Need Misalignment for Climate Change

Oucheng Liu, Lexing Xie, Jing Jiang

Comments 37 pages (8 main), 31 figures, 14 tables

2603.26105 2026-03-30 cs.LG

Are LLM-Enhanced Graph Neural Networks Robust against Poisoning Attacks?

Yuhang Ma, Jie Wang, Zheng Yan

Comments To appear at 2026 IEEE Symposium on Security and Privacy (SP)

2603.26098 2026-03-30 cs.SD cs.AI cs.LG

A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning

Harunori Kawano, Takeshi Sasaki

2603.26097 2026-03-30 cs.LG cs.AI stat.ML

Dynamic Tokenization via Reinforcement Patching: End-to-end Training and Zero-shot Transfer

Yulun Wu, Sravan Kumar Ankireddy, Samuel Sharpe, Nikita Seleznev, Dehao Yuan, Hyeji Kim, Nam H. Nguyen

2603.26096 2026-03-30 cs.LG cs.CV

AcTTA: Rethinking Test-Time Adaptation via Dynamic Activation

Hyeongyu Kim, Geonhui Han, Dosik Hwang

Comments Accepted at CVPR 2026

2603.26095 2026-03-30 cs.CL

IndoBERT-Relevancy: A Context-Conditioned Relevancy Classifier for Indonesian Text

Muhammad Apriandito Arya Saputra, Andry Alamsyah, Dian Puteri Ramadhani, Thomhert Suprapto Siadari, Hanif Fakhrurroja

Comments 9 pages, 3 figures,6 tables

2603.26092 2026-03-30 cs.CV cs.LG

CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection

Youngjun Song, Hyeongyu Kim, Dosik Hwang

Comments Accepted at CVPR 2026

2603.26088 2026-03-30 cs.CV

Learnable Instance Attention Filtering for Adaptive Detector Distillation

Chen Liu, Qizhen Lan, Zhicheng Ding, Xinyu Chu, Qing Tian

2603.26078 2026-03-30 cs.CV cs.AI

When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization

Zhihan Chen, Yuhuan Zhao, Yijie Zhu, Xinyu Yao

Comments 10 pages, 7 figures, accepted by CVPR 2026 Workshop P13N

2603.26076 2026-03-30 cs.AI cs.CL cs.IR

Semi-Automated Knowledge Engineering and Process Mapping for Total Airport Management

Darryl Teo, Adharsha Sam, Chuan Shen Marcus Koh, Rakesh Nagi, Nuno Antunes Ribeiro

2603.26071 2026-03-30 cs.CV cs.LG

MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality

Kyungwon Kim, Dosik Hwang

Comments Accepted to CVPR 2026. 10 pages, 5 figures, supplementary included

2603.26067 2026-03-30 cs.CV cs.AI

R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting

Tianrui Lou, Siyuan Liang, Jiawei Liang, Yuze Gao, Xiaochun Cao

Comments Under review

2603.26066 2026-03-30 cs.LG

Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses

Zhuoyu Cheng, Kohei Hatano, Eiji Takimoto

2603.26055 2026-03-30 cs.CV

Pioneering Perceptual Video Fluency Assessment: A Novel Task with Benchmark Dataset and Baseline

Qizhi Xie, Kun Yuan, Yunpeng Qu, Ming Sun, Chao Zhou, Jihong Zhu

Comments 14 pages, 6 figures. Accepted by CVPR 2026 findings track

2603.26052 2026-03-30 cs.CV cs.AI

Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification

Zizhao Chen, Ping Wei, Ziyang Ren, Huan Li, Xiangru Yin

Comments Accepted by CVPR 2026

2603.26049 2026-03-30 cs.CV cs.AI

Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays

Kang Liu, Zhuoqi Ma, Siyu Liang, Yunan Li, Xiyue Gao, Chao Liang, Kun Xie, Qiguang Miao

Comments Code: https://github.com/mk-runner/CoGaze

2603.26046 2026-03-30 cs.CL

Retrieval-Augmented Generation Based Nurse Observation Extraction

Kyomin Hwang, Nojun Kwak

2603.26045 2026-03-30 cs.LG cs.AI cs.CL cs.NE

H-Node Attack and Defense in Large Language Models

Eric Yocam, Varghese Vaidyan, Yong Wang

Comments 17 pages, 7 figures, 6 tables

2603.26036 2026-03-30 cs.CV

Face2Parts: Exploring Coarse-to-Fine Inter-Regional Facial Dependencies for Generalized Deepfake Detection

Kutub Uddin, Nusrat Tasnim, Byung Tae Oh

2603.26034 2026-03-30 cs.CL

AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents

Wenbo Gao, Renxi Liu, Xian Wang, Fang Guo, Shuai Yang, Xi Chen, Hui-Ling Zhen, Hanting Chen, Weizhe Lin, Xiaosong Li, Yaoyuan Wang