arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.02837 2026-04-06 cs.CR cs.AI

Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis

Zhiyuan Li, Jingzheng Wu, Xiang Ling, Xing Cui, Tianyue Luo

详情

英文摘要

Agent Skills is an emerging open standard that defines a modular, filesystem-based packaging format enabling LLM-based agents to acquire domain-specific expertise on demand. Despite rapid adoption across multiple agentic platforms and the emergence of large community marketplaces, the security properties of Agent Skills have not been systematically studied. This paper presents the first comprehensive security analysis of the Agent Skills framework. We define the full lifecycle of an Agent Skill across four phases -- Creation, Distribution, Deployment, and Execution -- and identify the structural attack surface each phase introduces. Building on this lifecycle analysis, we construct a threat taxonomy comprising seven categories and seventeen scenarios organized across three attack layers, grounded in both architectural analysis and real-world evidence. We validate the taxonomy through analysis of five confirmed security incidents in the Agent Skills ecosystem. Based on these findings, we discuss defense directions for each threat category, identify open research challenges, and provide actionable recommendations for stakeholders. Our analysis reveals that the most severe threats arise from structural properties of the framework itself, including the absence of a data-instruction boundary, a single-approval persistent trust model, and the lack of mandatory marketplace security review, and cannot be addressed through incremental mitigations alone.

URL PDF HTML ☆

赞 0 踩 0

2604.02783 2026-04-06 cs.HC cs.AI

Disrupting Cognitive Passivity: Rethinking AI-Assisted Data Literacy through Cognitive Alignment

Yongsu Ahn, Nam Wook Kim, Benjamin Bach

2604.02767 2026-04-06 cs.CR cs.AI cs.MA

SentinelAgent: Intent-Verified Delegation Chains for Securing Federal Multi-Agent AI Systems

KrishnaSaiReddy Patil

Comments 12 pages, 2 figures, 9 tables. Includes TLA+ mechanical verification, DelegationBench v4 benchmark (516 scenarios), live LangChain agent integration, and independent red-team evaluation

2604.02742 2026-04-06 eess.IV cs.CV

Task-Guided Prompting for Unified Remote Sensing Image Restoration

Wenli Huang, Yang Wu, Xiaomeng Xin, Zhihong Liu, Jinjun Wang, Ye Deng

Comments 17 pages, 11 figures

详情

DOI: 10.1109/TGRS.2025.3649021
Journal ref: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 64, 2026

英文摘要

Remote sensing image restoration (RSIR) is essential for recovering high-fidelity imagery from degraded observations, enabling accurate downstream analysis. However, most existing methods focus on single degradation types within homogeneous data, restricting their practicality in real-world scenarios where multiple degradations often across diverse spectral bands or sensor modalities, creating a significant operational bottleneck. To address this fundamental gap, we propose TGPNet, a unified framework capable of handling denoising, cloud removal, shadow removal, deblurring, and SAR despeckling within a single, unified architecture. The core of our framework is a novel Task-Guided Prompting (TGP) strategy. TGP leverages learnable, task-specific embeddings to generate degradation-aware cues, which then hierarchically modulate features throughout the decoder. This task-adaptive mechanism allows the network to precisely tailor its restoration process for distinct degradation patterns while maintaining a single set of shared weights. To validate our framework, we construct a unified RSIR benchmark covering RGB, multispectral, SAR, and thermal infrared modalities for five aforementioned restoration tasks. Experimental results demonstrate that TGPNet achieves state-of-the-art performance on both unified multi-task scenarios and unseen composite degradations, surpassing even specialized models in individual domains such as cloud removal. By successfully unifying heterogeneous degradation removal within a single adaptive framework, this work presents a significant advancement for multi-task RSIR, offering a practical and scalable solution for operational pipelines. The code and benchmark will be released at https://github.com/huangwenwenlili/TGPNet.

URL PDF HTML ☆

赞 0 踩 0

2604.02740 2026-04-06 cs.SI cs.AI

Cross Event Detection and Topic Evolution Mining in cross events for Man Made Disasters in Social Media Streams

Pramod Bide, Sudhir Dhage, Mohammed Afaan Ansari, Rudresh Veerkhare

2604.02738 2026-04-06 stat.ML cs.LG math.OC stat.CO

State estimations and noise identifications with intermittent corrupted observations via Bayesian variational inference

Peng Sun, Ruoyu Wang, Xue Luo

Comments 8 pages, 6 figures

2604.02729 2026-04-06 cs.SE cs.AI cs.CL

IndustryCode: A Benchmark for Industry Code Generation

Puyu Zeng, Zhaoxi Wang, Zhixu Duan, Liang Feng, Shaobo Wang, Cunxiang Wang, Jinghang Wang, Bing Zhao, Hu Wei, Linfeng Zhang

Comments 37 pages, 28 figures, 4 tables. Includes appendix

2604.02678 2026-04-06 stat.ME cs.AI stat.AP

Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis

Yao Zhao, Zhiyue Zhang, Yanxun Xu

详情

英文摘要

Clinical evidence synthesis requires identifying relevant trials from large registries and aggregating results that account for population differences. While recent LLM-based approaches have automated components of systematic review, they do not support end-to-end evidence synthesis. Moreover, conventional meta-analysis weights studies by statistical precision without considering clinical compatibility reflected in eligibility criteria. We propose EligMeta, an agentic framework that integrates automated trial discovery with eligibility-aware meta-analysis, translating natural-language queries into reproducible trial selection and incorporating eligibility alignment into study weighting to produce cohort-specific pooled estimates. EligMeta employs a hybrid architecture separating LLM-based reasoning from deterministic execution: LLMs generate interpretable rules from natural-language queries and perform schema-constrained parsing of trial metadata, while all logical operations, weight computations, and statistical pooling are executed deterministically to ensure reproducibility. The framework structures eligibility criteria and computes similarity-based study weights reflecting population alignment between target and comparator trials. In a gastric cancer landscape analysis, EligMeta reduced 4,044 candidate trials to 39 clinically relevant studies through rule-based filtering, recovering all 13 guideline-cited trials. In an olaparib adverse events meta-analysis across four trials, eligibility-aware weighting shifted the pooled risk ratio from 2.18 (95% CI: 1.71-2.79) under conventional Mantel-Haenszel estimation to 1.97 (95% CI: 1.76-2.20), demonstrating quantifiable impact of incorporating eligibility alignment. EligMeta bridges automated trial discovery with eligibility-aware meta-analysis, providing a scalable and reproducible framework for evidence synthesis in precision medicine.

URL PDF HTML ☆

赞 0 踩 0

2604.02674 2026-04-06 cs.MA cs.AI

Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems

Kavana Venkatesh, Jiaming Cui

2604.02648 2026-04-06 cs.SE cs.AI

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Shufan Jiang, Chios Chen, Zhiyang Chen

Comments Accepted as a workshop paper at the Fourteenth International Conference on Learning Representations (ICLR 2026)

2604.02629 2026-04-06 cs.HC cs.AI

Toys that listen, talk, and play: Understanding Children's Sensemaking and Interactions with AI Toys

Aayushi Dangol, Meghna Gupta, Daeun Yoo, Robert Wolfe, Jason Yip, Franziska Roesner, Julie A. Kientz

2604.02624 2026-04-06 physics.optics cs.CV cs.NE physics.app-ph

Wavelength-multiplexed massively parallel diffractive optical information storage and image projection

Che-Yung Shen, Yuhang Li, Cagatay Isil, Jingxi Li, Leon Lenk, Tianyi Gan, Guangdong Ma, Fazil Onuralp Ardic, Mona Jarrahi, Aydogan Ozcan

Comments 28 Pages, 8 Figures

2604.02610 2026-04-06 stat.ML cs.LG

Structure-Preserving Multi-View Embedding Using Gromov-Wasserstein Optimal Transport

Rafael Pereira Eufrazio, Eduardo Fernandes Montesuma, Charles Casimiro Cavalcante

Comments This manuscript is currently under review for possible publication in the journal Signal Processing (ELSEVIER)

2604.02581 2026-04-06 stat.ML cs.LG cs.NA math.NA

Learning interacting particle systems from unlabeled data

Viska Wei, Fei Lu

Comments 39 pages, 7 figures

2604.02578 2026-04-06 cs.MA cs.AI cs.CL cs.GT

High Volatility and Action Bias Distinguish LLMs from Humans in Group Coordination

Sahaj Singh Maini, Robert L. Goldstone, Zoran Tiganj

2604.02574 2026-04-06 cs.CR cs.AI cs.LG

Understanding the Effects of Safety Unalignment on Large Language Models

John T. Halloran

Comments 12 pages, 2 figures, 5 tables

2604.02567 2026-04-06 cs.CY cs.AI cs.ET cs.HC

Generative AI Use in Entrepreneurship: An Integrative Review and an Empowerment-Entrapment Framework

Jackson G. Lu, Gerui Gloria Zhao, Anna Manyi Zheng

2604.02555 2026-04-06 cs.DS cs.LG

Robust Learning with Optimal Error

Guy Blanc

2604.02549 2026-04-06 q-fin.ST cs.LG

Financial Anomaly Detection for the Canadian Market

Luigi Caputi, Nicholas Meadows

2604.02548 2026-04-06 cs.CR cs.AI

From Theory to Practice: Code Generation Using LLMs for CAPEC and CWE Frameworks

Murtuza Shahzad, Joseph Wilson, Ibrahim Al Azher, Hamed Alhoori, Mona Rahimi

2604.02539 2026-04-06 cs.IR cs.LG

Synapse: Evolving Job-Person Fit with Explainable Two-phase Retrieval and LLM-guided Genetic Resume Optimization

Ansel Kaplan Erol, Seohee Yoon, Keenan Hom, Xisheng Zhang

2604.02524 2026-04-06 cond-mat.mtrl-sci cs.LG

AQVolt26: High-Temperature r$^2$SCAN Halide Dataset for Universal ML Potentials and Solid-State Batteries

Jiyoon Kim, Chuhong Wang, Aayush R. Singh, Tyler Sours, Shivang Agarwal, AJ Nish, Paul Abruzzo, Ang Xiao, Omar Allam

2604.02522 2026-04-06 cs.CR cs.AI

Opal: Private Memory for Personal AI

Darya Kaviani, Alp Eren Ozdarendeli, Jinhao Zhu, Yu Ding, Raluca Ada Popa

2604.02520 2026-04-06 physics.data-an cs.LG

Neural posterior estimation for scalable and accurate inverse parameter inference in Li-ion batteries

Malik Hassanaly, Corey R. Randall, Peter J. Weddle, Paul J. Gasper, Conlain Kelly, Tanvir R. Tanim, Kandler Smith

2604.02513 2026-04-06 eess.SP cs.AI

Sparse Bayesian Learning Algorithms Revisited: From Learning Majorizers to Structured Algorithmic Learning using Neural Networks

Rushabha Balaji, Kuan-Lin Chen, Danijela Cabric, Bhaskar D. Rao

详情

英文摘要

Sparse Bayesian Learning is one of the most popular sparse signal recovery methods, and various algorithms exist under the SBL paradigm. However, given a performance metric and a sparse recovery problem, it is difficult to know a-priori the best algorithm to choose. This difficulty is in part due to a lack of a unified framework to derive SBL algorithms. We address this issue by first showing that the most popular SBL algorithms can be derived using the majorization-minimization (MM) principle, providing hitherto unknown convergence guarantees to this class of SBL methods. Moreover, we show that the two most popular SBL update rules not only fall under the MM framework but are both valid descent steps for a common majorizer, revealing a deeper analytical compatibility between these algorithms. Using this insight and properties from MM theory we expand the class of SBL algorithms, and address finding the best SBL algorithm via data within the MM framework. Second, we go beyond the MM framework by introducing the powerful modeling capabilities of deep learning to further expand the class of SBL algorithms, aiming to learn a superior SBL update rule from data. We propose a novel deep learning architecture that can outperform the classical MM based ones across different sparse recovery problems. Our architecture's complexity does not scale with the measurement matrix dimension, hence providing a unique opportunity to test generalization capability across different matrices. For parameterized dictionaries, this invariance allows us to train and test the model across different parameter ranges. We also showcase our model's ability to learn a functional mapping by its zero-shot performance on unseen measurement matrices. Finally, we test our model's performance across different numbers of snapshots, signal-to-noise ratios, and sparsity levels.

URL PDF HTML ☆

赞 0 踩 0

2604.02507 2026-04-06 stat.ML cs.LG

Reinforcement Learning from Human Feedback: A Statistical Perspective

Pangpang Liu, Chengchun Shi, Will Wei Sun

2604.02505 2026-04-06 math.OC cs.LG

Optimal Projection-Free Adaptive SGD for Matrix Optimization

Dmitry Kovalev

2604.02490 2026-04-06 cs.CR cs.AI

Automated Malware Family Classification using Weighted Hierarchical Ensembles of Large Language Models

Samita Bai, Hamed Jelodar, Tochukwu Emmanuel Nwankwo, Parisa Hamedi, Mohammad Meymani, Roozbeh Razavi-Far, Ali A. Ghorbani

2604.02483 2026-04-06 physics.flu-dyn cs.AI

A Multimodal Vision Transformer-based Modeling Framework for Prediction of Fluid Flows in Energy Systems

Kiran Yalamanchi, Shivam Barwey, Ibrahim Jarrah, Pinaki Pal

2604.02448 2026-04-06 eess.IV cs.AI cs.CV

Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview

Shramana Dey, Zahir Khan, T. A. PramodKumar, B. Uma Shankar, Ashis K. Dhara, Ramachandran Rajalakshmi, Rajiv Raman, Sushmita Mitra