arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.26997 2026-05-01 cs.CR cs.AI cs.MA

Agent Name Service (ANS): A Proof-of-Concept Trust Layer for Secure AI Agent Discovery, Identity, and Governance in Kubernetes

Akshay Mittal, Elyson De La Cruz

Comments 9 pages, 2 figures

详情

英文摘要

Autonomous AI agent ecosystems require stronger mechanisms for secure discovery, identity verification, capability attestation, and policy governance. Current deployments frequently lack (1) uniform agent discovery, (2) cryptographic agent authentication, (3) capability proofs that protect secrets, and (4) enforceable policy controls. This paper presents an implementation-oriented proof of concept for the Agent Name Service (ANS), a DNS-inspired trust layer for AI agent discovery and interoperability in Kubernetes, grounded in the ANS protocol specification~\cite{huang2025ans}. The implementation uses Decentralized Identifiers (DIDs), Verifiable Credentials (VCs), policy-as-code enforcement with Open Policy Agent (OPA), and Kubernetes-native integration patterns (CRDs, admission controls, service mesh integration). In a demo research environment (3-node cluster, 50-agent workflow simulation), we observe sub-10ms response in demonstrated service paths and full success for scripted demo deployment scenarios. We explicitly scope these findings as proof-of-concept evidence rather than production certification. We further provide a threat model, assumptions, and limitations to separate implemented evidence from protocol-defined and roadmap capabilities. The result is an evidence-grounded pathway from ANS protocol concepts to reproducible engineering practice for secure multi-agent systems.

URL PDF HTML ☆

赞 0 踩 0

2604.26983 2026-05-01 cs.IR cs.LG stat.ML

Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure

María Florencia Acosta, Rodrigo García Arancibia, Pamela Llop, Mariel Lovatto, Lucas Mansilla

2604.26981 2026-05-01 cs.IR cs.LG

Budget-Constrained Online Retrieval-Augmented Generation: The Chunk-as-a-Service Model

Shawqi Al-Maliki, Ammar Gharaibeh, Mohamed Rahouti, Mohammad Ruhul Amin, Mohamed Abdallah, Junaid Qadir, Ala Al-Fuqaha

详情

DOI: 10.1109/TAI.2026.3666170

英文摘要

Large Language Models (LLMs) have revolutionized the field of natural language processing. However, they exhibit some limitations, including a lack of reliability and transparency: they may hallucinate and fail to provide sources that support the generated output. Retrieval-Augmented Generation (RAG) was introduced to address such limitations in LLMs. One popular implementation, RAG-as-a-Service (RaaS), has shortcomings that hinder its adoption and accessibility. For instance, RaaS pricing is based on the number of submitted prompts, without considering whether the prompts are enriched by relevant chunks, i.e., text segments retrieved from a vector database, or the quality of the utilized chunks (i.e., their degree of relevance). This results in an opaque and less cost-effective payment model. We propose Chunk-as-a-Service (CaaS) as a transparent and cost-effective alternative. CaaS includes two variants: Open-Budget CaaS (OB-CaaS) and Limited-Budget CaaS (LB-CaaS), which is enabled by our ``Utility-Cost Online Selection Algorithm (UCOSA)''. UCOSA further extends the cost-effectiveness and the accessibility of the OB-CaaS variant by enriching, in an online manner, a subset of the submitted prompts based on budget constraints and utility-cost tradeoff. Our experiments demonstrate the efficacy of the proposed UCOSA compared to both offline and relevance-greedy selection baselines. In terms of the performance metric-the number of enriched prompts (NEP) multiplied by the Average Relevance (AR)-UCOSA outperforms random selection by approximately 52% and achieves around 75% of the performance of offline selection methods. Additionally, in terms of budget utilization, LB-CaaS and OB-CaaS achieve higher performance-to-budget ratios of 140% and 86%, respectively, compared to RaaS, indicating their superior efficiency.

URL PDF HTML ☆

赞 0 踩 0

2604.26979 2026-05-01 cs.AR cs.AI cs.ET

Multibit neural inference in a N-ary crossbar architecture

Anatole Moureaux, Anthony Lopes Temporao, Flavio Abreu Araujo

Comments 24 pages, 7 figures, 3 tables

2604.26976 2026-05-01 cs.LO cs.AI

Fitting Horn DL Ontologies to ABox and Query Examples: A Tale of Simulation Quantifiers and Finite Models

Marvin Grosser, Carsten Lutz

Comments Submitted to the 23rd International Conference on Principles of Knowledge Representation and Reasoning (KR2026)

2604.26973 2026-05-01 cs.NE cs.LG stat.CO

MAEO: Multiobjective Animorphic Ensemble Optimization for Scalable Large-scale Engineering Applications

Omer F. Erdem, Dean Price, Paul Seurin, Majdi I. Radaideh

Comments 33 pages, 9 figures, 5 tables, under peer review

详情

英文摘要

Multiobjective optimization remains challenging for many scientific and engineering problems due to the need to balance convergence, diversity, and computational efficiency across high-dimensional objective landscapes. This work presents the Multiobjective Animorphic Ensemble Optimization (MAEO) framework, a parallelizable ensemble strategy that unifies state-of-the-art evolutionary algorithms within an island-based architecture, overcoming the limitations of relying on a single optimizer, as implied by the No Free Lunch theorem. MAEO uses a parameter-free hypervolume indicator for island performance assessment and a strict Pareto-rank-based individual scoring formulation that incorporates crowding distance and nadir-point proximity to ensure consistent selection pressure within each front. The framework is initiated using four algorithms (NSGA-III, CTAEA, AGEMOEA2, SPEA2) and evaluated through extensive benchmarking on 12 DTLZ/ZDT functions under 36 dimensionality settings using Wilcoxon signed-rank tests with both hypervolume and inverse generational distance metrics. Results show that MAEO achieves balanced convergence-diversity performance, outperforming or matching some of the leading multiobjective optimization algorithms across different benchmark problems. To demonstrate practical applicability, MAEO is applied to the equilibrium-cycle optimization of a small modular nuclear reactor. Eight discrete design variables (and three objectives (levelized cost of electricity, peak soluble boron concentration, fuel cycle length) are optimized under two safety constraints. The algorithm carried out roughly 40000 evaluations using computer simulations. MAEO identifies core designs that lower both the levelized cost of electricity and the peak boron concentration, while preserving fuel cycle length and meeting all safety constraints.

URL PDF HTML ☆

赞 0 踩 0

2604.26972 2026-05-01 cs.CC cs.LG

How Hard Is Continuous Clustering? Lower Bounds from the Existential Theory of the Reals

Angshul Majumdar

2604.26970 2026-05-01 cs.IR cs.AI cs.LG q-bio.QM

Not All Memories Age the Same: Autodiscovery of Adaptive Decay in Knowledge Graphs

Mandar Karhade

Comments 27 pages, 2 figures, 19 tables (including appendix). Preprint under review

详情

英文摘要

Knowledge graphs used for retrieval treat all facts as equally current. Existing temporal approaches apply uniform decay, using a single forgetting curve regardless of knowledge type. We show this is fundamentally misspecified: different knowledge types exhibit different temporal dynamics, and the core retrieval problem is not latency or throughput but identifying what is important at query time. We propose a hierarchical framework that replaces uniform decay with a continuous decay surface parameterized by two orthogonal signals: velocity (how frequently a concept is observed) and volatility (how much the value changes between observations, measured via embedding distance). The decay surface is decomposed into three learnable levels: domain-level parameters capture universal patterns (some predicates are inherently permanent, others inherently transient), context-level parameters capture setting-dependent variation, and entity-level adaptation personalizes decay to specific subjects. All parameters emerge from data through survival analysis on observed value lifetimes, requiring no predefined taxonomies or domain expertise. We formulate edge lifetime as a survival problem where the event is value supersession (a meaningfully different value replacing the current one), distinct from mere re-observation. Experiments on synthetic temporal knowledge graphs demonstrate recovery of planted hierarchical parameters (HDBSCAN ARI = 1.0). Validation on 107 Wikipedia articles and 1,163 patient records from the Synthea clinical EHR simulator shows that velocity-volatility clusters emerge naturally, align with observable persistence patterns, and near-universally exhibit the Lindy effect (Weibull shape k < 1). Uniform decay performs 18x worse than no temporal weighting. Heterogeneous decay recovers from this, with each hierarchy level contributing measurable improvement.

URL PDF HTML ☆

赞 0 踩 0

2604.26968 2026-05-01 cs.AR cs.AI cs.DC cs.PF

Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference

Sanjeev Rao Ganjihal

Comments 9 pages, 9 tables, 1 figure. Under review at a systems conference

2604.26965 2026-05-01 cs.CY cs.AI cs.SI

The Impact of AI-Generated Text on the Internet

Jonas Dolezal, Sawood Alam, Mark Graham, Maty Bohacek

2604.26964 2026-05-01 cs.CY cs.AI cs.LG

Learning-to-Explain through 20Q Gaming: An Explainable Recommender for Cybersecurity Education

Mary Nusrat, Sarfuddin Bhuiyan, Gahangir Hossain

2604.26960 2026-05-01 cs.CY cs.AI

LLM Biases

Jinhui Han, Ming Hu, Xilin Zhang

详情

英文摘要

Transformer-based agentic AI is rapidly being deployed on major platforms to help users shop, watch, and navigate content with less effort. While these systems can deliver impressive performance, a key concern is whether they may be less reliable than they appear. We ask a simple but fundamental question: whether the mechanisms that make transformer-based agents effective can also induce systematic biases or distortions? We study this question through a theoretical analysis of transformer-based generative recommenders, in which the next user interaction is generated sequentially from the user history. Focusing on how the model allocates attention across historical evidence, we identify four bias channels: (i) Positional bias: stronger positional encoding shifts influence toward recent history, improving responsiveness but potentially reducing stability and long-term diversity; (ii) Popularity amplification: small frequency differences in data can be magnified into disproportionate exposure, contributing to Matthew effects and echo chambers; (iii) Latent driver bias: when important drivers of user choices are not directly observed, the model can place overly concentrated weight on a small subset of past events, creating overconfident attributions. (iv) Synthetic data bias: when users increasingly follow AI suggestions and platforms retrain on model-shaped synthetic logs, outputs can concentrate over time, and long-tail alternatives can disappear first. Our analysis highlights mechanism-level reliability risks that may not be visible in offline performance metrics. The four bias channels indicate that large-scale deployment may systematically distort exposure and choice. For managers, the immediate implication is to treat these as operational risk factors and to monitor concentration and drift over time, rather than assuming that performance gains alone guarantee reliability.

URL PDF HTML ☆

赞 0 踩 0

2604.26959 2026-05-01 cs.CY cs.AI cs.MA

CareGuardAI: Context-Aware Multi-Agent Guardrails for Clinical Safety & Hallucination Mitigation in Patient-Facing LLMs

Elham Nasarian, Abhilash Neog, Kwok-Leung Tsui, Niyousha HosseiniChimeh

2604.26958 2026-05-01 cs.CY cs.AI

Designing Ethical Learning for Agentic AI: Toegye Yi Hwang's Ethical Emotion Regulation Framework

Ji Yeon Kim

2604.26957 2026-05-01 cs.CY cs.AI

Simulating Validity: Modal Decoupling in MLLM Generated Feedback on Science Drawings

Arne Bewersdorff, Nejla Yuruk, Xiaoming Zhai

Comments Accepted as AIED Short Paper 2026, Seoul, South Korea. Submission #1147. This is the long paper version

2604.26956 2026-05-01 cs.CY cs.AI cs.HC

Can AI be a moral victim? The role of moral patiency and ownership perceptions in ethical judgments of using AI-generated content

Hyesun Choung, Soojong Kim

Comments Honourable Mention Award, ACM CHI 2026

2604.26955 2026-05-01 cs.CY cs.AI

Policy-Governed LLM Routing with Intent Matching for Instrument Laboratories

Emmanuel A. Olowe, Danial Chitnis

Comments IEEE EduCon

2604.26954 2026-05-01 cs.CY cs.AI

The Impact of LLM Self-Consistency and Reasoning Effort on Automated Scoring Accuracy and Cost

Scott Frohn

Comments 14 pages, 10 tables, 2 figures. Presented at the 2026 National Council on Measurement in Education (NCME) Annual Meeting, April 11, 2026, Los Angeles, CA

2604.26103 2026-05-01 cs.AR cs.AI cs.DC cs.LG

AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving

Zhongkai Yu, Haotian Ye, Chenyang Zhou, Ohm Rishabh Venkatachalam, Zaifeng Pan, Zhengding Hu, Junsung Kim, Won Woo Ro, Po-An Tsai, Shuyi Pei, Yangwook Kang, Yufei Ding

2604.25977 2026-05-01 econ.EM cs.AI cs.LG q-fin.PM

Auditing Marketing Budget Allocation with Hindsight Regret

Nilavra Pathak, Olivier Jeunen, Eric Lambert

Comments 6 pages, 8 figures

2604.25711 2026-05-01 cs.SE cs.AI

Learning Generalizable Multimodal Representations for Software Vulnerability Detection

Zeming Dong, Yuejun Guo, Qiang Hu, Yao Zhang, Maxime Cordy, Hao Liu, Mike Papadakis, Yongqiang Lyu

2604.23341 2026-05-01 cs.CR cs.AI

Evaluating Jailbreaking Vulnerabilities in LLMs Deployed as Assistants for Smart Grid Operations: A Benchmark Against NERC Standards

Taha Hammadia, Lucas Rea, Ahmad Mohammad Saber, Amr Youssef, Deepa Kundur

2604.20577 2026-05-01 cs.SE cs.LG

Evaluating Assurance Cases as Text-Attributed Graphs for Structure and Provenance Analysis

Fariz Ikhwantri, Dusica Marijan

Comments 10 pages, 4 figures, 8 tables. Accepted to EASE 2026 AI Models / Data track, Glasgow, United Kingdom Fix the captions of tables 7 and 8

2604.11817 2026-05-01 quant-ph cs.CV

QMC-Net: Data-Aware Quantum Representations for Remote Sensing Image Classification

Md Aminur Hossain, Ayush V. Patel, Biplab Banerjee

Comments 15 pages

2604.11119 2026-05-01 stat.ML cs.LG

DDO-RM: Distribution-Level Policy Improvement after Reward Learning

Tiantian Zhang, Jierui Zuo, Michael Chen, Wenping Wang

Comments 8 pages, 4 figures

2604.09718 2026-05-01 cs.DC cs.AI cs.PL

Agentic Compilation: Mitigating the LLM Rerun Crisis for Minimized-Inference-Cost Web Automation

Jagadeesh Chundru

Comments 12 pages, 4 figures, 2 tables. v2: Expanded literature review and clarified architecture limitations

2604.06616 2026-05-01 cs.DB cs.AI cs.IR

CubeGraph: Efficient Retrieval-Augmented Generation for Spatial and Temporal Data

Mingyu Yang, Wentao Li, Wei Wang

Comments Updated Report

2603.22366 2026-05-01 quant-ph cs.AI cs.LG

Modeling Quantum Federated Autoencoder for Anomaly Detection in IoT Networks

Devashish Chaudhary, Sutharshan Rajasegarar, Shiva Raj Pokhrel

Comments This paper has been accepted at ICOIN 2026

2603.13566 2026-05-01 stat.ML cs.LG

EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

En-Ya Kuo, Sebastien Motsch

Comments Updated the first page to include the IEEE submission notice required for previously posted electronic preprint versions

2603.10252 2026-05-01 stat.ML cs.LG physics.data-an stat.ME

Bayesian Hierarchical Models and the Maximum Entropy Principle

Brendon J. Brewer

Comments 6 pages, 2 figures. To appear in the proceedings of the 44th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering (MaxEnt 2025), held in Auckland, New Zealand