arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2503.07555 2026-04-02 cs.LG

Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference

Fateme Jamshidi, Mohammad Shahverdikondori, Negar Kiyavash

详情

英文摘要

We study multi-armed bandits under network interference, where each unit's reward depends on its own treatment and those of its neighbors in a given graph. This induces an exponentially large action space, making standard approaches computationally impractical. We propose a novel algorithm that uses the local graph structure to minimize regret. We derive a graph-dependent upper bound on cumulative regret that improves over prior work. Additionally, we provide the first lower bounds for bandits with arbitrary network interference, where each bound involves a distinct structural property of the graph. These bounds show that for both dense and sparse graphs, our algorithm is nearly optimal, with matching upper and lower bounds up to logarithmic factors. When the interference graph is unknown, a variant of our algorithm is Pareto optimal: no algorithm can uniformly outperform it across all instances. We complement our theoretical results with numerical experiments, showing that our approach outperforms the baseline methods.

URL PDF HTML ☆

赞 0 踩 0

2502.21060 2026-04-02 cs.LG cs.IT math.IT

VT-Former: Efffcient Transformer-based Decoder for Varshamov-Tenengolts Codes

Yali Wei, Alan J. X. Guo, Zihui Yan, Yufan Dai, Wenjia Fan

Comments 9 pages, 10 figures, 5 tables

2502.14883 2026-04-02 cs.CV cs.AI

How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions

Na Min An, Eunki Kim, Wan Ju Kang, Sangryul Kim, James Thorne, Hyunjung Shim

Comments This paper has been superseded by version 2 of arXiv:2510.00766

2502.01861 2026-04-02 cs.LG stat.ML

Learning Hyperparameters via a Data-Emphasized Variational Objective

Ethan Harvey, Mikhail Petrov, Michael C. Hughes

Comments arXiv admin note: text overlap with arXiv:2410.19675

2502.01556 2026-04-02 cs.LG stat.ML

A Gaussian Process View on Observation Noise and Initialization in Wide Neural Networks

Sergio Calvo-Ordoñez, Jonathan Plenk, Richard Bergna, Alvaro Cartea, Jose Miguel Hernandez-Lobato, Konstantina Palla, Kamil Ciosek

Comments AISTATS 2026, Camera-ready version

2501.09821 2026-04-02 cs.LG math.PR

BN-Pool: Bayesian Nonparametric Pooling for Graphs

Daniele Castellana, Filippo Maria Bianchi

2501.09136 2026-04-02 cs.AI cs.CL cs.IR

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

Aditi Singh, Abul Ehtesham, Saket Kumar, Tala Talaei Khoei, Athanasios V. Vasilakos

详情

英文摘要

Large Language Models (LLMs) have advanced artificial intelligence by enabling human-like text generation and natural language understanding. However, their reliance on static training data limits their ability to respond to dynamic, real-time queries, resulting in outdated or inaccurate outputs. Retrieval-Augmented Generation (RAG) has emerged as a solution, enhancing LLMs by integrating real-time data retrieval to provide contextually relevant and up-to-date responses. Despite its promise, traditional RAG systems are constrained by static workflows and lack the adaptability required for multi-step reasoning and complex task management. Agentic Retrieval-Augmented Generation (Agentic RAG) transcends these limitations by embedding autonomous AI agents into the RAG pipeline. These agents leverage agentic design patterns reflection, planning, tool use, and multi-agent collaboration to dynamically manage retrieval strategies, iteratively refine contextual understanding, and adapt workflows through operational structures ranging from sequential steps to adaptive collaboration. This integration enables Agentic RAG systems to deliver flexibility, scalability, and context-awareness across diverse applications. This paper presents an analytical survey of Agentic RAG systems. It traces the evolution of RAG paradigms, introduces a principled taxonomy of Agentic RAG architectures based on agent cardinality, control structure, autonomy, and knowledge representation, and provides a comparative analysis of design trade-offs across existing frameworks. The survey examines applications in healthcare, finance, education, and enterprise document processing, and distills practical lessons for system designers and practitioners. Finally, it identifies key open research challenges related to evaluation, coordination, memory management, efficiency, and governance, outlining directions for future research.

URL PDF HTML ☆

赞 0 踩 0

2412.01114 2026-04-02 cs.LG

Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations

Cevahir Koprulu, Po-han Li, Tianyu Qiu, Ruihan Zhao, Tyler Westenbroek, David Fridovich-Keil, Sandeep Chinchali, Ufuk Topcu

2411.17499 2026-04-02 cs.LG

Time-Series Forecasting in Smart Manufacturing Systems: An Experimental Evaluation of the State-of-the-art Algorithms

Mojtaba A. Farahani, Fadi El Kalach, Austin Harper, M. R. McCormick, Ramy Harik, Thorsten Wuest

详情

DOI: 10.1016/j.rcim.2025.103010
Journal ref: Robotics and Computer-Integrated Manufacturing 95 (2025): 103010

英文摘要

TSF is growing in various domains including manufacturing. Although numerous TSF algorithms have been developed recently, the validation and evaluation of algorithms hold substantial value for researchers and practitioners and are missing. This study aims to fill this gap by evaluating the SoTA TSF algorithms on thirteen manufacturing datasets, focusing on their applicability in manufacturing. Each algorithm was selected based on its TSF category to ensure a representative set of algorithms. The evaluation includes different scenarios to evaluate the models using two problem categories and two forecasting horizons. To evaluate the performance, the WAPE was calculated, and additional post hoc analyses were conducted to assess the significance of observed differences. Only algorithms with codes from open-source libraries were utilized, and no hyperparameter tuning was done. This allowed us to evaluate the algorithms as "out-of-the-box" solutions that can be easily implemented, ensuring their usability within the manufacturing by practitioners with limited technical knowledge. This aligns to facilitate the adoption of these techniques in smart manufacturing systems. Based on the results, transformer and MLP-based architectures demonstrated the best performance with MLP-based architecture winning the most scenarios. For univariate TSF, PatchTST emerged as the most robust, particularly for long-term horizons, while for multivariate problems, MLP-based architectures like N-HITS and TiDE showed superior results. The study revealed that simpler algorithms like XGBoost could outperform complex algorithms in certain tasks. These findings challenge the assumption that more sophisticated models produce better results. Additionally, the research highlighted the importance of computational resource considerations, showing variations in runtime and memory usage across different algorithms.

URL PDF HTML ☆

赞 0 踩 0

2411.13181 2026-04-02 cs.CV cs.AI cs.CY

Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learning

Luigi Celona, Simone Bianco, Paolo Napoletano

2410.19899 2026-04-02 cs.CV

Exploring Self-Supervised Learning with U-Net Masked Autoencoders and EfficientNet-B7 for Improved Gastrointestinal Abnormality Classification in Video Capsule Endoscopy

Vamshi Krishna Kancharla, Pavan Kumar Kaveti, Dasari Naga Raju

Comments Capsule Vision 2024 Challenge

2410.09512 2026-04-02 cs.RO

The Indirect Method for Generating Libraries of Optimal Periodic Trajectories and Its Application to Economical Bipedal Walking

Maximilian Raff, Kathrin Flaßkamp, C. David Remy

Comments submitted to the International Journal of Robotics Research (IJRR)

2410.03131 2026-04-02 cs.AI cs.CL cs.LG

Code Comprehension then Auditing for Unsupervised LLM Evaluation

Bhrij Patel, Souradip Chakraborty, Mengdi Wang, Dinesh Manocha, Amrit Singh Bedi

Comments 19 pages

2409.20146 2026-04-02 cs.CV

VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection

Huilin Deng, Hongchen Luo, Wei Zhai, Yang Cao, Yu Kang

2408.07575 2026-04-02 cs.AI math.ST stat.ME stat.TH

A General Framework on Conditions for Constraint-based Causal Learning

Kai Z. Teh, Kayvan Sadeghi, Terry Soo

2408.06788 2026-04-02 cs.CV cs.HC

Visual Neural Decoding via Improved Visual-EEG Semantic Consistency

Hongzhou Chen, Lianghua He, Yihang Liu, Longzhen Yang, Shaohua Shang, MengChu Zhou

2407.01967 2026-04-02 cs.CV

Harnessing the Power of Local Representations for Few-Shot Classification

Shi Tang, Guiming Luo, Xinchen Ye, Zhiyi Xia

2407.01570 2026-04-02 cs.RO cs.AI

Ego-Foresight: Self-supervised Learning of Agent-Aware Representations for Improved RL

Manuel Serra Nunes, Atabak Dehban, Yiannis Demiris, José Santos-Victor

Comments 13 pages, 8 figures, conference

2406.15904 2026-04-02 cs.LG stat.ME stat.ML

Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction

Kulunu Dharmakeerthi, YoonHaeng Hur, Tengyuan Liang

2405.15556 2026-04-02 cs.LG cs.CL cs.CR

Certifiably Robust RAG against Retrieval Corruption

Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal

2401.14295 2026-04-02 cs.CL cs.AI cs.LG

Demystifying Chains, Trees, and Graphs of Thoughts

Maciej Besta, Florim Memedi, Zhenyu Zhang, Robert Gerstenberger, Guangyuan Piao, Nils Blach, Piotr Nyczyk, Marcin Copik, Grzegorz Kwaśniewski, Jürgen Müller, Lukas Gianinazzi, Ales Kubicek, Hubert Niewiadomski, Aidan O'Mahony, Onur Mutlu, Torsten Hoefler

详情

DOI: 10.1109/TPAMI.2025.3598182
Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 47, Issue 12, pages 10967-10989 (December 2025)

英文摘要

The field of natural language processing (NLP) has witnessed significant progress in recent years, with a notable focus on improving large language models' (LLM) performance through innovative prompting techniques. Among these, prompt engineering coupled with structures has emerged as a promising paradigm, with designs such as Chain-of-Thought, Tree of Thoughts, or Graph of Thoughts, in which the overall LLM reasoning is guided by a structure such as a graph. As illustrated with numerous examples, this paradigm significantly enhances the LLM's capability to solve numerous tasks, ranging from logical or mathematical reasoning to planning or creative writing. To facilitate the understanding of this growing field and pave the way for future developments, we devise a general blueprint for effective and efficient LLM reasoning schemes. For this, we conduct an in-depth analysis of the prompt execution pipeline, clarifying and clearly defining different concepts. We then build the first taxonomy of structure-enhanced LLM reasoning schemes. We focus on identifying fundamental classes of harnessed structures, and we analyze the representations of these structures, algorithms executed with these structures, and many others. We refer to these structures as reasoning topologies, because their representation becomes to a degree spatial, as they are contained within the LLM context. Our study compares existing prompting schemes using the proposed taxonomy, discussing how certain design choices lead to different patterns in performance and cost. We also outline theoretical underpinnings, relationships between prompting and other parts of the LLM ecosystem such as knowledge bases, and the associated research challenges. Our work will help to advance future prompt engineering techniques.

URL PDF HTML ☆

赞 0 踩 0

2401.09244 2026-04-02 cs.CL

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges

Aiqi Jiang, Arkaitz Zubiaga

Comments 35 pages, 7 figures

2310.20641 2026-04-02 cs.LG

Performance Improvement in Multi-class Classification via Automated Hierarchy Generation and Exploitation through Extended LCPN Schemes

Celal Alagoz

2303.08250 2026-04-02 cs.CV cs.LG

CHEEM: Continual Learning by Reuse, New, Adapt and Skip -- A Hierarchical Exploration-Exploitation Approach

Chinmay Savadikar, Michelle Dai, Tianfu Wu

Comments CVPR 2026

2303.06561 2026-04-02 cs.LG cond-mat.dis-nn math.OC stat.ML

Phase Diagram of Initial Condensation for Two-layer Neural Networks

Zhengan Chen, Yuqing Li, Tao Luo, Zhangchen Zhou, Zhi-Qin John Xu

2604.00324 2026-04-02 cs.LG cs.AI

The Persistent Vulnerability of Aligned AI Systems

Aengus Lynch

Comments PhD thesis, University College London, 2025. 157 pages. Supervised by Ricardo Silva

2604.00323 2026-04-02 cs.CL cs.CY

Large Language Models in the Abuse Detection Pipeline

Suraj Kath, Sanket Badhe, Preet Shah, Ashwin Sampathkumar, Shivani Gupta

2604.00320 2026-04-02 cs.RO cs.SY eess.SY

Hierarchical Motion Planning and Control under Unknown Nonlinear Dynamics via Predicted Reachability

Zhiquan Zhang, Melkior Ornik

2604.00319 2026-04-02 cs.AI cs.MA

Collaborative AI Agents and Critics for Fault Detection and Cause Analysis in Network Telemetry

Syed Eqbal Alam, Zhan Shu

2604.00310 2026-04-02 cs.LG cs.AI

Robust Multimodal Safety via Conditional Decoding

Anurag Kumar, Raghuveer Peri, Jon Burnsky, Alexandru Nelus, Rohit Paturi, Srikanth Vishnubhotla, Yanjun Qi

Comments 8 pages + Appendix section. Submitted to ACL 2026