arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.13050 2026-04-16 cs.DB cs.LG

Exploring Urban Land Use Patterns by Pattern Mining and Unsupervised Learning

Zdena Dobesova, Tai Dinh, Pavel Novak

详情

英文摘要

Urban areas are intricate systems shaped by socioeconomic, environmental, and infrastructural factors, with land use patterns serving as aspects of urban morphology. This paper proposes a novel methodology leveraging frequent item set mining and unsupervised learning techniques to identify similar cities based on co-occurring land use patterns. The Copernicus program's Urban Atlas data are used as source data. The methodology involves data preprocessing, pattern mining using the negFIN algorithm, postprocessing, and knowledge extraction and visualization. The preprocessing of spatial datasets results in a publicly available transaction dataset. The framework is scalable and the source code is made publicly available.

URL PDF HTML ☆

赞 0 踩 0

2604.13049 2026-04-16 cs.SI cs.AI

Hijacking online reviews: sparse manipulation and behavioral buffering in popularity-biased rating systems

Itsuki Fujisaki, Kunhao Yang

Comments 18page, 3figures

2604.13048 2026-04-16 cs.DB cs.AI cs.SE

From Natural Language to PromQL: A Catalog-Driven Framework with Dynamic Temporal Resolution for Cloud-Native Observability

Twinkll Sisodia

Comments 15 pages, 7 tables, 1 figure

2604.13047 2026-04-16 cs.SI cs.AI cs.CY

Integration of Deep Reinforcement Learning and Agent-based Simulation to Explore Strategies Counteracting Information Disorder

Luigi Lomasto, Andrea Camoia, Alfonso Guarino, Nicola Lettieri, Delfina Malandrino, Rocco Zaccagnino

2604.13046 2026-04-16 cs.DB cs.CL cs.IR cs.LG cs.PL

A Domain-Specific Language for LLM-Driven Trigger Generation in Multimodal Data Collection

Philipp Reis, Philipp Rigoll, Martin Zehetner, Jacqueline Henle, Stefan Otten, Eric Sax

Comments Version submitted to the IEEE International Conference on Intelligent Transportation Systems (ITSC 2026)

2604.13042 2026-04-16 cs.DB cs.AI cs.SE

A Pythonic Functional Approach for Semantic Data Harmonisation in the ILIAD Project

Erik Johan Nystad, Francisco Martín-Recuerda

Comments 17 pages, 9 figures

2604.13041 2026-04-16 cs.DB cs.AI

TableNet A Large-Scale Table Dataset with LLM-Powered Autonomous

Ruilin Zhang, Kai Yang

Comments The 40th Annual AAAI Conference on Artificial Intelligence Bridge Program on Logic & AI

详情

英文摘要

Table Structure Recognition (TSR) requires the logical reasoning ability of large language models (LLMs) to handle complex table layouts, but current datasets are limited in scale and quality, hindering effective use of this reasoning capacity. We thus present TableNet dataset, a new table structure recognition dataset collected and generated through multiple sources. Central to our approach is the first LLM-powered autonomous table generation and recognition multi-agent system that we developed. The generation part of our system integrates controllable visual, structural, and semantic parameters into the synthesis of table images. It facilitates the creation of a wide array of semantically coherent tables, adaptable to user-defined configurations along with annotations, thereby supporting large-scale and detailed dataset construction. This capability enables a comprehensive and nuanced table image annotation taxonomy, potentially advancing research in table-related domains. In contrast to traditional data collection methods, This approach facilitates the theoretically infinite, domain-agnostic, and style-flexible generation of table images, ensuring both efficiency and precision. The recognition part of our system is a diversity-based active learning paradigm that utilizes tables from multiple sources and selectively samples most informative data to finetune a model, achieving a competitive performance on TableNet test set while reducing training samples by a large margin compared with baselines, and a much higher performance on web-crawled real-world tables compared with models trained on predominant table datasets. To the best of our knowledge, this is the first work which employs active learning into the structure recognition of tables which is diverse in numbers of rows or columns, merged cells, cell contents, etc, which fits better for diversity-based active learning.

URL PDF HTML ☆

赞 0 踩 0

2604.13037 2026-04-16 cs.DB cs.AI

OVT-MLCS: An Online Visual Tool for MLCS Mining from Long or Big Sequences

Zhi Wang, Yanni Li, Tihua Duan, Bing Liu, Liyong Zhang, Hui Li

2604.12737 2026-04-16 cs.CR cs.LG

Evaluating Differential Privacy Against Membership Inference in Federated Learning: Insights from the NIST Genomics Red Team Challenge

Gustavo de Carvalho Bertoli

Comments 21 pages

2604.11671 2026-04-16 eess.SP cs.RO

VLMaterial: Vision-Language Model-Based Camera-Radar Fusion for Physics-Grounded Material Identification

Jiangyou Zhu, He Chen

详情

英文摘要

Accurate material recognition is a fundamental capability for intelligent perception systems to interact safely and effectively with the physical world. For instance, distinguishing visually similar objects like glass and plastic cups is critical for safety but challenging for vision-based methods due to specular reflections, transparency, and visual deception. While millimeter-wave (mmWave) radar offers robust material sensing regardless of lighting, existing camera-radar fusion methods are limited to closed-set categories and lack semantic interpretability. In this paper, we introduce VLMaterial, a training-free framework that fuses vision-language models (VLMs) with domain-specific radar knowledge for physics-grounded material identification. First, we propose a dual-pipeline architecture: an optical pipeline uses the segment anything model and VLM for material candidate proposals, while an electromagnetic characterization pipeline extracts the intrinsic dielectric constant from radar signals via an effective peak reflection cell area (PRCA) method and weighted vector synthesis. Second, we employ a context-augmented generation (CAG) strategy to equip the VLM with radar-specific physical knowledge, enabling it to interpret electromagnetic parameters as stable references. Third, an adaptive fusion mechanism is introduced to intelligently integrate outputs from both sensors by resolving cross-modal conflicts based on uncertainty estimation. We evaluated VLMaterial in over 120 real-world experiments involving 41 diverse everyday objects and 4 typical visually deceptive counterfeits across varying environments. Experimental results demonstrate that VLMaterial achieves a recognition accuracy of 96.08%, delivering performance on par with state-of-the-art closed-set benchmarks while eliminating the need for extensive task-specific data collection and training.

URL PDF HTML ☆

赞 0 踩 0

2604.11165 2026-04-16 stat.ML cs.AI cs.LG math.ST stat.TH

Cost-optimal Sequential Testing via Doubly Robust Q-learning

Doudou Zhou, Yiran Zhang, Dian Jin, Yingye Zheng, Lu Tian, Tianxi Cai

2604.09752 2026-04-16 cs.DC cs.AI

A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs

Chen Zhang, Yan Ding, Haotian Wang, Chubo Liu, Keqin Li, Kenli Li

2604.09613 2026-04-16 cs.DC cs.AI

Token-Budget-Aware Pool Routing for Cost-Efficient LLM Inference

Huamin Chen, Xunzhuo Liu, Junchen Jiang, Bowei He, Xue Liu

Comments duplicate of arXiv:2604.08075

2604.08791 2026-04-16 cs.NI cs.AI

eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming

Mahdi Alizadeh

2604.07662 2026-04-16 math.OC cs.LG

Parameter-Free Non-Ergodic Extragradient Algorithms for Solving Monotone Variational Inequalities

Lingqing Shen, Fatma Kılınç-Karzan

2604.02811 2026-04-16 cs.AR cs.AI

ChatSVA: Bridging SVA Generation for Hardware Verification via Task-Specific LLMs

Lik Tung Fu, Jie Zhou, Shaokai Ren, Mengli Zhang, Jia Xiong, Hugo Jiang, Nan Guan, Xi Wang, Jun Yang

Comments Accepted by DAC 2026

2603.29088 2026-04-16 cs.SE cs.AI

WybeCoder: Verified Imperative Code Generation

Fabian Gloeckle, Mantas Baksys, Darius Feher, Kunhao Zheng, Amaury Hayat, Sean B. Holden, Gabriel Synnaeve, Peter O'Hearn

2603.27306 2026-04-16 cs.MA cs.AI cs.SY eess.SY

GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations

Alejandro Carrasco, Mariko Storey-Matsutani, Victor Rodriguez-Fernandez, Richard Linares

Comments Accepted to AI4Space@CVPR Workshop in CVPR 2026

2603.25749 2026-04-16 eess.SP cs.AI cs.LG

A Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic Systems

Xiaoke Yang, Long Gao, Haoyu He, Hanyuan Hang, Qi Liu, Shuai Zhao, Qiantu Tuo, Rui Li

Comments 10 pages, 13 figures

2603.24654 2026-04-16 quant-ph cs.LG stat.ML

Spectral methods: crucial for machine learning, natural for quantum computers?

Vasilis Belis, Joseph Bowles, Rishabh Gupta, Evan Peters, Maria Schuld

Comments 25 pages, 8 figures

2603.23682 2026-04-16 cs.HC cs.AI

Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots

Licol Zeinfeld, Alona Strugatski, Ziva Bar-Dov, Ron Blonder, Shelley Rap, Giora Alexandron

2603.20340 2026-04-16 cs.SE cs.AI

ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents

Zijian Lu, Yiping Zuo, Yupeng Nie, Xin He, Weibei Fan, Lianyong Qi, Shi Jin

Comments 10 pages, 4 figures, 6 tables

2603.15970 2026-04-16 cs.DB cs.AI

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Yeounoh Chung, Rushabh Desai, Jian He, Yu Xiao, Thibaud Hottelier, Yves-Laurent Kom Samo, Pushkar Khadilkar, Xianshun Chen, Sam Idicula, Fatma Özcan, Alon Halevy, Yannis Papakonstantinou

2603.11875 2026-04-16 cs.CR cs.AI

The Mirror Design Pattern: Strict Data Geometry over Model Scale for Prompt Injection Detection

J Alex Corll

2603.05957 2026-04-16 cs.DC cs.AI

Domain-Adaptive Model Merging Across Disconnected Modes

Junming Liu, Yusen Zhang, Rongchao Zhang, Wenkai Zhu, Tian Wu

Comments 5 pages, 1 figure, 3 tables; Accepted by ICASSP 2026

2603.03959 2026-04-16 cs.SE cs.LG

LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification

Md Akib Haider, Ahsan Bulbul, Nafis Fuad Shahid, Aimaan Ahmed, Mohammad Ishrak Abedin

Comments Accepted at the ICSE co-located Workshop NLBSE 2026

2602.15472 2026-04-16 physics.flu-dyn cs.LG

Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows

Ramansh Sharma, Matthew Lowery, Houman Owhadi, Varun Shankar

2602.13156 2026-04-16 cs.CR cs.AI

In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach

Yiran Gao, Kim Hammar, Tao Li

Comments 2026 AAAI Summer Symposium on Human-Aware AI Agents for the Cyber Battlefield

2512.20481 2026-04-16 q-bio.NC cs.CL

Coherence in the brain unfolds across separable temporal regimes

Davide Staub, Finn Rabe, Akhil Misra, Yves Pauli, Roya Hüppi, Ni Yang, Nils Lang, Lars Michels, Victoria Edkins, Sascha Frühholz, Iris Sommer, Wolfram Hinzen, Philipp Homan

2512.09953 2026-04-16 cs.CR cs.AI cs.LG

ZK-APEX: Zero-Knowledge Approximate Personalized Unlearning with Executable Proofs

Mohammad M Maheri, Sunil Cotterill, Alex Davidson, Hamed Haddadi

Comments Accepted at the 9th Conference on Machine Learning and Systems (MLSys 2026)