arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.06186 2026-04-09 cs.HC cs.AI

Full State-Space Visualisation of the 8-Puzzle: Feasibility, Design, and Educational Use

Ian Frank, Kanata Kawanishi

Comments This is a preprint of a paper accepted to IEEE ITET 2026

详情

英文摘要

Search algorithms are a foundational topic in artificial intelligence education, yet even simple domains can generate large state spaces that challenge learners' ability to form accurate mental models. This paper presents an interactive learning system that demonstrates the feasibility of visualising the entire reachable state space of the 8-puzzle (181,440 states), while tightly coupling abstract graph structure with concrete puzzle manipulation. Built using Unity and modern GPU-based rendering techniques, the system enables real-time exploration of global structure, step-by-step execution of search algorithms, and direct comparison of how different strategies traverse the same space. We describe the system's design, visualisation layouts, and educational use, reporting findings from an initial classroom deployment and pilot study with students at different levels of university education. Overall, the results indicate that full state-space visualisation is both technically feasible and educationally valuable for supporting conceptual understanding of search behaviour within this canonical problem domain.

URL PDF HTML ☆

赞 0 踩 0

2604.06185 2026-04-09 cs.HC cs.AI cs.CL

Benchmarking LLM Tool-Use in the Wild

Peijie Yu, Wei Liu, Yifan Yang, Jinjian Li, Zelong Zhang, Xiao Feng, Feng Zhang

Comments accepted by ICLR 2026

2604.06184 2026-04-09 cs.HC cs.AI

A Goal-Oriented Chatbot for Engaging the Elderly Through Family Photo Conversations

Raymond Chung, Keith Ng, CD Shum

Comments Accepted at 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC)

2604.06182 2026-04-09 cs.HC cs.AI

VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics

Yichen Gong, Zhuohan Cai, Sunhao Dai, Yuqi Zhou, Zhangxuan Gu, Changhua Meng, Shuheng Shen

2604.06180 2026-04-09 eess.IV cs.CV cs.LG cs.MA

MedRoute: RL-Based Dynamic Specialist Routing in Multi-Agent Medical Diagnosis

Ashmal Vayani, Parth Parag Kulkarni, Joseph Fioresi, Song Wang, Mubarak Shah

2604.06179 2026-04-09 cs.IR cs.CL

ARIA: Adaptive Retrieval Intelligence Assistant -- A Multimodal RAG Framework for Domain-Specific Engineering Education

Yue Luo, Dibakar Roy Sarkar, Rachel Herring Sangree, Somdatta Goswami

2604.06177 2026-04-09 cs.IR cs.AI cs.CL

WebExpert: domain-aware web agents with critic-guided expert experience for high-precision search

Yuelin Hu, Zhengxue Cheng, Ronghua Wu, Qunshan Gu, Hongwei Hu, Wei Liu, Qiao Liang, Li Song

Comments accepted by icassp2026

2604.06176 2026-04-09 cs.IR cs.AI cs.CL

Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model

Weishu Chen, Zhouhui Hou, Mingjie Zhan, Zhicheng Zhao, Fei Su

2604.06172 2026-04-09 cs.IR cs.AI

EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

Yingjun Dai, Ahmed El-Roby

Comments 8 pages

2604.06098 2026-04-09 cs.IR cs.CL

JUÁ -- A Benchmark for Information Retrieval in Brazilian Legal Text Collections

Jayr Pereira, Leandro Fernandes, Erick de Brito, Roberto Lotufo, Luiz Bonifacio

2604.06018 2026-04-09 cs.CY cs.AI cs.HC

Governance and Regulation of Artificial Intelligence in Developing Countries: A Case Study of Nigeria

Uloma Okoro, Tammy Mackenzie, Branislav Radeljic

2604.05429 2026-04-09 eess.SY cs.AI cs.CL cs.SY

Bridging Natural Language and Microgrid Dynamics: A Context-Aware Simulator and Dataset

Tinko Sebastian Bartels, Ruixiang Wu, Xinyu Lu, Yikai Lu, Fanzeng Xia, Haoxiang Yang, Yue Chen, Tongxin Li

2604.03486 2026-04-09 cs.HC cs.AI cs.CV cs.LG cs.MA

VisionClaw: Always-On AI Agents through Smart Glasses

Xiaoan Liu, DaeHo Lee, Eric J Gonzalez, Mar Gonzalez-Franco, Ryo Suzuki

Comments 17 pages, 11 figures, plus appendix

2603.29660 2026-04-09 astro-ph.IM cs.CV

STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer

Andrea DeMarco, Ian Fenech Conti, Hayley Camilleri, Ardiana Bushi, Simone Riggi

Comments 19 pages

详情

英文摘要

Next-generation radio astronomy surveys are delivering millions of resolved sources, but robust and scalable morphology analysis remains difficult across heterogeneous telescopes and imaging pipelines. We present STRADAViT, a self-supervised Vision Transformer continued-pretraining framework for learning transferable encoders from radio astronomy imagery. The framework combines mixed-survey data curation, radio astronomy-aware training-view generation, and a ViT-MAE-initialized encoder family with optional register tokens. It supports reconstruction-only, contrastive-only, and two-stage branches. Our pretraining dataset comprises radio astronomy cutouts drawn from four complementary sources. We evaluate transfer with linear probing and fine-tuning on three morphology benchmarks spanning binary and multi-class settings. Relative to the ViT-MAE initialization used for continued pretraining, the best two-stage models improve Macro-F1 in all reported linear-probe settings and in two of three fine-tuning settings, with the largest gain on RGZ DR1. Relative to DINOv2, gains are selective rather than universal: the best two-stage models achieve higher mean Macro-F1 than the strongest DINOv2 baseline on LoTSS DR2 and RGZ DR1 under linear probing, and on MiraBest and RGZ DR1 under fine-tuning. A targeted DINOv2 initialization ablation further indicates that the adaptation recipe is not specific to the ViT-MAE starting point and that, under the same recipe. The ViT-MAE-based STRADAViT checkpoint is retained as the released checkpoint because it combines competitive transfer with substantially lower token count and downstream cost than the DINOv2-based alternative. These results indicate that radio astronomy-aware view generation and staged continued pretraining can provide a stronger domain-adapted starting point than off-the-shelf ViT checkpoints for radio astronomy transfer.

URL PDF HTML ☆

赞 0 踩 0

2603.20654 2026-04-09 cs.DC cs.AI cs.AR

Modernizing Amdahl's Law: How AI Scaling Laws Shape Computer Architecture

Chien-Ping Lu

Comments Use: 12 pages, 1 table, 5 figures. arXiv version v4

2603.06257 2026-04-09 stat.ML cs.LG

Robust support vector model based on bounded asymmetric elastic net loss for binary classification

Haiyan Du, Hu Yang

Comments Upon re-examination, we found fundamental flaws in the BAEN-SVM model that undermine our conclusions. The design inadequately addresses geometrical rationality on slack variables, questioning generalizability. Thus, we retract this manuscript. We are exploring a different model and will resubmit after thorough validation. We apologize for any confusion

2602.22220 2026-04-09 cs.IR cs.AI cs.CL

What Makes an Ideal Quote? Recommending "Unexpected yet Rational" Quotations via Novelty

Bowei Zhang, Jin Xiao, Guanglei Yue, Qianyu He, Yanghua Xiao, Deqing Yang, Jiaqing Liang

Comments Accepted to ACL 2026 main conference ; Code available at <https://github.com/Chang-pw/NoQuote>

2602.18377 2026-04-09 quant-ph cs.LG

Theory and interpretability of Quantum Extreme Learning Machines: a Pauli-transfer matrix approach

Markus Gross, Hans-Martin Rieser

Comments 45 pages, 15 figures

2602.15889 2026-04-09 stat.AP cs.AI cs.CL physics.ed-ph

Daily and Weekly Periodicity in Large Language Model Performance and Its Implications for Research

Paul Tschisgale, Peter Wulff

Comments The Supplementary Information can be found in the OSF repository cited in the Data Availability Statement

2601.16294 2026-04-09 cs.DC cs.AI

Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple

Evangelos Georganas, Alexander Heinecke, Pradeep Dubey

2601.11680 2026-04-09 eess.IV cs.CV

FourierPET: Deep Fourier-based Unrolled Network for Low-count PET Reconstruction

Zheng Zhang, Hao Tang, Yingying Hu, Zhanli Hu, Jing Qin

Comments Accepted for oral presentation at AAAI 2026

2601.02624 2026-04-09 cs.CR cs.AI

LAsset: An LLM-assisted Security Asset Identification Framework for System-on-Chip (SoC) Verification

Md Ajoad Hasan, Dipayan Saha, Khan Thamid Hasan, Nashmin Alam, Azim Uddin, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi

Comments This paper will be presented at Design, Automation and Test in Europe Conference (DATE) 2026

2601.01311 2026-04-09 math.OC cs.LG

Concave Certificates: Geometric Framework for Distributionally Robust Risk and Complexity Analysis

Hong T. M. Chu

Comments 32 pages, 10 figures

2512.21389 2026-04-09 physics.med-ph cs.LG physics.app-ph physics.bio-ph

Deep learning-enhanced dual-mode multiplexed optical sensor for point-of-care diagnostics of cardiovascular diseases

Gyeo-Re Han, Merve Eryilmaz, Artem Goncharov, Yuzhu Li, Shun Ye, Aoi Tomoeda, Emily Ngo, Margherita Scussat, Xiao Wang, Zixiang Ji, Max Zhang, Jeffrey J. Hsu, Omai B. Garner, Dino Di Carlo, Aydogan Ozcan

Comments 32 Pages, 6 Figures, 2 Tables

详情

DOI: 10.1038/s41377-026-02275-9
Journal ref: Light: Science & Applications (2026)

英文摘要

Rapid and accessible cardiac biomarker testing is essential for the timely diagnosis and risk assessment of myocardial infarction (MI) and heart failure (HF), two interrelated conditions that frequently coexist and drive recurrent hospitalizations with high mortality. However, current laboratory and point-of-care testing systems are limited by long turnaround times, narrow dynamic ranges for the tested biomarkers, and single-analyte formats that fail to capture the complexity of cardiovascular disease. Here, we present a deep learning-enhanced dual-mode multiplexed vertical flow assay (xVFA) with a portable optical reader and a neural network-based quantification pipeline. This optical sensor integrates colorimetric and chemiluminescent detection within a single paper-based cartridge to complementarily cover a large dynamic range (spanning ~6 orders of magnitude) for both low- and high-abundance biomarkers, while maintaining quantitative accuracy. Using 50 uL of serum, the optical sensor simultaneously quantifies cardiac troponin I (cTnI), creatine kinase-MB (CK-MB), and N-terminal pro-B-type natriuretic peptide (NT-proBNP) within 23 min. The xVFA achieves sub-pg/mL sensitivity for cTnI and sub-ng/mL sensitivity for CK-MB and NT-proBNP, spanning the clinically relevant ranges for these biomarkers. Neural network models trained and blindly tested on 92 patient serum samples yielded a robust quantification performance (Pearson's r > 0.96 vs. reference assays). By combining high sensitivity, multiplexing, and automation in a compact and cost-effective optical sensor format, the dual-mode xVFA enables rapid and quantitative cardiovascular diagnostics at the point of care.

URL PDF HTML ☆

赞 0 踩 0

2512.14735 2026-04-09 q-fin.CP cs.AI cs.CV

PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents

Yuqun Zhang, Yuxuan Zhao, Sijia Chen

2511.18258 2026-04-09 cs.MA cs.AI cs.LG

Hybrid Agentic AI and Multi-Agent Systems in Smart Manufacturing

Mojtaba A. Farahani, Md Irfan Khan, Thorsten Wuest

详情

DOI: 10.1016/j.jmsy.2026.04.002

英文摘要

The convergence of Agentic AI and MAS enables a new paradigm for intelligent decision making in SMS. Traditional MAS architectures emphasize distributed coordination and specialized autonomy, while recent advances in agentic AI driven by LLMs introduce higher order reasoning, planning, and tool orchestration capabilities. This paper presents a hybrid agentic AI and multi agent framework for a Prescriptive Maintenance use case, where LLM based agents provide strategic orchestration and adaptive reasoning, complemented by rule based and SLMs agents performing efficient, domain specific tasks on the edge. The proposed framework adopts a layered architecture that consists of perception, preprocessing, analytics, and optimization layers, coordinated through an LLM Planner Agent that manages workflow decisions and context retention. Specialized agents autonomously handle schema discovery, intelligent feature analysis, model selection, and prescriptive optimization, while a HITL interface ensures transparency and auditability of generated maintenance recommendations. This hybrid design supports dynamic model adaptation, cost efficient maintenance scheduling, and interpretable decision making. An initial proof of concept implementation is validated on two industrial manufacturing datasets. The developed framework is modular and extensible, supporting seamless integration of new agents or domain modules as capabilities evolve. The results demonstrate the system capability to automatically detect schema, adapt preprocessing pipelines, optimize model performance through adaptive intelligence, and generate actionable, prioritized maintenance recommendations. The framework shows promise in achieving improved robustness, scalability, and explainability for RxM in smart manufacturing, bridging the gap between high level agentic reasoning and low level autonomous execution.

URL PDF HTML ☆

赞 0 踩 0

2511.07176 2026-04-09 cs.NI cs.CL

Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents

Hanlin Cai, Houtianfu Wang, Haofan Dong, Kai Li, Sai Zou, Ozgur B. Akan

Comments This paper has been accepted by the IEEE 22nd International Wireless Communications & Mobile Computing Conference (IWCMC 2026, Shanghai, China)

2510.25235 2026-04-09 eess.AS cs.SD

Disentangling peripheral hearing loss from central and cognitive effects on speech intelligibility in older adults

Toshio Irino, Ayako Yamamoto, Fuki Miyazaki

Comments This manuscript was submitted to Speech Communication on April 8, 2026

2510.23642 2026-04-09 cs.SE cs.AI cs.CL cs.PL

VisCoder2: Building Multi-Language Visualization Coding Agents

Yuansheng Ni, Songcheng Cai, Xiangchao Chen, Jiarong Liang, Zhiheng Lyu, Jiaqi Deng, Kai Zou, Ping Nie, Fei Yuan, Xiang Yue, Wenhu Chen

2510.19225 2026-04-09 cs.DC cs.LG

RLBoost: Harvesting Preemptible Resources for Cost-Efficient Reinforcement Learning on LLMs

Yongji Wu, Xueshen Liu, Haizhong Zheng, Juncheng Gu, Beidi Chen, Z. Morley Mao, Arvind Krishnamurthy, Ion Stoica

详情

英文摘要

Reinforcement learning (RL) has become essential for unlocking advanced reasoning capabilities in large language models (LLMs). RL workflows involve interleaving rollout and training stages with fundamentally different resource requirements. Rollout typically dominates overall execution time, yet scales efficiently through multiple independent instances. In contrast, training requires tightly-coupled GPUs with full-mesh communication. Existing RL frameworks fall into two categories: co-located and disaggregated architectures. Co-located frameworks fail to address this resource tension by forcing both stages to share the same GPUs. Disaggregated architectures, without modifications of well-established RL algorithms, suffer from resource under-utilization. Meanwhile, preemptible GPU resources, i.e., spot instances on public clouds and spare capacity in production clusters, present significant cost-saving opportunities for accelerating RL workflows, if efficiently harvested for rollout. In this paper, we present RLBoost, a framework for cost-efficient RL training that harvests preemptible GPU resources. Our key insight is that rollout's stateless and embarrassingly parallel nature aligns perfectly with preemptible and often fragmented resources. To efficiently utilize these resources despite frequent and unpredictable availability changes, RLBoost adopts a hybrid architecture with three key techniques: (1) adaptive rollout offload to dynamically adjust workloads on the reserved (on-demand) cluster, (2) pull-based weight transfer that quickly provisions newly available instances, and (3) token-level response collection and migration for efficient preemption handling and continuous load balancing. Extensive experiments show RLBoost increases training throughput by 1.51x-1.97x while improving cost efficiency by 28%-49% compared to using only on-demand GPU resources.

URL PDF HTML ☆

赞 0 踩 0