arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.09158 2026-04-13 cs.HC cs.AI

Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning

Fatma Betül Güreş, Tanya Nazaretsky, Seyed Parsa Neshaei, Tanja Käser

Comments 12 pages, 8 figures. Accepted at LAK 2026

详情

DOI: 10.1145/3785022.3785105
Journal ref: In Proceedings of the 16th International Learning Analytics and Knowledge Conference (LAK 2026), April 27-May 1, 2026, Bergen, Norway. ACM, New York, NY, USA

英文摘要

Supporting students in developing diagnostic reasoning is a key challenge across educational domains. Novices often face cognitive biases such as premature closure and over-reliance on heuristics, and they struggle to transfer diagnostic strategies to new cases. Scenario-based learning (SBL) enhanced by Learning Analytics (LA) and large language models (LLM) offers a promising approach by combining realistic case experiences with personalized scaffolding. Yet, how different scaffolding approaches shape reasoning processes remains insufficiently explored. This study introduces PharmaSim Switch, an SBL environment for pharmacy technician training, extended with an LA- and LLM-powered pharmacist agent that implements pedagogical conversations rooted in two theory-driven scaffolding approaches: \emph{structuring} and \emph{problematizing}, as well as a student learning trajectory. In a between-groups experiment, 63 vocational students completed a learning scenario, a near-transfer scenario, and a far-transfer scenario under one of the two scaffolding conditions. Results indicate that both scaffolding approaches were effective in supporting the use of diagnostic strategies. Performance outcomes were primarily influenced by scenario complexity rather than students' prior knowledge or the scaffolding approach used. The structuring approach was associated with more accurate Active and Interactive participation, whereas problematizing elicited more Constructive engagement. These findings underscore the value of combining scaffolding approaches when designing LA- and LLM-based systems to effectively foster diagnostic reasoning.

URL PDF HTML ☆

赞 0 踩 0

2604.09135 2026-04-13 stat.ML cs.LG math.ST stat.ME stat.TH

Identifying Causal Effects Using a Single Proxy Variable

Silvan Vollmer, Niklas Pfister, Sebastian Weichwald

Comments Equal contribution between Pfister and Weichwald

2604.09124 2026-04-13 cs.DC cs.AR cs.LG

MATCHA: Efficient Deployment of Deep Neural Networks on Multi-Accelerator Heterogeneous Edge SoCs

Enrico Russo, Mohamed Amine Hamdi, Alessandro Ottaviano, Francesco Conti, Angelo Garofalo, Daniele Jahier Pagliari, Maurizio Palesi, Luca Benini, Alessio Burrello

Comments Accepted at the 63rd ACM/IEEE Design Automation Conference (DAC26)

2604.09115 2026-04-13 cs.NI cs.RO

"Take Me Home, Wi-Fi Drone": A Drone-based Wireless System for Wilderness Search and Rescue

Weiying Hou, Luca Jiang-Tao Yu, Chenshu Wu

Comments 16 pages, 12 figures, 1 table. Project page: https://aiot-lab.github.io/Wi2SAR

详情

DOI: 10.1145/3795866.3796679
Journal ref: In The 32nd Annual International Conference on Mobile Computing and Networking (MobiCom '26), October 26-30, 2026, Austin, TX, USA. ACM, New York, NY, USA

英文摘要

Wilderness Search and Rescue (WiSAR) represents a longstanding and critical societal challenge, demanding innovative and automatic technological solutions. In this paper, we introduce Wi2SAR, a novel autonomous drone-based wireless system for long-range, through-occlusion WiSAR operations, without relying on existing infrastructure. Our basic insight is to leverage the automatic reconnection behavior of modern Wi-Fi devices to known networks. By mimicking these networks via on-drone Wi-Fi, Wi2SAR uniquely facilitates the discovery and localization of victims through their accompanying mobile devices. Translating this simple idea into a practical system poses substantial technical challenges. Wi2SAR overcomes these challenges via three distinct innovations: (1) a rapid and energy-efficient device discovery mechanism to discover and identify the target victim, (2) a novel RSS-only, long-range direction finding approach using a 3D-printed Luneburg Lens, amplifying the directional signal strength differences and significantly extending the operational range, and (3) an adaptive drone navigation scheme that guides the drone toward the target efficiently. We implement an end-to-end prototype and evaluate Wi2SAR across various mobile devices and real-world wilderness scenarios. Experimental results demonstrate Wi2SAR's high performance, efficiency, and practicality, highlighting its potential to advance autonomous WiSAR solutions. Wi2SAR is open-sourced at https://aiot-lab.github.io/Wi2SAR to facilitate further research and real-world deployment.

URL PDF HTML ☆

赞 0 踩 0

2604.09107 2026-04-13 cs.DC cs.AI

TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training

Chenhao Ye, Huaizheng Zhang, Mingcong Han, Baoquan Zhong, Xiang Li, Qixiang Chen, Xinyi Zhang, Weidong Zhang, Kaihua Jiang, Wang Zhang, He Sun, Wencong Xiao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

2604.09104 2026-04-13 cs.CY cs.AI

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Tommy Shaffer Shane, Simon Mylius, Hamish Hobbs

Comments 44 pages, 4 figures, 5 tables (main text). Includes 5 appendices

详情

英文摘要

Scheming, the covert pursuit of misaligned goals by AI systems, represents a potentially catastrophic risk, yet scheming research suffers from significant limitations. In particular, scheming evaluations demonstrate behaviours that may not occur in real-world settings, limiting scientific understanding, hindering policy development, and not enabling real-time detection of loss of control incidents. Real-world evidence is needed, but current monitoring techniques are not effective for this purpose. This paper introduces a novel open-source intelligence (OSINT) methodology for detecting real-world scheming incidents: collecting and analysing transcripts from chatbot conversations or command-line interactions shared online. Analysing over 183,420 transcripts from X (formerly Twitter), we identify 698 real-world scheming-related incidents between October 2025 and March 2026. We observe a statistically significant 4.9x increase in monthly incidents from the first to last month, compared to a 1.7x increase in posts discussing scheming. We find evidence of multiple scheming-related behaviours in real-world deployments previously reported only in experiments, many resulting in real-world harms. While we did not detect catastrophic scheming incidents, the behaviours observed demonstrate concerning precursors, such as willingness to disregard instructions, circumvent safeguards, lie to users, and single-mindedly pursue goals in harmful ways. As AI systems become more capable, these could evolve into more strategic scheming with potentially catastrophic consequences. Our findings demonstrate the viability of transcript-based OSINT as a scalable approach to real-world scheming detection supporting scientific research, policy development, and emergency response. We recommend further investment towards OSINT techniques for monitoring scheming and loss of control.

URL PDF HTML ☆

赞 0 踩 0

2604.09101 2026-04-13 cs.CR cs.AI cs.CV cs.LG

CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion

Akshit Jindal, Saket Anand, Chetan Arora, Vikram Goyal

Comments 17 pages (8 main + 2 references + 7 supplementary), Accepted to CVPR Findings 2026

2604.09089 2026-04-13 cs.SE cs.AI cs.CR

DeepGuard: Secure Code Generation via Multi-Layer Semantic Aggregation

Li Huang, Zhongxin Liu, Yifan Wu, Tao Yin, Dong Li, Jichao Bi, Nankun Mu, Hongyu Zhang, Meng Yan

Comments ACL 2026 main conference

2604.09048 2026-04-13 cs.DC cs.AI

Watt Counts: Energy-Aware Benchmark for Sustainable LLM Inference on Heterogeneous GPU Architectures

Mauricio Fadel Argerich, Jonathan Fürst, Marta Patiño-Martínez

Comments Under review

2604.09028 2026-04-13 cs.MA cs.LG cs.NI

Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks

Wen Qiu, Zhiqiang He, Wei Zhao, Hiroshi Masui

Comments 20 pages, 12 figures, 3 tables

2604.08979 2026-04-13 cs.HC cs.SD

Accessible Fine-grained Data Representation via Spatial Audio

Can Liu, Wenjie Jiang, Shaolun Ruan, Kotaro Hara, Yong Wang

Comments Accepted by IEEE Computer Graphics and Applications (IEEE CG&A)

2604.08969 2026-04-13 stat.ML cs.LG math.ST stat.TH

Online Quantile Regression for Nonparametric Additive Models

Haoran Zhan

2604.08935 2026-04-13 stat.ML cs.LG

A novel hybrid approach for positive-valued DAG learning

Yao Zhao

Comments 13 pages, 2 tables. Accepted at CLeaR 2026

2604.08920 2026-04-13 cs.IR cs.AI cs.CL cs.LG

Beyond Relevance: Utility-Centric Retrieval in the LLM Era

Hengran Zhang, Minghao Tang, Keping Bi, Jiafeng Guo

Comments Accepted by SIGIR2026

2604.08894 2026-04-13 cs.NE cs.AI cs.CV

Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer

Zecheng Hao, Shenghao Xie, Kang Chen, Wenxuan Liu, Zhaofei Yu, Tiejun Huang

2604.08868 2026-04-13 eess.IV cs.AI cs.CV cs.LG

MedFormer-UR: Uncertainty-Routed Transformer for Medical Image Classification

Mohammed Maaz Sibhai, Abedalrhman Alkhateeb, Saad B. Ahmed

2604.08866 2026-04-13 cs.HC cs.AI

AI-Induced Human Responsibility (AIHR) in AI-Human teams

Greg Nyilasy, Brock Bastian, Jennifer Overbeck, Abraham Ryan Ade Putra Hito

2604.08805 2026-04-13 cs.CR cs.AI

Building Better Environments for Autonomous Cyber Defence

Chris Hicks, Elizabeth Bates, Shae McFadden, Isaac Symes Thompson, Myles Foley, Ed Chapman, Nickolas Espinosa Dice, Ankita Samaddar, Joshua Sylvester, Himanshu Neema, Nicholas Butts, Nate Foster, Ahmad Ridley, Zoe M, Paul Jones

2604.08804 2026-04-13 stat.ML cs.LG stat.ME

Policy-Aware Design of Large-Scale Factorial Experiments

Xin Wen, Xi Chen, Will Wei Sun, Yichen Zhang

2604.08803 2026-04-13 cs.CY cs.AI

Scrapyard AI

Marc Böhlen, Sai Krishna

Comments 13 pages, 4 figures, XcoAx 2026 pre-publication

2604.08800 2026-04-13 cs.CR cs.LG

Tracing the Chain: Deep Learning for Stepping-Stone Intrusion Detection

Nate Mathews, Nicholas Hopper, Matthew Wright

2604.08799 2026-04-13 cs.GR cs.CV

MeshOn: Intersection-Free Mesh-to-Mesh Composition

Hyunwoo Kim, Itai Lang, Hadar Averbuch-Elor, Silvia Sellán, Rana Hanocka

Comments Project page: \hyperlink{https://threedle.github.io/MeshOn/}{this https URL}

2604.08781 2026-04-13 eess.IV cs.AI cs.CV eess.SP physics.med-ph

PSIRNet: Deep Learning-based Free-breathing Rapid Acquisition Late Enhancement Imaging

Arda Atalik, Hui Xue, Rhodri H. Davies, Thomas A. Treibel, Daniel K. Sodickson, Michael S. Hansen, Peter Kellman

Comments 25 pages, 5 figures, 4 tables

详情

英文摘要

Purpose: To develop and evaluate a deep learning (DL) method for free-breathing phase-sensitive inversion recovery (PSIR) late gadolinium enhancement (LGE) cardiac MRI that produces diagnostic-quality images from a single acquisition over two heartbeats, eliminating the need for 8 to 24 motion-corrected (MOCO) signal averages. Materials and Methods: Raw data comprising 800,653 slices from 55,917 patients, acquired on 1.5T and 3T scanners across multiple sites from 2016 to 2024, were used in this retrospective study. Data were split by patient: 640,000 slices (42,822 patients) for training and the remainder for validation and testing, without overlap. The training and testing data were from different institutions. PSIRNet, a physics-guided DL network with 845 million parameters, was trained end-to-end to reconstruct PSIR images with surface coil correction from a single interleaved IR/PD acquisition over two heartbeats. Reconstruction quality was evaluated using SSIM, PSNR, and NRMSE against MOCO PSIR references. Two expert cardiologists performed an independent qualitative assessment, scoring image quality on a 5-point Likert scale across bright blood, dark blood, and wideband LGE variants. Paired superiority and equivalence (margin = 0.25 Likert points) were tested using exact Wilcoxon signed-rank tests at a significance level of 0.05 using R version 4.5.2. Results: Both readers rated single-average PSIRNet reconstructions superior to MOCO PSIR for dark blood LGE (conservative P = .002); for bright blood and wideband, one reader rated it superior and the other confirmed equivalence (all P < .001). Inference required approximately 100 msec per slice versus more than 5 sec for MOCO PSIR. Conclusion: PSIRNet produces diagnostic-quality free-breathing PSIR LGE images from a single acquisition, enabling 8- to 24-fold reduction in acquisition time.

URL PDF HTML ☆

赞 0 踩 0

2604.08772 2026-04-13 physics.ao-ph cs.LG

CERBERUS: A Three-Headed Decoder for Vertical Cloud Profiles

Emily K. deJong, Nipun Gunawardena, Kevin Smalley, Hassan Beydoun, Peter Caldwell

Comments Accepted for oral presentation at 2026 ICLR workshop on Machine Learning for Remote Sensing

2604.08763 2026-04-13 quant-ph cs.LG cs.NA math.NA

Weak Adversarial Neural Pushforward Method for the Wigner Transport Equation

Andrew Qing He, Wei Cai, Sihong Shao

Comments 9 pages, 1 algorithm

2604.08759 2026-04-13 cs.IT cs.CL math.IT

Optimal Multi-bit Generative Watermarking Schemes Under Worst-Case False-Alarm Constraints

Yu-Shin Huang, Chao Tian, Krishna Narayanan

Comments 41 pages, 8 tables

2604.08755 2026-04-13 cs.CE cs.LG stat.ML

Accurate and Reliable Uncertainty Estimates for Deterministic Predictions Extensions to Under and Overpredictions

Rileigh Bandy, Enrico Camporeale, Andong Hu, Thomas Berger, Rebecca Morrison

2604.08744 2026-04-13 physics.chem-ph cond-mat.mtrl-sci cs.LG physics.comp-ph

Active Learning for Generalizable Detonation Performance Prediction of Energetic Materials

R. Seaton Ullberg, Megan C. Davis, Jeremy N. Schroeder, Andrew H. Salij, M. J. Cawkwell, Christopher J. Snyder, Wilton J. M. Kort-Kamp, Ivana Matanovic

2604.08742 2026-04-13 math.OC cs.LG

Adam-HNAG: A Convergent Reformulation of Adam with Accelerated Rate

Yaxin Yu, Long Chen, Zeyi Xu

Comments 27 pages, 4 figures

2604.08739 2026-04-13 cs.CR cs.LG

RansomTrack: A Hybrid Behavioral Analysis Framework for Ransomware Detection

Busra Caliskan, Ibrahim Gulatas, H. Hakan Kilinc, A. Halim Zaim

Comments 20 pages, 7 figures