arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.06862 2026-03-16 cs.CR cs.AI cs.CL

Supporting Artifact Evaluation with LLMs: A Study with Published Security Research Papers

David Heye, Karl Kindermann, Robin Decker, Johannes Lohmöller, Anastasiia Belova, Sandra Geisler, Klaus Wehrle, Jan Pennekamp

详情

DOI: 10.1109/BigData66926.2025.11401815

英文摘要

Artifact Evaluation (AE) is essential for ensuring the transparency and reliability of research, closing the gap between exploratory work and real-world deployment is particularly important in cybersecurity, particularly in IoT and CPSs, where large-scale, heterogeneous, and privacy-sensitive data meet safety-critical actuation. Yet, manual reproducibility checks are time-consuming and do not scale with growing submission volumes. In this work, we demonstrate that Large Language Models (LLMs) can provide powerful support for AE tasks: (i) text-based reproducibility rating, (ii) autonomous sandboxed execution environment preparation, and (iii) assessment of methodological pitfalls. Our reproducibility-assessment toolkit yields an accuracy of over 72% and autonomously sets up execution environments for 28% of runnable cybersecurity artifacts. Our automated pitfall assessment detects seven prevalent pitfalls with high accuracy ($F_1$ > 92%). Hence, the toolkit significantly reduces reviewer effort and, when integrated into established AE processes, could incentivize authors to submit higher-quality and more reproducible artifacts. IoT, CPS, and cybersecurity conferences and workshops may integrate the toolkit into their peer-review processes to support reviewers' decisions on awarding artifact badges, improving the overall sustainability of the process.

URL PDF HTML ☆

赞 0 踩 0

2602.11638 2026-03-16 cs.GR cs.AI

Variation-aware Flexible 3D Gaussian Editing

Hao Qin, Yukai Sun, Meng Wang, Ming Kong, Mengxu Lu, Qiang Zhu

2602.08917 2026-03-16 cs.IR cs.AI

Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion

Minghan Li, Ercong Nie, Siqi Zhao, Tongna Chen, Huiping Huang, Guodong Zhou

Comments Preprint. This paper is under consideration at Pattern Recognition Letters

2601.10436 2026-03-16 cs.IR cs.AI

Development of Ontological Knowledge Bases by Leveraging Large Language Models

Le Ngoc Luyen, Marie-Hélène Abel, Philippe Gouspillou

2512.00299 2026-03-16 q-fin.MF cs.LG q-fin.PM q-fin.RM

Stochastic Dominance Constrained Optimization with S-shaped Utilities: Poor-Performance-Region Algorithm and Neural Network

Zeyun Hu, Yang Liu

Comments 30 pages

2510.01930 2026-03-16 stat.ML cond-mat.dis-nn cs.LG

Precise Dynamics of Diagonal Linear Networks: A Unifying Analysis by Dynamical Mean-Field Theory

Sota Nishiyama, Masaaki Imaizumi

Comments 48 pages, accepted at AISTATS 2026 (Spotlight)

2509.26471 2026-03-16 eess.AS cs.AI

On Deepfake Voice Detection -- It's All in the Presentation

Héctor Delgado, Giorgio Ramondetti, Emanuele Dalmasso, Gennady Karvitsky, Daniele Colibro, Haydar Talib

Comments ICASSP 2026. \c{opyright}IEEE Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

2509.12431 2026-03-16 cond-mat.str-el cs.AI cs.LG quant-ph

Neural-Quantum-States Impurity Solver for Quantum Embedding Problems

Yinzhanghao Zhou, Tsung-Han Lee, Ao Chen, Nicola Lanatà, Hong Guo

Comments 10 pages main text, and 4 figures. Note that YinZhangHao Zhou and Zhanghao Zhouyin are the same person, I use them both

2503.14550 2026-03-16 eess.IV cs.AI cs.CV cs.LG

Novel AI-Based Quantification of Breast Arterial Calcification to Predict Cardiovascular Risk

Theodorus Dapamede, Aisha Urooj, Vedant Joshi, Gabrielle Gershon, Frank Li, Mohammadreza Chavoshi, Beatrice Brown-Mulry, Rohan Satya Isaac, Aawez Mansuri, Chad Robichaux, Chadi Ayoub, Reza Arsanjani, Laurence Sperling, Judy Gichoya, Marly van Assen, Charles W. ONeill, Imon Banerjee, Hari Trivedi

2412.14841 2026-03-16 cs.SE cs.AI

Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis

Greta Dolcetti, Vincenzo Arceri, Eleonora Iotti, Sergio Maffeis, Agostino Cortesi, Enea Zaffanella

详情

DOI: 10.1007/s44163-026-01009-5
Journal ref: Discover Artificial Intelligence, 2026

英文摘要

Large Language Models (LLMs) are one of the most promising developments in the field of artificial intelligence, and the software engineering community has readily noticed their potential role in the software development life-cycle. Developers routinely ask LLMs to generate code snippets, increasing productivity but also potentially introducing ownership, privacy, correctness, and security issues. Previous work highlighted how code generated by mainstream commercial LLMs is often not safe, containing vulnerabilities, bugs, and code smells. In this paper, we present a framework that leverages testing and static analysis to assess the quality, and guide the self-improvement, of code generated by general-purpose, open-source LLMs. First, we ask LLMs to generate C code to solve a number of programming tasks. Then we employ ground-truth tests to assess the (in)correctness of the generated code, and a static analysis tool to detect potential safety vulnerabilities. Next, we assess the models ability to evaluate the generated code, by asking them to detect errors and vulnerabilities. Finally, we test the models ability to fix the generated code, providing the reports produced during the static analysis and incorrectness evaluation phases as feedback. Our results show that models often produce incorrect code, and that the generated code can include safety issues. Moreover, they perform very poorly at detecting either issue. On the positive side, we observe a substantial ability to fix flawed code when provided with information about failed tests or potential vulnerabilities, indicating a promising avenue for improving the safety of LLM-based code generation tools.

URL PDF HTML ☆

赞 0 踩 0

2411.15266 2026-03-16 astro-ph.IM cond-mat.dis-nn cond-mat.mtrl-sci cs.RO physics.class-ph

Continuous Design and Reprogramming of Totimorphic Structures for Space Applications

Dominik Dold, Amy Thomas, Nicole Rosi, Jai Grover, Dario Izzo

Comments Code: https://github.com/esa/LattyMorph/tree/main

2409.01523 2026-03-16 cond-mat.mtrl-sci cs.LG

Machine learning approach for vibronically renormalized electronic band structures

Niraj Aryal, Sheng Zhang, Weiguo Yin, Gia-Wei Chern

Comments 17 pages, 7 figures

2407.03131 2026-03-16 cs.NE cs.AI eess.SP

MVGT: A Multi-view Graph Transformer Based on Spatial Relations for EEG Emotion Recognition

Yanjie Cui, Xiaohong Liu, Jing Liang, Yamin Fu

Comments Accepted by ICONIP 2025 (Oral). 16 pages, 5 figures

2208.13701 2026-03-16 stat.ME cs.LG math.OC stat.ML

Data-Driven Influence Functions for Optimization-Based Causal Inference

Michael I. Jordan, Yixin Wang, Angela Zhou

Comments Revision

2103.01801 2026-03-16 eess.SP cs.LG

Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB traffic

Fabio Saggese, Luca Pasqualini, Marco Moretti, Andrea Abrardo

Comments This work has been submitted to the IEEE for possible publication

2603.12880 2026-03-16 eess.SP cs.LG

Explainable AI Using Inherently Interpretable Components for Wearable-based Health Monitoring

Maurice Kuschel, Solveig Vieluf, Claus Reinsberger, Tobias Loddenkemper, Tanuj Hasija

Comments Submitted to the IEEE Journal of Biomedical and Health Informatics

2603.12870 2026-03-16 math.NA cs.CE cs.LG cs.NA

Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives

Matteo Giacomini, Pedro Díez

2603.12849 2026-03-16 eess.SP cs.RO

AoI-FusionNet: Age-Aware Tightly Coupled Fusion of UWB-IMU under Sparse Ranging Conditions

Tehmina Bibi, Anselm Köhler, Jan-Thomas Fischer, Falko Dressler

2603.12828 2026-03-16 eess.SY cs.LG cs.SY

From AI Weather Prediction to Infrastructure Resilience: A Correction-Downscaling Framework for Tropical Cyclone Impacts

You Wu, Zhenguo Wang, Naiyu Wang

2603.12781 2026-03-16 cs.CY cs.AI cs.HC

The RIGID Framework: Research-Integrated, Generative AI-Mediated Instructional Design

Yerin Kwak, Zachary A. Pardos

2603.12752 2026-03-16 cs.IR cs.LG

Taming the Long Tail: Efficient Item-wise Sharpness-Aware Minimization for LLM-based Recommender Systems

Jiaming Zhang, Yuyuan Li, Xiaohua Feng, Li Zhang, Longfei Li, Jun Zhou, Chaochao Chen

2603.12739 2026-03-16 cs.NE cs.AI cs.AR

SRAM-Based Compute-in-Memory Accelerator for Linear-decay Spiking Neural Networks

Hongyang Shang, Shuai Dong, Yahan Yang, Junyi Yang, Peng Zhou, Arindam Basu

2603.12734 2026-03-16 stat.ML cs.LG

VecMol: Vector-Field Representations for 3D Molecule Generation

Yuchen Hua, Xingang Peng, Jianzhu Ma, Muhan Zhang

2603.12726 2026-03-16 cs.IR cs.LG

Anchored Alignment: Preventing Positional Collapse in Multimodal Recommender Systems

Yonghun Jeong, David Yoon Suk Kang, Yeon-Chang Lee

Comments 5 pages, 5 figures

2603.12715 2026-03-16 eess.IV cs.CV

Deep Learning Based Estimation of Blood Glucose Levels from Multidirectional Scleral Blood Vessel Imaging

Muhammad Ahmed Khan, Manqiang Peng, Ding Lin, Saif Ur Rehman Khan

2603.12712 2026-03-16 cs.SE cs.LG

Design-Specification Tiling for ICL-based CAD Code Generation

Yali Du, San-Zhuo Xi, Hui Sun, Ming Li

2603.12701 2026-03-16 cs.HC cs.AI

Seeing Eye to Eye: Enabling Cognitive Alignment Through Shared First-Person Perspective in Human-AI Collaboration

Zhuyu Teng, Pei Chen, Yichen Cai, Ruoqing Lu, Zhaoqu Jiang, Jiayang Li, Weitao You, Lingyun Sun

Comments 19 pages, 11 figures. Accepted at ACM CHI 2026, Barcelona

2603.12642 2026-03-16 eess.AS cs.CL cs.LG cs.SD

Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces

Kwanghee Choi, Eunjung Yeo, Cheol Jun Cho, David R. Mortensen, David Harwath

Comments Submitted to Interspeech 2026

2603.12636 2026-03-16 math.OC cs.LG

Weakly Time-Coupled Approximation of Markov Decision Processes

Negar Soheili, Selvaprabu Nadarajah, Bo Yang

2603.12630 2026-03-16 econ.TH cs.AI cs.CY cs.HC econ.EM

The Economics of AI Supply Chain Regulation

Sihan Qian, Amit Mehra, Dengpan Liu

Comments An earlier version of this paper, titled "The Economics of Fine-Tuning for Large-Scale AI Models," was presented at WISE 2023, where it won the Best Student Paper Award