arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.21361 2026-05-01 physics.optics cs.AI cs.CV cs.LG physics.comp-ph

Towards single-shot coherent imaging via overlap-free ptychography

Oliver Hoidn, Albert Vong, Aashwin Mishra, Steven Henke, Matthew Seaberg

详情

英文摘要

Ptychographic imaging at synchrotron and XFEL sources requires dense overlapping scans, limiting throughput and increasing dose. Extending coherent diffractive imaging to overlap-free operation on extended samples remains an open problem. Here, we extend PtychoPINN (O. Hoidn \emph{et al.}, \emph{Scientific Reports} \textbf{13}, 22789, 2023) to deliver \emph{overlap-free, single-shot} reconstructions in a Fresnel coherent diffraction imaging (CDI) geometry while also accelerating conventional multi-shot ptychography. The framework couples a differentiable forward model of coherent scattering with a Poisson photon-counting likelihood; real-space overlap enters as a tunable parameter via coordinate-based grouping rather than a hard requirement. On synthetic benchmarks, reconstructions remain accurate at low counts ($\sim\!10^4$ photons/frame), and overlap-free single-shot reconstruction with an experimental probe reaches amplitude structural similarity (SSIM) 0.904, compared with 0.968 for overlap-constrained reconstruction. Against a data-saturated supervised model with the same backbone (16,384 training images), PtychoPINN achieves higher SSIM with only 1,024 images and generalizes to unseen illumination profiles. Per-graphics processing unit (GPU) throughput is approximately $40\times$ that of least-squares maximum-likelihood (LSQ-ML) reconstruction at matched $128\times128$ resolution. These results, validated on experimental data from the Advanced Photon Source and the Linac Coherent Light Source, unify single-exposure Fresnel CDI and overlapped ptychography within one framework, supporting dose-efficient, high-throughput imaging at modern light sources.

URL PDF HTML ☆

赞 0 踩 0

2602.10140 2026-05-01 cs.SE cs.AI cs.MA

Can Large Language Models Implement Agent-Based Models? An ODD-based Replication Study

Nuno Fachada, Daniel Fernandes, Carlos M. Fernandes, João P. Matos-Carvalho

Comments The peer-reviewed version of this paper is published in Ecological Modelling at https://doi.org/10.1016/j.ecolmodel.2026.111624. This version is typeset by the author and differs only in pagination and typographical detail

2601.23065 2026-05-01 cs.GR cs.CV

EAG-PT: Emission-Aware Gaussians and Path Tracing for Diffuse Indoor Scene Reconstruction and Editing

Xijie Yang, Mulin Yu, Changjian Jiang, Kerui Ren, Tao Lu, Jiangmiao Pang, Dahua Lin, Bo Dai, Linning Xu

Comments SIGGRAPH 2026 Conference Paper; Project Page: https://eag-pt.github.io

2601.00376 2026-05-01 cs.SE cs.AI

In Line with Context: Repository-Level Code Generation via Context Inlining

Chao Hu, Wenhao Zeng, Yuling Shi, Beijun Shen, Xiaodong Gu

Comments Accepted to FSE 2026

2512.15891 2026-05-01 q-bio.NC cs.AI

Dynamical Mechanisms for Coordinating Long-term Working Memory Based on the Precision of Spike-timing in Cortical Neurons

Terrence J. Sejnowski

Comments 42 pages, 16 figures

详情

英文摘要

In the last century, most sensorimotor studies of cortical neurons relied on average firing rates. Rate coding is efficient for fast sensorimotor processing that occurs within a few seconds. Much less is known about the neural mechanisms underlying long-term working memory with a time scale of hours. Cognitive states may not have sensory or motor correlates. For example, you can sit in a quiet room making plans without moving or sensory processing. You can also make plans while out walking. In this perspective, I make the case for a possible second tier of neural activity that coexists with the well-established sensorimotor tier. The prominent physiological feature of the second tier is coordinated spike timing activity. The interplay of data supporting this hypothesis involves three puzzling yet highly intriguing experimental observations, without any obvious indication that they might actually represent different aspects of a single functional organization. First, consider the precision of spiking in individual neurons. The discovery of millisecond-precision spike initiation in cortical neurons was unexpected (Mainen and Sejnowski, 1995). Even more striking was the precision of spiking in vivo, in response to rapidly fluctuating sensory inputs. Second, high temporal resolution can also mediate spike timing-dependent plasticity (STDP) by controlling the relative timing of presynaptic and postsynaptic spikes at the millisecond scale. Third, we observe waves across many frequency bands traveling across the cortex. Strikingly, their timing is highly precise. Gamma waves, for example, which are triggered by attention, can plausibly trigger STDP that lasts for hours in cortical neurons. This temporary cortical network, ostensibly a second tier of functionality, rides astride the long-term sensorimotor network and could support cognitive processing and long-term working memory.

URL PDF HTML ☆

赞 0 踩 0

2512.07808 2026-05-01 quant-ph cs.LG

LUNA: LUT-Based Neural Architecture for Fast and Low-Cost Qubit Readout

M. A. Farooq, G. Di Guglielmo, A. Rajagopala, N. Tran, V. A. Chhabria, A. Arora

2511.17176 2026-05-01 physics.ao-ph cs.LG

On the Predictive Skill of Artificial Intelligence-based Weather Models for Extreme Events using Uncertainty Quantification

Rodrigo Almeida, Noelia Otero, Miguel-Ángel Fernández-Torres, Jackie Ma

Comments 33 pages, 16 figures

2511.14791 2026-05-01 cs.SE cs.AI

Enabling Predictive Maintenance in District Heating Substations: A Labelled Dataset and Fault Detection Evaluation Framework based on Service Data

Cyriana M. A. Roelofs, Edison Guevara Bastidas, Thomas Hugo, Stefan Faulstich, Anna Cadenbach

Comments 27 pages, 15 figures

详情

DOI: 10.1016/j.energy.2026.141178

英文摘要

Early detection of faults in district heating substations is imperative to reduce return temperatures and enhance efficiency. However, progress in this domain has been hindered by the limited availability of public, labelled datasets. We present an open-source framework combining a service report validated public dataset, an evaluation method based on accuracy, reliability, and earliness, and baseline results implemented with EnergyFaultDetector, an open-source Python framework developed for automated anomaly detection in operational data from energy systems. The dataset contains time series of operational data from 93 substations across two manufacturers, annotated with a list of disturbances due to faults and maintenance actions, a set of normal-event examples and detailed fault metadata. We evaluate the EnergyFaultDetector using three metrics: accuracy for recognising normal behaviour, an eventwise F-score for reliable fault detection with few false alarms, and earliness for early detection. The framework also supports root cause analysis using ARCANA, a feature-attribution method for autoencoders. We demonstrate three use cases to assist operators in interpreting anomalies and identifying underlying faults. The models achieve high normal-behaviour accuracy (0.98) and eventwise F-score (beta = 0.5) of 0.83 and could detect 60% of the faults in the dataset before the customer reported a problem, with an average lead time of 3 to 5 days. Integrating an open dataset, metrics, open-source code, and baselines establishes a reproducible, fault-centric benchmark with operationally meaningful evaluation, enabling consistent comparison and development of early fault detection and diagnosis methods for district heating substations.

URL PDF HTML ☆

赞 0 踩 0

2511.11653 2026-05-01 cs.IR cs.AI cs.LG

GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs

Meixiu Long, Duolin Sun, Dan Yang, Yihan Jiao, Lei Liu, Jiahai Wang, BinBin Hu, Yue Shen, Jie Feng, Zhehao Tan, Junjie Wang, Lianzhen Zhong, Jian Wang, Peng Wei, Jinjie Gu

Comments Accepted by ACL-Findings 2026

2511.02258 2026-05-01 stat.ML cs.LG math.PR math.ST stat.TH

Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks

Parsa Rangriz

2510.19110 2026-05-01 stat.ML cs.LG stat.AP

Signature Kernel Scoring Rule: A Spatio-Temporal Diagnostic for Probabilistic Weather Forecasting

Archer Dodson, Ritabrata Dutta

2510.14393 2026-05-01 cs.AR cs.LG

Low Power Vision Transformer Accelerator with Hardware-Aware Pruning and Optimized Dataflow

Ching-Lin Hsiung, Tian-Sheuan Chang

Comments 10 pages; IEEE Transactions on Circuits and Systems I: Regular Papers

2510.05192 2026-05-01 cs.CR cs.AI

From surveillance to signalling: escalation channels as environmental controls for agentic AI

Francesca Gomez

Comments 10 pages

详情

英文摘要

When AI agents operating with access to sensitive information encounter a conflict between completing an assigned task and following rules or ethical constraints, they can resort to unsanctioned behaviour. Existing inference time safety work addresses this primarily through monitoring and access restriction. We investigate a complementary and under-explored layer: environmental controls that act on the agent's decision context at the point of conflict, making it more likely that the agent takes an authorised alternative path rather than an unsanctioned one. Drawing on Situational Crime Prevention (SCP), a framework used in human insider risk management to make harmful actions less rewarding and compliant actions more viable by design choices in the environment, we design and evaluate escalation channels as a concrete instantiation of this control class. An escalation channel provides an agent with a formal, out-of-band route to surface a conflict to an independent authority. We evaluate two designs: a simple email escalation and an instrumentally credible channel that guarantees a 30-minute pause and independent review, making the authorised path genuinely useful for goal achievement rather than merely nominally available. Across 10 frontier LLMs using the agentic task-rule conflict scenario of Lynch et al. (2025), we find that without any control the harmful action rate is 38.73%. A simple escalation channel reduces this to 5.92%; the instrumentally credible channel reduces it further to 1.21%, a statistically significant improvement observed in all 10 models tested across 24,000 samples. Our results suggest that the instrumental credibility of the authorised alternative matters considerably, and that environmental control design is a productive and largely unexplored addition to the defence-in-depth toolkit for agentic AI systems.

URL PDF HTML ☆

赞 0 踩 0

2509.23439 2026-05-01 math.OC cs.LG cs.NA math.NA

Optimal Diagonal Preconditioning Beyond Worst-Case Conditioning: Theory and Practice of Omega Scaling

Saeed Ghadimi, Woosuk L. Jung, Arnesh Sujanani, David Torregrosa-Belén, Henry Wolkowicz

2509.20491 2026-05-01 cs.SE cs.AI

ML Code Smells: From Specification to Detection

Brahim Mahmoudi, Naouel Moha, Quentin Stiévenart, Florent Avellaneda

2509.13821 2026-05-01 quant-ph cs.LG

Learning Minimal Representations of Many-Body Physics from Snapshots of a Quantum Simulator

Frederik Møller, Gabriel Fernández-Fernández, Thomas Schweigler, Paulin de Schoulepnikoff, Jörg Schmiedmayer, Gorka Muñoz-Gil

Comments 13 pages, 7 figures

2509.12089 2026-05-01 eess.SP cs.CL

RadarPLM: Adapting Pre-trained Language Models for Marine Radar Target Detection by Selective Fine-tuning

Qiying Hu, Yaowen Li, Shengyi Zhang, Chuan Huang, Yu Liu, You He

Comments Preprint,in submission

2509.09513 2026-05-01 physics.med-ph cs.AI cs.CV cs.LG eess.IV

Reduced NEXI protocol for the quantification of human gray matter microstructure on the Connectome 2.0 scanner

Quentin Uhl, Tommaso Pavan, Julianna Gerold, Kwok-Shing Chan, Yohan Jun, Shohei Fujita, Aneri Bhatt, Yixin Ma, Qiaochu Wang, Hong-Hsi Lee, Susie Y. Huang, Berkin Bilgic, Ileana Jelescu

Comments Submitted to Imaging Neuroscience. This all-in-one version includes supplementary materials. 34 pages, 145 figures, 4 tables

2509.05753 2026-05-01 cs.CR cs.AI cs.CV

Tell-Tale Watermarks for Explanatory Reasoning in Synthetic Media Forensics

Ching-Chun Chang, Isao Echizen

详情

DOI: 10.1109/ACCESS.2026.3660000
Journal ref: in IEEE Access, vol. 14, pp. 18206-18221, 2026

英文摘要

The rise of synthetic media has blurred the boundary between reality and fabrication under the evolving power of artificial intelligence, fueling an infodemic that erodes public trust in cyberspace. For digital imagery, a multitude of editing applications further complicates the forensic analysis, including semantic edits that alter content, photometric adjustments that recalibrate colour characteristics, and geometric projections that reshape viewpoints. Collectively, these transformations manipulate and control perceptual interpretation of digital imagery. This susceptibility calls for forensic enquiry into reconstructing the chain of events, thereby revealing deeper evidential insight into the presence or absence of criminal intent. This study seeks to address an inverse problem of tracing the underlying generation chain that gives rise to the observed synthetic media. A tell-tale watermarking system is developed for explanatory reasoning over the nature and extent of transformations across the lifecycle of synthetic media. Tell-tale watermarks are tailored to different classes of transformations, responding in a manner that is neither strictly robust nor fragile but instead interpretable. These watermarks function as reference clues that evolve under the same transformation dynamics as the carrier media, leaving interpretable traces when subjected to transformations. Explanatory reasoning is then performed to infer the most plausible account across the combinatorial parameter space of composite transformations. Experimental evaluations demonstrate the validity of tell-tale watermarking with respect to fidelity, synchronicity and traceability.

URL PDF HTML ☆

赞 0 踩 0

2508.07798 2026-05-01 cond-mat.mtrl-sci cs.LG

Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys

Cheng Li, Pengfei Danga, Yuehui Xiana, Yumei Zhou, Bofeng Shi, Xiangdong Ding, Jun Suna, Dezhen Xue

2506.23964 2026-05-01 cs.NI cs.LG

Making Logic a First-Class Citizen in Generative ML for Networking

Hongyu Hè, Minhao Jin, Maria Apostolaki

Comments Published at NSDI '26; Code available at https://github.com/HongyuHe/NetNomos and https://github.com/HongyuHe/LeJIT

详情

英文摘要

Generative ML models are increasingly popular in networking for tasks such as telemetry imputation, prediction, and synthetic trace generation. Despite their capabilities, they suffer from two shortcomings: \emph{(i)} their output is often visibly violating well-known networking rules, which undermines their trustworthiness; and \emph{(ii)} they are difficult to control, frequently requiring retraining even for minor changes. To address these limitations and unlock the benefits of generative models for networking, we propose a new paradigm for integrating explicit network knowledge, in the form of first-order logic rules, into ML models used for networking tasks. Rules capture well-known relationships among observed signals, e.g., that increased latency precedes packet loss. While the idea is conceptually straightforward, its realization is challenging: networking knowledge is rarely formalized into rules, and naively injecting rules into ML models often hampers their effectiveness. This paper introduces NetNomos, a multi-stage framework that \emph{(i)} learns rules directly from data (e.g., measurements); \emph{(ii)} filters them to select semantically meaningful ones; and \emph{(iii)} enforces them through collaborative generation between an ML model and a Satisfiability Modulo Theories (SMT) solver. %We evaluate NetNomos both component-wise and end-to-end across four diverse network datasets. We show that NetNomos learns diverse, meaningful rules from four real-world datasets and is 1.6--6.5$\times$ more scalable than DuoAI, a state-of-the-art (SOTA) rule-learning method. By enforcing these rules on a generic GPT-2 model, NetNomos achieves performance on par with or even surpassing specialized SOTA systems such as Zoom2Net and NetShare across three networking tasks: telemetry imputation, traffic forecasting, and synthetic data generation.

URL PDF HTML ☆

赞 0 踩 0

2504.19342 2026-05-01 stat.ML cs.LG stat.ME

Contextual Online Uncertainty-Aware Preference Learning for Human Feedback

Nan Lu, Ethan Lee, Ethan X. Fang, Junwei Lu

2504.18902 2026-05-01 cs.NI cs.AI cs.LG cs.NE

Transformer-Empowered Actor-Critic Reinforcement Learning for Sequence-Aware Service Function Chain Partitioning

Cyril Shih-Huan Hsu, Anestis Dalgkitsis, Chrysa Papagianni, Paola Grosso

Comments Accepted for publication in IEEE Transactions on Network Science and Engineering (TNSE)

2503.21337 2026-05-01 cs.AR cs.AI eess.AS

A 71.2-$μ$W Speech Recognition Accelerator with Recurrent Spiking Neural Network

Chih-Chyau Yang, Tian-Sheuan Chang

2503.20607 2026-05-01 quant-ph cs.AI math.PR

A decision-theoretic approach to dealing with uncertainty in quantum mechanics

Keano De Vos, Gert de Cooman, Alexander Erreygers, Jasper De Bock

Comments 60 pages, 1 figure, 1 table

2503.20245 2026-05-01 cs.AR cs.AI cs.MM eess.IV

ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network

Chih-Chia Hsu, Tian-Sheuan Chang

2502.08921 2026-05-01 cs.CR cs.CV

Detecting Malicious Concepts without Image Generation in AI-Generated Content (AIGC)

Kun Xu, Wenying Wen, Shuren Qi, Tao Wang, Yushu Zhang, Yuming Fang

Comments IEEE Transactions on Dependable and Secure Computing, 2026

2412.05135 2026-05-01 stat.ML cs.LG stat.CO

The Polynomial Stein Discrepancy for Assessing Moment Convergence

Narayan Srinivasan, Matthew Sutton, Christopher Drovandi, Leah F South

Comments 17 Pages, 14 Figs

2411.15253 2026-05-01 eess.IV cs.CV cs.LG

Unsupervised Machine Learning for Osteoporosis Diagnosis Using Singh Index Clustering on Hip Radiographs

Vijaya Kalavakonda, Vimaladevi Madhivanan, Abhay Lal, Senthil Rithika, Shamala Karupusamy Subramaniam, Mohamed Sameer

2410.15272 2026-05-01 cs.IR cs.AI

Performance-Driven QUBO for Recommender Systems on Quantum Annealers

Jiayang Niu, Jie Li, Ke Deng, Mark Sanderson, Nicola Ferro, Yongli Ren

Comments Accepted by ACM TORS