arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.04484 2026-04-07 eess.IV cs.CV

TM-BSN: Triangular-Masked Blind-Spot Network for Real-World Self-Supervised Image Denoising

Junyoung Park, Youngjin Oh, Nam Ik Cho

Comments Accepted to CVPR 2026

详情

英文摘要

Blind-spot networks (BSNs) enable self-supervised image denoising by preventing access to the target pixel, allowing clean signal estimation without ground-truth supervision. However, this approach assumes pixel-wise noise independence, which is violated in real-world sRGB images due to spatially correlated noise from the camera's image signal processing (ISP) pipeline. While several methods employ downsampling to decorrelate noise, they alter noise statistics and limit the network's ability to utilize full contextual information. In this paper, we propose the Triangular-Masked Blind-Spot Network (TM-BSN), a novel blind-spot architecture that accurately models the spatial correlation of real sRGB noise. This correlation originates from demosaicing, where each pixel is reconstructed from neighboring samples with spatially decaying weights, resulting in a diamond-shaped pattern. To align the receptive field with this geometry, we introduce a triangular-masked convolution that restricts the kernel to its upper-triangular region, creating a diamond-shaped blind spot at the original resolution. This design excludes correlated pixels while fully leveraging uncorrelated context, eliminating the need for downsampling or post-processing. Furthermore, we use knowledge distillation to transfer complementary knowledge from multiple blind-spot predictions into a lightweight U-Net, improving both accuracy and efficiency. Extensive experiments on real-world benchmarks demonstrate that our method achieves state-of-the-art performance, significantly outperforming existing self-supervised approaches. Our code is available at https://github.com/parkjun210/TM-BSN.

URL PDF HTML ☆

赞 0 踩 0

2604.04470 2026-04-07 eess.IV cs.AI

MC-GenRef: Annotation-free mammography microcalcification segmentation with generative posterior refinement

Hyunwoo Cho, Yeeun Kwon, Min Jung Kim, Yangmo Yoo

详情

英文摘要

Microcalcification (MC) analysis is clinically important in screening mammography because clustered puncta can be an early sign of malignancy, yet dense MC segmentation remains challenging: targets are extremely small and sparse, dense pixel-level labels are expensive and ambiguous, and cross-site shift often induces texture-driven false positives and missed puncta in dense tissue. We propose MC-GenRef, a real dense-label-free framework that combines high-fidelity synthetic supervision with test-time generative posterior refinement (TT-GPR). During training, real negative mammogram patches are used as backgrounds, and physically plausible MC patterns are injected through a lightweight image formation model with local contrast modulation and blur, yielding exact image-mask pairs without real dense annotation. Using only these synthetic labeled pairs, MC-GenRef trains a base segmentor and a seed-conditioned rectified-flow (RF) generator that serves as a controllable generative prior. During inference, TT-GPR treats segmentation as approximate posterior inference: it derives a sparse seed from the current prediction, forms seed-consistent RF projections, converts them into case-specific surrogate targets through the frozen segmentor, and iteratively refines the logits with overlap-consistent and edge-aware regularization. On INbreast, the synthetic-only initializer achieved the best Dice without real dense annotations, while TT-GPR improved miss-sensitive performance to Recall and FNR, with strong class-balanced behavior (Bal.Acc., G-Mean). On an external private Yonsei cohort ( n=50 ), TT-GPR consistently improved the synthetic-only initializer under cross-site shift, increasing Dice and Recall while reducing FNR. These results suggest that test-time generative posterior refinement is a practical route to reduce MC misses and improve robustness without additional real dense labeling.

URL PDF HTML ☆

赞 0 踩 0

2604.04442 2026-04-07 cs.CR cs.LG cs.MA

Explainable Autonomous Cyber Defense using Adversarial Multi-Agent Reinforcement Learning

Yiyao Zhang, Diksha Goel, Hussain Ahmad

详情

英文摘要

Autonomous agents are increasingly deployed in both offensive and defensive cyber operations, creating high-speed, closed-loop interactions in critical infrastructure environments. Advanced Persistent Threat (APT) actors exploit "Living off the Land" techniques and targeted telemetry perturbations to induce ambiguity in monitoring systems, causing automated defenses to overreact or misclassify benign behavior as malicious activity. Existing monolithic and multi-agent defense pipelines largely operate on correlation-based signals, lack structural constraints on response actions, and are vulnerable to reasoning drift under ambiguous or adversarial inputs. We present the Causal Multi-Agent Decision Framework (C-MADF), a structurally constrained architecture for autonomous cyber defense that integrates causal modeling with adversarial dual-policy control. C-MADF first learns a Structural Causal Model (SCM) from historical telemetry and compiles it into an investigation-level Directed Acyclic Graph (DAG) that defines admissible response transitions. This roadmap is formalized as a Markov Decision Process (MDP) whose action space is explicitly restricted to causally consistent transitions. Decision-making within this constrained space is performed by a dual-agent reinforcement learning system in which a threat-optimizing Blue-Team policy is counterbalanced by a conservatively shaped Red-Team policy. Inter-policy disagreement is quantified through a Policy Divergence Score and exposed via a human-in-the-loop interface equipped with an Explainability-Transparency Score that serves as an escalation signal under uncertainty. On the real-world CICIoT2023 dataset, C-MADF reduces the false-positive rate from 11.2%, 9.7%, and 8.4% in three cutting-edge literature baselines to 1.8%, while achieving 0.997 precision, 0.961 recall, and 0.979 F1-score.

URL PDF HTML ☆

赞 0 踩 0

2604.04427 2026-04-07 cs.IR cs.CL

FAVE: Flow-based Average Velocity Establishment for Sequential Recommendation

Ke Shi, Yao Zhang, Feng Guo, Jinyuan Zhang, JunShuo Zhang, Shen Gao, Shuo Shang

Comments Accepted by SIGIR 2026

2604.04414 2026-04-07 cs.ET cs.LG quant-ph

Eliminating Vendor Lock-In in Quantum Machine Learning via Framework-Agnostic Neural Networks

Poornima Kumaresan, Shwetha Singaravelu, Lakshmi Rajendran, Santhosh Sivasubramani

2604.04407 2026-04-07 eess.IV cs.CV cs.LG cs.MM

NAIMA: Semantics Aware RGB Guided Depth Super-Resolution

Tayyab Nasir, Daochang Liu, Ajmal Mian

2604.04354 2026-04-07 cs.HC cs.CL cs.CY

Talk2AI: A Longitudinal Dataset of Human--AI Persuasive Conversations

Alexis Carrillo, Enrique Taietta, Ali Aghazadeh Ardebili, Giuseppe Alessandro Veltri, Massimo Stella

Comments 17 pages, 2 figures, 7 tables

2604.04321 2026-04-07 math.DG cs.LG

Minimising Willmore Energy via Neural Flow

Edward Hirst, Henrique N. Sá Earp, Tomás S. R. Silva

Comments 16+5 pages, 9 figures

2604.04319 2026-04-07 cs.CY cs.AI cs.ET cs.HC cs.LG

Effects of Generative AI Errors on User Reliance Across Task Difficulty

Jacy Reese Anthis, Hannah Cha, Solon Barocas, Alexandra Chouldechova, Jake Hofman

Comments Published in CHI EA 2026

2604.04312 2026-04-07 cs.IT cs.DC cs.LG math.IT

Out-of-Air Computation: Enabling Structured Extraction from Wireless Superposition

Seyed Mohammad Azimi-Abarghouyi

详情

英文摘要

Over-the-air computation (AirComp) has traditionally been built on the principle of pre-embedding computation into transmitted waveforms or on exploiting massive antenna arrays, often requiring the wireless multiple-access channel (MAC) to operate under conditions that approximate an ideal computational medium. This paper introduces a new computation framework, termed out-of-air computation (AirCPU), which establishes a joint source-channel coding foundation in which computation is not embedded before transmission but is instead extracted from the wireless superposition by exploiting structured coding. AirCPU operates directly on continuous-valued device data, avoiding the need for a separate source quantization stage, and employs a multi-layer nested lattice architecture that enables progressive resolution by decomposing each input into hierarchically scaled components, all transmitted over a common bounded digital constellation under a fixed power constraint. We formalize the notion of decoupled resolution, showing that in operating regimes where the decoding error probability is sufficiently small, the impact of channel noise and finite constellation constraints on distortion becomes negligible, and the resulting computation error is primarily determined by the target resolution set by the finest lattice. For fading MACs, we further introduce collective and successive computation mechanisms, in addition to the proposed direct computation, which exploit multiple decoded integer-coefficient functions and side-information functions as structural representations of the wireless superposition to significantly expand the reliable operating regime; in this context, we formulate and characterize the underlying reliability conditions and integer optimization problems, and develop a structured low-complexity two-group approximation to address them.

URL PDF HTML ☆

赞 0 踩 0

2604.04302 2026-04-07 stat.ME cs.LG

CavMerge: Merging K-means Based on Local Log-Concavity

Zhili Qiao, Wangqian Ju, Peng Liu

2604.04289 2026-04-07 cs.CR cs.AI cs.SE

Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6

Luis Guzmán Lorenzo

Comments 18 pages, 1 figure, 17 references. Code and data: https://github.com/Kieleth/obfuscated-sentinel

2604.04271 2026-04-07 cs.NI cs.LG

A Family of Open Time-Series Foundation Models for the Radio Access Network

Ioannis Panitsas, Leandros Tassiulas

2604.04270 2026-04-07 cs.IR cs.LG

A Logical-Rule Autoencoder for Interpretable Recommendations

Jinhao Pan, Bowen Wei, Ziwei Zhu

2604.04265 2026-04-07 cs.CR cs.AI cs.MA

Governance-Constrained Agentic AI: Blockchain-Enforced Human Oversight for Safety-Critical Wildfire Monitoring

Ali Akarma, Toqeer Ali Syed, Salman Jan, Hammad Muneer, Abdul Khadar Jilani

Comments This paper was presented at ICETAS 2026 Bahrain

2604.04264 2026-04-07 stat.ML cs.IT cs.LG eess.SP math.IT stat.AP

Avoiding Non-Integrable Beliefs in Expectation Propagation

Zilu Zhao, Jichao Chen, Dirk Slock

2604.04263 2026-04-07 cs.CY cs.AI cs.CL

Commercial Persuasion in AI-Mediated Conversations

Francesco Salvi, Alejandro Cuevas, Manoel Horta Ribeiro

2604.04262 2026-04-07 cs.MA cs.AI cs.CR

Agents for Agents: An Interrogator-Based Secure Framework for Autonomous Internet of Underwater Things

Ali Akarma, Toqeer Ali Syed, Abdul Khadar Jilani, Salman Jan, Hammad Muneer, Muazzam A. Khan, Changli Yu

Comments This paper was presented in ICETAS 2026 in Bahrain

2604.04246 2026-04-07 cs.SI cs.LG cs.SY eess.SY math.DS

Transmission Neural Networks: Inhibitory and Excitatory Connections

Shuang Gao, Peter E. Caines

Comments 8 pages

2604.04229 2026-04-07 cs.MM cs.AI cs.CV cs.SD

Hierarchical Semantic Correlation-Aware Masked Autoencoder for Unsupervised Audio-Visual Representation Learning

Donghuo Zeng, Hao Niu, Masato Taya

Comments 6 pages, 2 tables, 4 figures. Accepted by IEEE ICME 2026

2604.04218 2026-04-07 stat.ML cs.LG math.ST stat.TH

Sharp asymptotic theory for Q-learning with LDTZ learning rate and its generalization

Soham Bonnerjee, Zhipeng Lou, Wei Biao Wu

详情

Journal ref: ICLR 2026, Main Conference Track, Poster

英文摘要

Despite the sustained popularity of Q-learning as a practical tool for policy determination, a majority of relevant theoretical literature deals with either constant ($η_{t}\equiv η$) or polynomially decaying ($η_{t} = ηt^{-α}$) learning schedules. However, it is well known that these choices suffer from either persistent bias or prohibitively slow convergence. In contrast, the recently proposed linear decay to zero (\texttt{LD2Z}: $η_{t,n}=η(1-t/n)$) schedule has shown appreciable empirical performance, but its theoretical and statistical properties remain largely unexplored, especially in the Q-learning setting. We address this gap in the literature by first considering a general class of power-law decay to zero (\texttt{PD2Z}-$ν$: $η_{t,n}=η(1-t/n)^ν$). Proceeding step-by-step, we present a sharp non-asymptotic error bound for Q-learning with \texttt{PD2Z}-$ν$ schedule, which then is used to derive a central limit theory for a new \textit{tail} Polyak-Ruppert averaging estimator. Finally, we also provide a novel time-uniform Gaussian approximation (also known as \textit{strong invariance principle}) for the partial sum process of Q-learning iterates, which facilitates bootstrap-based inference. All our theoretical results are complemented by extensive numerical experiments. Beyond being new theoretical and statistical contributions to the Q-learning literature, our results definitively establish that \texttt{LD2Z} and in general \texttt{PD2Z}-$ν$ achieve a best-of-both-worlds property: they inherit the rapid decay from initialization (characteristic of constant step-sizes) while retaining the asymptotic convergence guarantees (characteristic of polynomially decaying schedules). This dual advantage explains the empirical success of \texttt{LD2Z} while providing practical guidelines for inference through our results.

URL PDF HTML ☆

赞 0 踩 0

2604.04212 2026-04-07 eess.SP cs.LG

Relay-Assisted Activation-Integrated SIM for Wireless Physical Neural Networks

Meng Hua, Deniz Gündüz

2604.04211 2026-04-07 cs.CR cs.AI

LOCARD: An Agentic Framework for Blockchain Forensics

Xiaohang Yu, William Knottenbelt

2604.04194 2026-04-07 cond-mat.mtrl-sci cs.AI cs.LG physics.data-an

PATHFINDER: Multi-objective discovery in structural and spectral spaces

Kamyar Barakati, Boris N. Slautin, Utkarsh Pratiush, Hiroshi Funakubo, Sergei V. Kalinin

Comments 24 pages, 6 figures

2604.04160 2026-04-07 eess.AS cs.SD eess.SP

AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis

Tianhua Qi, Wenming Zheng, Björn W. Schuller, Zhaojie Luo, Haizhou Li

Comments Submitted to IEEE Transactions

2604.04154 2026-04-07 cond-mat.stat-mech cond-mat.dis-nn cs.LG q-bio.NC

Non-Equilibrium Stochastic Dynamics as a Unified Framework for Insight and Repetitive Learning: A Kramers Escape Approach to Continual Learning

Gunn Kim

Comments 12 pages, 4 figures

详情

英文摘要

Continual learning in artificial neural networks is fundamentally limited by the stability--plasticity dilemma: systems that retain prior knowledge tend to resist acquiring new knowledge, and vice versa. Existing approaches, most notably elastic weight consolidation~(EWC), address this empirically without a physical account of why plasticity eventually collapses as tasks accumulate. Separately, the distinction between sudden insight and gradual skill acquisition through repetitive practice has lacked a unified theoretical description. Here, we show that both problems admit a common resolution within non-equilibrium statistical physics. We model the state of a learning system as a particle evolving under Langevin dynamics on a double-well energy landscape, with the noise amplitude governed by a time-dependent effective temperature $T(t)$. The probability density obeys a Fokker--Planck equation, and transitions between metastable states are governed by the Kramers escape rate $k = (ω_0ω_b/2π)\,e^{-ΔE/T}$. We make two contributions. First, we identify the EWC penalty term as an energy barrier whose height grows linearly with the number of accumulated tasks, yielding an exponential collapse of the transition rate predicted analytically and confirmed numerically. Second, we show that insight and repetitive learning correspond to two qualitatively distinct temperature protocols within the same Fokker--Planck equation: insight events produce transient spikes in $T(t)$ that drive rapid barrier crossing, whereas repetitive practice operates at a modestly elevated but fixed temperature, achieving transitions through sustained stochastic diffusion. These results establish a physically grounded framework for understanding plasticity and its failure in continual learning systems, and suggest principled design criteria for adaptive noise schedules in artificial intelligence.

URL PDF HTML ☆

赞 0 踩 0

2604.04130 2026-04-07 math.OC cs.LG cs.NA math.NA

Primal-Dual Methods for Nonsmooth Nonconvex Optimization with Orthogonality Constraints

Linglingzhi Zhu, Wentao Ding, Shangyuan Liu, Anthony Man-Cho So

2604.04121 2026-04-07 cs.CR cs.AI cs.NI cs.PF

NetSecBed: A Container-Native Testbed for Reproducible Cybersecurity Experimentation

Leonardo Bitzki, Diego Kreutz, Tiago Heinrich, Douglas Fideles, Leandro Bertholdo, Silvio Quincozes, Angelo Diniz

Comments 8 pages, including 4 figures and 2 tables, submitted to SBCUP 2026

2604.04105 2026-04-07 cs.HC cs.CL

Lexical Indicators of Mind Perception in Human-AI Companionship

Jaime Banks, Jianghui Li

2603.29403 2026-04-07 cs.CR cs.AI

Security in LLM-as-a-Judge: A Comprehensive SoK

Aiman Al Masoud, Antony Anju, Marco Arazzi, Mert Cihangiroglu, Vignesh Kumar Kembu, Serena Nicolazzo, Antonino Nocera, Vinod P., Saraga Sakthidharan