arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1822
专题追踪
2604.27010 2026-05-01 cs.HC

Quantifying the Cost of Manual Navigation: A Comparison of Gesture-Based Magnification versus Direct Access Reading in Digital Layout-based Documents

Sebastián Gallardo, Hui-Yin Wu, Dorian Mazauric, Pierre Kornprobst, Monica Di Meo, Stéphanie Baillif, Aurelie Calabrese

Journal ref IMX 2026 - International Conference on Interactive Media Experiences, Technological University of the Shannon, Jun 2026, Athlone, Ireland

详情
英文摘要

Understanding how diverse audiences engage with structured media is critical to ensure a consistent quality of experience. In this context, we quantify the behavioral and performance cost of manual navigation (e.g., pinch and zoom) versus direct structural access in layout-based digital documents. We specifically investigate newspaper reading when visual access to structural cues (headlines as entry points) is constrained. Participants completed two tasks-reading all headlines aloud and locating target articles-under two conditions: (1) original edition with gesture-based magnification (pan and zoom), which is the industry standard for digital documents, and (2) large-print edition supporting direct-access reading. We collected performance measures (success ratio and completion time), behavioral integrity through reading path analysis, alongside perceived workload and preferences (NASA-TLX). Results from linear mixed-effects models show that the large-print condition yielded not only better performance than gesture-based magnification (18% improvement in reading speed, 30% improvement in speed to locate a target), but more importantly, restored the natural reading strategy that gesture-based magnification interaction disrupts. Readers also reported lower workload and higher preference. These findings highlight the importance of developing automated methods for generating large-print editions, where layout adaptation complements font scaling to support accessibility and quality of experience.

2604.27009 2026-05-01 quant-ph

Geometric-Phase (Pancharatnam-Berry) Correction for Time-Bin Photonic Qudits: A Calibration and Feed-Forward Algorithm

Ryan Rae-Cheng Wee, Josef Bruzzese

Comments 14 pages, 5 figures. Versioned code and numerical routines available at https://github.com/97RyanW/Geometric-Phase-Correction and https://doi.org/10.5281/zenodo.19349233

详情
英文摘要

We develop a geometric-phase framework for time-bin photonic qudits and propose a practical calibration and feed-forward algorithm for separating and compensating geometric (Pancharatnam-Berry), dynamical, and technical phase contributions. Working directly in the time-bin basis, we use a parallel-transport gauge so that geometric phases appear as experimentally identifiable interferometric offsets, while all phase contributions enter a bin-resolved diagonal transformation. We model state preparation by cascaded unbalanced Mach-Zehnder interferometers and give closed-form amplitudes for arbitrary splitting ratios and phases, noting that single-port monitoring requires post-selection and renormalization. We then give an interferometric tomography recipe based on adjacent-bin scans, with a Fourier-basis cross-check, and a multi-mode numerical case study that separates total, dynamical, and geometric phases and demonstrates feed-forward compensation. The protocol uses standard components, including tunable UMZIs, phase shifters or EOMs, and single-photon detectors, together with routine phase sweeps. It is intended for small to moderate dimensions, approximately d up to 10, and provides a scalable route toward phase-stable high-dimensional temporal encoding for quantum communication and photonic processing.

2604.27008 2026-05-01 cs.LO cs.NE

Compressing ACAS-Xu Lookup Tables with Binary Decision Diagrams

Martin Boniol, Julien Brunel, Jean-Baptiste Chaudron, Christophe Garion, Xavier Thirioux

Journal ref NASA Formal Methods (NFM) 2026, May 2026, Los Angeles (CA), United States

详情
英文摘要

The Airborne Collision Avoidance System Xu (ACAS-Xu) relies on large certified Look-Up Tables (LUTs) that encode the exact decision logic used in operation. Neural-network-based approximations have been proposed to reduce memory requirements, but they inherently introduce approximation errors and complicate formal verification. This paper presents a symbolic compression approach based on Binary Decision Diagrams (BDDs) that preserves the exact semantics of the ACAS-Xu LUTs. The resulting representation is canonical, deterministic, and fully equivalent to the original tables, enabling sound and exact reasoning over the complete decision logic. By expressing both the system behavior and domain-specific operational properties within a common Boolean framework, verification reduces to efficient BDD operations and emptiness checks, with precise counterexamples generated when properties are violated. We demonstrate that the proposed BDD-based representation significantly reduces memory usage, achieves predictable and low-latency execution, and can be deployed on embedded platforms. These results highlight BDDs as a compelling alternative for exact, verifiable, and embedded deployment of ACAS-Xu decision logic.

2604.27005 2026-05-01 physics.optics cond-mat.mtrl-sci

Phase-Transition-Driven Hyperbolic Optical Response and Directional Polaritons in Epitaxial VO2 Thin Films

Maria Chiara Paolozzi, Annalisa D Arco, Ilaria Martinelli, Lorenzo Mosesso, Jacopo Sera, Alessandro D Elia, Augusto Marcelli, Yingxue Chen, Chongwen Zou, Maria Cristina Larciprete, Marco Centini, Stefano Lupi, Salvatore Macis

详情
英文摘要

Optical anisotropy in crystalline solids enables direction-dependent light-matter interactions and underpins a variety of advanced photonic functionalities. In this context, Vanadium dioxide (VO2) represents a prototypical material that undergoes a reversible MIT near 67°C, accompanied by pronounced electronic, structural, and optical modifications. The MIT not only dramatically modifies the VO2 electrical conductivity but also reshapes its anisotropic optical response, making VO2 an exceptional platform for dynamically tunable photonic and optoelectronic devices. In this work, we investigate how the intrinsic crystalline anisotropy of VO2 induces a hyperbolic optical behavior in the metallic rutile phase. We study two epitaxial VO2 thin films of different thicknesses grown on (110) oriented MgF2 substrates. Broadband polarized spectroscopic measurements, spanning the infrared to UV spectral range, are employed to independently investigate the optical response in both the monoclinic and rutile phases. From these measurements, we extract the optical conductivity and the dielectric function, revealing a pronounced anisotropy in the rutile metallic phase, with an enhanced free-carrier response along the rutile c axis. Our data show that, within a narrow near-infrared spectral window, the real parts of the dielectric tensor components along the two principal axes acquire opposite signs, indicating the emergence of a hyperbolic type-II dispersion. The hyperbolic response is quantitatively evaluated through the quality factor and the degree of dielectric anisotropy, enabling a systematic assessment of VO2 as a thermally switchable, hyperbolic optical medium. These findings expand the understanding of anisotropy-driven optical phenomena in phase-change materials and highlight VO2 thin films as a promising platform for tunable and reconfigurable photonic applications.

2604.27002 2026-05-01 cs.CR

Membership Inference Attacks Against Video Large Language Models

Wei Song, Yuxin Cao, Ziqi Ding, Yi Liu, Gelei Deng, Yuekang Li

详情
英文摘要

Video large language models (VideoLLMs) are increasingly trained or instruction-tuned on large-scale video--text corpora collected from heterogeneous sources, raising an immediate privacy question: can an external auditor determine whether a particular video was used during training? While membership inference attacks (MIAs) have been studied extensively for classifiers and, more recently, for text and image generation models, the VideoLLM setting remains unexplored. This setting is challenging because black-box auditors observe only generated text, whereas the membership signal is entangled with video-specific factors such as motion complexity and temporal span. In this paper, we present a black-box MIA targeting VideoLLMs that couples temperature-perturbed generation with video-aware difficulty features. Our key intuition is that member samples tend to induce sharper, more brittle generation behavior across decoding temperatures, and that this signal should be interpreted jointly with the intrinsic difficulty of the queried video. Concretely, we query the target model at low and high temperatures, measure the semantic drift between the resulting texts. We evaluate the attack against \texttt{LLaVA-Video-7B-Qwen2-Video-Only} and achieve a member inference AUC of 0.68 and accuracy of 0.63. These results demonstrate that Video-LLMs are vulnerable to black-box membership inference attacks, highlighting an urgent need for the community to systematically evaluate and mitigate privacy risks in VideoLLMs.

2604.27001 2026-05-01 cs.CR cs.SE

An Empirical Security Evaluation of LLM-Generated Cryptographic Rust Code

Mohamed Elsayed, Kenneth Fulton, Jeong Yang

Comments 10 pages, 2 figures , EASE 2026-The 6th International Workshop on Software Security Engineering

详情
英文摘要

Developers and organizations are using Large Language Models (LLMs) to generate security-critical code more frequently than ever, including cryptographic solutions for their products. This study presents an empirical evaluation of cryptographic security in 240 Rust code samples for two crypto algorithms (AES-256-GCM and ChaCha20-Poly1305) generated by three LLMs (Gemini 2.5 Pro, GPT-4o, and DeepSeek Coder) using four different prompt strategies. For each successfully compiled code sample, CodeQL static analysis and our rule-based crypto-specific analyzer were used to detect vulnerabilities, which are also associated with Common Weakness Enumeration (CWE). The evaluation results revealed that only 23.3% of the generated code samples were successfully compiled. Among the compiled code, CodeQL produced only two false positives, while our rule-based crypto-specific analyzer identified vulnerabilities in 57% of the compiled samples with zero false positives. This demonstrates that general-purpose analysis tools are insufficient for code validation for the experimented crypto algorithms. The compilation success of the two algorithms varied significantly (AES-256-GCM 34.2% versus ChaCha20-Poly1305 12.5%), showing a gap in code generation capabilities. While model choice had no significant effect on compilation success, prompt strategy significantly influenced outcomes (P = 0.002), with chain-of-thought prompting performing 5 times worse than zero-shot. All three models exhibit systematic failures, including nonce reuse and API hallucinations.

2604.27000 2026-05-01 cs.SE cs.CR cs.PL

Adaptive and AI-Augmented Security Testing: A Systematic Survey of Program Analysis, Feedback-Driven Testing, and Hybrid Learning-Based Approaches

Michael Wienczkowski

Comments 29 pages, submitted for review

详情
英文摘要

Modern software systems are increasingly developed within rapid continuous integration and deployment (CI/CD) pipelines, where ensuring security prior to release presents significant technical and organizational challenges. Traditional static and dynamic analysis tools provide valuable structural and behavioral insights, yet they often operate in non-adaptive workflows and produce large volumes of warnings requiring manual triage. Feedback-driven fuzzing and search-based testing approaches have demonstrated the power of iterative input refinement guided by execution signals, while large language models (LLMs) have shown promise in automated test generation but frequently lack semantic grounding in program structure. This paper presents a systematic survey of adaptive and AI-augmented security testing research across five domains: (1) structural program analysis for vulnerability detection, (2) DevSecOps and continuous security testing, (3) feedback-driven fuzzing and search-based testing, (4) LLM-based automated test generation, and (5) emerging hybrid systems integrating program analysis with adaptive learning. We analyze fifty-five peer-reviewed studies drawn from a systematic search of four major databases yielding 22,088 raw records. Our analysis reveals a persistent disconnect between structural program representations (ASTs, CFGs, and CPGs) and adaptive testing mechanisms. We characterize this as structural-adaptive fragmentation: a systematic separation that neither paradigm individually addresses. No existing system incorporates human triage signals as feedback for refining structural models. We conclude by identifying five open research challenges and outlining a unified agenda for semantically grounded, feedback-driven, polyglot security testing frameworks.

2604.26996 2026-05-01 cs.IR

LUCid: Redefining Relevance For Lifelong Personalization

Chimaobi Okite, Anika Misra, Joyce Chai, Rada Mihalcea

Comments first version

详情
英文摘要

Current approaches to lifelong personalization operationalize relevance through semantic proximity, causing them to miss essential user information from topically unrelated interactions. To address this gap, we introduce LUCid, a benchmark designed to measure situational user-centric relevance in personalization. The benchmark consists of 1,936 realistic queries paired with interaction histories from up to 500 sessions. Across multiple architectures, our experiments show significant performance collapse when relevant context must be surfaced from semantically distant history: retrieval recall drops to near zero on the hardest instances, and response alignment remains near 50% even for state-of-the-art models such as Gemini-3-Flash, GPT-5.4, and Claude Haiku. These results expose a fundamental mismatch between the notion of relevance encoded by current systems and the situational relevance required for personalization, with direct implications for robustness and safety when critical user attributes remain undetected. LUCid enables the systematic evaluation of whether current models can surface situationally-relevant user information from previous interactions, and serves as a step toward realigning personalization with user-centered relevance.

2604.26995 2026-05-01 cond-mat.mtrl-sci

Thin film synthesis of SrZn2P2 with SrI2 post-annealing for enhanced crystallinity and optoelectronic quality

Sita Dugu, Shaham Quadir, Christopher P. Muzzillo, Zhenkun Yuan, Smitakshi Goswami, Xiaojing Hao, Jialiang Huang, Guillermo Esparza, Baptiste Julien, David Fenning, Jifeng Liu, Geoffroy Hautier, Andriy Zakutayev, Sage R. Bauers

详情
英文摘要

Ternary Zintl phosphides are promising light-absorbing semiconductors for thin-film optoelectronic applications, but strategies for controlling their microstructure and optoelectronic quality remain underexplored. Here, we report the synthesis of phase-pure SrZn2P2 thin films using radio-frequency co-sputtering in a PH3 + Ar atmosphere and investigate the impact of post-growth processing on their structural and optical properties. Grazing-incidence X-ray scattering and Raman spectroscopy confirm the formation of crystalline SrZn2P2 films over a finite compositional window. Optical measurements reveal strong absorption near the direct-band-gap energy (~1.8 eV) and near-band-edge photoluminescence. Further, we have studied the effects of chemically compatible halide-assisted annealing. It is found that SrI2 treatments lead to pronounced grain growth and reduced diffraction peak broadening while preserving phase purity, in contrast to rapid thermal or forming-gas annealing. Notably, annealing with SrI2 at 450 °C significantly enhances both the intensity and spatial uniformity of the photoluminescence, thus connecting the observed microstructural consolidation with improved radiative recombination. Our study demonstrates that halide-assisted annealing provides an effective pathway for microstructural control in SrZn2P2 thin films and highlights a generalizable processing strategy for advancing Zintl phosphide semiconductors toward optoelectronic applications.

2604.26994 2026-05-01 cs.DS

Fast and Faithful Edge Bundling using Spectral Sparsification

Xingjue Jiang, Seok-Hee Hong, Amyra Meidiana, Xianyuan Zeng

详情
英文摘要

Edge bundling reduces the visual complexity of drawings of large and complex graphs by clustering "compatible" edges. However, it often introduces distortion by bundling "unrelated" edges, resulting in misleading, ambiguous drawings. Moreover, existing edge bundling methods often have high computational complexity. We present new edge bundling methods and faithfulness metrics for edge bundling using spectral sparsification, which sparsifies a graph G into a subgraph G' with O(n log n) edges, based on the effective resistance values of edges, preserving the spectrum of G, closely related to important structural properties of G, such as connectivity, clustering, and the commute distance. We first present a new edge bundling method SEB (Spectral Edge Bundling), introducing effective resistance-based compatibility for spectral-faithful bundling, improving distortion and ambiguity. Then, we present a general framework FEB (Fast Edge Bundling) utilizing spectral sparsification to improve the efficiency of existing bundling methods while maintaining a similar quality. We also present FBQ (Fast Bundling Quality) framework for proxy bundle faithfulness metrics, for measuring how FEB faithfully preserves the ground truth structure in the original edge bundling, with two variants, FBQ_JS (utilizing Jaccard Similarity) and FBQ_SQ (utilizing sampling quality metrics). Extensive experiments using various real-world and synthetic graphs demonstrate the effectiveness of SEB for edge bundling, outperforming state-of-art bundling methods on quality metrics, with 46% and 17% average improvement in distortion and ambiguity respectively for SEB2. Furthermore, experiments successfully demonstrate the efficiency of the FEB framework, with 61% runtime improvement over the original edge bundling methods without sparsification, while maintaining a similar quality, with 74% similarity based on FBQ_SQ.

2604.26992 2026-05-01 math.ST stat.ME stat.ML stat.TH

Adaptive Robust Confidence Intervals in Efron's Gaussian Two-Groups Model

Qiaosen Wang, Shuwen Chai, Chao Gao

详情
英文摘要

Robust uncertainty quantification is increasingly important in modern data analysis and is often formalized under Huber's model, which allows an $\varepsilon$-fraction of arbitrary corruptions. In many experimental sciences, however, the measurement protocol is well controlled, and contamination is more plausibly introduced upstream. Motivated by this noise-oblivious nature of adversaries, we study confidence intervals for the null location parameter $θ$ in Efron's Gaussian two-groups model, where an unknown fraction $\varepsilon$ of observations have arbitrarily shifted means, but all samples share the same law of additive Gaussian measurement noise with variance $σ^2$. We characterize the minimax-optimal length among confidence intervals with a prescribed coverage level uniformly over the unknown contamination proportion and all noise-oblivious adversaries. Although prior work has shown that the minimax point estimation rate of theta does not deteriorate when $\varepsilon$ becomes unknown, our results reveal that, with a given $σ^2$, the minimax-optimal length of confidence intervals that are adaptive to unknown $\varepsilon$ is of order $σ(n^{-1/4}+\varepsilon^{1/2}/\max\{1, \log(en \varepsilon^2)\}^{1/2})$, which is polynomially worse than the optimal length when $\varepsilon$ is known. When the variance $σ^2$ is also unknown, we show a further degradation: no adaptive robust confidence interval can be shorter than $Ω(σn^{-1/8})$. Algorithmically, we introduce a Fourier-based certification procedure built on Carathéodory's positive-semidefiniteness constraints. By scanning candidate points and accepting those whose residual characteristic function is certifiably consistent with a Gaussian location mixture, our algorithm attains the minimax lower bound in the known-variance setting and is computable in polynomial time.

2604.26990 2026-05-01 cs.SE

UCSC-NLP at SemEval-2026 Task 13: Multi-View Generalization and Diagnostic Analysis of Machine-Generated Code Detection

Kargi Chauhan, Sadiba Nusrat Nur

详情
英文摘要

With the rapid growth of large language models for code generation, distinguishing between human-written and AI-generated code has become increasingly critical for academic integrity, hiring evaluations, and software security. We present our system for SemEval-2026 Task 13: Multilingual Machine-Generated Code Detection, participating in Subtask A (binary detection) and Subtask B (multi-class attribution across 10 LLM families). For Subtask A, we fine-tune UniXcoder-base with a multi-view training framework that promotes generator-invariant representations. The framework combines domain-specific structural prefixes, delexicalization with symmetric KL consistency loss, token dropout, and mixed-content augmentation. Our system achieves 0.993 macro F1 on validation and 0.845 macro F1 on the test set, which spans unseen languages and domains. For Subtask B, we show that severe class imbalance (88.4% human code, 221:1 majority-to-minority ratio) causes catastrophic minority-class failure under standard fine-tuning, with macro F1 collapsing to 0.086 despite 88.4% accuracy. A class-weighted extension trained for 3 epochs recovers macro F1 to 0.345 (+301% relative), confirming that multi-class attribution requires imbalance-aware training strategies.

2604.26989 2026-05-01 math.CO

Subgroups of Finite Fields As Cap Sets

Anthony Kable, Melissa Mills, David J. Wright

详情
英文摘要

We show the subgroup of 20 nonzero fourth powers in the finite field of order 81 is a cap set. Similarly, the subgroup of 9 nonzero seventh powers in the field of order 64 is a cap set. These are the cases related to the card games of SET and EvenQuads, and both are known to be maximal cap sets. A corollary is that the cosets of these subgroups form a partition by maximal caps of the multiplicative groups of their respective fields. We identify certain multiplicative subgroups of fields of orders 243 and 729 as cap sets, and show in general that the subgroup of $(2^n-1)$-th powers is a cap set in the field of order $2^{2n}$.

2604.26987 2026-05-01 cond-mat.mtrl-sci

The effect of Van der Waals interaction on the microstructure of EPD deposits: a simulation study

Rémi Martin, Sandrine Duluard, Céline Merlet

详情
英文摘要

Electrophoretic deposition is a method of choice for generating coatings thanks to its ease of implementation and its ability to produce coatings of relatively large thicknesses in a single step process. While this process also benefits from a large number of tunable parameters to adapt the coating to each application (applied electric field, particle concentration, viscosity of the suspension, etc...), such a freedom can lead to the selection of parameters being an overwhelming task. A better fundamental understanding of the microscopic phenomena and mechanisms at play during deposition can provide clues for the more efficient design of optimized coatings. Particle-based models, allowing for the simulation of deposit microstructures for various process parameters, are particularly interesting to get insights in such systems. Nevertheless, such studies are rare and usually do not involve the possibility of self-cohesion between particles, while it seems crucial for the final structure of the deposit. Here, we use particle-based simulations to study the influence of aggregation on the deposit formed for different applied electric fields. We show that the self-cohesion indeed leads to different microstructures, both in the close vicinity of the substrate and in the bulk of the deposit, and relate this to the mechanical signature of the deposits. Our results reveal that at high electric field, the influence of self-cohesion on resulting microstructures essentially vanishes beyond a critical field strength. This marks the transition between a deposition regime affected by aggregation to a regime largely dominated by volume exclusion effects.

2604.26980 2026-05-01 quant-ph physics.app-ph

Naturally Resonant Emitters: Approaching Fundamental Antenna Limits

Damir Latypov

Comments Journal of Applied Physics, May 14 2026

详情
英文摘要

Antenna miniaturization remains a critical technological challenge across frequency scales - from microwave RF links in phones and wearables to VLF for underwater-to-air communications and ionospheric probing. At deeply subwavelength scales conventional antennas require complex and lossy matching circuits due to absent intrinsic material resonances, motivating resonant electrically small emitters (ESEs) like mechanical resonators and quantum emitters. Here, we extend the theory of electrically small antennas (ESAs) to this broader ESE class, deriving the fundamental efficiency limit for a unit volume emitter at given frequency and bandwidth. Our figure of merit (FOM) - quantifying proximity to this limit - enables direct comparison across ESE types, frequencies, bandwidths and scales. We demonstrate its utility using public data from ELF and VLF Navy facilities alongside two mechanical ESEs reported in literature. The measurements reveal that mechanical antennas operate near theoretical FOM limit, questioning claims of possible further orders-of-magnitude gains. A naturally resonant emitter is still subject to the Chu-Harrington limit (CHL) under its standard assumptions. Indeed, we derive novel CHL-dictated constraints on atomic ESE properties: lower bound on excited-state lifetime and an upper bound on transition dipole moment.

2604.26975 2026-05-01 q-bio.GN

T-cell repertoire response in individuals with post-acute sequelae of COVID-19

Zachary Montague, Rhea M Grover, Andrew Baumgartner, Assya Trofimov, Jennifer Hadlock, Armita Nourmohammad

详情
英文摘要

T-cells are central to SARS-CoV-2 clearance and immunological memory, yet their contribution to the persistence of post-acute sequelae of COVID-19 (PASC) remains poorly understood. The immunological features that distinguish individuals who develop PASC from those who recover fully are unresolved, in part due to the phenotypic heterogeneity of the condition and the likely multiplicity of its underlying mechanisms. Here, we profiled longitudinal bulk TCR$β$ repertoires from 120 individuals in the INCOV cohort--71 with PASC and 49 without--sampled at two to three time points spanning the acute and post-acute phases of infection. Using robust statistical modeling of repertoire composition and clonal dynamics, we found that global statistics such as V, J gene usage and CDR3 length do not differ between groups, but that locally enriched sequence motifs and differentially dynamic clones reveal distinct T-cell signatures associated with PASC status. Clones contracting following the peak of the acute response were significantly enriched for SARS-CoV-2 specificity in both groups. Interestingly, Influenza A-specific TCRs were disproportionately enriched among contracting clones in PASC{$^+$} repertoires, implicating viral co-infection as a potential contributor to early disease severity and, possibly, PASC pathogenesis. Rare public TCR clones were markedly enriched for SARS-CoV-2 specificity, with PASC{$^+$} individuals harboring a modestly but significantly higher proportion than PASC{$^-$} individuals. Together, we identified over 1,000 candidate TCR$β$ receptors potentially discriminating PASC{$^+$} from PASC{$^-$} immune responses, opening a path toward the identification of disease-relevant T-cell specificities and the development of T-cell-based immunological biomarkers for long COVID.

2604.26974 2026-05-01 cs.CR cs.ET

C8s: A Confidential Kubernetes Architecture

Amean Asad, Patrick McClurg, João Andrade

Comments 47 pages, 21 figures. Whitepaper from Confidential.ai

详情
英文摘要

This paper presents C8s, a confidential computing architecture for Kubernetes that provides cryptographically rooted confidentiality, integrity, and verifiability guarantees for Kubernetes clusters from infrastructure operators. These guarantees are cryptographically provable to any independent third party verifier. The architecture is built on hardware Trusted Execution Environments (TEEs), specifically AMD SEV-SNP, Intel TDX, and NVIDIA Confidential Computing support, to establish an attestation-rooted trust boundary around confidential VMs. This design is compatible with managed Kubernetes services such as Amazon EKS, Google GKE, and Microsoft AKS, where the control plane cannot be attested. Under this boundary, three groups gain guarantees that are absent from conventional deployments. Data and artifact owners can deploy sensitive workloads and proprietary artifacts on third-party infrastructure without risking exfiltration. Compute providers can offer execution services without revealing workloads to cloud operators. End users can submit requests that remain opaque to all parties except the attested TEE processing them. Representative workloads include AI inference, securing AI model weights, and training or fine-tuning on sensitive data.

2604.26971 2026-05-01 cs.IR

T2S-Metrics: Unified Library for Evaluating SPARQL Queries Generated From Natural Language

Yousouf Taghzouti, Tao Jiang, Camille Juigné, Benjamin Navet, Fabien Gandon, Franck Michel, Louis-Felix Nothias

详情
英文摘要

The evaluation of Question Answering (QA) systems over Knowledge Graphs has historically suffered from fragmentation, inconsistency, and limited reproducibility. While significant progress has been made in semantic parsing and SPARQL query generation, evaluation methodologies remain diverse, ad hoc, and often incomparable across studies. Existing benchmarks typically focus on a small subset of metrics, such as query exact match or answer-level F1, neglecting syntactic validity, semantic faithfulness, execution correctness, results ranking quality, and computational efficiency. In this paper, we present t2s-metrics, an open-source, extensible, and unified evaluation library designed specifically for SPARQL query comparison and execution-based assessment. t2s-metrics provides a broad and extensible set of over 20 evaluation metrics, collected from the literature and practical evaluation needs, spanning lexical, syntactic, semantic, structural, execution-based and ranking-based dimensions. These include query-based metrics such as token-level Precision, Recall, and F1; BLEU, ROUGE, METEOR, and CodeBLEU variants; variable-normalized metrics (SP-BLEU, SP-F1); graph-and URI-based exact match metrics; as well as answer set-based metrics such as F1-QALD and Jaccard similarity; ranking metrics including MRR, NDCG, P@k, and Hit@k; and LLM-as-a-Judge metrics. Taking inspiration from the ir-metrics library for Information Retrieval, t2s-metrics provides a modular abstraction layer that decouples metric specification from implementation, enabling consistent, transparent, and reproducible evaluation of SPARQLbased QA systems. We argue that t2s-metrics constitutes a necessary step toward systematic, standardized evaluation in question answering over knowledge graphs and facilitates deeper diagnostic insights into system behavior beyond answer correctness.

2604.26966 2026-05-01 cs.AR cs.NE physics.optics

Towards Topology-Aware Very Large-Scale Photonic AI Accelerators

Belal Jahannia, Abdolah Amirany, Hamed Dalir

Comments 15 pages, 7 figures

详情
英文摘要

The rapid growth of deep neural networks (DNNs) has exposed fundamental limitations in electronic accelerators, where data movement dominates energy consumption, commonly referred to as the memory wall. Photonic accelerators offer a compelling alternative due to their inherent parallelism and high-speed matrix operations. However, existing research largely focuses on device-level innovations, leaving system-level scalability insufficiently explored. In this paper, we present a scalable photonic accelerator architecture based on a modular scale-out paradigm using 4 X 4 photonic tensor core units. We perform a systematic architectural analysis that incorporates the practical scaling limits of photonic hardware, including insertion loss, fanout penalties, and laser power limits, which restrict monolithic photonic scaling. Through evaluation on representative DNN workloads (GoogleNet, ResNet-18, MobileNet, and AlphaGo Zero) with up to 1024 processing elements, we identify a topology-dominated scaling bottleneck in the photonic domain, termed the Utilization Wall, where performance is governed by grid topology rather than hardware size. We further establish the Symmetric Grid Rule, demonstrating that symmetric topologies improve utilization by up to 6X while reducing memory access by over 40% compared to linear configurations, which reveal that topology-aware scaling is essential for achieving energy-efficient and high-performance photonic AI accelerators.

2604.26953 2026-05-01 cs.IR cs.CY

A Randomized Controlled Trial and Pilot of Scout: an LLM-Based EHR Search and Synthesis Platform

Michael Gao, Suresh Balu, William Knechtle, Kartik Pejavara, William Jeck, Matthew Ellis, Jason Thieling, Blake Cameron, Jason Tatreau, Tareq Aljurf, Henry Foote, Michael Revoir, Marshall Nichols, Matthew Gardner, William Ratliff, Bradley Hintze, Angelo Milazzo, Sreekanth Vemulapalli

详情
英文摘要

Clinical documentation and data retrieval within Electronic Health Records (EHRs) contribute substantially to clinician workload and burnout. To address this, we developed Scout, an LLM-based EHR search and synthesis platform that enables clinicians to query EHR data using natural language. Each response includes citations linking each claim to the original data source, facilitating easy verification of generated content. We conducted a prospective randomized, evaluator-blinded crossover trial across seven clinical specialties (20 participants, 200 structured cases). Participants completed realistic clinical tasks using either Scout or the EHR alone, with outcomes including time to completion, NASA Task Load Index workload scores, and blinded expert adjudication of accuracy, completeness, and relevance. Scout reduced task completion time by 37.6% and significantly decreased perceived workload, with the largest reductions in mental demand, effort, and temporal demand. Non-inferiority analyses showed that tasks completed with Scout maintained accuracy, completeness, and relevance relative to tasks completed with the EHR-only. A concurrent pilot deployment across over 200 users and more than 20 specialties generated over 6,600 interactions in three months, revealing diverse clinical and administrative use cases. Automated evaluation using an LLM-as-judge framework identified errors at low rates. Subsequent manual review of a subset of outputs revealed that most claims flagged by the automated judge as errors were in fact supported by the patient chart, demonstrating the importance of human validation. These findings provide early trial-based evidence that LLM-powered EHR tools can meaningfully reduce clinical and administrative workloads while maintaining output quality.

2604.26902 2026-05-01 econ.TH

Many-to-many stable matching in large economies

Michael Greinecker, Karolina Vocke

详情
英文摘要

We study stability notions for networked many-to-many matching markets with individually insignificant agents in distributional form. Outcomes are formulated as joint distributions over characteristics of agents and contract choices. Characteristics can lie in an arbitrary Polish space. We provide a mechanical method for transferring existence results for finite matching models to large matching models for many stability notions. In particular, we show that tree-stable and pairwise-stable outcomes exist.

2604.26817 2026-05-01 hep-ph hep-ex

When Two Loops Matter: Electroweak Precision in the SMEFT

Lukas Born, Admir Greljo, Anders Eller Thomsen

Comments 12 pages, 4 figures, 1 table

详情
英文摘要

We identify a novel next-to-leading order renormalization effect in the dimension-six SMEFT with direct phenomenological impact. The Higgs-Yukawa operator that modifies the top-Higgs coupling $κ_t$ induces a shift in the $ W $ mass at two-loop order through a large anomalous dimension, rendering electroweak precision observables a powerful indirect probe of $κ_t$. We show that this effect is essential for the consistent interpretation of data from future Tera-$Z$ and Giga-$W$ factories such as FCC-ee. The effect is realized in a simple renormalizable two-Higgs doublet model.

2604.26623 2026-05-01 math.FA

The Riemann integral on Dedekind complete $f$-algebras

Eder Kikianty, Luan Naude, Mark Roelands, Christopher Schwanke

详情
英文摘要

In this paper we develop a theory of integration for locally band preserving functions, introduced by Ercan and Wickstead, on Dedekind complete $f$-algebras. Specifically, we construct Darboux and Riemann integrals and show that they are equal. We then connect the theory of integrable functions to the theory of order differentiable functions, introduced by the third and fourth authors, by proving a Fundamental Theorem of Calculus. Furthermore, we show that a Mean Value Theorem for Integrals holds and that we can integrate by parts and substitutions.

2604.26602 2026-05-01 astro-ph.GA astro-ph.SR

The intrinsic dispersion of elemental abundance ratios in nearby metal-poor halo stars

Poul Erik Nissen, Anish Amarsi

Comments Accepted for publication in A&A, 15 pages, 12 figures

详情
英文摘要

Differential abundances of C, O, Mg, Al, Ca, Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Zn, Y, and Zr were determined from high signal-to-noise VLT/UVES spectra for 25 turnoff stars with -2.4 < [Fe/H] < -1.3. Effective temperatures were obtained from profiles of the H_beta line and surface gravities via Gaia parallaxes. The analysis of the spectra were based on 1D model atmospheres assuming LTE, but 3D non-LTE corrections were applied for several elements. The dispersion in linear fits to the [X/Fe]-[Fe/H] relations is around a factor of two smaller than found in previous studies. After corrections for measurement errors, the 1-sigma intrinsic dispersion of [X/Fe] at a given metallicity is 0.09 dex for Y and Zr, 0.05-0.07 dex for C, O, and Al, 0.03-0.05 dex for Mg, Ca, Sc, Ti, V, Mn, and Zn, and <0.03 dex for Cr, Co, and Ni. Strong correlations between the residuals in the [X/Fe]-[Fe/H] fits are found for the alpha-capture elements Mg, Al, Ca, Sc, and Ti and between the residuals for Y and Zr. Correlations of the residuals in the [X/Fe]-[Fe/H] fits with effective temperature can be explained as due to differential atomic diffusion between elements, but its contribution to the scatter of [X/Fe] is of minor importance. Probably, both stochastic effects in sampling the IMF of CCSNe and differences in the TypeIa to CCSNe enrichment ratio between star-forming regions need to be considered in order to explain the intrinsic dispersion of [X/Fe].

2604.26534 2026-05-01 quant-ph

Neural and Tensor Networks in the Study of Quantum Annealing Processors

Tomasz Śmierzchalski

详情
英文摘要

Quantum annealing targets low-energy solutions of Ising/QUBO problems, but reliable assessment requires more than best-energy comparisons. This dissertation develops a benchmarking framework for D-Wave quantum annealers that combines strong classical baselines, sampling and diversity metrics, and thermodynamic cost. Its first contribution, SpinGlassPEPS$.$jl, is a topology-aware tensor-network heuristic for optimization and sampling on Pegasus/Zephyr-like graphs. It maps Ising instances to local Potts clusters, represents the partition function with PEPS, and performs branch-and-bound search in probability space. Benchmarks show that it is a physically interpretable reference solver, though approximate contractions limit its competitiveness on the largest instances. The second contribution treats quantum annealers as effective thermal machines, relating success probability and solution quality to dissipation, entropy production, and effective temperature. Carefully placed pauses can improve performance while reducing thermodynamic cost, although longitudinal fields may become harmful in paused schedules. The thesis also introduces reinforcement-learning post-processing to improve returned samples and exact small-system simulations to probe annealing dynamics. Overall, it argues for quantum-annealing benchmarks that jointly measure algorithmic performance and physical expenditure.

2604.26525 2026-05-01 cs.CR

PRAG: End-to-End Privacy-Preserving Retrieval-Augmented Generation

Zhijun Li, Minghui Xu, Huayi Qi, Wenxuan Yu, Tingchuang Zhang, Qiao Zhang, GuangYong Shang, Zhen Ma, Xiuzhen Cheng

Comments 16 pages,6 figures, journal

详情
英文摘要

Retrieval-Augmented Generation (RAG) is essential for enhancing Large Language Models (LLMs) with external knowledge, but its reliance on cloud environments exposes sensitive data to privacy risks. Existing privacy-preserving solutions often sacrifice retrieval quality due to noise injection or only provide partial encryption. We propose PRAG, an end-to-end privacy-preserving RAG system that achieves end-to-end confidentiality for both documents and queries without sacrificing the scalability of cloud-hosted RAG. PRAG features a dual-mode architecture: a non-interactive PRAG-I utilizes homomorphic-friendly approximations for low-latency retrieval, while an interactive PRAG-II leverages client assistance to match the accuracy of non-private RAG. To ensure robust semantic ordering, we introduce Operation-Error Estimation (OEE), a mechanism that stabilizes ranking against homomorphic noise. Experiments on large-scale datasets demonstrate that PRAG achieves competitive recall (72.45%-74.45%), practical retrieval latency, and strong resilience against graph reconstruction attacks while maintaining end-to-end confidentiality. This work confirms the feasibility of secure, high-performance RAG at scale.

2604.26469 2026-05-01 cs.SE

An Empirical Study of Speculative Decoding on Software Engineering Tasks

Yijia Li, Junkai Chen, Xing Hu, Xin Xia

详情
英文摘要

Large Language Models (LLMs) have become widely used for Software Engineering (SE) tasks, spanning from function-level code generation to complex repository-level workflows. However, the high latency of autoregressive inference remains a significant bottleneck, hindering their deployment in interactive environments. While Speculative Decoding (SD) offers a promising technique for lossless acceleration, prior research on long-context repository-level tasks and complex agentic interactions remains limited. To bridge this gap, we present the first systematic empirical study to evaluate the effectiveness of SD in SE tasks. We systematically benchmark a comprehensive spectrum of strategies, encompassing both model-based and model-free methods, across representative generation, editing, and repair scenarios. Our empirical results indicate that SD demonstrates clear potential for accelerating inference, particularly for smaller models that achieve higher speedups than those of their larger counterparts. We find that the effectiveness of SD methods varies across different task scenarios. Model-based approaches are well-suited for code generation, whereas model-free methods are better adapted to repository-level repair and editing scenarios. Furthermore, we observe that the repetitiveness of SE tasks improves the performance of model-free methods. In contrast to natural language tasks, the higher predictability of SE tasks allows for more aggressive hyperparameters. Our findings are summarized as guidelines to help increase inference efficiency for SE scenarios.

2604.26448 2026-05-01 math.MG

The Hausdorff dimension of sets containing circles in many directions

Antonio Córdoba

详情
英文摘要

Let us consider a sphere $S^{n-1}$ of radius $r$ in $\mathbb{R}^n$, where we have fixed poles $N$ and $S$. Suppose that $K$ is a set in $\mathbb{R}^n$ containing a translated copy of each meridian (that is an $S^{n-2}$-sphere) of $S^{n-1}$. Then the Hausdorff dimension of $K$ must be bigger than or equal to $n-1$.

2604.26327 2026-05-01 eess.AS

Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification

Qituan Shangguan, Junhao Du, Kunyang Peng, Feng Xue, Hui Zhang, Xinsheng Wang, Kai Yu, Shuai Wang

Comments Submitted to Interspeech 2026; 5 pages

详情
英文摘要

Cross-lingual speaker verification suffers from severe language-speaker entanglement. This causes systematic degradation in the hardest scenario: correctly accepting utterances from the same speaker across different languages while rejecting those from different speakers sharing the same language. Standard adversarial disentanglement degrades speaker discriminability; blind discriminators inadvertently penalize speaker-discriminative traits that merely correlate with language. To address this, we propose Dual-LoRA, injecting trainable task-factorized LoRA adapters into a frozen pre-trained backbone. Our core innovation is a Language-Anchored Adversary: by grounding the discriminator with an explicit language branch, adversarial gradients target true linguistic cues rather than arbitrary correlations, preserving essential speaker characteristics. Evaluated on the TidyVoice benchmark, our system achieves a 0.91% validation EER and achieves 3rd place in the official challenge.

2604.26314 2026-05-01 quant-ph physics.chem-ph

Amplitude Encoding of Slater-Type Orbitals via Matrix Product States: Efficient State Preparation and Integral Evaluation on Quantum Hardware

Sorin Bolos

详情
英文摘要

Slater-type orbitals (STOs) provide the physically correct description of atomic wavefunctions but have been largely replaced by Gaussian-type orbitals in computational chemistry due to the lack of closed-form multi-center integrals. We present a systematic study of amplitude encoding of STOs on quantum computers using matrix product states (MPS). For one-dimensional orbital functions of the form $p_d(x) e^{-ζx}$, we derive analytical MPS constructions with constant bond dimension $χ= d + 1$, requiring $O(n)$ classical and quantum resources for $n$-qubit registers with no grid sampling. We demonstrate a complete one-electron integral pipeline -- overlap, kinetic energy, and nuclear attraction -- in one dimension, validating the overlap and kinetic energy on IBM Heron processors at 5~qubits with 0.67\% hardware-induced error using Zero-Noise Extrapolation. In three dimensions, we compute multi-center overlap integrals between 1s and 2s orbitals in Cartesian coordinates with 0.02\% discretization error at 18~qubits. A systematic entanglement analysis reveals that the MPS bond dimension of three-dimensional STOs in Cartesian coordinates saturates with increasing grid resolution -- reaching $\sim$138 for the hydrogen 1s orbital at 12~qubits per coordinate -- establishing bounded encoding complexity rather than the exponential scaling initially expected. The SVD truncation threshold provides a practical resource parameter, reducing the bond dimension to 39 at threshold $10^{-6}$ with negligible accuracy loss. These results map the entanglement landscape for amplitude encoding of atomic orbitals and establish MPS-based state preparation as a viable path toward exact STO basis sets on quantum computers.