arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2393
2602.13210 2026-02-17 cs.NI cs.AI cs.LG

Large Language Model (LLM)-enabled Reinforcement Learning for Wireless Network Optimization

Jie Zheng, Ruichen Zhang, Dusit Niyato, Haijun Zhang, Jiacheng Wang, Hongyang Du, Jiawen Kang, Zehui Xiong

详情
英文摘要

Enhancing future wireless networks presents a significant challenge for networking systems due to diverse user demands and the emergence of 6G technology. While reinforcement learning (RL) is a powerful framework, it often encounters difficulties with high-dimensional state spaces and complex environments, leading to substantial computational demands, distributed intelligence, and potentially inconsistent outcomes. Large language models (LLMs), with their extensive pretrained knowledge and advanced reasoning capabilities, offer promising tools to enhance RL in optimizing 6G wireless networks. We explore RL models augmented by LLMs, emphasizing their roles and the potential benefits of their synergy in wireless network optimization. We then examine LLM-enabled RL across various protocol layers: physical, data link, network, transport, and application layers. Additionally, we propose an LLM-assisted state representation and semantic extraction to enhance the multi-agent reinforcement learning (MARL) framework. This approach is applied to service migration and request routing, as well as topology graph generation in unmanned aerial vehicle (UAV)-satellite networks. Through case studies, we demonstrate that our framework effectively performs optimization of wireless network. Finally, we outline prospective research directions for LLM-enabled RL in wireless network optimization.

2602.13207 2026-02-17 cs.NI cs.AI

A Safety-Constrained Reinforcement Learning Framework for Reliable Wireless Autonomy

Abdikarim Mohamed Ibrahim, Rosdiadee Nordin

详情
英文摘要

Artificial intelligence (AI) and reinforcement learning (RL) have shown significant promise in wireless systems, enabling dynamic spectrum allocation, traffic management, and large-scale Internet of Things (IoT) coordination. However, their deployment in mission-critical applications introduces the risk of unsafe emergent behaviors, such as UAV collisions, denial-of-service events, or instability in vehicular networks. Existing safety mechanisms are predominantly reactive, relying on anomaly detection or fallback controllers that intervene only after unsafe actions occur, which cannot guarantee reliability in ultra-reliable low-latency communication (URLLC) settings. In this work, we propose a proactive safety-constrained RL framework that integrates proof-carrying control (PCC) with empowerment-budgeted (EB) enforcement. Each agent action is verified through lightweight mathematical certificates to ensure compliance with interference constraints, while empowerment budgets regulate the frequency of safety overrides to balance safety and autonomy. We implement this framework on a wireless uplink scheduling task using Proximal Policy Optimization (PPO). Simulation results demonstrate that the proposed PCC+EB controller eliminates unsafe transmissions while preserving system throughput and predictable autonomy. Compared with unconstrained and reactive baselines, our method achieves provable safety guarantees with minimal performance degradation. These results highlight the potential of proactive safety constrained RL to enable trustworthy wireless autonomy in future 6G networks.

2602.13204 2026-02-17 cs.NI cs.AI

Hybrid Secure Routing in Mobile Ad-hoc Networks (MANETSs)

Soundes Oumaima Boufaida, Abdemadjid Benmachiche, Majda Maatallah, Chaouki Chemam

详情
英文摘要

Because wireless communication is dynamic and has inherent defects, routing algorithms are crucial in the quickly evolving field of mobile ad hoc networks, or MANETs This study looks at the many security problems that MANETs encounter. These problems, which pose major risks to network performance, include flooding, sinkholes, and black hole assaults to address these challenges. We introduce the Hybrid Secure Routing Protocol (HSRP), which enhances the security and robustness of routing operations by fusing trust-based tactics with cryptographic approaches. HSRP combines the strengths of both proactive and reactive routing strategies, enabling it to adapt dynamically to evolving network conditions while protecting against malicious activities. We use extensive simulations with Network Simulator (NS-2) and a thorough review of the literature to assess HSRP's performance under different attack scenarios. The results show that, in comparison to traditional protocols, HSRP increases throughput and decreases latency, hence improving routing efficiency while simultaneously bolstering data transfer security. With uses in vital domains including military operations and disaster response, this study provides a scalable and workable approach for safe routing in MANETs. The findings highlight how crucial it is to include cutting-edge security features in routing protocol design to guarantee the dependability and integrity of MANETs in practical situations.

2602.13200 2026-02-17 cs.NI cs.AI

Traffic Simulation in Ad Hoc Network of Flying UAVs with Generative AI Adaptation

Andrii Grekhov, Volodymyr Kharchenko, Vasyl Kondratiuk

Comments 15 pages, 10 figures

详情
英文摘要

The purpose of this paper is to model traffic in Ad Hoc network of Unmanned Aerial Vehicles and demonstrate a way for adapting communication channel using Artificial Intelligence. The modeling was based on the original model of Ad Hoc network including 20 Unmanned Aerial Vehicles. The dependences of packet loss on the packet size for different transmission powers, on the packet size for different frequencies, on Unmanned Aerial Vehicles flight area and on the number of Unmanned Aerial Vehicles were obtained and analyzed. The implementation of adaptive data transmission is presented in the program code. The dependences of packet loss, power and transaction size on time during Artificial Intelligence adaptation are shown.

2602.13199 2026-02-17 cs.NI cs.AI

Simulation-Based Study of AI-Assisted Channel Adaptation in UAV-Enabled Cellular Networks

Andrii Grekhov, Volodymyr Kharchenko, Vasyl Kondratiuk

Comments 13 pages, 8 figures

详情
英文摘要

This paper presents a simulation based study of Artificial Intelligence assisted communication channel adaptation in Unmanned Aerial Vehicle enabled cellular networks. The considered system model includes communication channel Ground Base Station Aerial Repeater UAV Base Station Cluster of Cellular Network Users. The primary objective of the study is to investigate the impact of adaptive channel parameter control on communication performance under dynamically changing interference conditions. A lightweight supervised machine learning approach based on linear regression is employed to implement cognitive channel adaptation. The AI model operates on packet level performance indicators and enables real time adjustment of Transaction Size in response to variations in Bit Error Rate and effective Data Rate. A custom simulation environment is developed to generate training and testing datasets and to evaluate system behavior under both static and adaptive channel configurations.

2602.07107 2026-02-17 cs.CR cs.AI

ShallowJail: Steering Jailbreaks against Large Language Models

Shang Liu, Hanyu Pei, Zeyan Liu

详情
英文摘要

Large Language Models(LLMs) have been successful in numerous fields. Alignment has usually been applied to prevent them from harmful purposes. However, aligned LLMs remain vulnerable to jailbreak attacks that deliberately mislead them into producing harmful outputs. Existing jailbreaks are either black-box, using carefully crafted, unstealthy prompts, or white-box, requiring resource-intensive computation. In light of these challenges, we introduce ShallowJail, a novel attack that exploits shallow alignment in LLMs. ShallowJail can misguide LLMs' responses by manipulating the initial tokens during inference. Through extensive experiments, we demonstrate the effectiveness of ShallowJail, which substantially degrades the safety of state-of-the-art LLM responses. Our code is available at https://github.com/liuup/ShallowJail.

2601.20336 2026-02-17 q-fin.CP cs.LG

Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

Murad Farzulla

Comments 38 pages, 9 figures, 14 tables. JEL: G14, G12, C38, C45. Code available at https://github.com/studiofarzulla/tensor-defi

详情
英文摘要

This study investigates whether cryptocurrency whitepaper narratives align with empirically observed market factor structure. We construct a pipeline combining zero-shot NLP classification of 38 whitepapers across 10 semantic categories with CP tensor decomposition of hourly market data (49 assets, 17,543 timestamps). Using Procrustes rotation and Tucker's congruence coefficient (phi), we find weak alignment between claims and market statistics (phi = 0.246, p = 0.339) and between claims and latent factors (phi = 0.058, p = 0.751). A methodological validation comparison (statistics versus factors, both derived from market data) achieves significance (p < 0.001), confirming the pipeline detects real structure. The null result indicates whitepaper narratives do not meaningfully predict market factor structure, with implications for narrative economics and investor decision-making. Entity-level analysis reveals specialized tokens (XMR, CRV, YFI) show stronger narrative-market correspondence than broad infrastructure tokens.

2601.15518 2026-02-17 cs.IR cs.CL cs.LG

DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

Wenxin Zhou, Ritesh Mehta, Anthony Miyaguchi

Comments Paper submitted to TREC 2025 (34th Text REtrieval Conference)

详情
英文摘要

We develop a two-stage retrieval system that combines multiple complementary retrieval methods with a learned reranker and LLM-based reranking, to address the TREC Tip-of-the-Tongue (ToT) task. In the first stage, we employ hybrid retrieval that merges LLM-based retrieval, sparse (BM25), and dense (BGE-M3) retrieval methods. We also introduce topic-aware multi-index dense retrieval that partitions the Wikipedia corpus into 24 topical domains. In the second stage, we evaluate both a trained LambdaMART reranker and LLM-based reranking. To support model training, we generate 5000 synthetic ToT queries using LLMs. Our best system achieves recall of 0.66 and NDCG@1000 of 0.41 on the test set by combining hybrid retrieval with Gemini-2.5-flash reranking, demonstrating the effectiveness of fusion retrieval.

2601.12522 2026-02-17 cs.SE cs.AI cs.IR cs.LG cs.MA

Improved Bug Localization with AI Agents Leveraging Hypothesis and Dynamic Cognition

Asif Mohammed Samir, Mohammad Masudur Rahman

Comments 13 pages, 7 tables, 5 figures

详情
英文摘要

Software bugs cost technology providers (e.g., AT&T) billions annually and cause developers to spend roughly 50% of their time on bug resolution. Traditional methods for bug localization often analyze the suspiciousness of code components (e.g., methods, documents) in isolation, overlooking their connections with other components in the codebase. Recent advances in Large Language Models (LLMs) and agentic AI techniques have shown strong potential for code understanding, but still lack causal reasoning during code exploration and struggle to manage growing context effectively, limiting their capability. In this paper, we present a novel agentic technique for bug localization -- CogniGent -- that overcomes the limitations above by leveraging multiple AI agents capable of causal reasoning, call-graph-based root cause analysis and context engineering. It emulates developers-inspired debugging practices (a.k.a., dynamic cognitive debugging) and conducts hypothesis testing to support bug localization. We evaluate CogniGent on a curated dataset of 591 bug reports using three widely adopted performance metrics and compare it against six established baselines from the literature. Experimental results show that our technique consistently outperformed existing traditional and LLM-based techniques, achieving MAP improvements of 23.33-38.57% at the document and method levels. Similar gains were observed in MRR, with increases of 25.14-53.74% at both granularity levels. Statistical significance tests also confirm the superiority of our technique. By addressing the reasoning, dependency, and context limitations, CogniGent advances the state of bug localization, bridging human-like cognition with agentic automation for improved performance.

2509.05311 2026-02-17 cs.CR cs.AI cs.LG

Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations

Konur Tholl, François Rivest, Mariam El Mezouar, Adrian Taylor, Ranwa Al Mallah

详情
英文摘要

Reinforcement Learning (RL) has shown great potential for autonomous decision-making in the cybersecurity domain, enabling agents to learn through direct environment interaction. However, RL agents in Autonomous Cyber Operations (ACO) typically learn from scratch, requiring them to execute undesirable actions to learn their consequences. In this study, we integrate external knowledge in the form of a Large Language Model (LLM) pretrained on cybersecurity data that our RL agent can directly leverage to make informed decisions. By guiding initial training with an LLM, we improve baseline performance and reduce the need for exploratory actions with obviously negative outcomes. We evaluate our LLM-integrated approach in a simulated cybersecurity environment, and demonstrate that our guided agent achieves over 2x higher rewards during early training and converges to a favorable policy approximately 4,500 episodes faster than the baseline.

2508.21285 2026-02-17 q-fin.GN cs.AI cs.CE econ.GN q-fin.EC

A Financial Brain Scan of the LLM

Hui Chen, Antoine Didisheim, Mohammad, Pourmohammadi, Luciano Somoza, Hanqing Tian

Comments 47 pages

详情
英文摘要

Emerging techniques in computer science make it possible to "brain scan" large language models (LLMs), identify the plain-English concepts that guide their reasoning, and steer them while holding other factors constant. We show that this approach can map LLM-generated economic forecasts to concepts such as sentiment, technical analysis, and timing, and compute their relative importance without reducing performance. We also show that models can be steered to be more or less risk-averse, optimistic, or pessimistic, which allows researchers to correct or simulate biases. The method is transparent, lightweight, and replicable for empirical research in the social sciences.

2508.19278 2026-02-17 cs.CR cs.AI cs.LG

Towards Production-Worthy Simulation for Autonomous Cyber Operations

Konur Tholl, Mariam El Mezouar, Adrian Taylor, Ranwa Al Mallah

详情
英文摘要

Simulated environments have proven invaluable in Autonomous Cyber Operations (ACO) where Reinforcement Learning (RL) agents can be trained without the computational overhead of emulation. These environments must accurately represent cybersecurity scenarios while producing the necessary signals to support RL training. In this study, we present a framework where we first extend CybORG's Cage Challenge 2 environment by implementing three new actions: Patch, Isolate, and Unisolate, to better represent the capabilities available to human operators in real-world settings. We then propose a design for agent development where we modify the reward signals and the agent's feature space to enhance training performance. To validate these modifications, we train DQN and PPO agents in the updated environment. Our study demonstrates that CybORG can be extended with additional realistic functionality, while maintaining its ability to generate informative training signals for RL agents.

2507.14186 2026-02-17 cs.NI cs.AI cs.LG eess.SP

A Disentangled Representation Learning Framework for Low-altitude Network Coverage Prediction

Xiaojie Li, Zhijie Cai, Nan Qi, Chao Dong, Guangxu Zhu, Haixia Ma, Qihui Wu, Shi Jin

Comments This paper has been accepted for publication in IEEE Transactions on Mobile Computing

Journal ref IEEE Transactions on Mobile Computing, early access, 2025

详情
英文摘要

The expansion of the low-altitude economy has underscored the significance of Low-Altitude Network Coverage (LANC) prediction for designing aerial corridors. While accurate LANC forecasting hinges on the antenna beam patterns of Base Stations (BSs), these patterns are typically proprietary and not readily accessible. Operational parameters of BSs, which inherently contain beam information, offer an opportunity for data-driven low-altitude coverage prediction. However, collecting extensive low-altitude road test data is cost-prohibitive, often yielding only sparse samples per BS. This scarcity results in two primary challenges: imbalanced feature sampling due to limited variability in high-dimensional operational parameters against the backdrop of substantial changes in low-dimensional sampling locations, and diminished generalizability stemming from insufficient data samples. To overcome these obstacles, we introduce a dual strategy comprising expert knowledge-based feature compression and disentangled representation learning. The former reduces feature space complexity by leveraging communications expertise, while the latter enhances model generalizability through the integration of propagation models and distinct subnetworks that capture and aggregate the semantic representations of latent features. Experimental evaluation confirms the efficacy of our framework, yielding a 7% reduction in error compared to the best baseline algorithm. Real-network validations further attest to its reliability, achieving practical prediction accuracy with MAE errors at the 5dB level.

2506.02634 2026-02-17 cs.DC cs.AI

KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider

Jiahao Wang, Jinbo Han, Xingda Wei, Sijie Shen, Dingyan Zhang, Chenguang Fang, Rong Chen, Wenyuan Yu, Haibo Chen

Comments Accepted by USENIX ATC'25

详情
英文摘要

Serving large language models (LLMs) is important for cloud providers, and caching intermediate results (KV\$) after processing each request substantially improves serving throughput and latency. However, there is limited understanding of how LLM serving benefits from KV\$ caching, where system design decisions like cache eviction policies are highly workload-dependent. In this paper, we present the first systematic characterization of the KV\$ workload patterns from one of the leading LLM service providers. We draw observations that were not covered by previous studies focusing on synthetic workloads, including: KV\$ reuses are skewed across requests, where reuses between single-turn requests are equally important as multi-turn requests; the reuse time and probability are diverse considering all requests, but for a specific request category, the pattern tends to be predictable; and the overall cache size required for an ideal cache hit ratio is moderate. Based on the characterization, we further propose a workload-aware cache eviction policy that improves the serving performance under real-world traces, especially with limited cache capacity.

2504.21205 2026-02-17 cs.CR cs.AI

SecRepoBench: Benchmarking Code Agents for Secure Code Completion in Real-World Repositories

Chihao Shen, Connor Dilgren, Purva Chiniya, Luke Griffith, Yu Ding, Yizheng Chen

详情
英文摘要

This paper introduces SecRepoBench, a benchmark to evaluate code agents on secure code completion in real-world repositories. SecRepoBench has 318 code completion tasks in 27 C/C++ repositories, covering 15 CWEs. We evaluate 29 standalone LLMs and 15 code agents across 3 state-of-the-art agent frameworks using our benchmark. We find that state-of-the-art LLMs struggle with generating correct and secure code completions. However, code agents significantly outperform standalone LLMs. We show that SecRepoBench is more difficult than the prior state-of-the-art benchmark. Finally, our comprehensive analysis provides insights into potential directions for enhancing the ability of code agents to write correct and secure code in real-world repositories.

2504.09733 2026-02-17 cs.CG cs.LG cs.NA math.NA

Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) of 2D Black Box Classifier Functions

Mithun Goutham, Riccardo DalferroNucci, Stephanie Stockar, Meghna Menon, Sneha Nayak, Harshad Zade, Chetan Patel, Mario Santillo

详情
英文摘要

Accurately estimating decision boundaries in black box systems is critical when ensuring safety, quality, and feasibility in real-world applications. However, existing methods iteratively refine boundary estimates by sampling in regions of uncertainty, without providing guarantees on the closeness to the decision boundary and also result in unnecessary exploration that is especially disadvantageous when evaluations are costly. This paper presents $\varepsilon$-Neighborhood Decision-Boundary Governed Estimation (EDGE), a sample efficient and function-agnostic algorithm that leverages the intermediate value theorem to estimate the location of the decision boundary of a black box binary classifier within a user-specified $\varepsilon$-neighborhood. To demonstrate applicability, a case study is presented of an electric grid stability problem with uncertain renewable power injection. Evaluations are conducted on three test functions, where it is seen that the EDGE algorithm demonstrates superior sample efficiency and better boundary approximation than adaptive sampling techniques and grid-based searches.

2502.12581 2026-02-17 stat.ML cs.AI cs.LG

The Majority Vote Paradigm Shift: When Popular Meets Optimal

Antonio Purificato, Maria Sofia Bucarelli, Anil Kumar Nelakanti, Andrea Bacciu, Fabrizio Silvestri, Amin Mantrach

Comments 33 pages, 7 figures

详情
英文摘要

Reliably labelling data typically requires annotations from multiple human workers. However, humans are far from being perfect. Hence, it is a common practice to aggregate labels gathered from multiple annotators to make a more confident estimate of the true label. Among many aggregation methods, the simple and well known Majority Vote (MV) selects the class label polling the highest number of votes. However, despite its importance, the optimality of MV's label aggregation has not been extensively studied. We address this gap in our work by characterising the conditions under which MV achieves the theoretically optimal lower bound on label estimation error. Our results capture the tolerable limits on annotation noise under which MV can optimally recover labels for a given class distribution. This certificate of optimality provides a more principled approach to model selection for label aggregation as an alternative to otherwise inefficient practices that sometimes include higher experts, gold labels, etc., that are all marred by the same human uncertainty despite huge time and monetary costs. Experiments on both synthetic and real world data corroborate our theoretical findings.

2501.17168 2026-02-17 cs.NE cs.AI

Enabling Population-Level Parallelism in Tree-Based Genetic Programming for GPU Acceleration

Zhihong Wu, Lishuang Wang, Kebin Sun, Zhuozhao Li, Ran Cheng

Comments Accepted by IEEE TEVC

详情
英文摘要

Tree-based Genetic Programming (TGP) is a widely used evolutionary algorithm for tasks such as symbolic regression, classification, and robotic control. Due to the intensive computational demands of running TGP, GPU acceleration is crucial for achieving scalable performance. However, efficient GPU-based execution of TGP remains challenging, primarily due to three core issues: (1) the structural heterogeneity of program individuals, (2) the complexity of integrating multiple levels of parallelism, and (3) the incompatibility between high-performance CUDA execution and flexible Python-based environments. To address these issues, we propose EvoGP, a high-performance framework tailored for GPU acceleration of TGP via population-level parallel execution. First, EvoGP introduces a tensorized representation that encodes variable-sized trees into fixed-shape, memory-aligned arrays, enabling uniform memory access and parallel computation across diverse individuals. Second, EvoGP adopts an adaptive parallelism strategy that dynamically combines intra- and inter-individual parallelism based on dataset size, ensuring high GPU utilization across a broad spectrum of tasks. Third, EvoGP embeds custom CUDA kernels into the PyTorch runtime, achieving seamless integration with Python-based environments such as Gym, MuJoCo, Brax, and Genesis. Experimental results demonstrate that EvoGP achieves a peak throughput exceeding $10^{11}$ GPops/s. Specifically, this performance represents a speedup of up to $304\times$ over existing GPU-based TGP implementations and $18\times$ over state-of-the-art CPU-based libraries. Furthermore, EvoGP maintains comparable accuracy and exhibits improved scalability across large population sizes. EvoGP is open source and accessible at: https://github.com/EMI-Group/evogp.

2501.01696 2026-02-17 stat.ML cs.IT cs.LG math.IT

Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

Tong Wu

Comments This paper has been accepted for publication in the Journal of Machine Learning Research

详情
英文摘要

Tensors, which give a faithful and effective representation to deliver the intrinsic structure of multi-dimensional data, play a crucial role in an increasing number of signal processing and machine learning problems. However, tensor data are often accompanied by arbitrary signal corruptions, including missing entries and sparse noise. A fundamental challenge is to reliably extract the meaningful information from corrupted tensor data in a statistically and computationally efficient manner. This paper develops a scaled gradient descent (ScaledGD) algorithm to directly estimate the tensor factors with tailored spectral initializations under the tensor-tensor product (t-product) and tensor singular value decomposition (t-SVD) framework. With tailored variants for tensor robust principal component analysis, (robust) tensor completion and tensor regression, we theoretically show that ScaledGD achieves linear convergence at a constant rate that is independent of the condition number of the ground truth low-rank tensor, while maintaining the low per-iteration cost of gradient descent. To the best of our knowledge, ScaledGD is the first algorithm that provably has such properties for low-rank tensor estimation with the t-SVD. Finally, numerical examples are provided to demonstrate the efficacy of ScaledGD in accelerating the convergence rate of ill-conditioned low-rank tensor estimation in a number of applications.

2411.12748 2026-02-17 q-fin.TR cs.LG

FinBERT-BiLSTM: A Deep Learning Model for Predicting Volatile Cryptocurrency Market Prices Using Market Sentiment Dynamics

Mabsur Fatin Bin Hossain, Lubna Zahan Lamia, Md Mahmudur Rahman, Md Mosaddek Khan

详情
英文摘要

Time series forecasting is a key tool in financial markets, helping to predict asset prices and guide investment decisions. In highly volatile markets, such as cryptocurrencies like Bitcoin (BTC) and Ethereum (ETH), forecasting becomes more difficult due to extreme price fluctuations driven by market sentiment, technological changes, and regulatory shifts. Traditionally, forecasting relied on statistical methods, but as markets became more complex, deep learning models like LSTM, Bi-LSTM, and the newer FinBERT-LSTM emerged to capture intricate patterns. Building upon recent advancements and addressing the volatility inherent in cryptocurrency markets, we propose a hybrid model that combines Bidirectional Long Short-Term Memory (Bi-LSTM) networks with FinBERT to enhance forecasting accuracy for these assets. This approach fills a key gap in forecasting volatile financial markets by blending advanced time series models with sentiment analysis, offering valuable insights for investors and analysts navigating unpredictable markets.

2411.01629 2026-02-17 stat.ML cs.LG math.OC math.ST stat.TH

Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

Tengyuan Liang, Kulunu Dharmakeerthi, Takuya Koriyama

Comments 30 pages, 11 figures

Journal ref Transactions on Machine Learning Research, 2026

详情
英文摘要

Adding noise is easy; what about denoising? Diffusion is easy; what about reverting a diffusion? Diffusion-based generative models aim to denoise a Langevin diffusion chain, moving from a log-concave equilibrium measure $ν$, say an isotropic Gaussian, back to a complex, possibly non-log-concave initial measure $μ$. The score function performs denoising, moving backward in time, and predicting the conditional mean of the past location given the current one. We show that score denoising is the optimal backward map in transportation cost. What is its localization uncertainty? We show that the curvature function determines this localization uncertainty, measured as the conditional variance of the past location given the current. We study in this paper the effectiveness of the diffuse-then-denoise process: the contraction of the forward diffusion chain, offset by the possible expansion of the backward denoising chain, governs the denoising difficulty. For any initial measure $μ$, we prove that this offset net contraction at time $t$ is characterized by the curvature complexity of a smoothed $μ$ at a specific signal-to-noise ratio (SNR) scale $r(t)$. We discover that the multi-scale curvature complexity collectively determines the difficulty of the denoising chain. Our multi-scale complexity quantifies a fine-grained notion of average-case curvature instead of the worst-case. Curiously, it depends on an integrated tail function, measuring the relative mass of locations with positive curvature versus those with negative curvature; denoising at a specific SNR scale is easy if such an integrated tail is light. We conclude with several non-log-concave examples to demonstrate how the multi-scale complexity probes the bottleneck SNR for the diffuse-then-denoise process.

2410.15756 2026-02-17 cs.SE cs.AI

Automated Proof Generation for Rust Code via Self-Evolution

Tianyu Chen, Shuai Lu, Shan Lu, Yeyun Gong, Chenyuan Yang, Xuheng Li, Md Rakib Hossain Misu, Hao Yu, Nan Duan, Peng Cheng, Fan Yang, Shuvendu K Lahiri, Tao Xie, Lidong Zhou

详情
英文摘要

Ensuring correctness is crucial for code generation. Formal verification offers a definitive assurance of correctness, but demands substantial human effort in proof construction and hence raises a pressing need for automation. The primary obstacle lies in the severe lack of data-there is much fewer proofs than code snippets for Large Language Models (LLMs) to train upon. In this paper, we introduce SAFE, a framework that overcomes the lack of human-written proofs to enable automated proof generation of Rust code. SAFE establishes a self-evolving cycle where data synthesis and fine-tuning collaborate to enhance the model capability, leveraging the definitive power of a symbolic verifier in telling correct proofs from incorrect ones. SAFE also re-purposes the large number of synthesized incorrect proofs to train the self-debugging capability of the fine-tuned models, empowering them to fix incorrect proofs based on the verifier's feedback. SAFE demonstrates superior efficiency and precision compared to GPT-4o. Through tens of thousands of synthesized proofs and the self-debugging mechanism, we improve the capability of open-source models, initially unacquainted with formal verification, to automatically write proofs for Rust code. This advancement leads to a significant improvement in performance, achieving a 52.52% accuracy rate in a benchmark crafted by human experts, a significant leap over GPT-4o's performance of 14.39%.

2408.10746 2026-02-17 cs.DC cs.AI cs.LG cs.NI

Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing

Shengyuan Ye, Bei Ouyang, Tianyi Qian, Liekang Zeng, Jingyi Li, Jiangsu Du, Xiaowen Chu, Guoliang Xing, Xu Chen

详情
英文摘要

Large language models (LLMs) have unlocked a plethora of powerful applications at the network edge, such as intelligent personal assistants. Data privacy and security concerns have prompted a shift towards edge-based fine-tuning of personal LLMs, away from cloud reliance. However, this raises issues of computational intensity and resource scarcity, hindering training efficiency and feasibility. While current studies investigate parameter-efficient fine-tuning (PEFT) techniques to mitigate resource constraints, our analysis indicates that these techniques are not sufficiently resource-efficient for edge devices. To tackle these challenges, we propose Pluto and Charon (PAC), a time and memory efficient collaborative edge AI framework for personal LLMs fine-tuning. PAC breaks the resource wall of personal LLMs fine-tuning with a sophisticated algorithm-system co-design. (1) Algorithmically, PAC implements a personal LLMs fine-tuning technique that is efficient in terms of parameters, time, and memory. It utilizes Parallel Adapters to circumvent the need for a full backward pass through the LLM backbone. Additionally, an activation cache mechanism further streamlining the process by negating the necessity for repeated forward passes across multiple epochs. (2) Systematically, PAC leverages edge devices in close proximity, pooling them as a collective resource for in-situ personal LLMs fine-tuning, utilizing a hybrid data and pipeline parallelism to orchestrate distributed training. The use of the activation cache eliminates the need for forward pass through the LLM backbone,enabling exclusive fine-tuning of the Parallel Adapters using data parallelism. Extensive evaluation based on prototype implementation demonstrates that PAC remarkably outperforms state-of-the-art approaches, achieving up to 8.64x end-to-end speedup and up to 88.16% reduction in memory footprint.

2402.06635 2026-02-17 q-fin.ST cs.CE cs.LG

Large and Deep Factor Models

Bryan Kelly, Boris Kuznetsov, Semyon Malamud, Teng Andrea Xu, Yuan Zhang

详情
英文摘要

We show that a deep neural network (DNN) trained to construct a stochastic discount factor (SDF) admits a sharp additive decomposition that separates nonlinear characteristic discovery from the pricing rule that aggregates them. The economically relevant component of this decomposition is governed by a new object, the Portfolio Tangent Kernel (PTK), which captures the features learned by the network and induces an explicit linear factor pricing representation for the SDF. In population, the PTK-implied SDF converges to a ridge-regularized version of the true SDF, with the effective strength of regularization determined by the spectral complexity of the PTK. Using U.S. equity data, we show that the PTK representation delivers large and statistically significant performance gains, while its spectral complexity has risen sharply-by roughly a factor of six since the early 2000s-imposing increasingly tight limits on finite-sample pricing performance.

2312.11797 2026-02-17 q-fin.PM cs.LG q-fin.CP

Data-Driven Merton's Strategies via Policy Randomization

Min Dai, Yuchao Dong, Yanwei Jia, Xun Yu Zhou

Comments 45 pages, 4 figures, 2 tables

详情
英文摘要

We study Merton's expected utility maximization problem in an incomplete market, characterized by a factor process in addition to the stock price process, where all the model primitives are unknown. The agent under consideration is a price taker who has access only to the stock and factor value processes and the instantaneous volatility. We propose an auxiliary problem in which the agent can invoke policy randomization according to a specific class of Gaussian distributions, and prove that the mean of its optimal Gaussian policy solves the original Merton problem. With randomized policies, we are in the realm of continuous-time reinforcement learning (RL) recently developed in Wang et al. (2020) and Jia and Zhou (2022a, 2022b, 2023), enabling us to solve the auxiliary problem in a data-driven way without having to estimate the model primitives. Specifically, we establish a policy improvement theorem based on which we design both online and offline actor-critic RL algorithms for learning Merton's strategies. A key insight from this study is that RL in general and policy randomization in particular are useful beyond the purpose for exploration -- they can be employed as a technical tool to solve a problem that cannot be otherwise solved by mere deterministic policies. At last, we carry out both simulation and empirical studies in a stochastic volatility environment to demonstrate the decisive outperformance of the devised RL algorithms in comparison to the conventional model-based, plug-in method.

2602.15027 2026-02-17 hep-ph

Complementarity of di-top and four-top searches in interpreting possible signals of new physics

Henning Bahl, Philipp Gadow, Romal Kumar, Krisztian Peters, Panagiotis Stylianou, Georg Weiglein

Comments 26 pages + appendices, 15 figures

详情
英文摘要

Final states comprising two or more top quarks are important search channels at the Large Hadron Collider for scalar particles predicted in models of physics beyond the Standard Model. While the di-top final state profits from a higher signal cross section, it can be subject to intricate interference patterns. Besides the interference with the large QCD background, in case of the presence of more than one high-mass scalar also large signal--signal interference contributions can occur. We show that in such scenarios it is crucial to account for loop-level mixing for obtaining accurate exclusion bounds. We demonstrate how the interference patterns can obscure the interpretation of possible deviations from the Standard Model expectations. We show that the four-top final state, while giving rise to a smaller signal cross section, provides important complementary information due to its much smaller signal--background interference contributions. Thus, the results obtained from the four-top final state can be instrumental for pinpointing the underlying new physics scenario.

2602.15026 2026-02-17 astro-ph.CO hep-ph

Prospects of Indirect Detection of Dark Matter via Primordial Black Hole Induced Gravitational Waves

Debarun Paul, Md Riajul Haque, Supratik Pal

Comments 38 pages, 18 figures and 1 table

详情
英文摘要

Primordial black holes (PBHs), produced in the early Universe, can source a stochastic background of induced gravitational waves (GWs) and provide a non-thermal origin for dark matter (DM). We investigate DM production in a PBH-dominated cosmological framework, including contributions from PBH evaporation, gravitational production, and thermal freeze-in and freeze-out mechanisms, and determine the regions consistent with the observed DM relic abundance. We find that thermal freeze-in can compensate for the underabundance of PBH-sourced DM, while indirect detection remains largely insensitive due to the feeble interaction strength, making future GW observatories such as LISA and the Einstein Telescope (ET) unique probes of this scenario. For freeze-out DM, indirect detection experiments constrain regions with relatively large annihilation cross-sections, whereas GW observations probe complementary regions with heavier DM masses and smaller interaction strengths. Consequently, the same DM parameter space cannot be simultaneously probed by both indirect detection searches and GW missions. These results establish GW observations as a powerful and independent probe of DM production in PBH-dominated cosmologies, opening a new observational window into DM properties and the thermal history of the pre-BBN Universe.

2602.15023 2026-02-17 astro-ph.CO hep-ph hep-th

Gravitational Wave Echoes of the First Order Phase Transition in a Kination-Induced Big Bang

Richard Casey, Katherine Freese, Evangelos I. Sfakianakis

Comments 54 pages, 18 figures

详情
英文摘要

Gravitational waves (GWs) produced during first-order phase transitions (FOPTs) in the early universe provide a powerful probe of nonstandard cosmological histories. We study GW production from a FOPT ending a kination-dominated epoch in the Kination-Induced Big Bang scenario, in which a period of kination domination terminates through a phase transition that reheats the universe into radiation domination. A rolling scalar field drives the kination epoch. In the specific model we consider, its derivative coupling to a second scalar (tunneling field) dynamically traps the latter in a false vacuum, with the phase transition triggered as the kination field slows due to Hubble friction. We compute the resulting stochastic GW background from bubble nucleation and collisions, presenting analytic estimates and numerical results for the peak amplitude and frequency. In all cases we find an upper bound $Ω_{\rm GW} h^2\lesssim 2\times10^{-7}$ from the bubble percolation condition. In the case where the false vacuum energy dominates at the transition (yet the kination field drives the FOPT), we find $Ω_{\rm GW}h^2\gtrsim 10^{-12}$. We further find that the Hubble scale during the phase transition across a broad set of model parameters is bounded by $\mathscr{O}(10^{-13})M^2/M_{\rm Pl}\lesssim H_* \lesssim \mathscr{O}(0.1)M^2/M_{\rm Pl}$, where $M$ is the mass-scale controlling the strength of the interaction between the kination and tunneling fields. The predicted signal spans frequencies from nHz to MHz, allowing the model to explain the signal reported by Pulsar Timing Array experiments and to be constrained or probed by interferometers such as LISA, Advanced LIGO, Cosmic Explorer, and BBO. Interestingly, a FOPT can occur even if the bare tunneling potential has a single minimum, as metastability is generated dynamically by the coupling between the tunneling and the kination field.

2602.15020 2026-02-17 cond-mat.str-el

Majorana Signatures in Planar Tunneling through a Kitaev Spin Liquid

Weiyao Li, Vitor Dantas, Wen-Han Kao, Natalia B. Perkins

Comments 14 pages, 5 figures

详情
英文摘要

We propose a planar tunneling setup to probe vacancy-bound Majorana modes in the chiral Kitaev spin liquid. In this geometry, the inelastic tunneling conductance can be expressed directly in terms of real-space spin correlations, establishing a link between measurable spectra and the underlying fractionalized excitations. We show that spin vacancies host localized Majorana states that generate sharp near-zero-bias features, well separated from the continuum of bulk spin excitations. Compared to local STM measurements, the planar configuration naturally enhances the signal by coherently summing over multiple vacancies, reducing spatial resolution requirements. Our results demonstrate a realistic and scalable route to detect Majorana excitations in Kitaev materials.

2602.15017 2026-02-17 math.AG math.AC math.CO

The projective coinvariant algebra, Young invariants and bigraded coordinate rings of Segre embeddings

Balázs Szendrői

Comments 18 pages

详情
英文摘要

This paper studies a flat degeneration P_n of the classical coinvariant algebra R_n, a bigraded Artinian Gorenstein algebra that arises from the coordinate ring of the Segre embedding of the n-fold self-product of the projective line. The Frobenius character of P_n is computed by a natural bigraded refinement of the classical Lusztig--Stanley formula for the character of the coinvariant algebra. Young invariants in P_n get related to coordinate rings of general Segre embeddings of products of projective spaces; their bigraded Hilbert polynomials get expressed in terms of major-descent generating functions of words in multisets. Relations to the diagonal coinvariant algebra, cohomological interpretations including quantum cohomology, and Garsia-Stanton-style bases are also explored.