arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20852 2026-04-24 cs.IR cs.AI

DenoiseRank: Learning to Rank by Diffusion Models

Ying Wang, Preslav Nakov, Shangsong Liang

详情

英文摘要

Learning to rank (LTR) is one of the core tasks in Machine Learning. Traditional LTR models have made great progress, but nearly all of them are implemented from discriminative perspective. In this paper, we aim at addressing LTR from a novel perspective, i.e., by a deep generative model. Specifically, we propose a novel denoise rank model, DenoiseRank, which noises the relevant labels in the diffusion process and denoises them on the query documents in the reverse process to accurately predict their distribution. Our model is the first to address traditional LTR from generative perspective and is a diffusion method for LTR. Our extensive experiments on benchmark datasets demonstrated the effectiveness of DenoiseRank, and we believe it provides a benchmark for generative LTR task.

URL PDF HTML ☆

赞 0 踩 0

2604.20851 2026-04-24 cs.IR cs.AI cs.CV

Robust Test-time Video-Text Retrieval: Benchmarking and Adapting for Query Shifts

Bingqing Zhang, Zhuo Cao, Heming Du, Yang Li, Xue Li, Jiajun Liu, Sen Wang

Comments Accepted to ICLR2026

2604.20850 2026-04-24 cs.IR cs.AI cs.CL

Association Is Not Similarity: Learning Corpus-Specific Associations for Multi-Hop Retrieval

Jason Dury

Comments 10 pages, 7 appendices, 10 tables. Code: https://github.com/EridosAI/AAR

2604.20849 2026-04-24 cs.IR cs.AI cs.CL

SPIRE: Structure-Preserving Interpretable Retrieval of Evidence

Mike Rainey, Umut Acar, Muhammed Sezer

2604.20848 2026-04-24 cs.IR cs.AI

MATRAG: Multi-Agent Transparent Retrieval-Augmented Generation for Explainable Recommendations

Sushant Mehta

2604.20847 2026-04-24 cs.IR cs.AI

Revisiting Content-Based Music Recommendation: Efficient Feature Aggregation from Large-Scale Music Models

Yizhi Zhou, Jia-Qi Yang, De-Chuan Zhan, Da-Wei Zhou

2604.20846 2026-04-24 cs.IR cs.AI

ADS-POI: Agentic Spatiotemporal State Decomposition for Next Point-of-Interest Recommendation

Zhenyu Yu, Chunlei Meng, Yangchen Zeng, Mohd Yamani Idna Idris, Shuigeng Zhou

2604.20845 2026-04-24 cs.IR cs.AI

CaST-POI: Candidate-Conditioned Spatiotemporal Modeling for Next POI Recommendation

Zhenyu Yu, Chunlei Meng, Yangchen Zeng, Mohd Yamani Idna Idris, Shuigeng Zhou

2604.20844 2026-04-24 cs.IR cs.AI

AtomicRAG: Atom-Entity Graphs for Retrieval-Augmented Generation

Yanning Hou, Duanyang Yuan, Sihang Zhou, Xiaoshu Chen, Ke Liang, Siwei Wang, Xinwang Liu, Jian Huang

2604.20569 2026-04-24 cs.HC cs.AI

The Effect of Idea Elaboration on the Automatic Assessment of Idea Originality

Umberto Domanti, Moritz Mock, Sergio Agnoli, Antonella De Angeli

2604.20311 2026-04-24 cs.MM cs.AI

Seeing Further and Wider: Joint Spatio-Temporal Enlargement for Micro-Video Popularity Prediction

Dali Wang, Yunyao Zhang, Junqing Yu, Yi-Ping Phoebe Chen, Chen Xu, Zikai Song

详情

英文摘要

Micro-video popularity prediction (MVPP) aims to forecast the future popularity of videos on online media, which is essential for applications such as content recommendation and traffic allocation. In real-world scenarios, it is critical for MVPP approaches to understand both the temporal dynamics of a given video (temporal) and its historical relevance to other videos (spatial). However, existing approaches sufer from limitations in both dimensions: temporally, they rely on sparse short-range sampling that restricts content perception; spatially, they depend on flat retrieval memory with limited capacity and low efficiency, hindering scalable knowledge utilization. To overcome these limitations, we propose a unified framework that achieves joint spatio-temporal enlargement, enabling precise perception of extremely long video sequences while supporting a scalable memory bank that can infinitely expand to incorporate all relevant historical videos. Technically, we employ a Temporal Enlargement driven by a frame scoring module that extracts highlight cues from video frames through two complementary pathways: sparse sampling and dense perception. Their outputs are adaptively fused to enable robust long-sequence content understanding. For Spatial Enlargement, we construct a Topology-Aware Memory Bank that hierarchically clusters historically relevant content based on topological relationships. Instead of directly expanding memory capacity, we update the encoder features of the corresponding clusters when incorporating new videos, enabling unbounded historical association without unbounded storage growth. Extensive experiments on three widely used MVPP benchmarks demonstrate that our method consistently outperforms 11 strong baselines across mainstream metrics, achieving robust improvements in both prediction accuracy and ranking consistency.

URL PDF HTML ☆

赞 0 踩 0

2604.20279 2026-04-24 cs.HC cs.AI cs.MA

AgentLens: Adaptive Visual Modalities for Human-Agent Interaction in Mobile GUI Agents

Jeonghyeon Kim, Byeongjun Joung, Junwon Lee, Joohyung Lee, Taehoon Min, Sunjae Lee

2604.20210 2026-04-24 cs.HC cs.AI cs.LG

Vibrotactile Preference Learning: Uncertainty-Aware Preference Learning for Personalized Vibration Feedback

Rongtao Zhang, Xin Zhu, Masoume Pourebadi Khotbehsara, Warren Dao, Erdem Bıyık, Heather Culbertson

Comments Project webpage: https://isanshi.github.io/publication/vpl/

2604.19811 2026-04-24 cs.CY cs.AI

Model Capability Assessment and Safeguards for Biological Weaponization

Michael Richter

2604.19738 2026-04-24 math.PR cs.LG stat.ML

Phase Transitions in the Fluctuations of Functionals of Random Neural Networks

Simmaco Di Lillo, Leonardo Maini, Domenico Marinucci

2604.16432 2026-04-24 cs.CY cs.AI cs.LG econ.EM

Quantifying how AI Panels improve precision

Nicholas CL Beale

Comments 11 pages, 8 Figures, 13pp of Supplementary Information

2604.15468 2026-04-24 cs.SE cs.AI

The Semi-Executable Stack: Agentic Software Engineering and the Expanding Scope of SE

Robert Feldt, Per Lenberg, Julian Frattini, Dhasarathy Parthasarathy

Comments This paper is the write-up of Robert Feldt's keynote "Agentic Software Engineering Will Eat the World: AI-Based Systems as the New Operating System of Society'' given at the Agentic Engineering 2026 workshop, Rio de Janeiro, Brazil, April 14, 2026. April 23 upload fixed the reference list to be more complete, and added a few additional citations; text essentially unchanged

2604.12994 2026-04-24 cs.CR cs.AI

LogicEval: A Systematic Framework for Evaluating Automated Repair Techniques for Logical Vulnerabilities in Real-World Software

Syed Md Mukit Rashid, Abdullah Al Ishtiaq, Kai Tu, Yilu Dong, Tianwei Wu, Ali Ranjbar, Tianchang Yang, Najrin Sultana, Shagufta Mehnaz, Syed Rafiul Hussain

Comments To appear in ACL 2026 Main Conference

2604.04685 2026-04-24 quant-ph cs.CV

Unsharp Measurement with Adaptive Gaussian POVMs for Quantum-Inspired Image Processing

Debashis Saikia, Bikash K. Behera, Mayukha Pal, Prasanta K. Panigrahi

2604.02832 2026-04-24 q-fin.RM cs.LG

Transfer Learning for Loan Recovery Prediction under Distribution Shifts with Heterogeneous Feature Spaces

Christopher Gerling, Hanqiu Peng, Ying Chen, Stefan Lessmann

Comments 35 pages, 14 figures. Christopher Gerling had previously withdrawn his submission due to NDA restrictions, and that matter was resolved. We are authorized to publish the preprint now

2603.24111 2026-04-24 cs.CR cs.LG

Toward a Multi-Layer ML-Based Security Framework for Industrial IoT

Aymen Bouferroum, Valeria Loscri, Abderrahim Benslimane

2603.21697 2026-04-24 cs.CR cs.AI cs.MM

Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

Rui Yang Tan, Yujia Hu, Roy Ka-Wei Lee

Comments Code released at: https://github.com/Social-AI-Studio/ComicJailbreak

2603.15055 2026-04-24 stat.ML cs.LG math.ST stat.TH

Spatio-temporal probabilistic forecast using MMAF-guided learning

Leonardo Bardi, Imma Valentina Curato, Lorenzo Proietti

2603.11066 2026-04-24 math.DS cs.AI cs.HC

Exploring Collatz Dynamics with Human-LLM Collaboration

Edward Y. Chang

Comments 233 pages, 11 figures, 52 tables

2603.10845 2026-04-24 eess.SP cs.AI cs.CV

Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops

Jessica Sanson, Rahul C. Shah, Valerio Frascolla

Comments 6 pages, Conference

详情

Journal ref: Percom 2026

英文摘要

Human Presence Detection (HPD) is key to enable intelligent power management and security features in everyday devices. In this paper we propose the first HPD solution that leverages monostatic Wi-Fi sensing and detects user position using only the built-in Wi-Fi hardware of a device, with no need for external devices, access points, or additional sensors. In contrast, existing HPD solutions for laptops require external dedicated sensors which add cost and complexity, or rely on camera-based approaches that introduce significant privacy concerns. We herewith introduce the Range-Filtered Doppler Spectrum (RF-DS), a novel Wi-Fi sensing technique for presence estimation that enables both range-selective and temporally windowed detection of user presence. By applying targeted range-area filtering in the Channel Impulse Response (CIR) domain before Doppler analysis, our method focuses processing on task-relevant spatial zones, significantly reducing computational complexity. In addition, the use of temporal windows in the spectrum domain provides greater estimator stability compared to conventional 2D Range-Doppler detectors. Furthermore, we propose an adaptive multi-rate processing framework that dynamically adjusts Channel State Information (CSI) sampling rates-operating at low frame rates (10Hz) during idle periods and high rates (100Hz) only when motion is detected. To our knowledge, this is the first low-complexity solution for occupancy detection using monostatic Wi-Fi sensing on a built-in Wi-Fi network interface controller (NIC) of a commercial off-the-shelf laptop that requires no external network infrastructure or specialized sensors. Our solution can scale across different environments and devices without calibration or retraining.

URL PDF HTML ☆

赞 0 踩 0

2603.06545 2026-04-24 eess.SP cs.AI

LiveSense: A Real-Time Wi-Fi Sensing Platform for Range-Doppler on COTS Laptop

Jessica Sanson, Rahul C. Shah, Maximilian Pinaroc, Cagri Tanriover, Valerio Frascolla

2603.03700 2026-04-24 stat.ML cs.AI cs.LG math.ST stat.TH

Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

Saptarshi Chakraborty, Quentin Berthet, Peter L. Bartlett

详情

英文摘要

Despite the remarkable empirical success of score-based diffusion models, their statistical guarantees remain underdeveloped. Existing analyses often provide pessimistic convergence rates that do not reflect the intrinsic low-dimensional structure common in real data, such as that arising in natural images. In this work, we study the statistical convergence of score-based diffusion models for learning an unknown distribution $μ$ from finitely many samples. Under mild regularity conditions on the forward diffusion process and the data distribution, we derive finite-sample error bounds on the learned generative distribution, measured in the Wasserstein-$p$ distance. Unlike prior results, our guarantees hold for all $p \ge 1$ and require only a finite-moment assumption on $μ$, without compact-support, manifold, or smooth-density conditions. Specifically, given $n$ i.i.d.\ samples from $μ$ with finite $q$-th moment and appropriately chosen network architectures, hyperparameters, and discretization schemes, we show that the expected Wasserstein-$p$ error between the learned distribution $\hatμ$ and $μ$ scales as $\mathbb{E}\, \mathbb{W}_p(\hatμ,μ) = \widetilde{O}\!\left(n^{-1 / d^\ast_{p,q}(μ)}\right),$ where $d^\ast_{p,q}(μ)$ is the $(p,q)$-Wasserstein dimension of $μ$. Our results demonstrate that diffusion models naturally adapt to the intrinsic geometry of data and mitigate the curse of dimensionality, since the convergence rate depends on $d^\ast_{p,q}(μ)$ rather than the ambient dimension. Moreover, our theory conceptually bridges the analysis of diffusion models with that of GANs and the sharp minimax rates established in optimal transport. The proposed $(p,q)$-Wasserstein dimension also extends the notion of classical Wasserstein dimension to distributions with unbounded support, which may be of independent theoretical interest.

URL PDF HTML ☆

赞 0 踩 0

2602.16729 2026-04-24 cs.CR cs.AI cs.CL cs.LG

Intent Laundering: AI Safety Datasets Are Not What They Seem

Shahriar Golchin, Marc Wetter

Comments v2 preprint: updated with more models and a new dataset

2602.13211 2026-04-24 cs.NI cs.AI

An Overlay Multicast Routing Method Based on Network Situational Awareness and Hierarchical Multi-Agent Reinforcement Learning

Miao Ye, Yanye Chen, Yong Wang, Cheng Zhu, Qiuxiang Jiang, Gai Huang, Feng Ding

Comments 30page, 10 figures

2602.08561 2026-04-24 cs.SE cs.CL

Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches

Syed Mehtab Hussain Shah, Frank Hopfgartner, Arnim Bleier

Comments 12 pages, 5 figures. Submitted to ACM conference