arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.02980 2026-05-01 eess.AS cs.CL eess.SP

WST-X Series: Wavelet Scattering Transform for Interpretable Speech Deepfake Detection

Xi Xuan, Davide Carbone, Wenxin Zhang, Ruchi Pandey, Tomi H. Kinnunen

Comments IEEE Signal Processing Letters

详情

英文摘要

In this work, we focus on front-end design for speech deepfake detectors, the component that determines the discriminative acoustic cues provided to the classifier. Existing approaches are primarily categorized into two types. Hand-crafted filterbank features are transparent but limited in capturing higher-level information. SSL features, in turn, lack interpretability and may overlook fine-grained spectral anomalies. We propose the WST-X series, a novel family of feature extractors that combines the best of both worlds via the wavelet scattering transform (WST), which cascades wavelet convolutions with modulus nonlinearities to produce deformation-stable, multi-scale features. Experiments on the recent Deepfake-Eval-2024 benchmark, together with cross-dataset evaluations on the SpoofCeleb and In-the-Wild, show that WST-X outperforms existing front-ends by a wide margin. Our analysis reveals that a small averaging scale ($J$), combined with high-frequency and directional resolutions ($Q$, $L$), is critical for capturing subtle artifacts. This underscores the value of stable and translation-invariant features for speech deepfake detection. The code is available at https://github.com/xxuan-acoustics/WST-X-Series.

URL PDF HTML ☆

赞 0 踩 0

2602.00607 2026-05-01 cs.MM cs.SD

MTAVG-Bench: A Diagnostic Benchmark for Multi-Talker Dialogue-Centric Audio-Video Generation

Yang-Hao Zhou, Haitian Li, Rexar Lin, Heyan Huang, Jinxing Zhou, Changsen Yuan, Tian Lan, Ziqin Zhou, Yudong Li, Jiajun Xu, Jingyun Liao, Yi-Ming Cheng, Xuefeng Chen, Xian-Ling Mao, Yousheng Feng

2601.08611 2026-05-01 cs.IR cs.AI cs.CV cs.MM

VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking

Mark Rothermel, Marcus Kornmann, Marcus Rohrbach, Anna Rohrbach

Comments ACL 2026 Oral

2601.06992 2026-05-01 cs.IR cs.AI cs.CL

FinCARDS: Card-Based Analyst Reranking for Financial Document Question Answering

Yixi Zhou, Fan Zhang, Yu Chen, Haipeng Zhang, Preslav Nakov, Zhuohan Xie

Comments 17 pages, including figures and tables

2511.17372 2026-05-01 quant-ph cs.AI cs.LG

Quantum Masked Autoencoders for Vision Learning

Emma Andrews, Prabhat Mishra

2511.14070 2026-05-01 eess.IV cs.CV

ELiC: Efficient LiDAR Geometry Compression via Cross-Bit-depth Feature Propagation and Bag-of-Encoders

Junsik Kim, Gun Bang, Soowoong Kim

Comments Accepted to CVPR 2026

2510.19322 2026-05-01 cs.NI cs.AI cs.DC

Enabling Reconfiguration-Communication Overlap for Collective Communication in Optical Networks

Changbo Wu, Zhuolong Yu, Gongming Zhao, Hongli Xu

Comments Accepted at ACM CoNEXT '26. To be published in Proceedings of the ACM on Networking (PACMNET), Volume 4, CoNEXT2, June 2026

2509.21087 2026-05-01 eess.AS cs.LG cs.SD

Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?

Rostislav Makarov, Lea Schönherr, Timo Gerkmann

Comments Copyright 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

2509.16248 2026-05-01 cs.PL cs.LG cs.SE

GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2

Savini Kashmira, Jayanaka Dantanarayana, Thamirawaran Sathiyalogeswaran, Krisztian Flautner, Lingjia Tang, Jason Mars

2507.03478 2026-05-01 eess.IV cs.CV

PhotIQA: A photoacoustic image data set with image quality ratings

Anna Breger, Janek Gröhl, Clemens Karner, Thomas R Else, Ian Selby, Tom Rix, Lara-Sophie Witt, Merle Duchêne, Jonathan Weir-McCall, Carola-Bibiane Schönlieb

Comments 16 pages

2504.15564 2026-05-01 cs.SE cs.AI cs.LG

OpenClassGen: A Large-Scale Corpus of Real-World Python Classes for LLM Research

Musfiqur Rahman, SayedHassan Khatoonabadi, Emad Shihab

Comments This paper has been accepted for publication at the 30th International Conference on Evaluation and Assessment in Software Engineering (EASE 2026) AI models/data track

2503.04956 2026-05-01 stat.ML cs.LG

Foreclassing: A new machine learning perspective on human decision making with temporal data

Daniel Andrew Coulson, Martin T. Wells

Comments 20 pages, 1 figure, 15 tables

2604.27725 2026-05-01 cs.HC cs.AI

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments

Jiaju Chen, Jinghua Piao, Xia Xu, Songwei Li, Tong Xia, Xiangnan He, Yong Li

2604.27721 2026-05-01 physics.ao-ph cs.CV physics.data-an physics.space-ph

Physically-Informed Fuzzy Clustering of Vertical Sounding Ionograms

Oleg I. Berngardt, Sergey N. Ponomarchuk

Comments 31 pages, 8 figures

2604.27685 2026-05-01 cond-mat.mtrl-sci cs.AI cs.LG physics.comp-ph

VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentials

Rogério Almeida Gouvêa, Gian-Marco Rignanese

2604.27643 2026-05-01 cs.AR cs.AI

HAVEN: Hybrid Automated Verification ENgine for UVM Testbench Synthesis with LLMs

Chang-Chih Meng, Yu-Ren Lu, Guan-Yu Lin, Tsung Tai Yeh, Kai-Chiang Wu, I-Chen Wu

Comments 9 pages, 5 figures, 5 tables

详情

英文摘要

Integrated Circuit (IC) verification consumes nearly 70% of the IC development cycle, and recent research leverages Large Language Models (LLMs) to automatically generate testbenches and reduce verification overhead. However, LLMs have difficulty generating testbenches correctly. Unlike high-level programming languages, Hardware Description Languages (HDLs) are extremely rare in LLMs training data, leading LLMs to produce incorrect code. To overcome challenges when using LLMs to generate Universal Verification Methodology (UVM) testbenches and sequences, wepropose HAVEN (Hybrid Automated Verification ENgine) to prevent LLMs from writing HDL directly. For UVM testbench generation, HAVEN utilizes LLM agents to analyze design specifications to produce a structured architectural plan. The HAVEN Template Engine then combines with predefined and protocol-specific templates to generate all UVM components with correct bus-handshake timing. For UVM sequence generation, HAVEN introduces a Protocol-Aware Sequence Domain-Specific Language (DSL) that decomposes sequences into fine-grained step types. A set of predefined DSL patterns first establishes sequences that achieve a high coverage rate without LLM involvement. HAVEN continues to improve the coverage rate by iteratively leveraging LLM agents to analyze coverage gap reports and compose additional targeted DSL sequences. Unlike previous works, HAVEN is the first system that utilizes pre-defined, protocol-specific Jinja2 templates to generate all UVM components and UVM sequences using our proposed Protocol-Aware DSL and rule-based code generator. Our experimental results on 19 open-source IP designs spanning three interface protocols (Direct, Wishbone, AXI4-Lite) show that HAVEN achieves 100% compilation success, 90.6% code coverage, and 87.9% functional coverage on average, and is SOTA among LLM-assisted testbench generation systems.

URL PDF HTML ☆

赞 0 踩 0

2604.27599 2026-05-01 cs.IR cs.LG

One Pass, Any Order: Position-Invariant Listwise Reranking for LLM-Based Recommendation

Ethan Bito, Yongli Ren, Estrid He

Comments Accepted at SIGIR 2026

2604.27593 2026-05-01 astro-ph.IM cs.CV

An Extended Evaluation Split for DeepSpaceYoloDataset

Olivier Parisot

Comments 9 pages, 5 figures

2604.27583 2026-05-01 q-bio.NC cs.RO

Simulating Infant First-Person Sensorimotor Experience via Motion Retargeting from Babies to Humanoids

Francisco M. López, Hoshinori Kanazawa, Ondrej Fiala, Yakov Balashov, Valentin Marcel, Lukas Rustler, Miles Lenz, Dongmin Kim, Yasuo Kuniyoshi, Jochen Triesch, Matej Hoffmann

Comments Submitted to IEEE ICDL. 8 pages, 6 figures

2604.27576 2026-05-01 cs.LO cs.LG

BAss: Symbolic Reasoning in Abstract Dialectical Frameworks

Samuel Pastva, Van-Giang Trinh

2604.27539 2026-05-01 cs.HC cs.AI

Knowledge Affordances for Hybrid Human-AI Information Seeking

Irene Celino

Comments 10 pages, accepted at Hybrid Human Artificial Intelligence Conference (HHAI 2026)

2604.27467 2026-05-01 cs.SE cs.CL

ScaleBox: Enabling High-Fidelity and Scalable Code Verification for Large Language Models

Jiasheng Zheng, Xin Zheng, Boxi Cao, Pengbo Wang, Zhengzhao Ma, Qiming Zhu, Jiazhen Jiang, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun

Comments Accepted to ACL 2026 Demo. Our project is available at https://github.com/icip-cas/ScaleBox

2604.27464 2026-05-01 cs.CR cs.AI

Security Attack and Defense Strategies for Autonomous Agent Frameworks: A Layered Review with OpenClaw as a Case Study

Luyao Xu, Xiang Chen

Comments 14 pages, 2 figures, 6 tables

2604.27447 2026-05-01 math.OC cs.AI cs.LG q-fin.PM q-fin.RM

Sampler-Robust Optimization under Generative Models

Ziwei Zhang, Jonathan Yu-Meng Li

2604.27426 2026-05-01 cs.CR cs.AI

Secret Stealing Attacks on Local LLM Fine-Tuning through Supply-Chain Model Code Backdoors

Zi Li, Tian Zhou, Wenze Li, Jingyu Hua, Yunlong Mao, Sheng Zhong

2604.27421 2026-05-01 cs.IR cs.CL

A Reproducibility Study of LLM-Based Query Reformulation

Amin Bigdeli, Radin Hamidi Rad, Hai Son Le, Mert Incesu, Negar Arabzadeh, Charles L. A. Clarke, Ebrahim Bagheri

2604.27410 2026-05-01 cs.IR cs.CL

From Unstructured to Structured: LLM-Guided Attribute Graphs for Entity Search and Ranking

Yilun Zhu, Nikhita Vedula, Shervin Malmasi

2604.27394 2026-05-01 stat.ML cs.LG

Bayesian X-Learner: Calibrated Posterior Inference for Heterogeneous Treatment Effects under Heavy-Tailed Outcomes

Eichi Uehara

Comments 47 pages, 7 figures, 25 tables. Code: https://github.com/EichiUehara/bayesian-X-Leaner. Prepared for submission to TMLR

2604.27383 2026-05-01 eess.IV cs.CV

A Real-time Scale-robust Network for Glottis Segmentation in Nasal Transnasal Intubation

Yang Zhou, Chaoyong Zhang, Ruoyi Hao, Huilin Pan, Yang Zhang, Hongliang Ren

Comments 14 pages, 9 figures

2604.27378 2026-05-01 math.OC cs.LG cs.MA

Continuous-time q-learning for mean-field control with common noise, part-II: q-learning algorithms

Zhenjie Ren, Xiaoli Wei, Xiang Yu, Xun Yu Zhou

Comments Keywords: Mean-field control, common noise, martingale characterization, optimal q-learning algorithm, Actor-Critic q-learning algorithm