arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.24793 2026-04-29 eess.IV cs.CV

CRC-SAM: SAM-Based Multi-Modal Segmentation and Quantification of Colorectal Cancer in CT, Colonoscopy, and Histology Images

Daniel Lao

Comments 4 pages, 3 figures, ISBI 2026 oral presentation

2604.24790 2026-04-29 cs.CR cs.AI

Semantic Denial of Service in LLM-controlled robots

Jonathan Steinberg, Oren Gal

2604.24785 2026-04-29 cs.AR cs.AI cs.DC cs.PF

Cloud to Edge: Benchmarking LLM Inference On Hardware-Accelerated Single-Board Computers

Harri Renney, Fouad Trad, Michael Mattarock, Zena Wood

2604.24775 2026-04-29 physics.data-an cs.LG hep-ex nucl-ex physics.ins-det

Application of a Mixture of Experts-based Foundation Model to the GlueX DIRC Detector

Cristiano Fanelli, James Giroux, Cole Granger, Justin Stevens

Comments 18 pages, 10 figures

2604.24765 2026-04-29 eess.SP cs.HC cs.LG

Interpretable Fuzzy Modeling Reveals Population-Level Representation Differences in P300 Brain Computer Interfaces Across Neurodivergent and Neurotypical Cohorts

Xiaowei Jiang, Sudong Shang, Adrian Wilkinson, Michael L. Platt, Da Xiao, Bening Cao, Thomas Do

2604.24477 2026-04-29 cs.CR cs.AI cs.MA

GAMMAF: A Common Framework for Graph-Based Anomaly Monitoring Benchmarking in LLM Multi-Agent Systems

Pablo Mateo-Torrejón, Alfonso Sánchez-Macián

2604.22762 2026-04-29 cs.IR cs.AI

Behavioral Intelligence Platforms: From Event Streams to Autonomous Insight via Probabilistic Journey Graphs, Behavioral Knowledge Extraction, and Grounded Language Generation

Arun Patra, Bhushan Vadgave

Comments v2: corrected numerical values in Fig 3 and Sec 7.2 fact bundle to match published simulation scripts; clarified Markov-property identity in Sec 4.2.2; added simulate_trajectories.py for Monte Carlo reproducibility; softened confidence and path-quality presentation; added Markov-attribution citations (Anderl 2016, Shao & Li 2011, Kakalejcik 2022). Formal results unchanged

2604.21598 2026-04-29 cs.SE cs.AI

You Don't Need Public Tests to Generate Correct Code

Kaushitha Silva, Srinath Perera

Comments 9 pages, 6 figures

2604.20891 2026-04-29 cs.AR cs.AI cs.ET cs.LO

Ternary Memristive Logic: Hardware for Reasoning Realized via Domain Algebra

Chao Li

Comments 24pages

2604.15384 2026-04-29 cs.CR cs.AI cs.SE

LinuxArena: A Control Setting for AI Agents in Live Production Software Environments

Tyler Tracy, Ram Potham, Nick Kuhn, Myles Heller, Anshul Khandelwal, Cody Rushing, Henri Lemoine, Miguel Brandao, Tomas Turlik, Adam Hanson, Josh Hills, Amy Ngo, Ram Rachum, Nik Mitchell, Falko Galperin, Oscar Sykes, Pip Arnott, Samuel Prieto Lima, Carlos Giudice, Matt Goldwater, Daniel Popp, Drew de Wet, Ruben Castaing, Qi Guo, Douw Marx, Benjamin Shaffrey, Justin Shenk, Martin Milbradt, Hannah Meagher, Shaheen Ahmed-Chowdhury, Daniel O'Connell, Chris Canal, Buck Shlegeris, Aryan Bhatt

2604.09618 2026-04-29 cs.DC cs.AI cs.CR

HearthNet: Edge Multi-Agent Orchestration for Smart Homes

Zhonghao Zhan, Krinos Li, Yefan Zhang, Hamed Haddadi

Comments (CAIS 2026) Proceedings of the ACM Conference on AI and Agentic Systems, Demo Track

2604.04978 2026-04-29 cs.SE cs.AI cs.CR

Measuring the Permission Gate: A Stress-Test Evaluation of Claude Code's Auto Mode

Zimo Ji, Zongjie Li, Wenyuan Jiang, Yudong Gao, Shuai Wang

2604.04973 2026-04-29 stat.ML cs.LG cs.SD

StrADiff: A Structured Source-Wise Adaptive Diffusion Framework for Linear and Nonlinear Blind Source Separation

Yuan-Hao Wei

2603.28886 2026-04-29 cs.IR cs.LG

Calibrated Fusion for Heterogeneous Graph-Vector Retrieval in Multi-Hop QA

Andre Bacellar

Comments 10 pages, 5 figures

2603.21942 2026-04-29 physics.chem-ph cs.AI

Suiren-1.0 Technical Report: A Family of Molecular Foundation Models

Junyi An, Xinyu Lu, Yun-Fei Shi, Li-Cheng Xu, Nannan Zhang, Chao Qu, Yuan Qi, Fenglei Cao

Comments 24 pages,5 figures

2603.19874 2026-04-29 stat.ML cs.LG

Minimax Generalized Cross-Entropy

Kartheek Bondugula, Santiago Mazuelas, Aritz Pérez, Anqi Liu

2603.13730 2026-04-29 cs.IR cs.AI

R3-REC: Reasoning-Driven Recommendation via Retrieval-Augmented LLMs over Multi-Granular Interest Signals

Yuchen Miao, Mingxuan Cui, Yitong Zhu, Yu Wang, Siyang Xu

Comments 5 pages, 4 figures, 2 tables. Accepted to the 2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2026)

2602.13864 2026-04-29 cs.NE cs.LG

Evolving Multi-Channel Confidence-Aware Activation Functions for Missing Data with Channel Propagation

Naeem Shahabi Sani, Ferial Najiantabriz, Shayan Shafaei, Dean F. Hougen

Comments Accepted at GECCO 2026. 9 pages, 4 figures, 10 tables

2602.10750 2026-04-29 cs.CR cs.AI cs.CV cs.LG

SecureScan: An AI-Driven Multi-Layer Framework for Malware and Phishing Detection Using Logistic Regression and Threat Intelligence Integration

Rumman Firdos, Aman Dangi

2602.00065 2026-04-29 cs.CY cs.AI

Responsible Evaluation of AI for Mental Health

Hiba Arnaout, Anmol Goel, H. Andrew Schwartz, Steffen T. Eberhardt, Dana Atzil-Slonim, Gavin Doherty, Brian Schwartz, Wolfgang Lutz, Tim Althoff, Munmun De Choudhury, Hamidreza Jamalabadi, Raj Sanjay Shah, Flor Miriam Plaza-del-Arco, Dirk Hovy, Maria Liakata, Iryna Gurevych

2601.22597 2026-04-29 cs.SE cs.CL

TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks

Ryo Fujii, Makoto Morishita, Kazuki Yano, Jun Suzuki

Comments Accepted to EACL 2026 Main, camera-ready

2601.18612 2026-04-29 cs.CR cs.CV

Multimodal Privacy-Preserving Entity Resolution with Fully Homomorphic Encryption

Susim Roy, Nalini Ratha

Comments 5 pages, 3 figures, IEEE ICASSP'26

2601.14925 2026-04-29 eess.AS cs.AI

Fast-ULCNet: A fast and ultra low complexity network for single-channel speech enhancement

Nicolás Arrieta Larraza, Niels de Koeijer

Comments ©2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

2601.08816 2026-04-29 cs.IR cs.AI

MemRec: Collaborative Memory-Augmented Agentic Recommender System

Weixin Chen, Yuhan Zhao, Jingyuan Huang, Zihe Ye, Clark Mingxuan Ju, Tong Zhao, Neil Shah, Li Chen, Yongfeng Zhang

Comments Accepted at ACL 2026 Main Conference

2601.01930 2026-04-29 cs.IR cs.AI

MCGI: Manifold-Consistent Graph Indexing for Billion-Scale Disk-Resident Vector Search

Dongfang Zhao

2512.20677 2026-04-29 cs.CR cs.CL

Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language Models

Zhang Wei, Hanxuan Chen, Peilu Hu, Zhenyuan Wei, Chenwei Liang, Jing Luo, Ziyi Ni, Hao Yan, Li Mei, Shengning Lang, Kuan Lu, Xi Xiao, Zhimo Han, Yijin Wang, Yichao Zhang, Chen Yang, Junfeng Hao, Jiayi Gu, Riyang Bao, Mu-Jiang-Shan Wang

Comments accepted by EACL camera ready version

2512.05207 2026-04-29 cs.NI cs.LG cs.MA

Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives Problem

Ali Al Housseini, Cristina Rottondi, Omran Ayoub

Comments This paper has been rejected from the conferences i submitted it, and it turns out that contains several errors, please review section of MILP

2512.05201 2026-04-29 cs.NI cs.SD

MuMeNet: A Network Simulator for Musical Metaverse Communications

Ali Al Housseini, Jaime Llorca, Luca Turchet, Tiziano Leidi, Cristina Rottondi, Omran Ayoub

Comments To appear in 2025 IEEE 6th International Symposium on the Internet of Sounds (IS2) proceedings

2511.20006 2026-04-29 eess.AS cs.AI cs.SD

BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference

Sungjae Kim, Kihyun Na, Jinyoung Choi, Injung Kim

Comments 14 pages, 8 figures, 8 tables. Accepted for publication in IEEE Transactions on Audio, Speech, and Language Processing

详情

英文摘要

Automatic Pitch Correction (APC) enhances vocal recordings by aligning pitch deviations with intended musical notes. However, existing APC systems either rely on reference pitches, which limits practical applicability, or employ simple pitch estimation algorithms that often fail to preserve expressiveness and naturalness. We propose BERT-APC, a reference-free APC framework that corrects pitch errors while maintaining the expressiveness and naturalness of vocal performances. In BERT-APC, a stationary pitch predictor first estimates the stationary pitch of each note from the detuned singing voice, where stationary pitch is the continuous pitch from the stable region of a note and approximates its perceived pitch. A context-aware note pitch predictor then infers the intended pitch sequence using a repurposed music language model that incorporates musical context. Finally, a note-level correction algorithm fixes pitch errors while preserving intentional deviations for emotional expression. We also introduce a learnable data augmentation strategy that improves robustness by simulating realistic detuning patterns. Compared to two recent singing voice transcription models, BERT-APC demonstrated superior target note pitch prediction, outperforming the second-best model, ROSVOT, by 10.49 percentage points on highly detuned samples in raw pitch accuracy. In the MOS test, BERT-APC achieved the highest quality rating of $4.32 \pm 0.15$, significantly higher than Auto-Tune ($3.22 \pm 0.18$) and Melodyne ($3.08 \pm 0.18$), while maintaining a comparable ability to preserve expressive nuances. To the best of our knowledge, this is the first APC model that leverages a music language model to achieve reference-free pitch correction with symbolic musical context. The corrected audio samples are available at https://joshua-1995.github.io/BERT-APC-Demo/.

URL PDF HTML ☆

赞 0 踩 0

2511.13595 2026-04-29 eess.SY cs.AI cs.SY

Physics-Informed Neural Networks for Nonlinear Output Regulation

Sebastiano Mengozzi, Giovanni B. Esposito, Michelangelo Bin, Andrea Acquaviva, Andrea Bartolini, Lorenzo Marconi