arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.29755 2026-04-01 cs.AI

CausalPulse: An Industrial-Grade Neurosymbolic Multi-Agent Copilot for Causal Diagnostics in Smart Manufacturing

Chathurangi Shyalika, Utkarshani Jaimini, Cory Henson, Amit Sheth

Comments 10 pages, 8 figures, 4 tables, Accepted at AAAI-MAKE 2026 (AAAI Spring Symposium on Machine Learning and Knowledge Engineering for Knowledge-Grounded Semantic Agents)

详情

英文摘要

Modern manufacturing environments demand real-time, trustworthy, and interpretable root-cause insights to sustain productivity and quality. Traditional analytics pipelines often treat anomaly detection, causal inference, and root-cause analysis as isolated stages, limiting scalability and explainability. In this work, we present CausalPulse, an industry-grade multi-agent copilot that automates causal diagnostics in smart manufacturing. It unifies anomaly detection, causal discovery, and reasoning through a neurosymbolic architecture built on standardized agentic protocols. CausalPulse is being deployed in a Robert Bosch manufacturing plant, integrating seamlessly with existing monitoring workflows and supporting real-time operation at production scale. Evaluations on both public (Future Factories) and proprietary (Planar Sensor Element) datasets show high reliability, achieving overall success rates of 98.0% and 98.73%. Per-criterion success rates reached 98.75% for planning and tool use, 97.3% for self-reflection, and 99.2% for collaboration. Runtime experiments report end-to-end latency of 50-60s per diagnostic workflow with near-linear scalability (R^2=0.97), confirming real-time readiness. Comparison with existing industrial copilots highlights distinct advantages in modularity, extensibility, and deployment maturity. These results demonstrate how CausalPulse's modular, human-in-the-loop design enables reliable, interpretable, and production-ready automation for next-generation manufacturing.

URL PDF HTML ☆

赞 0 踩 0

2603.29734 2026-04-01 cs.CV

GRVS: a Generalizable and Recurrent Approach to Monocular Dynamic View Synthesis

Thomas Tanay, Mohammed Brahimi, Michal Nazarczuk, Qingwen Zhang, Sibi Catley-Chandar, Arthur Moreau, Zhensong Zhang, Eduardo Pérez-Pellitero

Comments CVPR Findings 2026

2603.29733 2026-04-01 cs.CV

Leveraging Synthetic Data for Enhancing Egocentric Hand-Object Interaction Detection

Rosario Leonardi, Antonino Furnari, Francesco Ragusa, Giovanni Maria Farinella

2603.29732 2026-04-01 cs.CV

Compressive sensing inspired self-supervised single-pixel imaging

Jijun Lu, Yifan Chen, Libang Chen, Yiqiang Zhou, Ye Zheng, Mingliang Chen, Zhe Sun, Xuelong Li

Comments 10 pages, 9 figures, 2 algorithms, 2 tables, journal paper

2603.29723 2026-04-01 cs.AI

Reinforced Reasoning for End-to-End Retrosynthetic Planning

Chenyang Zuo, Siqi Fan, Yizhen Luo, Zaiqing Nie

2603.29715 2026-04-01 cs.LG eess.SP math.OC stat.ML

Nonnegative Matrix Factorization in the Component-Wise L1 Norm for Sparse Data

Giovanni Seraghiti, Kévin Dubrulle, Arnaud Vandaele, Nicolas Gillis

Comments 21 pages before supplementary, code available from https://github.com/giovanniseraghiti/wL1-NMF

2603.29710 2026-04-01 cs.SD eess.AS

A Comprehensive Corpus of Biomechanically Constrained Piano Chords: Generation, Analysis, and Implications for Voicing and Psychoacoustics

Mahesh Ramani

Comments 10 pages, 3 figures

2603.29709 2026-04-01 cs.AI cs.LG

Symphony for Medical Coding: A Next-Generation Agentic System for Scalable and Explainable Medical Coding

Joakim Edin, Andreas Motzfeldt, Simon Flachs, Lars Maaløe

2603.29708 2026-04-01 cs.RO cs.SY eess.SY math.DS

SafeDMPs: Integrating Formal Safety with DMPs for Adaptive HRI

Soumyodipta Nath, Pranav Tiwari, Ravi Prakash

Comments 8 pages, 8 figures and 1 table

2603.29697 2026-04-01 cs.CV

FED-Bench: A Cross-Granular Benchmark for Disentangled Evaluation of Facial Expression Editing

Fengjian Xue, Xuecheng Wu, Heli Sun, Yunyun Shi, Shi Chen, Liangyu Fu, Jinheng Xie, Dingkang Yang, Hao Wang, Junxiao Xue, Liang He

2603.29694 2026-04-01 cs.CV cs.AI

Exploring the Impact of Skin Color on Skin Lesion Segmentation

Kuniko Paxton, Medina Kapo, Amila Akagić, Koorosh Aslansefat, Dhavalkumar Thakker, Yiannis Papadopoulos

2603.29692 2026-04-01 cs.CV

SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition

Ning Wang, Tieyue Wu, Naeha Sharif, Farid Boussaid, Guangming Zhu, Lin Mei, Mohammed Bennamoun, zhang liang

Comments Accepted by CVPR 2026

2603.29691 2026-04-01 cs.AI

A First Step Towards Even More Sparse Encodings of Probability Distributions

Florian Andreas Marwitz, Tanya Braun, Ralf Möller

Comments Published in ILP2021. The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-97454-1_13

2603.29681 2026-04-01 cs.AI cs.HC

Beyond the Steeper Curve: AI-Mediated Metacognitive Decoupling and the Limits of the Dunning-Kruger Metaphor

Christopher Koch

2603.29677 2026-04-01 cs.LG cs.AI

Mind the Gap: A Framework for Assessing Pitfalls in Multimodal Active Learning

Dustin Eisenhardt, Yunhee Jeong, Florian Buettner

2603.29676 2026-04-01 cs.LG cs.CL cs.CV

A Comprehensive Information-Decomposition Analysis of Large Vision-Language Models

Lixin Xiu, Xufang Luo, Hideki Nakayama

Comments Accepted at ICLR 2026. Project page: https://riishin.github.io/pid-lvlm-iclr26/

2603.29670 2026-04-01 cs.CV

Clinical DVH metrics as a loss function for 3D dose prediction in head and neck radiotherapy

Ruochen Gao, Marius Staring, Frank Dankers

Comments 19 pages

2603.29666 2026-04-01 cs.CV

CoRe-DA: Contrastive Regression for Unsupervised Domain Adaptation in Surgical Skill Assessment

Dimitrios Anastasiou, Razvan Caramalau, Jialang Xu, Runlong He, Freweini Tesfai, Matthew Boal, Nader Francis, Danail Stoyanov, Evangelos B. Mazomenos

2603.29664 2026-04-01 cs.CV

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Shifang Zhao, Yihan Hu, Ying Shan, Yunchao Wei, Xiaodong Cun

Comments Project Code: https://github.com/GVCLab/CutClaw

2603.29661 2026-04-01 cs.CL cs.AI cs.IR

Agenda-based Narrative Extraction: Steering Pathfinding Algorithms with Large Language Models

Brian Felipe Keith-Norambuena, Carolina Inés Rojas-Córdova, Claudio Juvenal Meneses-Villegas, Elizabeth Johanna Lam-Esquenazi, Angélica María Flores-Bustos, Ignacio Alejandro Molina-Villablanca, Joshua Emanuel Leyton-Vallejos

Comments Text2Story Workshop 2026 at ECIR 2026

2603.29655 2026-04-01 cs.CV

Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors

Pengfei Zhou, Xiangyue Zhang, Xukun Shen, Yong Hu

2603.29654 2026-04-01 cs.LG cs.AI stat.ML

Concept frustration: Aligning human concepts and machine representations

Enrico Parisini, Christopher J. Soelistyo, Ahab Isaac, Alessandro Barp, Christopher R. S. Banerji

Comments 34 pages, 7 figures

2603.29646 2026-04-01 cs.RO

Design and Aerodynamic Modeling of MetaMorpher: A Hybrid Rotary andFixed-Wing Morphing UAV

Anja Bosak, Dorian Erić, Ana Milas, Stjepan Bogdan

Comments 8 pages, 12 figures

2603.29644 2026-04-01 cs.LG

Disentangled Graph Prompting for Out-Of-Distribution Detection

Cheng Yang, Yu Hao, Qi Zhang, Chuan Shi

Comments Accepted for publication in IEEE Transactions on Knowledge and Data Engineering (TKDE)

2603.29643 2026-04-01 cs.AI

Optimizing Donor Outreach for Blood Collection Sessions: A Scalable Decision Support Framework

André Carneiro, Pedro T. Monteiro, Rui Henriques

Comments 16 pages, 9 figures, 4 supplementary figures, 2 supplementary tables

详情

英文摘要

Blood donation centers face challenges in matching supply with demand while managing donor availability. Although targeted outreach is important, it can cause donor fatigue via over-solicitation. Effective recruitment requires targeting the right donors at the right time, balancing constraints with donor convenience and eligibility. Despite extensive work on blood supply chain optimization and growing interest in algorithmic donor recruitment, the operational problem of assigning donors to sessions across a multi-site network, taking into account eligibility, capacity, blood-type demand targets, geographic convenience, and donor safety, remains unaddressed. We address this gap with an optimization framework for donor invitation scheduling incorporating donor eligibility, travel convenience, blood-type demand targets, and penalties. We evaluate two strategies: (i) a binary integer linear programming (BILP) formulation and (ii) an efficient greedy heuristic. Evaluation uses the registry from Instituto Português do Sangue e da Transplantação (IPST) for invite planning in the Lisbon operational region using 4-month windows. A prospective pipeline integrates organic attendance forecasting, quantile-based demand targets, and residual capacity estimation for forward-looking invitation plans. Results reveal its key role in closing the supply-demand gap in the Lisbon operational region. A controlled comparison shows that the greedy heuristic achieves results comparable to the BILP, with 188x less peak memory and 115x faster runtime; trade-offs include 3.9 pp lower demand fulfillment (86.1% vs. 90.0%), larger donor-session distance, higher adverse-reaction donor exposure, and greater invitation burden per non-high-frequency donor, reflecting local versus global optimization. Experiments assess how constraint-aware scheduling can close gaps by mobilizing eligible inactive/lapsing donors.

URL PDF HTML ☆

赞 0 踩 0

2603.29640 2026-04-01 cs.AI

ASI-Evolve: AI Accelerates AI

Weixian Xu, Tiantian Mi, Yixiu Liu, Yang Nan, Zhimeng Zhou, Lyumanshan Ye, Lin Zhang, Yu Qiao, Pengfei Liu

Comments 19 pages, 6 figures, 6 tables. Code available at https://github.com/GAIR-NLP/ASI-Evolve

2603.29634 2026-04-01 cs.CV cs.AI

MacTok: Robust Continuous Tokenization for Image Generation

Hengyu Zeng, Xin Gao, Guanghao Li, Yuxiang Yan, Jiaoyang Ruan, Junpeng Ma, Haoyu Albert Wang, Jian Pu

2603.29633 2026-04-01 cs.CV

Self-Supervised Federated Learning under Data Heterogeneity for Label-Scarce Diatom Classification

Mingkun Tan, Xilu Wang, Michael Kloster, Tim W. Nattkemper

Comments 22 pages, 9 figures

详情

英文摘要

Label-scarce visual classification under decentralized and heterogeneous data is a fundamental challenge in pattern recognition, especially when sites exhibit partially overlapping class sets. While self-supervised federated learning (SSFL) offers a promising solution, existing studies commonly assume the same data heterogeneity pattern throughout pre-training and fine-tuning. Moreover, current partitioning schemes often fail to generate pure partially class-disjoint data settings, limiting controllable simulation of real-world label-space heterogeneity. In this work, we introduce SSFL for diatom classification as a representative real-world instance and systematically investigate stage-specific data heterogeneity. We study cross-site variation in unlabeled data volume during pre-training and label-space misalignment during downstream fine-tuning. To study the latter in a controllable setting, we propose PreDi, a partitioning scheme that disentangles label-space heterogeneity into two orthogonal dimensions, namely class Prevalence and class-set size Disparity, enabling separate analysis of their effects. Guided by the resulting insights, we further propose PreP-WFL (Prevalence-based Personalized Weighted Federated Learning) to adaptively strengthen rare-class representations in low-prevalence scenarios. Extensive experiments show that SSFL consistently outperforms local-only training under both homogeneous and heterogeneous settings. The pronounced heterogeneity in unlabeled data volume is associated with improved representation pre-training, whereas under label-space heterogeneity, prevalence dominates performance and disparity has a smaller effect. PreP-WFL effectively mitigates this degradation, with gains increasing as prevalence decreases. These findings provide a mechanistic basis for characterizing label-space heterogeneity in decentralized recognition systems.

URL PDF HTML ☆

赞 0 踩 0

2603.29631 2026-04-01 cs.CV cs.DC cs.IR

Storing Less, Finding More: How Novelty Filtering Improves Cross-Modal Retrieval on Edge Cameras

Sherif Abdelwahab

Comments 6 pages, 3 figures, 5 tables; supplementary video included as ancillary file

2603.29627 2026-04-01 cs.RO

Semantic Zone-Based Map Management for Stable AI-Integrated Mobile Robots

Huichang Yun, Seungho Yoo