arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.04331 2026-04-07 cs.CV cs.AI

GA-GS: Generation-Assisted Gaussian Splatting for Static Scene Reconstruction

Yedong Shen, Shiqi Zhang, Sha Zhang, Yifan Duan, Xinran Zhang, Wenhao Yu, Lu Zhang, Jiajun Deng, Yanyong Zhang

详情

英文摘要

Reconstructing static 3D scene from monocular video with dynamic objects is important for numerous applications such as virtual reality and autonomous driving. Current approaches typically rely on background for static scene reconstruction, limiting the ability to recover regions occluded by dynamic objects. In this paper, we propose GA-GS, a Generation-Assisted Gaussian Splatting method for Static Scene Reconstruction. The key innovation of our work lies in leveraging generation to assist in reconstructing occluded regions. We employ a motion-aware module to segment and remove dynamic regions, and thenuse a diffusion model to inpaint the occluded areas, providing pseudo-ground-truth supervision. To balance contributions from real background and generated region, we introduce a learnable authenticity scalar for each Gaussian primitive, which dynamically modulates opacity during splatting for authenticity-aware rendering and supervision. Since no existing dataset provides ground-truth static scene of video with dynamic objects, we construct a dataset named Trajectory-Match, using a fixed-path robot to record each scene with/without dynamic objects, enabling quantitative evaluation in reconstruction of occluded regions. Extensive experiments on both the DAVIS and our dataset show that GA-GS achieves state-of-the-art performance in static scene reconstruction, especially in challenging scenarios with large-scale, persistent occlusions.

URL PDF HTML ☆

赞 0 踩 0

2604.04325 2026-04-07 cs.CL

Benchmarking Multi-turn Medical Diagnosis: Hold, Lure, and Self-Correction

Jinrui Fang, Runhan Chen, Xu Yang, Jian Yu, Jiawei Xu, Ashwin Vinod, Wenqi Shi, Tianlong Chen, Heng Ji, ChengXiang Zhai, Ying Ding, Yuji Zhang

2604.04324 2026-04-07 cs.AI cs.SE

RESCORE: LLM-Driven Simulation Recovery in Control Systems Research Papers

Vineet Bhat, Shiqing Wei, Ali Umut Kaypak, Prashanth Krishnamurthy, Ramesh Karri, Farshad Khorrami

Comments This work has been submitted to the IEEE for possible publication

2604.04323 2026-04-07 cs.CL

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Yujian Liu, Jiabao Ji, Li An, Tommi Jaakkola, Yang Zhang, Shiyu Chang

2604.04316 2026-04-07 cs.LG

How Long short-term memory artificial neural network, synthetic data, and fine-tuning improve the classification of raw EEG data

Albert Nasybullin, Vladimir Maksimenko, Semen Kurkin

Comments 4 pages, 4 figures, 2 tables

2604.04313 2026-04-07 cs.LG

Convolutional Neural Network and Adversarial Autoencoder in EEG images classification

Albert Nasybullin, Semen Kurkin

Comments 4 pages, 6 figures

2604.04300 2026-04-07 cs.CL cs.LG

High-Stakes Personalization: Rethinking LLM Customization for Individual Investor Decision-Making

Yash Ganpat Sawant

Comments 4 pages + 1 page references. Submitted to CustomNLP4U Workshop @ ACL 2026

2604.04299 2026-04-07 cs.CV cs.AI

A Persistent Homology Design Space for 3D Point Cloud Deep Learning

Prachi Kudeshia, Jiju Poovvancheri, Amr Ghoneim, Dong Chen

Comments 27 pages, 12 figures, 5 tables

详情

英文摘要

Persistent Homology (PH) offers stable, multi-scale descriptors of intrinsic shape structure by capturing connected components, loops, and voids that persist across scales, providing invariants that complement purely geometric representations of 3D data. Yet, despite strong theoretical guarantees and increasing empirical adoption, its integration into deep learning for point clouds remains largely ad hoc and architecturally peripheral. In this work, we introduce a unified design space for Persistent-Homology driven learning in 3D point clouds (3DPHDL), formalizing the interplay between complex construction, filtration strategy, persistence representation, neural backbone, and prediction task. Beyond the canonical pipeline of diagram computation and vectorization, we identify six principled injection points through which topology can act as a structural inductive bias reshaping sampling, neighborhood graphs, optimization dynamics, self-supervision, output calibration, and even internal network regularization. We instantiate this framework through a controlled empirical study on ModelNet40 classification and ShapeNetPart segmentation, systematically augmenting representative backbones (PointNet, DGCNN, and Point Transformer) with persistence diagrams, images, and landscapes, and analyzing their impact on accuracy, robustness to noise and sampling variation, and computational scalability. Our results demonstrate consistent improvements in topology-sensitive discrimination and part consistency, while revealing meaningful trade-offs between representational expressiveness and combinatorial complexity. By viewing persistent homology not merely as an auxiliary feature but as a structured component within the learning pipeline, this work provides a systematic framework for incorporating topological reasoning into 3D point cloud learning.

URL PDF HTML ☆

赞 0 踩 0

2604.04297 2026-04-07 cs.AI

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

Marija Zelic, Anna Tegon, Yawei Li, Thorir Mar Ingolfsson, Luca Benini

Comments 5 pages, 5 tables, 1 figure, preprint

2604.04291 2026-04-07 cs.LG

Correcting Source Mismatch in Flow Matching with Radial-Angular Transport

Fouad Oubari, Mathilde Mougeot

2604.04290 2026-04-07 cs.LG

DAGAF: A directed acyclic generative adversarial framework for joint structure learning and tabular data synthesis

Hristo Petkov, Calum MacLellan, Feng Dong

Comments The code for this paper is available at https://github.com/ItsyPetkov/DAGAF

2604.04286 2026-04-07 cs.RO

Real-Time Projected Adaptive Control for Closed-Chain Co-Manipulative Continuum Robots

Rana Danesh, Farrokh Janabi-Sharifi, Farhad Aghili

2604.04281 2026-04-07 cs.AI

Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts

Eren Unlu

Comments 16 pages, 2 figures, 8 tables

2604.04274 2026-04-07 cs.AI cs.CE stat.AP

InferenceEvolve: Towards Automated Causal Effect Estimators through Self-Evolving AI

Can Wang, Hongyu Zhao, Yiqun Chen

2604.04261 2026-04-07 cs.LG cs.AI

APPA: Adaptive Preference Pluralistic Alignment for Fair Federated RLHF of LLMs

Mahmoud Srewa, Tianyu Zhao, Salma Elmalaki

2604.04258 2026-04-07 cs.AI cs.HC

Context Engineering: A Practitioner Methodology for Structured Human-AI Collaboration

Elias Calboreanu

Comments 39 pages, 6 figures, 10 tables, 47 references. Submitted to Springer Nature journal. Open-access extraction datasets and methodology artifacts available

2604.04255 2026-04-07 cs.LG cs.CR

Towards Unveiling Vulnerabilities of Large Reasoning Models in Machine Unlearning

Aobo Chen, Chenxu Zhao, Chenglin Miao, Mengdi Huai

2604.04250 2026-04-07 cs.CL

CAWN: Continuous Acoustic Wave Networks for Autoregressive Language Modeling

Dejan Čugalj, Aleksandar Jevremovic

Comments 13 pages, 3 figures

详情

DOI: 10.5281/zenodo.19339514

英文摘要

Modern Large Language Models (LLMs) rely on Transformer self-attention, which scales quadratically with sequence length. Recent linear-time alternatives, like State Space Models (SSMs), often suffer from signal degradation over extended contexts. We introduce the Continuous Acoustic Wave Network (CAWN), a fully continuous sequence-mixing architecture. Instead of discrete matrix-based attention, CAWN projects hidden states into multi-headed complex-domain phasors, achieving sequence mixing through a causal, $O(L)$ Phase Accumulation mechanism. To prevent signal degradation over ultra-long contexts, we introduce a dual-gated Selective Phase Resonance mechanism incorporating Frequency-Dependent Retention, Hard-Threshold Gating via Straight-Through Estimation, and a Temporal Syntax Cache to capture short-term local dependencies. We also replace standard dense linear projections with Depth-wise Harmonic Convolutions for optimal spatial frequency mixing, augmented by Block Attention Residuals for depth-wise state routing. Scaled to a 150M-parameter model, CAWN utilizes custom Triton kernels for hardware-efficient, true-complex phase accumulation in float32. Trained via a continuous streaming loop on a 100-Billion-token corpus, the prototype is evaluated at a 5-Billion-token milestone. Empirical evaluations via a Targeted Semantic Retrieval protocol demonstrate robust vocabulary acquisition and extended explicitly learned contextual denoising. By leveraging $O(1)$ state-passing via chunked prefill, the model retrieves targeted information across 2,000,000 tokens while strictly plateauing at 8.72 GB of Peak VRAM, empirically overcoming the $O(L^2)$ context memory wall.

URL PDF HTML ☆

赞 0 踩 0

2604.04247 2026-04-07 cs.AI cs.CL cs.LG

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Hanchen Li, Runyuan He, Qizheng Zhang, Changxiu Ji, Qiuyang Mang, Xiaokun Chen, Lakshya A Agrawal, Wei-Liang Liao, Eric Yang, Alvin Cheung, James Zou, Kunle Olukotun, Ion Stoica, Joseph E. Gonzalez

2604.04241 2026-04-07 cs.LG math.OC

Learning An Interpretable Risk Scoring System for Maximizing Decision Net Benefit

Wenhao Chi, Ş. İlker Birbil

Comments 31 pages, 5 figures, and 6 tables

2604.04240 2026-04-07 cs.LG physics.soc-ph

Peoples Water Data: Enabling Reliable Field Data Generation and Microbial Contamination Screening in Household Drinking Water

Suzan Kagan, Shira Spigelman, Sankar Sudhir, Thalappil Pradeep, Hadas Mamane

2604.04239 2026-04-07 cs.LG cs.AI q-bio.QM

Good Rankings, Wrong Probabilities: A Calibration Audit of Multimodal Cancer Survival Models

Sajad Ghawami

Comments 15 pages, 5 figures

2604.04237 2026-04-07 cs.AI cs.CY cs.LG

Pedagogical Safety in Educational Reinforcement Learning: Formalizing and Detecting Reward Hacking in AI Tutoring Systems

Oluseyi Olukola, Nick Rahimi

Comments 43 pages, 5 figures. Submitted to the International Journal of Artificial Intelligence in Education (IJAIED)

2604.04233 2026-04-07 cs.RO cs.CL

Precise Robot Command Understanding Using Grammar-Constrained Large Language Models

Xinyun Huo, Raghav Gnanasambandam, Xinyao Zhang

Comments Accepted at ASME MSEC2026

2604.04231 2026-04-07 cs.LG

Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization

Yancheng Huang, Changsheng Wang, Chongyu Fan, Yicheng Lang, Bingqi Shang, Yang Zhang, Mingyi Hong, Qing Qu, Alvaro Velasquez, Sijia Liu

2604.04230 2026-04-07 cs.LG cs.AI cs.MA

Three Phases of Expert Routing: How Load Balance Evolves During Mixture-of-Experts Training

Charafeddine Mouzouni

2604.04225 2026-04-07 cs.LG cs.AI cs.RO cs.SY eess.SY

Learning from Imperfect Demonstrations via Temporal Behavior Tree-Guided Trajectory Repair

Aniruddh G. Puranic, Sebastian Schirmer, John S. Baras, Calin Belta

Comments 12 pages, 4 figures. This work has been submitted to the IEEE for possible publication

2604.04220 2026-04-07 cs.AI

TimeSeek: Temporal Reliability of Agentic Forecasters

Hamza Mostafa, Om Shastri, Dennis Lee

Comments Workshop paper. 11 pages including references

2604.04215 2026-04-07 cs.CL

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Jingyi Yang, Yuxian Jiang, Xuhao Hu, Shuang Cheng, Biqing Qi, Jing Shao

Comments 14 pages,3 figures,5 tables

2604.04208 2026-04-07 cs.LG

Towards Agentic Defect Reasoning: A Graph-Assisted Retrieval Framework for Laser Powder Bed Fusion

Muhammad Rizwan Awan, Volker Pickert, Muhammad Waqar Ashraf, Saleh Ali, Farshid Mahmouditabar, Shafiq Odhano