arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.24758 2026-03-10 cs.CV

ExGS: Extreme 3D Gaussian Compression with Diffusion Priors

Jiaqi Chen, Xinhao Ji, Yuanyuan Gao, Hao Li, Yuning Gong, Yifei Liu, Dan Xu, Zhihang Zhong, Dingwen Zhang, Xiao Sun

详情

英文摘要

Neural scene representations, such as 3D Gaussian Splatting (3DGS), have enabled high-quality neural rendering; however, their large storage and transmission costs hinder deployment in resource-constrained environments. Existing compression methods either rely on costly optimization, which is slow and scene-specific, or adopt training-free pruning and quantization, which degrade rendering quality under high compression ratios. In contrast, recent data-driven approaches provide a promising direction to overcome this trade-off, enabling efficient compression while preserving high rendering quality. We introduce ExGS, a novel feed-forward framework that unifies Universal Gaussian Compression (UGC) with GaussPainter for Extreme 3DGS compression. UGC performs re-optimization-free pruning to aggressively reduce Gaussian primitives while retaining only essential information, whereas GaussPainter leverages powerful diffusion priors with mask-guided refinement to restore high-quality renderings from heavily pruned Gaussian scenes. Unlike conventional inpainting, GaussPainter not only fills in missing regions but also enhances visible pixels, yielding substantial improvements in degraded renderings. To ensure practicality, it adopts a lightweight VAE and a one-step diffusion design, enabling real-time restoration. Our framework can even achieve over 100X compression (reducing a typical 354.77 MB model to about 3.31 MB) while preserving fidelity and significantly improving image quality under challenging conditions. These results highlight the central role of diffusion priors in bridging the gap between extreme compression and high-quality neural rendering. Our code repository will be released at: https://github.com/chenttt2001/ExGS

URL PDF HTML ☆

赞 0 踩 0

2509.21344 2026-03-10 cs.AI cs.CL cs.LG

Linear probes rely on textual evidence: Results from leakage mitigation studies in language models

Gerard Boxo, Aman Neelappa, Shivam Raval

Comments 33 pages, 22 figures

2509.14980 2026-03-10 cs.RO cs.AI cs.CV

M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation

Ju Dong, Lei Zhang, Liding Zhang, Yao Ling, Yu Fu, Kaixin Bai, Zoltán-Csaba Márton, Zhenshan Bing, Zhaopeng Chen, Alois Christian Knoll, Jianwei Zhang

Comments Project page: https://sites.google.com/view/m4diffuser, 10 pages, 9 figures

2509.14191 2026-03-10 cs.RO cs.CV

MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping

Zhihao Cao, Hanyu Wu, Li Wa Tang, Zizhou Luo, Wei Zhang, Marc Pollefeys, Zihan Zhu, Martin R. Oswald

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026

2509.13965 2026-03-10 cs.RO cs.CV

MetricNet: Recovering Metric Scale in Generative Navigation Policies

Abhijeet Nayak, Débora Oliveira Makowski, Samiran Gode, Cordelia Schmid, Wolfram Burgard

Comments Accepted to ICRA'26

2509.08612 2026-03-10 cs.CL cs.AI

OTESGN: Optimal Transport-Enhanced Syntactic-Semantic Graph Networks for Aspect-Based Sentiment Analysis

Xinfeng Liao, Xuanqi Chen, Lianxi Wang, Jiahuan Yang, Zhuowei Chen, Ziying Rong

Comments This paper accepted by ICDM 2025 proposes OTESGN for ABSA, fusing syntactic-semantic signals via optimal transport and attention mechanisms. It achieves SOTA on Rest14, Laptop14 and Twitter (up to +1.30 Macro-F1 on Laptop14), with strong noise suppression and fine-grained sentiment capture capabilities. https://ieeexplore.ieee.org/document/11392054

2509.01613 2026-03-10 cs.LG cs.AI

Entropy-Driven Curriculum for Multi-Task Training in Human Mobility Prediction

Tianye Fang, Xuanshu Luo, Martin Werner

Comments Accepted to 2025 IEEE International Conference on Big Data (BigData); camera-ready version

2508.00923 2026-03-10 cs.LG

Beyond Benchmarks: Dynamic, Automatic And Systematic Red-Teaming Agents For Trustworthy Medical Language Models

Jiazhen Pan, Bailiang Jian, Paul Hager, Yundi Zhang, Che Liu, Friedrike Jungmann, Hongwei Bran Li, Chenyu You, Junde Wu, Jiayuan Zhu, Fenglin Liu, Yuyuan Liu, Niklas Bubeck, Christian Wachinger, Chen, Chen, Zhenyu Gong, Cheng Ouyang, Georgios Kaissis, Benedikt Wiestler, Daniel Rueckert

2507.23273 2026-03-10 cs.RO cs.CV

LIVE-GS: Online LiDAR-Inertial-Visual State Estimation and Globally Consistent Mapping with 3D Gaussian Splatting

Jaeseok Park, Chanoh Park, Minsu Kim, Minkyoung Kim, Soohwan Kim

2507.19098 2026-03-10 cs.CV cs.AI

MedSymmFlow: Bridging Generative Modeling and Classification in Medical Imaging through Symmetrical Flow Matching

Francisco Caetano, Lemar Abdi, Christiaan Viviers, Amaan Valiuddin, Fons van der Sommen

Comments DGM4MICCAI 2025

2507.06967 2026-03-10 cs.LG cs.AI

Noisy PDE Training Requires Bigger PINNs

Sebastien Andre-Sloan, Anirbit Mukherjee, Matthew Colbrook

2506.11024 2026-03-10 cs.LG cs.AI cs.DC

Co-LoRA: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

Minhyuk Seo, Taeheon Kim, Hankook Lee, Jonghyun Choi, Tinne Tuytelaars

Comments ICLR 2026

2506.01941 2026-03-10 cs.RO

FreeTacMan: Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation

Longyan Wu, Checheng Yu, Jieji Ren, Li Chen, Yufei Jiang, Ran Huang, Guoying Gu, Hongyang Li

2506.00661 2026-03-10 cs.CV

Elytra: A Flexible Framework for Securing Large Vision Systems

Richard E. Neddo, Emmanuel Atindama, Zander W. Blasingame, Chen Liu

Comments Updated pre-print. Under review

2505.19719 2026-03-10 cs.LG cs.AI

OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction

Juntong Wang, Xiyuan Wang, Muhan Zhang

Comments 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

2505.16321 2026-03-10 cs.CV

Efficient Motion Prompt Learning for Robust Visual Tracking

Jie Zhao, Xin Chen, Yongsheng Yuan, Michael Felsberg, Dong Wang, Huchuan Lu

Comments Accepted by ICML2025

2505.16187 2026-03-10 cs.RO cs.AI

EasyInsert: A Data-Efficient and Generalizable Insertion Policy

Guanghe Li, Junming Zhao, Shengjie Wang, Yang Gao

2505.10822 2026-03-10 cs.LG

Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation

Reilly Haskins, Benjamin Adams

2505.06046 2026-03-10 cs.CL cs.LG

Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

Joshua Harris, Fan Grayson, Felix Feldman, Timothy Laurence, Toby Nonnenmacher, Oliver Higgins, Leo Loman, Selina Patel, Thomas Finnie, Samuel Collins, Michael Borowitz

Comments 27 pages, 9 pages main text

2505.05295 2026-03-10 cs.LG

Performance Estimation in Binary Classification Using Calibrated Confidence

Juhani Kivimäki, Jakub Białek, Wojtek Kuberski, Jukka K. Nurminen

Comments Accepted for publication in Machine Learning, (ACML 2025 Journal Track). Presented at the 17th Asian Conference on Machine Learning

2505.04095 2026-03-10 cs.RO cs.CV

Scalable Aerial GNSS Localization for Marine Robots

Shuo Wen, Edwin Meriaux, Mariana Sosa Guzmán, Charlotte Morissette, Chloe Si, Bobak Baghi, Gregory Dudek

Comments International Conference on Robotics and Automation 2025 Workshop Robots in the Wild

2505.01440 2026-03-10 cs.LG cs.AI cs.RO

Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving

Alkis Sygkounas, Ioannis Athanasiadis, Andreas Persson, Michael Felsberg, Amy Loutfi

Comments Accepted at IEEE Intelligent Vehicles Symposium (IV) 2025, 8 pages

2505.01315 2026-03-10 cs.CL cs.AI

Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System

Sheikh Samit Muhaimin, Spyridon Mastorakis

详情

DOI: 10.1109/TPS-ISA67132.2025.00036
Journal ref: Proceedings of the 2025 IEEE 7th International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA), Pittsburgh, PA, USA, November 12-14, 2025. IEEE

英文摘要

The recent growth in the use of Large Language Models has made them vulnerable to sophisticated adversarial assaults, manipulative prompts, and encoded malicious inputs. Existing countermeasures frequently necessitate retraining models, which is computationally costly and impracticable for deployment. Without the need for retraining or fine-tuning, this study presents a unique defense paradigm that allows LLMs to recognize, filter, and defend against adversarial or malicious inputs on their own. There are two main parts to the suggested framework: (1) A prompt filtering module that uses sophisticated Natural Language Processing (NLP) techniques, including zero-shot classification, keyword analysis, and encoded content detection (e.g. base64, hexadecimal, URL encoding), to detect, decode, and classify harmful inputs; and (2) A summarization module that processes and summarizes adversarial research literature to give the LLM context-aware defense knowledge. This approach strengthens LLMs' resistance to adversarial exploitation by fusing text extraction, summarization, and harmful prompt analysis. According to experimental results, this integrated technique has a 98.71% success rate in identifying harmful patterns, manipulative language structures, and encoded prompts. By employing a modest amount of adversarial research literature as context, the methodology also allows the model to react correctly to harmful inputs with a larger percentage of jailbreak resistance and refusal rate. While maintaining the quality of LLM responses, the framework dramatically increases LLM's resistance to hostile misuse, demonstrating its efficacy as a quick and easy substitute for time-consuming, retraining-based defenses.

URL PDF HTML ☆

赞 0 踩 0

2504.15920 2026-03-10 cs.LG

ScaleGNN: Towards Scalable Graph Neural Networks via Adaptive High-order Neighboring Feature Fusion

Xiang Li, Jianpeng Qi, Haobing Liu, Yuan Cao, Guoqing Chao, Zhongying Zhao, Junyu Dong, Xinwang Liu, Yanwei Yu

2504.05089 2026-03-10 cs.CV

Climplicit: Climatic Implicit Embeddings for Global Ecological Tasks

Johannes Dollinger, Damien Robert, Elena Plekhanova, Lukas Drees, Jan Dirk Wegner

Comments Published as a workshop paper at "Tackling Climate Change with Machine Learning", ICLR 2025

2502.17262 2026-03-10 cs.CL cs.AI cs.LG

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Chengyin Xu, Kaiyuan Chen, Xiao Li, Ke Shen, Chenggang Li

Comments Accepted by The Fourteenth International Conference on Learning Representations (ICLR2026)

2502.16610 2026-03-10 cs.CV cs.AI

AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs

Francisco Caetano, Christiaan Viviers, Lena Filatova, Peter H. N. de With, Fons van der Sommen

Comments SPIE Medical Imaging 2025 Runner-up 2025 Robert F. Wagner All-Conference Best Student Paper Award

详情

DOI: 10.1117/12.3045893

英文摘要

Ensuring the quality and integrity of medical images is crucial for maintaining diagnostic accuracy in deep learning-based Computer-Aided Diagnosis and Computer-Aided Detection (CAD) systems. Covariate shifts are subtle variations in the data distribution caused by different imaging devices or settings and can severely degrade model performance, similar to the effects of adversarial attacks. Therefore, it is vital to have a lightweight and fast method to assess the quality of these images prior to using CAD models. AdverX-Ray addresses this need by serving as an image-quality assessment layer, designed to detect covariate shifts effectively. This Adversarial Variational Autoencoder prioritizes the discriminator's role, using the suboptimal outputs of the generator as negative samples to fine-tune the discriminator's ability to identify high-frequency artifacts. Images generated by adversarial networks often exhibit severe high-frequency artifacts, guiding the discriminator to focus excessively on these components. This makes the discriminator ideal for this approach. Trained on patches from X-ray images of specific machine models, AdverX-Ray can evaluate whether a scan matches the training distribution, or if a scan from the same machine is captured under different settings. Extensive comparisons with various OOD detection methods show that AdverX-Ray significantly outperforms existing techniques, achieving a 96.2% average AUROC using only 64 random patches from an X-ray. Its lightweight and fast architecture makes it suitable for real-time applications, enhancing the reliability of medical imaging systems. The code and pretrained models are publicly available.

URL PDF HTML ☆

赞 0 踩 0

2502.05087 2026-03-10 cs.LG cs.AI cs.CL

Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs

Thierry Bossy, Julien Vignoud, Tahseen Rabbani, Juan R. Troncoso Pastoriza, Martin Jaggi

2502.03569 2026-03-10 cs.LG q-bio.GN q-bio.PE

Controllable Sequence Editing for Biological and Clinical Trajectories

Michelle M. Li, Kevin Li, Yasha Ektefaie, Ying Jin, Yepeng Huang, Shvat Messica, Tianxi Cai, Marinka Zitnik

Comments ICLR 2026

2502.01406 2026-03-10 cs.LG cs.AI cs.CL

GRADIEND: Feature Learning within Neural Networks Exemplified through Biases

Jonathan Drechsel, Steffen Herbold

Comments Accepted at ICLR 2026