arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.26437 2026-04-30 cs.CV

Are Data Augmentation and Segmentation Always Necessary? Insights from COVID-19 X-Rays and a Methodology Thereof

Aman Swaraj, Arnav Agarwal, Hitendra Singh Bhadouria, Sandeep Kumar, Karan Verma

详情

英文摘要

Purpose: Rapid and reliable diagnostic tools are crucial for managing respiratory diseases like COVID-19, where chest X-ray analysis coupled with artificial intelligence techniques has proven invaluable. However, most existing works on X-ray images have not considered lung segmentation, raising concerns about their reliability. Additionally, some have employed disproportionate and impractical augmentation techniques, making models less generalized and prone to overfitting. This study presents a critical analysis of both issues and proposes a methodology (SDL-COVID) for more reliable classification of chest X-rays for COVID-19 detection. Methods: We use class activation mapping to obtain a visual understanding of the predictions made by Convolutional Neural Networks (CNNs), validating the necessity of lung segmentation. To analyze the effect of data augmentation, deep learning models are implemented on two levels: one for an augmented dataset and another for a non-augmented dataset. Results: Careful analysis of X-ray images and their corresponding heat maps under expert medical supervision reveals that lung segmentation is necessary for accurate COVID-19 prediction. Regarding data augmentation, test accuracy significantly drops beyond a certain threshold with additional augmented images, indicating model overfitting. Conclusion: Our proposed methodology, SDL-COVID, achieves a precision of 95.21% and a lower false negative rate, ensuring its reliability for COVID-19 detection using chest X-rays.

URL PDF HTML ☆

赞 0 踩 0

2604.26435 2026-04-30 cs.CV cs.AI cs.ET

QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing

Garvit Kumar Mittal, Sahil Tomar, Sandeep Kumar

2604.26422 2026-04-30 cs.LG cs.AI

STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices

Yongliang Ding, Qigong Bi, Peng Pu

Comments 12 pages, 5 figures, 4 tables, conference

2604.26419 2026-04-30 cs.CV cs.AI

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Junru Song, Yimeng Hu, Yijing Chen, Huining Li, Qian Li, Lizhen Cui, Yuntao Du

2604.26417 2026-04-30 cs.CL cs.SD

EmoTransCap: Dataset and Pipeline for Emotion Transition-Aware Speech Captioning in Discourses

Shuhao Xu, Yifan Hu, Jingjing Wu, Zhihao Du, Zheng Lian, Rui Liu

Comments 15 pages, 5 figures, including appendix

2604.26411 2026-04-30 cs.LG

Unifying Runtime Monitoring Approaches for Safety-Critical Machine Learning: Application to Vision-Based Landing

Mathieu Dario, Florent Chenevier, Kévin Delmas, Joris Guerin, Jérémie Guiochet

Comments 15 pages, 5 figures, 3 tables, submitted to ICPR 2026

2604.26409 2026-04-30 cs.CV

Sparsity as a Key: Unlocking New Insights from Latent Structures for Out-of-Distribution Detection

Ahyoung Oh, Wonseok Shin, Songkuk Kim

Comments 8 pages, 6 figures, supplementary material included, CVPR 2026

2604.26404 2026-04-30 cs.CV

Decoupled Prototype Matching with Vision Foundation Models for Few-Shot Industrial Object Detection

Hari Prasanth S. M., Nilusha Jayawickrama, Risto Ojala

Comments This article is submitted to Journal of Intelligent Manufacturing, and is currently in under review

2604.26382 2026-04-30 cs.CL cs.AI cs.IR

Benchmarking Complex Multimodal Document Processing Pipelines: A Unified Evaluation Framework for Enterprise AI

Saurabh K. Singh, Sachin Raj

Comments 16 pages, 4 tables. Code, metrics, and pilot data to be released upon publication

2604.26379 2026-04-30 cs.CV

A Multimodal Pre-trained Network for Integrated EEG-Video Seizure Detection

Tong Lu, Ke Xu, Zimo Zhang, Zitong Zhao, Danwei Weng, Ruiyu Wang, Miao Liu, Zizuo Zhang, Jingyi Yao, Yixuan Zhao, Wenchao Zhang, Min Wang, Guoming Luan, Minmin Luo, Zhifeng Yue

2604.26378 2026-04-30 cs.LG

CoQuant: Joint Weight-Activation Subspace Projection for Mixed-Precision LLMs

Zhe Ding, Su Pan, Duowei Pan

Comments 14 pages, 3 figures

2604.26375 2026-04-30 cs.CL cs.AI cs.LG

SG-UniBuc-NLP at SemEval-2026 Task 6: Multi-Head RoBERTa with Chunking for Long-Context Evasion Detection

Gabriel Stefan, Sergiu Nisioi

Comments Accepted to SemEval-2026 (Task 6: CLARITY: Unmasking Political Question Evasions)

2604.26374 2026-04-30 cs.RO cs.MA

Split over $n$ resource sharing problem: Are fewer capable agents better than many simpler ones?

Karthik Soma, Mohamed S. Talamali, Genki Miyauchi, Giovanni Beltrame, Heiko Hamann, Roderich Gross

Comments Short paper presented at the 15th International Conference on Swarm Intelligence (ANTS 2026)

2604.26370 2026-04-30 cs.CV cs.LG math.AT

Topology-Aware Representation Alignment for Semi-Supervised Vision-Language Learning

Junwon You, Mihyun Jang, Sangwoo Mo, Jae-Hun Jung

Comments 30 pages, 10 figures, 24 tables

2604.26368 2026-04-30 cs.CV

Seamless Indoor-Outdoor Mapping for INGENIOUS First Responders

Jürgen Wohlfeil, Henry Meißner, Adrian Schischmanow, Thomas Kraft, Dirk Baumbach, Ines Ernst, Dennis Dahlke

2604.26365 2026-04-30 cs.CV cs.LG

Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

Zhirong Shen, Rui Huang, Jiacheng Liu, Chang Zou, Peiliang Cai, Shikang Zheng, Zhengyi Shi, Liang Feng, Linfeng Zhang

Comments Accepted by CVPR 2026

2604.26363 2026-04-30 cs.CV cs.LG

CO-EVO: Co-evolving Semantic Anchoring and Style Diversification for Federated DG-ReID

Fengchun Zhang, Qiang Ma, Liuyu Xiang, Jinshan Lai, Tingxuan Huang, Jianwei Hu

Comments Accepted at ACL 2026 (Main Conference)

2604.26361 2026-04-30 cs.CL cs.AI

Text Style Transfer with Machine Translation for Graphic Designs

Deergh Singh Budhauria, Sanyam Jain, Rishav Agarwal, Tracy King

2604.26360 2026-04-30 cs.LG cs.AI

Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking

Disha Singha

Comments 31 pages, 18 figures, 3 tables

2604.26353 2026-04-30 cs.CV

GateMOT: Q-Gated Attention for Dense Object Tracking

Mingjin Lv, Zelin Liu, Feifei Shao, Yi-Ping Phoebe Chen, Junqing Yu, Wei Yang, Zikai Song

2604.26351 2026-04-30 cs.CL

A Dual-Task Paradigm to Investigate Sentence Comprehension Strategies in Language Models

Rei Emura, Saku Sugawara

2604.26348 2026-04-30 cs.CV cs.AI

ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance

Yang Yang, Feifan Meng, Han Fang, Weiming Zhang

Comments 14 pages, 9 figures, 11 tables

2604.26342 2026-04-30 cs.CV

Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios

Lei Zhang, Zhiqing Guo, Dan Ma, Gaobo Yang

2604.26341 2026-04-30 cs.CV

SpatialFusion: Endowing Unified Image Generation with Intrinsic 3D Geometric Awareness

Haiyi Qiu, Kaihang Pan, Jiacheng Li, Juncheng Li, Siliang Tang, Yueting Zhuang

2604.26340 2026-04-30 cs.LG

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

Weihang Li, Jianchun Liu, Hongli Xu

2604.26337 2026-04-30 cs.LG

AlphaJet: Automated Conceptual Aircraft Synthesis via Disentangled Generative Priors and Topology-Preserving Evolutionary Search

Boris Kriuk

Comments 10 pages, 2 figures, 1 table

2604.26328 2026-04-30 cs.CL cs.AI

DSIPA: Detecting LLM-Generated Texts via Sentiment-Invariant Patterns Divergence Analysis

Siyuan Li, Aodu Wulianghai, Guangyan Li, Xi Lin, Qinghua Mao, Yuliang Chen, Jun Wu, Jianhua Li

2604.26324 2026-04-30 cs.CV

Federated Medical Image Classification under Class and Domain Imbalance exploiting Synthetic Sample Generation

Martina Pavan, Matteo Caligiuri, Francesco Barbato, Pietro Zanuttigh

Comments Accepted at ICPR 2026, 13 pages, 3 figures, 5 tables

2604.26321 2026-04-30 cs.CV

Motion-Driven Multi-Object Tracking of Model Organisms in Space Science Experiments

Jianing You, Han Wang, Kang Liu, Jiale Ding, Fengjie Chu, Zihan Guo, Shengyang Li

Comments 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

2604.26319 2026-04-30 cs.CL

A Systematic Comparison of Prompting and Multi-Agent Methods for LLM-based Stance Detection

Genan Dai, Zini Chen, Yi Yang, Bowen Zhang