arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2405.03420 2026-04-09 cs.CV cs.AI

Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation

Emil Benedykciuk, Marcin Denkowski, Grzegorz Wójcik

详情

英文摘要

This paper introduces a novel approach to enhance the performance of pre-trained neural networks in medical image segmentation using gradient-based Neural Architecture Search (NAS) methods. We present the concept of Implantable Adaptive Cell (IAC), small modules identified through Partially-Connected DARTS based approach, designed to be injected into the skip connections of an existing and already trained U-shaped model. Unlike traditional NAS methods, our approach refines existing architectures without full retraining. Experiments on four medical datasets with MRI and CT images show consistent accuracy improvements on various U-Net configurations, with segmentation accuracy gain by approximately 5 percentage points across all validation datasets, with improvements reaching up to 11\%pt in the best-performing cases. The findings of this study not only offer a cost-effective alternative to the complete overhaul of complex models for performance upgrades but also indicate the potential applicability of our method to other architectures and problem domains.

URL PDF HTML ☆

赞 0 踩 0

2403.15152 2026-04-09 cs.CV

Caption-Matching: A Multimodal Approach for Cross-Domain Image Retrieval

Lucas Iijima, Nikolaos Giakoumoglou, Tania Stathaki

2307.03571 2026-04-09 cs.LG math.OC stat.ML

Smoothing the Edges: Smooth Optimization for Sparse Regularization using Hadamard Overparametrization

Chris Kolb, Christian L. Müller, Bernd Bischl, David Rügamer

2008.01574 2026-04-09 cs.CV

A Robust 3D Registration Method via Simultaneous Inlier Identification and Model Estimation

Xianyun Qian, Fei Wen, Peilin Liu

2604.06558 2026-04-09 cs.LG q-bio.MN

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

Bryan Cheng, Jasper Zhang

Comments 9 pages, 5 figures. Accepted at Workshop on AI for Accelerated Materials Design and Foundation Models for Science: Real-World Impact and Science-First Design at ICLR 2026

2604.07069 2026-04-09 eess.SY cs.LG cs.SY math.DS

Controller Design for Structured State-space Models via Contraction Theory

Muhammad Zakwan, Vaibhav Gupta, Alireza Karimi, Efe C. Balta, Giancarlo Ferrari-Trecate

Comments The first and second authors contributed equally. The paper has been accepted in 24th European Control Conference (ECC) in Reykjavik, Iceland, 2026

2604.07041 2026-04-09 cs.DB cs.AI cs.ET cs.HC cs.IR

AV-SQL: Decomposing Complex Text-to-SQL Queries with Agentic Views

Minh Tam Pham, Trinh Pham, Tong Chen, Hongzhi Yin, Quoc Viet Hung Nguyen, Thanh Tam Nguyen

2604.07037 2026-04-09 hep-ex cs.CV

Towards foundation-style models for energy-frontier heterogeneous neutrino detectors via self-supervised pre-training

Saúl Alonso-Monsalve, Fabio Cufino, Umut Kose, Anna Mascellani, André Rubbia

Comments 18 pages, 6 figures

详情

英文摘要

Accelerator-based neutrino physics is entering an energy-frontier regime in which interactions reach the TeV scale and produce exceptionally dense, overlapping detector signatures. In this regime, event interpretation becomes impractical for conventional reconstruction approaches, particularly when labelled data are scarce and the analysis spans diverse downstream objectives. We present a sparse ViT framework for learning reusable representations from heterogeneous detector data. Self-supervised pre-training combines masked autoencoder reconstruction with relational voxel-level objectives for hierarchy, ghost and particle identification, and the resulting shared encoder is then jointly fine-tuned across classification and regression tasks. Evaluated on simulated events from the proposed FASERCal concept at the LHC, we find that pre-training consistently improves neutrino flavour and charm-quark identification, momentum regression, and vertex reconstruction over training from scratch, with the addition of relational objectives yielding further gains in the most topologically complex channels. Interpretability analyses further show that pre-training yields a more structured latent space, while detector-subsystem ablations recover physically plausible channel-dependent roles for the heterogeneous inputs. A data-efficiency study shows that, with roughly $10^3$ labelled events, the pre-trained encoder already matches the flavour-classification performance of a randomly initialised model trained on an order of magnitude more data. The learned representations also transfer effectively to publicly available benchmarks spanning different detector technologies and energy scales, matching or exceeding published baselines. These results support self-supervised pre-training on multimodal detector data as a scalable route towards reusable representations for neutrino and particle-detector analysis.

URL PDF HTML ☆

赞 0 踩 0

2604.07025 2026-04-09 math.DS cs.LG cs.NA math.NA

Physics-Informed Functional Link Constrained Framework with Domain Mapping for Solving Bending Analysis of an Exponentially Loaded Perforated Beam

Iswari Sahu, Ramanath Garai, S. Chakraverty

2604.07013 2026-04-09 quant-ph cs.LG

QNAS: A Neural Architecture Search Framework for Accurate and Efficient Quantum Neural Networks

Kooshan Maleki, Alberto Marchisio, Muhammad Shafique

Comments To appear at the IEEE International Joint Conference on Neural Networks (IJCNN), Maastricht, The Netherlands, June 2026

2604.07007 2026-04-09 cs.MA cs.AI cs.CY

AgentCity: Constitutional Governance for Autonomous Agent Economies via Separation of Power

Anbang Ruan, Xing Zhang

Comments 111 pages, 11 figures, 19 tables, 67 references. Pre-registered experimental design

2604.06958 2026-04-09 eess.SP cs.LG

ELC: Evidential Lifelong Classifier for Uncertainty Aware Radar Pulse Classification

Mohamed Rabie, Chinthana Panagamuwa, Konstantinos G. Kyriakopoulos

Comments IEEE RadarConf'26 Submission. 6 pages; 3 figures; 1 table

2604.06956 2026-04-09 cs.DC cs.LG

NestPipe: Large-Scale Recommendation Training on 1,500+ Accelerators via Nested Pipelining

Zhida Jiang, Zhaolong Xing, Huichao Chai, Tianxing Sun, Qiang Peng, Baopeng Yuan, Jiaxing Wang, Hua Du, Zhixin Wu, Xuemiao Li, Yikui Cao, Xinyu Liu, Yongxiang Feng, Zhen Chen, Ke Zhang

2604.06946 2026-04-09 cs.SE cs.AI

An empirical study of LoRA-based fine-tuning of large language models for automated test case generation

Milad Moradi, Ke Yan, David Colwell, Rhona Asgari

2604.06942 2026-04-09 cs.CR cs.IT cs.LG cs.NE eess.SP math.IT

Evaluating PQC KEMs, Combiners, and Cascade Encryption via Adaptive IND-CPA Testing Using Deep Learning

Simon Calderon, Niklas Johansson, Onur Günlü

详情

英文摘要

Ensuring ciphertext indistinguishability is fundamental to cryptographic security, but empirically validating this property in real implementations and hybrid settings presents practical challenges. The transition to post-quantum cryptography (PQC), with its hybrid constructions combining classical and quantum-resistant primitives, makes empirical validation approaches increasingly valuable. By modeling IND-CPA games as binary classification tasks and training on labeled ciphertext data with BCE loss, we study deep neural network (DNN) distinguishers for ciphertext indistinguishability. We apply this methodology to PQC KEMs. We specifically test the public-key encryption (PKE) schemes used to construct examples such as ML-KEM, BIKE, and HQC. Moreover, a novel extension of this DNN modeling for empirical distinguishability testing of hybrid KEMs is presented. We implement and test this on combinations of PQC KEMs with plain RSA, RSA-OAEP, and plaintext. Finally, methodological generality is illustrated by applying the DNN IND-CPA classification framework to cascade symmetric encryption, where we test combinations of AES-CTR, AES-CBC, AES-ECB, ChaCha20, and DES-ECB. In our experiments on PQC algorithms, KEM combiners, and cascade encryption, no algorithm or combination of algorithms demonstrates a significant advantage (two-sided binomial test, significance level $α= 0.01$), consistent with theoretical guarantees that hybrids including at least one IND-CPA-secure component preserve indistinguishability, and with the absence of exploitable patterns under the considered DNN adversary model. These illustrate the potential of using deep learning as an adaptive, practical, and versatile empirical estimator for indistinguishability in more general IND-CPA settings, allowing data-driven validation of implementations and compositions and complementing the analytical security analysis.

URL PDF HTML ☆

赞 0 踩 0

2604.06926 2026-04-09 math.OC cs.LG math.DS

Continuous-Time Dynamics of the Difference-of-Convex Algorithm

Yi-Shuai Niu

Comments 22 pages

2604.06901 2026-04-09 cs.CE cs.AI cs.CV cs.CY cs.ET

XR-CareerAssist: An Immersive Platform for Personalised Career Guidance Leveraging Extended Reality and Multimodal AI

N. D. Tantaroudas, A. J. McCracken, I. Karachalios, E. Papatheou, V. Pastrikakis

Comments 21

2604.06900 2026-04-09 cs.CE cs.AI cs.CR cs.CY

SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training

Nikolaos D. Tantaroudas, Ilias Karachalios, Andrew J. McCracken

Comments 21

2604.06899 2026-04-09 cs.CR cs.LG cs.SE

Data Leakage in Automotive Perception: Practitioners' Insights

Md Abu Ahammed Babu, Sushant Kumar Pandey, Darko Durisic, Andras Balint, Miroslaw Staron

2604.06876 2026-04-09 cs.DC cs.MA cs.RO

Exploiting Aggregate Programming in a Multi-Robot Service Prototype

Giorgio Audrito, Andrea Basso, Daniele Bortoluzzi, Ferruccio Damiani, Giordano Scarso, Gianluca Torta

Comments In Proceedings PLACES 2026, arXiv:2604.05737

2604.06864 2026-04-09 stat.ML cs.LG

A Data-Informed Variational Clustering Framework for Noisy High-Dimensional Data

Wan Ping Chen

2604.06863 2026-04-09 cs.SI cs.AI cs.CL cs.HC

Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings

Mingchen Li, Wajdi Aljedaani, Yingjie Liu, Navyasri Meka, Xuan Lu, Xinyue Ye, Junhua Ding, Yunhe Feng

Comments Accepted at WWW'26

2604.06833 2026-04-09 cs.CR cs.LG

FedDetox: Robust Federated SLM Alignment via On-Device Data Sanitization

Shunan Zhu, Jiawei Chen, Yonghao Yu, Hideya Ochiai

2604.06831 2026-04-09 cs.CR cs.AI

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation

Jeongho Yoon, Chanhee Park, Yongchan Chun, Hyeonseok Moon, Heuiseok Lim

2604.06808 2026-04-09 cs.AR cs.LG

CBM-Dual: A 65-nm Fully Connected Chaotic Boltzmann Machine Processor for Dual Function Simulated Annealing and Reservoir Computing

Kanta Yoshioka, Soshi Hirayae, Yuichiro Tanaka, Yuichi Katori, Takashi Morie, Hakaru Tamukoh

Comments 3 pages, 9 figures

2604.06793 2026-04-09 cs.SE cs.AI

Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development

Xinchen Wang, Ruida Hu, Cuiyun Gao, Pengfei Gao, Chao Peng

2604.06742 2026-04-09 cs.SE cs.AI

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios

Ruida Hu, Xinchen Wang, Chao Peng, Cuiyun Gao, David Lo

2604.06724 2026-04-09 cs.NE cs.AI

The Traveling Thief Problem with Time Windows: Benchmarks and Heuristics

Helen Yuliana Angmalisang, Frank Neumann

Comments 13 pages

2604.06723 2026-04-09 cs.SE cs.AI

Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision

Hong Yi Lin, Chunhua Liu, Haoyu Gao, Patanamon Thongtanunam, Christoph Treude

详情

英文摘要

In today's AI-assisted software engineering landscape, developers increasingly depend on LLMs that are highly capable, yet inherently imperfect. The tendency of these models to produce incorrect outputs can reduce developer productivity. To this end, a canonical mitigation method is to provide calibrated confidence scores that faithfully reflect their likelihood of correctness at the instance-level. Such information allows users to make immediate decisions regarding output acceptance, abstain error-prone outputs, and better align their expectations with the model's capabilities. Since post-trained LLMs do not inherently produce well-calibrated confidence scores, researchers have developed post-hoc calibration methods, with global Platt-scaling of sequence-level confidence scores proving effective in many generative software engineering tasks but remaining unreliable or unexplored for automated code revision (ACR) tasks such as program repair, vulnerability repair, and code refinement. We hypothesise that the coarse-grained nature of this conventional method makes it ill-suited for ACR tasks, where correctness is often determined by local edit decisions and miscalibration can be sample-dependent, thereby motivating fine-grained confidence calibration. To address this, our study proposes local Platt-scaling applied separately to three different fine-grained confidence scores. Through experiments across 3 separate tasks and correctness metrics, as well as 14 different models of various sizes, we find that fine-grained confidence scores consistently achieve lower calibration error across a broader range of probability intervals, and this effect is further amplified when global Platt-scaling is applied. Our proposed approaches offer a practical solution to eliciting well-calibrated confidence scores, enabling more trustworthy and streamlined usage of imperfect models in ACR tasks.

URL PDF HTML ☆

赞 0 踩 0

2604.06722 2026-04-09 cs.CY cs.RO

Infrastructure First: Enabling Embodied AI for Science in the Global South

Shaoshan Liu, Jie Tang, Marwa S. Hassan, Mohamed H. Sharkawy, Moustafa M. G. Fouda, Tiewei Shang, Zixin Wang