arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2403.12193 2026-04-29 cs.RO

Continual Domain Randomization

Josip Josifovski, Sayantan Auddy, Mohammadhossein Malmir, Justus Piater, Alois Knoll, Nicolás Navarro-Guerrero

Comments Accepted at IROS 2024. Equal contribution from first two authors

详情

DOI: 10.1109/IROS58592.2024.10802060

英文摘要

Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at https://continual-dr.github.io/.

URL PDF HTML ☆

赞 0 踩 0

2402.09034 2026-04-29 cs.LG cs.AI

Contrast-Enhanced Gating in GRUs for Robust Low-Data Sequence Learning

Barathi Subramanian, Rathinaraja Jeyaraj, Anand Paul

Comments 43rd The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2026)

2309.15039 2026-04-29 cs.LG cs.AI stat.AP

Can-SAVE: Deploying Low-Cost and Population-Scale Cancer Screening via Survival Analysis Variables and EHR

Petr Philonenko, Vladimir Kokh, Pavel Blinov

Comments Accepted to the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2026)

详情

DOI: 10.1145/3770854.3783930
Journal ref: Proc. 32nd ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD '26), 2026, 11 pages

英文摘要

Conventional medical cancer screening methods are costly, labor-intensive, and extremely difficult to scale. Although AI can improve cancer detection, most systems rely on complex or specialized medical data, making them impractical for large-scale screening. We introduce Can-SAVE, a lightweight AI system that ranks population-wide cancer risks solely based on medical history events. By integrating survival model outputs into a gradient-boosting framework, our approach detects subtle, long-term patient risk patterns - often well before clinical symptoms manifest. Can-SAVE was rigorously evaluated on a real-world dataset of 2.5 million adults spanning five Russian regions, marking the study as one of the largest and most comprehensive deployments of AI-driven cancer risk assessment. In a retrospective oncologist-supervised study over 1.9M patients, Can-SAVE achieves a 4-10x higher detection rate at identical screening volumes and an Average Precision (AP) of 0.228 vs. 0.193 for the best baseline (LoRA-tuned Qwen3-Embeddings via DeepSeek-R1 summarization). In a year-long prospective pilot (426K patients), our method almost doubled the cancer detection rate (+91%) and increased population coverage by 36% over the national screening protocol. The system demonstrates practical scalability: a city-wide population of 1 million patients can be processed in under three hours using standard hardware, enabling seamless clinical integration. This work proves that Can-SAVE achieves nationally significant cancer detection improvements while adhering to real-world public healthcare constraints, offering immediate clinical utility and a replicable framework for population-wide screening. Code for training and feature engineering is available at https://github.com/sb-ai-lab/Can-SAVE.

URL PDF HTML ☆

赞 0 踩 0

2309.06768 2026-04-29 cs.RO

Hierarchical Time-Optimal Planning for Multi-Vehicle Racing

Georg Jank, Matthias Rowold, Boris Lohmann

Comments 6 pages, accepted to be published as part of the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023), Bilbao, Bizkaia, Spain, September 24-28, 2023

2307.00937 2026-04-29 cs.RO

A Biomimetic Fingerprint for Robotic Tactile Sensing

Oscar Alberto Juiña Quilachamín, Nicolás Navarro-Guerrero

Comments 56th International Symposium on Robotics (ISR Europe) | September 26-27, 2023, Stuttgart, Germany

2306.06213 2026-04-29 cs.LG math.OC

A Robust Twin Parametric Margin Support Vector Machine for Multiclass Classification

Renato De Leone, Francesca Maggioni, Andrea Spinelli

2305.06709 2026-04-29 cs.LG cs.MS stat.ML

NUBO: A Transparent Python Package for Bayesian Optimization

Mike Diessner, Kevin J. Wilson, Richard D. Whalley

详情

DOI: 10.18637/jss.v114.i01
Journal ref: Journal of Statistical Software, 114(1), 1-28 (2025)

英文摘要

NUBO, short for Newcastle University Bayesian Optimisation, is a Bayesian optimization framework for the optimization of expensive-to-evaluate black-box functions, such as physical experiments and computer simulators. Bayesian optimization is a costefficient optimization strategy that uses surrogate modelling via Gaussian processes to represent an objective function and acquisition functions to guide the selection of candidate points to approximate the global optimum of the objective function. NUBO itself focuses on transparency and user experience to make Bayesian optimization easily accessible to researchers from all disciplines. Clean and understandable code, precise references, and thorough documentation ensure transparency, while user experience is ensured by a modular and flexible design, easy-to-write syntax, and careful selection of Bayesian optimization algorithms. NUBO allows users to tailor Bayesian optimization to their specific problem by writing the optimization loop themselves using the provided building blocks. It supports sequential single-point, parallel multi-point, and asynchronous optimization of bounded, constrained, and/or mixed (discrete and continuous) parameter input spaces. Only algorithms and methods that are extensively tested and validated to perform well are included in NUBO. This ensures that the package remains compact and does not overwhelm the user with an unnecessarily large number of options. The package is written in Python but does not require expert knowledge of Python to optimize your simulators and experiments. NUBO is distributed as open-source software under the BSD 3-Clause license.

URL PDF HTML ☆

赞 0 踩 0

2212.14245 2026-04-29 cs.CV

Practical exposure correction via compensation

Long Ma, Nan An, Jinyuan Liu, Xin Fan, Zhongxuan Luo, Deyu Meng, Risheng Liu

Comments Project Page: https://rsliu.tech/PEC

2207.09845 2026-04-29 cs.RO cs.AI cs.HC cs.LG

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Daniel Harnack, Julie Pivin-Bachler, Nicolás Navarro-Guerrero

Comments Neural Computing and Applications (2022). Special Issue on Human-aligned Reinforcement Learning for Autonomous Agents and Robots

2206.06282 2026-04-29 cs.RO cs.AI

Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks

Josip Josifovski, Mohammadhossein Malmir, Noah Klarmann, Bare Luka Žagar, Nicolás Navarro-Guerrero, Alois Knoll

Comments Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

2203.11544 2026-04-29 cs.RO cs.AI

Visuo-Haptic Object Perception for Robots: An Overview

Nicolás Navarro-Guerrero, Sibel Toprak, Josip Josifovski, Lorenzo Jamone

Comments published in Autonomous Robots

2104.05565 2026-04-29 cs.CL cs.AI cs.LG

Survey on reinforcement learning for language processing

Victor Uc-Cetina, Nicolas Navarro-Guerrero, Anabel Martin-Gonzalez, Cornelius Weber, Stefan Wermter

2011.02121 2026-04-29 cs.CL

PheMT: A Phenomenon-wise Dataset for Machine Translation Robustness on User-Generated Contents

Ryo Fujii, Masato Mita, Kaori Abe, Kazuaki Hanawa, Makoto Morishita, Jun Suzuki, Kentaro Inui

Comments 15 pages, 4 figures, accepted at COLING 2020

2604.25903 2026-04-29 cs.SE cs.LG

Carbon-Taxed Transformers: A Green Compression Pipeline for Overgrown Language Models

Ajmain Inqiad Alam, Palash Roy, Chanchal K. Roy, Banani Roy, Kevin A. Schneider

详情

DOI: 10.1145/3797075
Journal ref: Proceedings of ACM Software Engineering 3, FSE, Article FSE047, 2026

英文摘要

The accelerating adoption of Large Language Models (LLMs) in software engineering (SE) has brought with it a silent crisis: unsustainable computational cost. While these models demonstrate remarkable capabilities in different SE tasks, they are unmanageably large, slow to deploy, memory-intensive, and carbon-heavy. This reality threatens not only the scalability and accessibility of AI-powered SE, but also its long-term environmental sustainability. The research challenge is clear: we must go beyond accuracy and address efficiency and environmental cost as first-class design constraints. To meet this challenge, we introduce Carbon-Taxed Transformers (CTT), a systematic multi-architectural compression principled pipeline ordering inspired by economic carbon taxation principles. Drawing from the economic concept of carbon pricing, CTT operationalizes a computational carbon tax that penalizes architectural inefficiencies and rewards deployment-ready compression. We evaluate CTT across three core SE tasks: code clone detection, code summarization, and code generation, with models spanning encoder-only, encoder-decoder, and decoder-only architecture. Our results show that CTT delivers on inference: (1) up to 49x memory reduction, (2) time reduction up to 8-10x for clone detection, up to 3x for summarization, and 4-7x for generation, (3) up to 81% reduction in CO2 emissions and (4) CTT retains around 98% accuracy on clone detection, around 89% on summarization, and up to 91% (textual metrics) and 68% (pass@1) for generation. Two ablation studies show that pipeline ordering and individual component contributions are both essential, providing empirical justification for CTT's design and effectiveness. This work establishes a viable path toward responsible AI in SE through aggressive yet performance-preserving compression.

URL PDF HTML ☆

赞 0 踩 0

2604.25895 2026-04-29 cs.CY cs.AI cs.CL

Three Models of RLHF Annotation: Extension, Evidence, and Authority

Steve Coyne

Comments 17 pages. Accepted to ACM FAccT '26, June 25-28, Montreal

2604.25885 2026-04-29 hep-ph cs.LG hep-ex

Explainable AI for Jet Tagging: A Comparative Study of GNNExplainer, GNNShap, and GradCAM for Jet Tagging in the Lund Jet Plane

Pahal D. Patel, Sanmay Ganguly

Comments 25 pages, 9 figures. Comments are welcome

2604.25884 2026-04-29 quant-ph cs.CV

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

Shuxiang Cao, Zijian Zhang, Abhishek Agarwal, Grace Bratrud, Niyaz R. Beysengulov, Daniel C. Cole, Alejandro Gómez Frieiro, Elena O. Glen, Hao Hsu, Gang Huang, Raymond Jow, Greshma Shaji, Tom Lubowe, Ligeng Zhu, Luis Mantilla Calderón, Nicola Pancotti, Joel Pendleton, Brandon Severin, Charles Etienne Staub, Sara Sussman, Antti Vepsäläinen, Neel Rajeshbhai Vora, Yilun Xu, Varinia Bernales, Daniel Bowring, Elica Kyoseva, Ivan Rungger, Giulia Semeghini, Sam Stanwyck, Timothy Costa, Alán Aspuru-Guzik, Krysta Svore

2604.25862 2026-04-29 cs.SE cs.AI

RESTestBench: A Benchmark for Evaluating the Effectiveness of LLM-Generated REST API Test Cases from NL Requirements

Leon Kogler, Stefan Hangler, Maximilian Ehrhart, Benedikt Dornauer, Roland Wuersching, Peter Schrammel

Comments Accepted for EASE 2026

2604.25847 2026-04-29 math.OC cs.AI cs.LG

From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling

Jianghao Lin, Zi Ling, Chenyu Zhou, Tianyi Xu, Ruoqing Jiang, Zizhuo Wang, Dongdong Ge

Comments Working Paper

2604.25846 2026-04-29 cs.CR cs.AI

Towards Agentic Investigation of Security Alerts

Even Eilertsen, Vasileios Mavroeidis, Gudmund Grov

Comments 10 pages, 3 figures, 4 tables. Accepted at the 2025 IEEE International Conference on Big Data (BigData)

2604.25799 2026-04-29 cs.AR cs.AI

At the Edge of the Heart: ULP FPGA-Based CNN for On-Device Cardiac Feature Extraction in Smart Health Sensors for Astronauts

Kazi Mohammad Abidur Rahman, Davis Rakhshan, Philipp Lütke, Laura Harms, Ulf Kulau

Comments 9 pages, 7 figures, To be published in: The 22nd Annual International Conference on Distributed Computing in Smart Systems and the Internet of Things (DCOSS-IoT 2026)

2604.25782 2026-04-29 cs.NI cs.RO

EOS-Bench: A Comprehensive Benchmark for Earth Observation Satellite Scheduling

Qian Yin, Jiaxing Li, Jiaqi Cheng, Qizhang Luo, Annalisa Riccardi, Abhijit Chatterjee, Rafael Vazquez, Carlo Novara, Michalis Mavrovouniotis, Ponnuthurai Nagaratnam Suganthan, Shengzhou Bai, Xiaoxuan Hu, Lining Xing, Ming Xu, Shuang Li, Zixuan Zheng, Xin Shen, Xiaoyu Chen, Yi Gu, Yanjie Song, Witold Pedrycz, Evan L. Kramer, Laio Oriel Seman, Cletah Shoko, Guohua Wu, Xinwei Wang

2604.25778 2026-04-29 cs.SE cs.AI cs.IR

Can Code Evaluation Metrics Detect Code Plagiarism?

Fahad Ebrahim, Mike Joy

Comments 10 pages, 5 figures, accepted at LEARNER 2026 workshop (associated with EASE 2026)

2604.25757 2026-04-29 cs.CR cs.AI cs.RO cs.SY eess.SY

Threat-Oriented Digital Twinning for Security Evaluation of Autonomous Platforms

Thomas J. Neubert, Laxima Niure Kandel, Berker Peköz

Comments Camera ready accepted for presentation at and publication in the proceedings of 2026 56st Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W): Dependable and Secure Autonomous Systems (DSAS)

2604.25737 2026-04-29 cs.SE cs.AI

SAFEdit: Does Multi-Agent Decomposition Resolve the Reliability Challenges of Instructed Code Editing?

Noam Tarshish, Nofar Selouk, Daniel Hodisan, Bar Ezra Gafniel, Yuval Elovici, Asaf Shabtai, Eliya Nachmani

Comments Accepted to the EQUISA (Evaluation of Qualitative Aspects of Intelligent Software Assistants) workshop at EASE (Evaluation and Assessment in Software Engineering) 2026

2604.25733 2026-04-29 cs.LO cs.AI cs.FL

Verification of Neural Networks (Lecture Notes)

Benedikt Bollig

Comments 72 pages

2604.25710 2026-04-29 stat.AP cs.LG stat.ME stat.ML

Adaptive Meta-Learning Stochastic Gradient Hamiltonian Monte Carlo Simulation for Bayesian Updating of Structural Dynamic Models

Xianghao Meng, James L. Beck, Yong Huang, Hui Li

2604.25689 2026-04-29 cs.SE cs.AI

Spreadsheet Modeling Experiments Using GPTs on Small Problem Statements and the Wall Task

Thomas A. Grossman, Yuan Chen, Sopiko Datuashvili

2604.25685 2026-04-29 eess.IV cs.CV

Robustness Evaluation of a Foundation Segmentation Model Under Simulated Domain Shifts in Abdominal CT: Implications for Health Digital Twin Deployment

Sanghati Basu

Comments 8 Pages, 5 Tables, 2 Figures

2604.25664 2026-04-29 stat.ML cs.LG math.OC

Deflation-Free Optimal Scoring

Sharmin Afroz, Brendan Ames