arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.01346 2026-03-03 cs.LG stat.ML

Relatively Smart: A New Approach for Instance-Optimal Learning

Shaddin Dughmi, Alireza F. Pour

详情

英文摘要

We revisit the framework of Smart PAC learning, which seeks supervised learners which compete with semi-supervised learners that are provided full knowledge of the marginal distribution on unlabeled data. Prior work has shown that such marginal-by-marginal guarantees are possible for "most" marginals, with respect to an arbitrary fixed and known measure, but not more generally. We discover that this failure can be attributed to an "indistinguishability" phenomenon: There are marginals which cannot be statistically distinguished from other marginals that require different learning approaches. In such settings, semi-supervised learning cannot certify its guarantees from unlabeled data, rendering them arguably non-actionable. We propose relatively smart learning, a new framework which demands that a supervised learner compete only with the best "certifiable" semi-supervised guarantee. We show that such modest relaxation suffices to bypass the impossibility results from prior work. In the distribution-free setting, we show that the OIG learner is relatively smart up to squaring the sample complexity, and show that no supervised learning algorithm can do better. For distribution-family settings, we show that relatively smart learning can be impossible or can require idiosyncratic learning approaches, and its difficulty can be non-monotone in the inclusion order on distribution families.

URL PDF HTML ☆

赞 0 踩 0

2603.01343 2026-03-03 cs.CL cs.AI

PanCanBench: A Comprehensive Benchmark for Evaluating Large Language Models in Pancreatic Oncology

Yimin Zhao, Sheela R. Damle, Simone E. Dekker, Scott Geng, Karly Williams Silva, Jesse J Hubbard, Manuel F Fernandez, Fatima Zelada-Arenas, Alejandra Alvarez, Brianne Flores, Alexis Rodriguez, Stephen Salerno, Carrie Wright, Zihao Wang, Pang Wei Koh, Jeffrey T. Leek

2603.01335 2026-03-03 cs.LG cs.AI

Provable and Practical In-Context Policy Optimization for Self-Improvement

Tianrun Yu, Yuxiao Yang, Zhaoyang Wang, Kaixiang Zhao, Porter Jenkins, Xuchao Zhang, Chetan Bansal, Huaxiu Yao, Weitong Zhang

Comments 34 pages, 8 tables, 4 figures, Accepted by ICLR 2026

2603.01328 2026-03-03 cs.CV cs.AI

You Only Need One Stage: Novel-View Synthesis From A Single Blind Face Image

Taoyue Wang, Xiang Zhang, Xiaotian Li, Huiyuan Yang, Lijun Yin

2603.01326 2026-03-03 cs.CL cs.LG

Truth as a Trajectory: What Internal Representations Reveal About Large Language Model Reasoning

Hamed Damirchi, Ignacio Meza De la Jara, Ehsan Abbasnejad, Afshar Shamsi, Zhen Zhang, Javen Shi

2603.01324 2026-03-03 cs.CV

Open-Vocabulary vs Supervised Learning Methods for Post-Disaster Visual Scene Understanding

Anna Michailidou, Georgios Angelidis, Vasileios Argyriou, Panagiotis Sarigiannidis, Georgios Th. Papadopoulos

Comments 7 pages, 2 figures

2603.01309 2026-03-03 cs.LG stat.ML

PAC Guarantees for Reinforcement Learning: Sample Complexity, Coverage, and Structure

Joshua Steier

Comments 43 pages

2603.01304 2026-03-03 cs.LG stat.ML

Nonconvex Latent Optimally Partitioned Block-Sparse Recovery via Log-Sum and Minimax Concave Penalties

Takanobu Furuhashi, Hiroki Kuroda, Masahiro Yukawa, Qibin Zhao, Hidekata Hontani, Tatsuya Yokota

Comments 13 pages, 11 figures

2603.01301 2026-03-03 cs.CV

When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains

Ahmadreza Jeddi, Kimia Shaban, Negin Baghbanzadeh, Natasha Sharan, Abhishek Moturu, Elham Dolatabadi, Babak Taati

2603.01297 2026-03-03 cs.LG cs.CL

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift

Subramanyam Sahoo, Vinija Jain, Divya Chaudhary, Aman Chadha

Comments Accepted at the ICBINB: Where LLMs Need to Improve workshop at ICLR 2026. 12 pages and 3 Figures

2603.01295 2026-03-03 cs.CV cs.AI

Multi-Level Bidirectional Decoder Interaction for Uncertainty-Aware Breast Ultrasound Analysis

Abdullah Al Shafi, Md Kawsar Mahmud Khan Zunayed, Safin Ahmmed, Sk Imran Hossain, Engelbert Mephu Nguifo

Comments 10 pages, 3 figures, 2 tables. The code is available at: https://github.com/C-loud-Nine/Uncertainty-Aware-Multi-Level-Decoder-Interaction

2603.01294 2026-03-03 cs.RO

Spherical Latent Motion Prior for Physics-Based Simulated Humanoid Control

Jing Tan, Weisheng Xu, Xiangrui Jiang, Jiaxi Zhang, Kun Yang, Kai Wu, Jiaqi Xiong, Shiting Chen, Yangfan Li, Yixiao Feng, Yuetong Fang, Yujia Zou, Yiqun Song, Renjing Xu

2603.01293 2026-03-03 cs.LG cs.AI stat.ML

Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models

Adel Javanmard, Baharan Mirzasoleiman, Vahab Mirrokni

Comments 35 pages, 5 figures

2603.01292 2026-03-03 cs.LG cs.AI cs.LO cs.RO

Integrating LTL Constraints into PPO for Safe Reinforcement Learning

Maifang Zhang, Hang Yu, Qian Zuo, Cheng Wang, Vaishak Belle, Fengxiang He

2603.01291 2026-03-03 cs.LG cs.CL

JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks

Masahiro Kaneko, Ayana Niwa, Timothy Baldwin

Comments ICLR 2026

2603.01289 2026-03-03 cs.CL

Individual Turing Test: A Case Study of LLM-based Simulation Using Longitudinal Personal Data

Minghao Guo, Ziyi Ye, Wujiang Xu, Xi Zhu, Wenyue Hua, Dimitris N. Metaxas

Comments 5 pages, 2 figures

2603.01288 2026-03-03 cs.CL

Efficient Extractive Summarization with MAMBA-Transformer Hybrids for Low-Resource Scenarios

Nisrine Ait Khayi

2603.01286 2026-03-03 cs.AI cs.RO

Information-Theoretic Framework for Self-Adapting Model Predictive Controllers

Wael Hafez, Amir Nazeri

Comments 9 pages, 5 figures

2603.01285 2026-03-03 cs.LG cs.AI cs.CL

Attention Smoothing Is All You Need For Unlearning

Saleh Zare Zade, Xiangyu Zhou, Sijia Liu, Dongxiao Zhu

Comments Accepted by ICLR 2026

2603.01284 2026-03-03 cs.CV

FoSS: Modeling Long Range Dependencies and Multimodal Uncertainty in Trajectory Prediction via Fourier State Space Integration

Yizhou Huang, Gengze Jiang, Yihua Cheng, Kezhi Wang

Comments Accepted by CVPR 2026

2603.01281 2026-03-03 cs.CL cs.AI

Spectral Attention Steering for Prompt Highlighting

Weixian Waylon Li, Yuchen Niu, Yongxin Yang, Keshuang Li, Tiejun Ma, Shay B. Cohen

Comments Accepted to ICLR 2026 (Poster, Top 4%)

2603.01275 2026-03-03 cs.LG

The Impact of Battery Cell Configuration on Electric Vehicle Performance: An XGBoost-Based Classification with SHAP Interpretability

Santanam Wishal, Louis Filiepe Tio Jansel, Matthew Abednego Inkiriwang, Jason Sebastian

Comments 12 pages, 7 figures, 3 tables

2603.01274 2026-03-03 cs.LG cs.AI

GlassMol: Interpretable Molecular Property Prediction with Concept Bottleneck Models

Oscar Rivera, Ziqing Wang, Matthieu Dagommer, Abhishek Pandey, Kaize Ding

2603.01264 2026-03-03 cs.LG

S2O: Enhancing Adversarial Training with Second-Order Statistics of Weights

Gaojie Jin, Xinping Yi, Wei Huang, Sven Schewe, Xiaowei Huang

Comments Accepted to TPAMI 2025

2603.01260 2026-03-03 cs.LG cs.AI

MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Heterogeneous Multi-Agent RL, LLM, VLM, and Human Decision-Makers

Abdulhamid M. Mousa, Yu Fu, Rakhmonberdi Khajiev, Jalaledin M. Azzabi, Abdulkarim M. Mousa, Peng Yang, Yunusa Haruna, Ming Liu

Comments 13 pages, 2 figures

2603.01254 2026-03-03 cs.CL cs.AI

LLM Self-Explanations Fail Semantic Invariance

Stefan Szeider

2603.01253 2026-03-03 cs.CV

Cross-Modal Guidance for Fast Diffusion-Based Computed Tomography

Timofey Efimov, Singanallur Venkatakrishnan, Maliha Hossain, Haley Duba-Sullivan, Amirkoushyar Ziabari

Comments Accepted at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2026

2603.01252 2026-03-03 cs.CL cs.AI

Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation

Liwen Sun, Xiang Yu, Ming Tan, Zhuohao Chen, Anqi Cheng, Ashutosh Joshi, Chenyan Xiong

Comments Short paper published in the Findings of EACL 2026

2603.01243 2026-03-03 cs.CL

Suffix-Constrained Greedy Search Algorithms for Causal Language Models

Ayoub Hammal, Pierre Zweigenbaum, Caio Corro

2603.01239 2026-03-03 cs.CL cs.AI

Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape Model Confidence

Harshavardhan