arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.22167 2026-03-24 cs.LG cs.AI cs.GT econ.TH

Calibeating Made Simple

Yurong Chen, Zhiyi Huang, Michael I. Jordan, Haipeng Luo

详情

英文摘要

We study calibeating, the problem of post-processing external forecasts online to minimize cumulative losses and match an informativeness-based benchmark. Unlike prior work, which analyzed calibeating for specific losses with specific arguments, we reduce calibeating to existing online learning techniques and obtain results for general proper losses. More concretely, we first show that calibeating is minimax-equivalent to regret minimization. This recovers the $O(\log T)$ calibeating rate of Foster and Hart [FH23] for the Brier and log losses and its optimality, and yields new optimal calibeating rates for mixable losses and general bounded losses. Second, we prove that multi-calibeating is minimax-equivalent to the combination of calibeating and the classical expert problem. This yields new optimal multi-calibeating rates for mixable losses, including Brier and log losses, and general bounded losses. Finally, we obtain new bounds for achieving calibeating and calibration simultaneously for the Brier loss. For binary predictions, our result gives the first calibrated algorithm that at the same time also achieves the optimal $O(\log T)$ calibeating rate.

URL PDF HTML ☆

赞 0 踩 0

2603.22165 2026-03-24 cs.CV

ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints

Kaili Huang, Hongming Zhang, Rui Shen, Linjun Dai, Jiahao Wang, Hanming Deng, Lewei Lu

2603.22158 2026-03-24 cs.LG cs.AI

Multimodal Survival Analysis with Locally Deployable Large Language Models

Moritz Gögl, Christopher Yau

Comments NeurIPS 2025 Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

2603.22154 2026-03-24 cs.LG cs.CV

dynActivation: A Trainable Activation Family for Adaptive Nonlinearity

Alois Bachmann

Comments 22 pages, 15 figures

2603.22148 2026-03-24 cs.CV

OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation

Sijie Zhao, Feng Liu, Xueliang Zhang, Hao Chen, Xinyu Gu, Zhe Jiang, Fenghua Ling, Ben Fei, Wenlong Zhang, Junjue Wang, Weihao Xuan, Pengfeng Xiao, Naoto Yokoya, Lei Bai

Comments 15 pages, 4 figures

详情

英文摘要

Earth Observation (EO) is essential for perceiving dynamic land surface changes, yet deploying autonomous EO in open environments is hindered by the immense diversity of multi-source data and heterogeneous tasks. While remote sensing agents have emerged to streamline EO workflows, existing tool-calling agents are confined to closed environments. They rely on pre-defined tools and are restricted to narrow scope, limiting their generalization to the diverse data and tasks. To overcome these limitations, we introduce OpenEarth-Agent, the first tool-creation agent framework tailored for open-environment EO. Rather than calling predefined tools, OpenEarth-Agent employs adaptive workflow planning and tool creation to generalize to unseen data and tasks. This adaptability is bolstered by an open-ended integration of multi-stage tools and cross-domain knowledge bases, enabling robust execution in the entire EO pipeline across multiple application domains. To comprehensively evaluate EO agents in open environments, we propose OpenEarth-Bench, a novel benchmark comprising 596 real-world, full-pipeline cases across seven application domains, explicitly designed to assess agents' adaptive planning and tool creation capabilities. Only essential pre-trained model tools are provided in this benchmark, devoid of any other predefined task-specific tools. Extensive experiments demonstrate that OpenEarth-Agent successfully masters full-pipeline EO across multiple domains in the open environment. Notably, on the cross-benchmark Earth-Bench, our tool-creating agent equipped with 6 essential pre-trained models achieves performance comparable to tool-calling agents relying on 104 specialized tools, and significantly outperforms them when provided with the complete toolset. In several cases, the created tools exhibit superior robustness to data anomalies compared to human-engineered counterparts.

URL PDF HTML ☆

赞 0 踩 0

2603.22136 2026-03-24 cs.CL cs.DB

The Semantic Ladder: A Framework for Progressive Formalization of Natural Language Content for Knowledge Graphs and AI Systems

Lars Vogt

2603.22125 2026-03-24 cs.CV

DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment

Xin Cai, Zhiyuan You, Zhoutong Zhang, Tianfan Xue

Comments CVPR 2026

2603.22123 2026-03-24 cs.CV

Biophysics-Enhanced Neural Representations for Patient-Specific Respiratory Motion Modeling

Jan Boysen, Hristina Uzunova, Heinz Handels, Jan Ehrhardt

Comments Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2026:008

详情

DOI: 10.59275/j.melba.2026-1ba1
Journal ref: Machine.Learning.for.Biomedical.Imaging. 2026 (2026)

英文摘要

A precise spatial delivery of the radiation dose is crucial for the treatment success in radiotherapy. In the lung and upper abdominal region, respiratory motion introduces significant treatment uncertainties, requiring special motion management techniques. To address this, respiratory motion models are commonly used to infer the patient-specific respiratory motion and target the dose more efficiently. In this work, we investigate the possibility of using implicit neural representations (INR) for surrogate-based motion modeling. Therefore, we propose physics-regularized implicit surrogate-based modeling for respiratory motion (PRISM-RM). Our new integrated respiratory motion model is free of a fixed reference breathing state. Unlike conventional pairwise registration techniques, our approach provides a trajectory-aware spatio-temporally continuous and diffeomorphic motion representation, improving generalization to extrapolation scenarios. We introduce biophysical constraints, ensuring physiologically plausible motion estimation across time beyond the training data. Our results show that our trajectory-aware approach performs on par in interpolation and improves the extrapolation ability compared to our initially proposed INR-based approach. Compared to sequential registration-based approaches both our approaches perform equally well in interpolation, but underperform in extrapolation scenarios. However, the methodical features of INRs make them particularly effective for respiratory motion modeling, and with their performance steadily improving, they demonstrate strong potential for advancing this field.

URL PDF HTML ☆

赞 0 踩 0

2603.22118 2026-03-24 cs.RO

Programming Manufacturing Robots with Imperfect AI: LLMs as Tuning Experts for FDM Print Configuration Selection

Ekta U. Samani, Christopher G. Atkeson

2603.22117 2026-03-24 cs.LG cs.AI

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Kexin Huang, Haoming Meng, Junkang Wu, Jinda Lu, Chiyu Ma, Ziqian Chen, Xue Wang, Bolin Ding, Jiancan Wu, Xiang Wang, Xiangnan He, Guoyin Wang, Jingren Zhou

2603.22103 2026-03-24 cs.CL

Multiperspectivity as a Resource for Narrative Similarity Prediction

Max Upravitelev, Veronika Solopova, Jing Yang, Charlott Jakob, Premtim Sahitaj, Ariana Sahitaj, Vera Schmitt

2603.22102 2026-03-24 cs.CV cs.GR cs.RO

FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario

Hang Dai, Hongwei Fan, Han Zhang, Duojin Wu, Jiyao Zhang, Hao Dong

Comments Accepted to CVPR 2026

2603.22097 2026-03-24 cs.AI cs.LG

SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models

Syed Usama Imtiaz, Mitra Nasr Azadani, Nasrin Alamdari

Comments Accepted to IEEE IGARSS 2026

2603.22096 2026-03-24 cs.AI

GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

Xiao Han, Yuzheng Fan, Sendong Zhao, Haochun Wang, Bing Qin

2603.22091 2026-03-24 cs.CV

P-Flow: Prompting Visual Effects Generation

Rui Zhao, Mike Zheng Shou

2603.22083 2026-03-24 cs.AI

A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Xi Yang, Aurelie Lozano, Naoki Abe, Bhavya, Saurabh Jha, Noah Zheutlin, Rohan R. Arora, Yu Deng, Daby M. Sow

2603.22075 2026-03-24 cs.CL

Autoregressive vs. Masked Diffusion Language Models: A Controlled Comparison

Caio Vicentino

Comments 10 pages, 2 figures, 4 tables. Code and checkpoints at https://github.com/caiovicentino/arche

2603.22074 2026-03-24 cs.LG

MIHT: A Hoeffding Tree for Time Series Classification using Multiple Instance Learning

Aurora Esteban, Amelia Zafra, Sebastián Ventura

2603.22061 2026-03-24 cs.LG cs.AI

On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration

Valentin Petrov

2603.22057 2026-03-24 cs.CV

SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning

Byungwoo Jeon, Dongyoung Kim, Huiwon Jang, Insoo Kim, Jinwoo Shin

Comments 35 pages; 7 figures

2603.22053 2026-03-24 cs.SD cs.LG

AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference

Risa Shinoda, Kaede Shiohara, Nakamasa Inoue, Hiroaki Santo, Fumio Okura

Comments ICASSP 2026

2603.22039 2026-03-24 cs.RO cs.LG

RAFL: Generalizable Sim-to-Real of Soft Robots with Residual Acceleration Field Learning

Dong Heon Cho, Boyuan Chen

2603.22036 2026-03-24 cs.CV

GTSR: Subsurface Scattering Awared 3D Gaussians for Translucent Surface Reconstruction

Youwen Yuan, Xi Zhao

2603.22035 2026-03-24 cs.AI

Future-Interactions-Aware Trajectory Prediction via Braid Theory

Caio Azevedo, Stefano Sabatini, Sascha Hornauer, Fabien Moutarde

Comments To be published in IEEE Intelligent Vehicles Symposium (IV) 2026

2603.22031 2026-03-24 cs.RO

MEVIUS2: Practical Open-Source Quadruped Robot with Sheet Metal Welding and Multimodal Perception

Kento Kawaharazuka, Keita Yoneda, Shintaro Inoue, Temma Suzuki, Jun Oda, Kei Okada

Comments Accepted to IEEE Robotics and Automation Practice, Website - https://haraduka.github.io/mevius2-hardware/

2603.22030 2026-03-24 cs.LG stat.ML

On the Interplay of Priors and Overparametrization in Bayesian Neural Network Posteriors

Julius Kobialka, Emanuel Sommer, Chris Kolb, Juntae Kwon, Daniel Dold, David Rügamer

Comments Accepted at the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026

2603.22027 2026-03-24 cs.CV

Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models

Purui Bai, Junxian Duan, Pin Wang, Jinhua Hao, Ming Sun, Chao Zhou, Huaibo Huang

Comments 27 pages, 10 figures

2603.22012 2026-03-24 cs.CV cs.RO

6D Robotic OCT Scanning of Curved Tissue Surfaces

Suresh Guttikonda, Maximilian Neidhardt, Vidas Raudonis, Alexander Schlaefer

Comments Accepted at IEEE ISBI 2026

2603.22002 2026-03-24 cs.CV cs.AI

SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation

Duy D. Nguyen, Phat T. Tran-Truong

2603.21999 2026-03-24 cs.CV

STENet: Superpixel Token Enhancing Network for RGB-D Salient Object Detection

Jianlin Chen, Gongyang Li, Zhijiang Zhang, Liang Chang, Dan Zeng

Comments 12 pages, 8 figures, accepted by IEEE TMM