arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.19665 2026-03-12 cs.CV cs.LG

Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy

Manuel Perez-Carrasco, Maya Nasr, Sebastien Roche, Chris Chan Miller, Zhan Zhang, Core Francisco Park, Eleanor Walker, Cecilia Garraffo, Douglas Finkbeiner, Sasha Ayvazov, Jonathan Franklin, Bingkun Luo, Xiong Liu, Ritesh Gautam, Steven Wofsy

详情

DOI: 10.1109/TGRS.2026.3672371
Journal ref: IEEE Transactions on Geoscience and Remote Sensing 2026

英文摘要

Effective cloud and cloud shadow detection is a critical prerequisite for accurate retrieval of concentrations of atmospheric methane (CH4) or other trace gases in hyperspectral remote sensing. This challenge is especially pertinent for MethaneSAT, a satellite mission launched in March 2024, to fill a significant data gap in terms of resolution, precision and swath between coarse-resolution global mappers and fine-scale point-source imagers of methane, and for its airborne companion mission, MethaneAIR. MethaneSAT delivers hyperspectral data at an intermediate spatial resolution (approx. 100 x 400, m), whereas MethaneAIR provides even finer resolution (approx. 25 m), enabling the development of highly detailed maps of concentrations that enable quantification of both the sources and rates of emissions. In this study, we use machine learning methods to address the cloud and cloud shadow detection problem for sensors with these high spatial resolutions. Cloud and cloud shadows in remote sensing data need to be effectively screened out as they bias methane retrievals in remote sensing imagery and impact the quantification of emissions. We deploy and evaluate conventional techniques-including Iterative Logistic Regression (ILR) and Multilayer Perceptron (MLP)-with advanced deep learning architectures, namely U-Net and a Spectral Channel Attention Network (SCAN) method. Our results show that conventional methods struggle with spatial coherence and boundary definition, affecting the detection of clouds and cloud shadows. Deep learning models substantially improve detection quality: U-Net performs best in preserving spatial structure, while SCAN excels at capturing fine boundary details... Our data and code is publicly available at: https://doi.org/10.7910/DVN/IKLZOJ

URL PDF HTML ☆

赞 0 踩 0

2509.18552 2026-03-12 cs.LG cs.AI

Global Minimizers of Sigmoid Contrastive Loss

Kiril Bangachev, Guy Bresler, Iliyas Noman, Yury Polyanskiy

Comments Author names listed in alphabetical order. NeurIPS 2025. New version includes some results on the geometry of CLIP in addition to geometry of SigLIP

2509.05729 2026-03-12 cs.CL

QCSE: A Pretrained Quantum Context-Sensitive Word Embedding for Natural Language Processing

Charles M. Varmantchaonala, Niclas Götting, Nils-Erik Schütte, Jean Louis E. K. Fendji, Christopher Gies

详情

英文摘要

Quantum Natural Language Processing (QNLP) offers a novel approach to encoding and understanding the complexity of natural languages through the power of quantum computation. This paper presents a pretrained quantum context-sensitive embedding model, called QCSE, that captures context-sensitive word embeddings, leveraging the unique properties of quantum systems to learn contextual relationships in languages. The model introduces quantum-native context learning, enabling the utilization of quantum computers for linguistic tasks. Central to the proposed approach are innovative context matrix computation methods, designed to create unique, representations of words based on their surrounding linguistic context. Five distinct methods are proposed and tested for computing the context matrices, incorporating techniques such as exponential decay, sinusoidal modulation, phase shifts, and hash-based transformations. These methods ensure that the quantum embeddings retain context sensitivity, thereby making them suitable for downstream language tasks where the expressibility and properties of quantum systems are valuable resources. To evaluate the effectiveness of the model and the associated context matrix methods, evaluations are conducted on both a Fulani corpus, a low-resource African language, dataset of small size and an English corpus of slightly larger size. The results demonstrate that QCSE not only captures context sensitivity but also leverages the expressibility of quantum systems for representing rich, context-aware language information. The use of Fulani further highlights the potential of QNLP to mitigate the problem of lack of data for this category of languages. This work underscores the power of quantum computation in natural language processing (NLP) and opens new avenues for applying QNLP to real-world linguistic challenges across various tasks and domains.

URL PDF HTML ☆

赞 0 踩 0

2508.18791 2026-03-12 cs.CL

LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination

Ziming Zhu, Chenglong Wang, Haosong Xv, Shunjie Xing, Yifu Huo, Fengning Tian, Quan Du, Di Yang, Chunliang Zhang, Tong Xiao, Jingbo Zhu

2508.12480 2026-03-12 cs.AI cs.LG cs.MA

The Yokai Learning Environment: Tracking Beliefs Over Space and Time

Constantin Ruhdorfer, Matteo Bortoletto, Johannes Forkel, Jakob Foerster, Andreas Bulling

Comments A previous version was presented as an oral presentation at the the ToM IJCAI 2025 Workshop

2508.03542 2026-03-12 cs.CV

Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences

Dmitrii Korzh, Dmitrii Tarasov, Artyom Iudin, Elvir Karimov, Matvey Skripkin, Nikita Kuzmin, Andrey Kuznetsov, Oleg Y. Rogov, Ivan Oseledets

Comments 22 pages, 2 figures, 16 Tables

2507.20859 2026-03-12 cs.CL

Leveraging Open-Source Large Language Models for Clinical Information Extraction in Resource-Constrained Settings

Luc Builtjes, Joeran Bosma, Mathias Prokop, Bram van Ginneken, Alessa Hering

Comments 34 pages, 5 figures

2507.11099 2026-03-12 cs.CV

A Survey on Interpretability in Visual Recognition

Qiyang Wan, Chengzhi Gao, Ruiping Wang, Xilin Chen

Comments 20 pages, 8 figures, 7 tables. Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

2507.01957 2026-03-12 cs.CV cs.AI

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Zhuoyang Zhang, Luke J. Huang, Chengyue Wu, Shang Yang, Kelly Peng, Yao Lu, Song Han

Comments ICLR 2026 Oral. The first two authors contributed equally to this work

2506.06658 2026-03-12 cs.RO cs.AI

Self-Improving Loops for Visual Robotic Planning

Calvin Luo, Zilai Zeng, Mingxi Jia, Yilun Du, Chen Sun

Comments ICLR 2026. Project Page: https://diffusion-supervision.github.io/silvr/

2506.05941 2026-03-12 cs.LG cs.AI

Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting

Luka Hobor, Mario Brcic, Lidija Polutnik, Ante Kapetanovic

Comments 12 total pages, 12 pages article

2506.02811 2026-03-12 cs.LG

CARTGen-IR: Synthetic Tabular Data Generation for Imbalanced Regression

António Pedro Pinheiro, Rita P. Ribeiro

Comments 14 pages, 6 figures, 2 tables, 1 algorithm

2505.18011 2026-03-12 cs.CL cs.AI

Training with Pseudo-Code for Instruction Following

Prince Kumar, Rudra Murthy, Riyaz Bhat, Danish Contractor

Comments Under Review

2505.17862 2026-03-12 cs.AI cs.CL cs.CV

Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities

Ziwei Zhou, Rui Wang, Zuxuan Wu, Yu-Gang Jiang

2505.13913 2026-03-12 cs.CL

Word length predicts word order: "Min-max"-ing drives language evolution

Hiram Ring

2505.13755 2026-03-12 cs.LG cs.NE nlin.CD stat.ML

Panda: A pretrained forecast model for chaotic dynamics

Jeffrey Lai, Anthony Bao, William Gilpin

2505.08245 2026-03-12 cs.CL cs.AI cs.HC

Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement

Haoran Ye, Jing Jin, Yuhang Xie, Xin Zhang, Guojie Song

Comments 400+ references

2504.11106 2026-03-12 cs.CV cs.CR

Token-Level Constraint Boundary Search for Jailbreaking Text-to-Image Models

Jiangtao Liu, Zhaoxin Wang, Handing Wang, Cong Tian, Yaochu Jin

2504.07997 2026-03-12 cs.CL

BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models

Tian Xie, Tongxin Yin, Vaishakh Keshava, Xueru Zhang, Siddhartha Reddy Jonnalagadda

Comments This work has been done when the first author is at Google. The first author is a student at the Ohio State University

2504.04371 2026-03-12 cs.LG stat.ML

An Algorithm to perform Covariance-Adjusted Support Vector Classification in Non-Euclidean Spaces

Satyajeet Sahoo, Jhareswar Maiti

2503.16107 2026-03-12 cs.LG cs.SY eess.SY

Learn to Bid as a Price-Maker Wind Power Producer

Shobhit Singhal, Marta Fochesato, Liviu Aolaritei, Florian Dörfler

2503.14255 2026-03-12 cs.RO

A Chain-Driven, Sandwich-Legged Quadruped Robot: Design and Experimental Analysis

Aman Singh, Bhavya Giri Goswami, Ketan Nehete, Shishir N. Y. Kolathaya

Comments 6 pages, 9 figures

2503.12918 2026-03-12 cs.CL

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Pengcheng Wen, Jiaming Ji, Chi-Min Chan, Juntao Dai, Donghai Hong, Yaodong Yang, Sirui Han, Yike Guo

2503.10705 2026-03-12 cs.CV

Enhanced Continual Learning of Vision-Language Models with Model Fusion

Haoyuan Gao, Zicong Zhang, Yuqi Wei, Linglan Zhao, Guilin Li, Yexin Li, Bo Wang, Linghe Kong, Weiran Huang

Comments Published as a conference paper at ICLR 2026

2503.07516 2026-03-12 cs.CV

Rethinking Two-Stage Referring-by-Tracking in Referring Multi-Object Tracking: Make it Strong Again

Weize Li, Yunhao Du, Qixiang Yin, Zhicheng Zhao, Fei Su

Comments Accepted to the CVPR 2026

2503.05170 2026-03-12 cs.CV

Leveraging Spatial Context for Positive Pair Sampling in Histopathology Image Representation Learning

Willmer Rafell Quinones Robles, Sakonporn Noree, Jongwoo Kim, Young Sin Ko, Bryan Wong, Mun Yong Yi

2503.01783 2026-03-12 cs.RO cs.CV

vS-Graphs: Tightly Coupling Visual SLAM and 3D Scene Graphs Exploiting Hierarchical Scene Understanding

Ali Tourani, Saad Ejaz, Hriday Bavle, Miguel Fernandez-Cortizas, David Morilla-Cabello, Jose Luis Sanchez-Lopez, Holger Voos

Comments 20 pages, 10 figures, 5 tables

2502.18928 2026-03-12 cs.AI

Talking like Piping and Instrumentation Diagrams (P&IDs)

Achmad Anggawirya Alimin, Dominik P. Goldstein, Lukas Schulze Balhorn, Artur M. Schweidtmann

2502.12188 2026-03-12 cs.LG cs.AI

Boosting Cross-problem Generalization in Diffusion-Based Neural Combinatorial Solver via Inference Time Adaptation

Haoyu Lei, Kaiwen Zhou, Yinchuan Li, Zhitang Chen, Farzan Farnia

2502.07460 2026-03-12 cs.LG stat.ML

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Heyang Zhao, Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang