arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00322 2026-05-04 cs.LG

Federated Weather Modeling on Sensor Data

Shengchao Chen, Guodong Long

Comments Accepted by Encyclopedia of GIS, this is an unedited version. Published version: https://link.springer.com/rwe/10.1007/978-3-319-23519-6_1719-1

详情

DOI: 10.1007/978-3-319-23519-6_1719-1
Journal ref: In: Shekhar, S., Xiong, H., Zhou, X. (eds) Encyclopedia of GIS. Springer, Cham (2026)

英文摘要

Federated weather modeling on sensor data is a distributed system underpinned by federated learning, enabling multiple sensor data sources, including ground weather stations, satellites and IoT devices, to collaboratively train deep learning models without sharing raw data. This method safeguards data privacy and security while leverages diverse, geographically distributed datasets to improve the accuracy and robustness of global/regional weather modeling tasks such as forecasting and anomaly detection.

URL PDF HTML ☆

赞 0 踩 0

2605.00318 2026-05-04 cs.CL cs.IR

Structure-Aware Chunking for Tabular Data in Retrieval-Augmented Generation

Pooja Guttal, Varun Magotra, Vasudeva Mahavishnu, Natasha Chanto, Sidharth Sivaprasad, Manas Gaur

Comments 5 Pages, 1 figure, 4 Tables, 1 Algorithm, Work In Progress

2605.00307 2026-05-04 cs.RO cs.CV

A Model-based Visual Contact Localization and Force Sensing System for Compliant Robotic Grippers

Kaiwen Zuo, Shuyuan Yang, Zonghe Chua

Comments 8 pages, 6 figures, IEEE Robotics and Automation Letters

2605.00300 2026-05-04 cs.AI cs.DC cs.LG cs.PF

Token Arena: A Continuous Benchmark Unifying Energy and Cognition in AI Inference

Yuxuan Gao, Megan Wang, Yi Ling Yu

Comments 14 pages, 1 figure, 8 tables

2605.00298 2026-05-04 cs.LG math.OC

Data Deletion Can Help in Adaptive RL

Param Budhraja, Aditya Gangrade, Alex Olshevsky, Venkatesh Saligrama

详情

英文摘要

Deploying reinforcement learning policies in the real world requires adapting to time-varying environments. We study this problem in the contextual Markov Decision Process (cMDP) framework, where a family of environments is indexed by a low-dimensional context unknown at test time. The standard approach decomposes the problem: train a so-called "universal policy" which assumes knowledge of the true context, then pair it with a context estimator which approximates context using the observed trajectory. We identify a simple, counterintuitive trick that substantially improves the estimator: randomly delete a fraction of the training buffer after each round. This works because data is collected across multiple rounds using progressively better policies, and older trajectories come from a different distribution than what the estimator will face at deployment time; random deletion creates an implicit exponential decay on older data while preserving diversity without requiring any explicit identification of which samples are stale. This reduces robustness gap by 30% for MLPs and by 6% on average for recurrent networks. Strikingly, it allows a narrow MLP with 5x fewer parameters to outperform a wide MLP trained without deletion. To understand when and why deletion helps, we analyze regularized empirical risk minimization with a mismatch between the train distribution and the distribution at deployment; in this idealized setting, we prove that removing a single uniformly random training point decreases expected test loss in expectation under mild conditions. For ridge regression we make this quantitative: deletion helps when the regularization coefficient is moderate and the signal-to-noise ratio (SNR) is sufficiently low, and, crucially, this SNR threshold gives a direct measure of how large the distribution mismatch between training and deployment must be for deletion to be beneficial.

URL PDF HTML ☆

赞 0 踩 0

2605.00296 2026-05-04 cs.CV

Efficient Spatio-Temporal Vegetation Pixel Classification with Vision Transformers

Alan Gomes, Anderson Gonçalves, Samuel Felipe dos Santos, Nathan Felipe Alves, Magna Soelma Beserra de Moura, Bruna de Costa Alberton, Leonor Patricia C. Morellato, Ricardo da Silva Torres, Jurandy Almeida

2605.00294 2026-05-04 cs.CL

What Don't You Understand? Using Large Language Models to Identify and Characterize Student Misconceptions About Challenging Topics

Michael J. Parker, Maria G. Zavala-Cerna

Comments 60 pages. Education and Information Technologies (2026)

详情

DOI: 10.1007/s10639-026-13902-8

英文摘要

This study presents a systematic approach to identifying and characterizing student misconceptions in online learning environments through a novel combination of quantitative performance analysis and large language model (LLM) assessment. We analyzed data from 9 course periods across 5 online biomedical science courses, encompassing 3,802 medical student enrollments. Using data from 40-50 topic-focused quizzes per course, we developed a two-stage methodology. First, we identified challenging central topics using quiz-level performance metrics. Second, we employed LLMs to characterize the underlying misconceptions in these high-priority areas. By examining student performance on first attempts across primarily multiple-choice questions (MCQs), we identified consistently challenging topics that were also central to course objectives. We then leveraged recent advances in generative AI to analyze three distinct data sources in combination: quiz question content, student response patterns, and lecture transcripts. This approach revealed actionable insights about student misconceptions that were not apparent from performance data alone. The quality of the LLM-identified misconceptions was rated as excellent by subject matter experts. We also conducted teacher interviews to assess the perceived utility of our topic identification method. Faculty found that data-driven identification of challenging topics was valuable and corroborated their own classroom observations. This methodology provides a scalable approach to characterizing student difficulties in learning environments where quizzes are used. Our findings demonstrate the potential for targeted and potentially personalized interventions in future course iterations, with clear pathways for measuring intervention effectiveness through follow-up quiz performance.

URL PDF HTML ☆

赞 0 踩 0

2605.00291 2026-05-04 cs.CV cs.RO

An End-to-End Decision-Aware Multi-Scale Attention-Based Model for Explainable Autonomous Driving

Maryam Sadat Hosseini Azad, Shahriar Baradaran Shokouhi, Amir Abbas Hamidi Imani, Shahin Atakishiyev, Randy Goebel

2605.00284 2026-05-04 cs.LG cs.NA math.NA stat.ML

A Dirac-Frenkel-Onsager principle: Instantaneous residual minimization with gauge momentum for nonlinear parametrizations of PDE solutions

Matteo Raviola, Benjamin Peherstorfer

2605.00281 2026-05-04 cs.LG cs.MA math.OC

High-Probability Convergence in Decentralized Stochastic Optimization with Gradient Tracking

Aleksandar Armacki, Haoyuan Cai, Ali H. Sayed

Comments 49 pages, 4 figures. arXiv admin note: text overlap with arXiv:2510.06141

2605.00276 2026-05-04 cs.AI

Agentic AI for Trip Planning Optimization Application

Tiejin Chen, Ahmadreza Moradipari, Kyungtae Han, Hua Wei, Nejib Ammar

Comments Accepted to IV 2026

2605.00271 2026-05-04 cs.CV cs.AI cs.RO

REALM: An RGB and Event Aligned Latent Manifold for Cross-Modal Perception

Vincenzo Polizzi, David B. Lindell, Jonathan Kelly

2605.00270 2026-05-04 cs.CL cs.AI cs.CY cs.HC

Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework

Sheza Munir, Ahanaf Rodoshi, Sumin Lee, Feiran Chang, Xujie Si, Syed Ishtiaque Ahmed

2605.00269 2026-05-04 cs.CL cs.LG

How Language Models Process Out-of-Distribution Inputs: A Two-Pathway Framework

Hamidreza Saghir

Comments 30 pages, 3 figures, 30+ tables. Submitted to COLM 2026

2605.00267 2026-05-04 cs.LG cs.AI cs.CR

Jailbroken Frontier Models Retain Their Capabilities

Daniel Zhu, Zihan Wang, Jenny Bao, Jerry Wei

2605.00261 2026-05-04 cs.RO

Task-Conditioned Uncertainty Costmaps for Legged Locomotion

Kartikeya Singh, Christo Aluckal, Romeo Orsolino, Karthik Dantu

2605.00260 2026-05-04 cs.LG

NLPOpt-Net: A Learning Method for Nonlinear Optimization with Feasibility Guarantees

Bimol Nath Roy, Rahul Golder, MM Faruque Hasan

2605.00257 2026-05-04 cs.CL cs.AI cs.IR

Retrieval-Augmented Reasoning for Chartered Accountancy

Jatin Gupta, Akhil Sharma, Saransh Singhania, Ali Imam Abidi

Comments 9 pages, 2 figures, and 3 tables

2605.00256 2026-05-04 cs.CV cs.AI

Remote SAMsing: From Segment Anything to Segment Everything

Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Anesmar Olino de Albuquerque, Daniel Guerreiro e Silva

Comments 31 pages, 8 figures, 7 tables

2605.00253 2026-05-04 cs.CL cs.LG

Lost in State Space: Probing Frozen Mamba Representations

Bhagyashree Wagh, Akash Singh

Comments 8 pages, 2 figures

2605.00251 2026-05-04 cs.SD cs.CL eess.AS

Alethia: A Foundational Encoder for Voice Deepfakes

Yi Zhu, Brahmi Dwivedi, Jayaram Raghuram, Surya Koppisetti

Comments Accepted to ICML 2026

2605.00248 2026-05-04 cs.AI cs.GT cs.MA

Causal Foundations of Collective Agency

Frederik Hytting Jørgensen, Sebastian Weichwald, Lewis Hammond

Comments CLeaR 2026

2605.00245 2026-05-04 cs.AI

ARMOR 2025: A Military-Aligned Benchmark for Evaluating Large Language Model Safety Beyond Civilian Contexts

Sydney Johns, Heng Jin, Chaoyu Zhang, Y. Thomas Hou, Wenjing Lou

2605.00244 2026-05-04 cs.RO cs.CV

Lucid-XR: An Extended-Reality Data Engine for Robotic Manipulation

Yajvan Ravan, Adam Rashid, Alan Yu, Kai McClennen, Gio Huh, Kevin Yang, Zhutian Yang, Qinxi Yu, Xiaolong Wang, Phillip Isola, Ge Yang

Comments Project website: https://lucidxr.github.io

2605.00237 2026-05-04 cs.LG

Bayesian Optimization in Linear Time

Jesse Schneider, William J. Welch

Comments 25 pages, 25 figures; code available at https://github.com/jsa378/bo_partition

2605.00233 2026-05-04 cs.CV

Adaptive Geodesic Conformal Prediction for Egocentric Camera Pose Estimation

Aishani Pathak, Hasti Seifi

2605.00227 2026-05-04 cs.CL

Persona-Grounded Safety Evaluation of AI Companions in Multi-Turn Conversations

Prerna Juneja, Lika Lomidze

2605.00226 2026-05-04 cs.CL cs.AI cs.GT

Why Do LLMs Struggle in Strategic Play? Broken Links Between Observations, Beliefs, and Actions

Jan Sobotka, Mustafa O. Karabag, Ufuk Topcu

2605.00224 2026-05-04 cs.AI

TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization

Abdulhady Abas Abdullah, Fatemeh Daneshfar, Seyedali Mirjalili, Mourad Oussalah

Comments Proceedings of the 43rd International Conference on Machine Learning (ICML 2026)

2605.00219 2026-05-04 cs.CV

VkSplat: High-Performance 3DGS Training in Vulkan Compute

Jingxiang Chen, Mohamed Ibrahim, Yang Liu

Comments Submitted to Eurographics 2026 - Short Papers