arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2309.14857 2026-04-28 cs.LG cs.HC

Cluster Exploration using Informative Manifold Projections

Stavros Gerolymatos, Xenophon Evangelopoulos, Vladimir Gusev, John Y. Goulermas

Comments This paper has been accepted in the 27th European Conference on Artificial Intelligence (ECAI) 2024

详情

DOI: 10.3233/FAIA240717

英文摘要

Dimensionality reduction (DR) is one of the key tools for the visual exploration of high-dimensional data and uncovering its cluster structure in two- or three-dimensional spaces. The vast majority of DR methods in the literature do not take into account any prior knowledge a practitioner may have regarding the dataset under consideration. We propose a novel method to generate informative embeddings which not only factor out the structure associated with different kinds of prior knowledge but also aim to reveal any remaining underlying structure. To achieve this, we employ a linear combination of two objectives: firstly, contrastive PCA that discounts the structure associated with the prior information, and secondly, kurtosis projection pursuit which ensures meaningful data separation in the obtained embeddings. We formulate this task as a manifold optimization problem and validate it empirically across a variety of datasets considering three distinct types of prior knowledge. Lastly, we provide an automated framework to perform iterative visual exploration of high-dimensional data.

URL PDF HTML ☆

赞 0 踩 0

2307.10803 2026-04-28 cs.LG cs.AI physics.ao-ph

Spatial-Temporal Data Mining for Ocean Science: Data, Methodologies, and Opportunities

Hanchen Yang, Wengen Li, Shuyu Wang, Hui Li, Jihong Guan, Shuigeng Zhou, Jiannong Cao

2211.09619 2026-04-28 cs.LG cs.RO cs.SY eess.SY math.OC stat.ML

Introduction to Online Control

Elad Hazan, Karan Singh

Comments Draft; comments/suggestions welcome at nonstochastic.control@gmail.com

2209.14742 2026-04-28 cs.LG

Learning Gradient-based Mixup with Extrapolation toward Flatter Minima for Domain Generalization

Danni Peng, Sinno Jialin Pan

Comments 45 pages, 9 figures

2604.23539 2026-04-28 cs.AI

MetaGAI: A Large-Scale and High-Quality Benchmark for Generative AI Model and Data Card Generation

Haoxuan Zhang, Ruochi Li, Yang Zhang, Zhenni Liang, Junhua Ding, Ting Xiao, Haihua Chen

2604.23536 2026-04-28 cs.CV

$Z^2$-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models

Haosen Li, Wenshuo Chen, Shaofeng Liang, Lei Wang, Kaishen Yuan, Yutao Yue

2604.23532 2026-04-28 cs.CV cs.AI

Emotion-Conditioned Short-Horizon Human Pose Forecasting with a Lightweight Predictive World Model

Jingni Huang, Peter Bloodsworth

2604.23530 2026-04-28 cs.CL cs.AI

MTRouter: Cost-Aware Multi-Turn LLM Routing with History-Model Joint Embeddings

Yiqun Zhang, Hao Li, Zihan Wang, Shi Feng, Xiaocui Yang, Daling Wang, Bo Zhang, Lei Bai, Shuyue Hu

Comments This work has accepted by ACL 2026

2604.23528 2026-04-28 cs.LG

When PINNs Go Wrong: Pseudo-Time Stepping Against Spurious Solutions

Sifan Wang, Shawn Koohy, Yiping Lu, Paris Perdikaris

Comments 41 pages, 18 figures

2604.23518 2026-04-28 cs.LG cs.AI

Autocorrelation Reintroduces Spectral Bias in KANs for Time Series Forecasting

Chen Zeng, Jiahui Wang, Qiao Wang

2604.23513 2026-04-28 cs.RO

Large Language Model based Interactive Decision-Making for Autonomous Driving

Xinwei Dong, Jiyang Li, Jiabin Xie, Yang Yi, Tianshang Jia, Shiyu Fang, Ye Tian, Peng Hang

Comments Accepted by Journal of Traffic and Transportation Engineering (English Edition)

详情

英文摘要

In high-conflict mixed-traffic scenarios involving human-driven and autonomous vehicles, most existing autonomous driving systems default to overly conservative behaviors, lack proactive interaction, and consequently suffer from limited public acceptance. To mitigate intent misunderstandings and decision failures, we present a Large Language Model based interactive decision-making framework that augments scene understanding and intent-aware interaction to jointly improve safety and efficiency. The approach uses Object-Process Methodology to semantically model complex multi-vehicle scenes, abstracting low-level perceptual data into objects, processes, and relations, thereby streamlining reasoning over latent causal structure. Building on this representation, the Large Language Model parses both explicit and implicit intents of surrounding agents and, under jointly enforced safety and efficiency constraints, selects candidate maneuvers. We further generate perturbed trajectory candidates via Monte Carlo sampling and evaluate them to obtain an optimized executable trajectory. To foster transparency and coordination with nearby road users, the final decision is translated by the Large Language Model into concise natural-language messages and broadcast through an external Human-Machine Interface, completing a closed loop from scene understanding to action to language. Experiments in a cluster driving simulator demonstrate that the proposed method outperforms traditional baselines across safety, comfort, and efficiency metrics, while a Turing-test-style evaluation indicates a high degree of human-likeness in decision making. Besides, these results suggest that coupling semantic scene abstraction with Large Language Model mediated intent reasoning and language-based eHMI communication offers a practical pathway toward interactive, trustworthy autonomous driving in dense mixed traffic.

URL PDF HTML ☆

赞 0 踩 0

2604.23508 2026-04-28 cs.CV

BurstGP: Enhancing Raw Burst Image Super Resolution with Generative Priors

Dong Huo, Tristan Aumentado-Armstrong, Samrudhdhi B. Rangrej, Maitreya Suin, Angela Ning Ye, Zhiming Hu, Amanpreet Walia, Amirhossein Kazerouni, Konstantinos G. Derpanis, Iqbal Mohomed, Alex Levinshtein

Comments 37 pages, 13 figures

2604.23500 2026-04-28 cs.LG cs.AI

Interpretable Physics-Informed Load Forecasting for U.S. Grid Resilience: SHAP-Guided Ensemble Validation in Hybrid Deep Learning Under Extreme Weather

Md Abubakkar, Sajib Debnath, Md. Uzzal Mia

2604.23494 2026-04-28 cs.AI cs.LG

Do Transaction-Level and Actor-Level AML Queues Agree? An Empirical Evaluation of Granularity Effects on the Elliptic++ Graph

Ankur Malik

Comments 20 pages, 9 tables, 4 appendices

2604.23488 2026-04-28 cs.LG

Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation

Lichen Li, Hengguang Zhou, Yijun Liang, Tianyi Zhou, Cho-Jui Hsieh

2604.23486 2026-04-28 cs.CL cs.CY cs.HC

Your Students Don't Use LLMs Like You Wish They Did

Sebastian Kobler, Matthew Clemson, Angela Sun, Jonathan K. Kummerfeld

Comments To appear at ACL 2026 (Main Conference)

2604.23483 2026-04-28 cs.AI

Agentic Adversarial Rewriting Exposes Architectural Vulnerabilities in Black-Box NLP Pipelines

Mazal Bethany, Kim-Kwang Raymond Choo, Nishant Vishwamitra, Peyman Najafirad

Comments Submitted to IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY

2604.23481 2026-04-28 cs.CV cs.LG

Leveraging Spatial Transcriptomics as Alternative to Manual Annotations for Deep Learning-Based Nuclei Analysis

Kazuya Nishimura, Ryoma Bise, Haruka Hirose, Yasuhiro Kojima

2604.23475 2026-04-28 cs.LG cs.CL

Supernodes and Halos: Loss-Critical Hubs in LLM Feed-Forward Layers

Audrey Cherilyn, Houman Safaai

2604.23474 2026-04-28 cs.LG

GeoCert: Certified Geometric AI for Reliable Forecasting

Regina Zhang, Zongru Li, Honggang Wen, Xiaofeng Liu, Siu-Ming Yiu, Pietro Liò, Kwok-Yan Lam

Comments 15 pages, 4 figures

2604.23467 2026-04-28 cs.LG cs.AI cs.AR

Hybrid JIT-CUDA Graph Optimization for Low-Latency Large Language Model Inference

Divakar Kumar Yadav, Tian Zhao

2604.23465 2026-04-28 cs.LG

Machine learning models for estimating counterfactuals in a single-arm inflammatory bowel disease study

Dan Liu, Fida K. Dankar, Jennifer C. deBruyn, Amanda Ricciuto, Anne M. Griffiths, Thomas D. Walters, Khaled EI Emam

2604.23460 2026-04-28 cs.AI cs.CL cs.LG

Ulterior Motives: Detecting Misaligned Reasoning in Continuous Thought Models

Sharan Ramjee

Comments 15 pages with 2 figures

2604.23458 2026-04-28 cs.CL cs.IR cs.LG

A Benchmark Suite of Reddit-Derived Datasets for Mental Health Detection

Khalid Hasan, Jamil Saquer

Comments In the proceedings of 12th Annual Conference on Computational Science & Computational Intelligence (CSCI'25)

2604.23452 2026-04-28 cs.CV cs.LG

From Edges to Depth: Probing the Spatial Hierarchy in Vision Transformers

Jainum Sanghavi

Comments 12 pages, 6 figures. Code available at https://github.com/JainumSanghavi/ProbingViTs

2604.23449 2026-04-28 cs.AI cs.HC

ArguAgent: AI-Supported Real-Time Grouping for Productive Argumentation in STEM Classrooms

Jennifer Kleiman, Yizhu Gao, Xin Xia, Zhaoji Wang, Zipei Zhu, Jongchan Park, Xiaoming Zhai

Comments Full paper accepted to the 27th International Conference on AI in Education (AIED 2026). AIED Proceedings to be released Summer 2026

详情

Journal ref: International Conference on Artificial Intelligence in Education, AIED 2026

英文摘要

Argumentation is a core practice in STEM education, but its productivity depends on who participates and how they interact. Higher-achieving students often dominate the talk and decision-making, while lower-achieving peers may disengage, defer, or comply without contributing substantive reasoning. Forming groups strategically based on students' stances and argumentation skills could help foster inclusive, evidence-based discourse. In practice, however, teachers are constrained in implementing this grouping strategy because it requires real-time insight into students' positions and the quality of their argumentation, information that is difficult to assess reliably and at scale during instruction. We present a generative AI-powered system, ArguAgent, that creates groups optimizing for stance heterogeneity while constraining argumentation quality differences to +/-1 level on a validated learning progression. ArguAgent uses a two-component assessment pipeline: first scoring student arguments on a 0-4 rubric, then clustering positions via semantic analysis. We validated the scoring component against human expert consensus (Krippendorff's ααα = 0.817) using 200 expert-generated scores. Testing three OpenAI models (GPT-4o-mini, GPT-5.1, GPT-5.2) with identical calibrated prompts, we found that systematic prompt engineering informed by human disagreement analysis contributed 89% of scoring improvement (QWK: 0.531 to 0.686), while model upgrades contributed an additional 11% (QWK: 0.686 to 0.708). Simulation testing across 100 classes demonstrated that the grouping algorithm achieves 95.4% of groups that meet both design criteria, a 3.2x improvement over random assignment. These results suggest ArguAgent can enable real-time, theoretically grounded grouping that promotes productive STEM argumentation in classrooms.

URL PDF HTML ☆

赞 0 踩 0

2604.23446 2026-04-28 cs.AI

IndustryAssetEQA: A Neurosymbolic Operational Intelligence System for Embodied Question Answering in Industrial Asset Maintenance

Chathurangi Shyalika, Dhaval Patel, Amit Sheth

Comments 20 pages, 4 figures, 4 tables, Accepted for the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026) Industry Track

2604.23445 2026-04-28 cs.CL cs.AI cs.CY cs.LG

AI Safety Training Can be Clinically Harmful

Suhas BN, Andrew M. Sherrill, Rosa I. Arriaga, Chris W. Wiese, Saeed Abdullah

Comments 26 pages, 5 figures, 10 tables

2604.23442 2026-04-28 cs.CV

Resource-Constrained UAV-Based Weed Detection for Site-Specific Management on Edge Devices

Linyuan Wang, Haibo Yao, Te-Ming Tseng, Kelvin Betitame, Xin Sun, Hanbo Huang, Dong Chen

2604.23434 2026-04-28 cs.LG cs.CL

When Does Removing LayerNorm Help? Activation Bounding as a Regime-Dependent Implicit Regularizer

Lucky Verma

Comments 28 pages, 7 figures, includes appendices. Code and artifacts: https://github.com/lucky-verma/dyt-composition-study