arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05598 2026-05-08 cs.AI cs.HC

Prober.ai: Gated Inquiry-Based Feedback via LLM-Constrained Personas for Argumentative Writing Development

Ran Bi, Shiyao Wei, Yuanyiyi Zhou

Comments Prototype awarded second place at the NYEdTech Hackathon (March 2026) https://www.nyedtechhackathon.com/2026-submissions

详情

英文摘要

The proliferation of large language models (LLMs) in educational settings has paradoxically undermined the cognitive processes they purport to support. Students increasingly outsource critical thinking to AI assistants that generate polished text on demand, resulting in measurable cognitive debt and diminished argumentative reasoning skills. We present Prober.ai, a web-based writing environment that inverts the conventional AI-tutoring paradigm: rather than generating or rewriting student text, the system constrains an LLM (Gemini 3 Flash Preview) through persona-specific system prompts and structured JSON output schemas to produce only targeted, inquiry-based questions about argumentative weaknesses. A two-phase interaction architecture -- Challenge and Unlock -- implements a pedagogical friction mechanism whereby revision suggestions are gated behind mandatory student reflection. The system's design is grounded in Toulmin's argumentation theory, research on peer feedforward questioning mechanisms, and evidence on AI-supported feedback in writing instruction. A functional prototype was developed in 36 hours during the NY EdTech Hackathon (March 2026), where it was awarded second place. We describe the system architecture, the prompt engineering methodology for constraining LLM output to pedagogically aligned JSON schemas, and discuss implications for scalable, cognition-preserving AI integration in writing education.

URL PDF HTML ☆

赞 0 踩 0

2605.05594 2026-05-08 cs.CL cs.CV cs.LG

The Cost of Context: Mitigating Textual Bias in Multimodal Retrieval-Augmented Generation

Hoin Jung, Xiaoqian Wang

2605.05593 2026-05-08 cs.AI

Causal Probing for Internal Visual Representations in Multimodal Large Language Models

Zehao Deng, Tianjie Ju, Zheng Wu, Liangbo He, Jun Lan, Huijia Zhu, Weiqiang Wang, Zhuosheng Zhang

2605.05592 2026-05-08 cs.LG cs.IT math.IT

When Can Voting Help, Hurt, or Change Course? Exact Structure of Binary Test-Time Aggregation

Yi Liu

2605.05590 2026-05-08 cs.CV

Uncertainty-Guided Edge Learning for Deep Image Regression in Remote Sensing

Anh Vu Nguyen, Dino Sejdinovic, Tat-Jun Chin

Comments AI4Space @ CVPR 2026

2605.05586 2026-05-08 cs.LG

AeroJEPA: Learning Semantic Latent Representations for Scalable 3D Aerodynamic Field Modeling

Francisco Giral, Abhijeet Vishwasrao, Andrea Arroyo Ramo, Mahmoud Golestanian, Federica Tonti, Adrian Lozano-Duran, Steven L. Brunton, Sergio Hoyas, Hector Gomez, Soledad Le Clainche, Ricardo Vinuesa

2605.05580 2026-05-08 cs.AI

AlphaCrafter: A Full-Stack Multi-Agent Framework for Cross-Sectional Quantitative Trading

Yishuo Yuan, Jiayi Sheng, Sirui Zeng, Jiaqi Wang, Jiaheng Liu

Comments Submitted to NeurIPS 2026. 26 pages, 8 figures,

详情

英文摘要

Financial markets are inherently non-stationary, driven by complex interactions among macroeconomic regimes, microstructural frictions, and behavioral dynamics. Building quantitative strategies that remain profitable demands the continuous coupling of factor discovery, regime-adaptive selection, and risk-constrained execution. Prevailing approaches, however, optimize these components under static or isolated assumptions. Factor mining frameworks typically treat alpha discovery as a one-time search process, implicitly assuming that factor efficacy persists across market regimes. Execution-oriented systems often adopt role-playing agent architectures that simulate anthropomorphic trading committees, introducing behavioral noise rather than systematic rationality. Consequently, a fully automated, rationality-driven framework unifying a coherent quantitative pipeline remains absent. We introduce AlphaCrafter, a full-stack multi-agent framework that closes this gap through a continuously adaptive factor-to-execution pipeline, designed to track and respond to evolving market conditions without manual intervention. AlphaCrafter operates via three specialized agents: a Miner that continuously expands the factor pool via LLM-guided search, a Screener that assesses prevailing market conditions to construct regime-conditioned factor ensembles, and a Trader that translates these ensembles into quantitative strategies under explicit risk constraints. Together, these three agents form a closed-loop cross-sectional trading system that adapts holistically to evolving market dynamics. Extensive experiments on CSI 300 and S&P 500 demonstrate that AlphaCrafter consistently outperforms state-of-the-art baselines in risk-adjusted returns while exhibiting the lowest cross-trial variance, confirming that integrated and adaptive factor-to-execution design yields robust trading performance.

URL PDF HTML ☆

赞 0 踩 0

2605.05577 2026-05-08 cs.LG cs.AI

Accelerating LMO-Based Optimization via Implicit Gradient Transport

Won-Jun Jang, Si-Hyeon Lee

2605.05572 2026-05-08 cs.CV

Text-to-CAD Retrieval: a Strong Baseline

Honghu Pan, Zibo Du, Daxiang Liu, Chengliang Liu, Xiaoling Luo

2605.05567 2026-05-08 cs.AI

Locality-aware Private Class Identification for Domain Adaptation with Extreme Label Shift

Chuan-Xian Ren, Cheng-Jun Guo, Hong Yan

详情

英文摘要

Domain adaptation aims to transfer knowledge from a labeled source domain to an unlabeled target domain with different distributions. In real-world scenarios, the label spaces of the two domains often have an inclusion relationship, where some classes exist only in one domain but not the other. These non-overlapping classes are referred to as private classes. Identifying private class samples and mitigating their adverse effects is critical in the literature. Existing methods rely on the assumption that shifts in private classes are large enough to be considered outliers. However, the variance within a single shared class can be significantly larger than the difference between a private class and another shared class, challenging this assumption. Consequently, private classes substantially increase the difficulty of cross-domain classification. To address these issues, based on local transportation and metric properties of optimal transport (OT), a locality-aware private class identification approach is proposed in the form of a score function on transport mass. The effectiveness of the proposed approach is theoretically proven, highlighting the score function's strong ability to distinguish between shared and private class samples. Building on this, we introduce a reliable OT-based method (ReOT) for domain adaptation under severe label shift. ReOT minimizes classification risk while learning the separated cluster structure between the identified shared classes and private classes, effectively avoiding mismatch between shared-private sample pairs, thus ensuring that important knowledge is reliably transported intra-class to mitigate class-conditional discrepancy. Furthermore, a generalization upper bound of the target risk is provided for extreme label shift scenarios, which can be minimized by ReOT. Extensive experiments on benchmarks validate the effectiveness of ReOT.

URL PDF HTML ☆

赞 0 踩 0

2605.05566 2026-05-08 cs.AI cs.CL cs.LG

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Langlin Huang, Chengsong Huang, Jinyuan Li, Donghong Cai, Yuyi Yang, Jiaxin Huang

2605.05561 2026-05-08 cs.AI

BitCal-TTS: Bit-Calibrated Test-Time Scaling for Quantized Reasoning Models

Sai Babu Patarlapalli, Surya Teja Avvaru

Comments 17 pages, 5 figures, 4 tables. Code and reproducibility materials at https://github.com/Saibabu7770/bitcal-tts

2605.05556 2026-05-08 cs.CV

An extremely coarse feedback signal is sufficient for learning human-aligned visual representations

Yash Mehta, Michael F. Bonner

Comments 21 Pages, 6 Figures

2605.05553 2026-05-08 cs.LG

FedeKD: Energy-Based Gating for Robust Federated Knowledge Distillation under Heterogeneous Settings

Quang-Huy Nguyen, Jiaqi Wang, Wei-shinn Ku

2605.05549 2026-05-08 cs.CV

A Novel Graph-Regulated Disentangling Mamba Model with Sparse Tokens for Enhanced Tree Species Classification from MODIS Time Series

Motasem Alkayid, Zhengsen Xu, Saeid Taleghanidoozdoozan, Yimin Zhu, Megan Greenwood, Quinn Ledingham, Zack Dewis, Mabel Heffring, Naser El-Sheimy, Lincoln Linlin Xu

2605.05546 2026-05-08 cs.AI

SPARK: Self-Play with Asymmetric Reward from Knowledge Graphs

Hyobin Park, Taeseop Kim, Dong-Geol Choi

2605.05544 2026-05-08 cs.LG cs.RO

Adaptive Q-Chunking for Offline-to-Online Reinforcement Learning

Nandiraju Gireesh, Yuanliang Ju, He Wang

2605.05541 2026-05-08 cs.RO

Real-world Latency Analysis of Vehicular Visible Light Communication with Multiple LED Transmitters and an Event-Based Camera

Ryota Soga, Tsukasa Shimizu, Shintaro Shiba, Quan Kong, Shan Lu, Takaya Yamazato

Comments 5 pages, IEEE VTC2026-Spring

2605.05540 2026-05-08 cs.LG physics.flu-dyn

Towards Scalable One-Step Generative Modeling for Autoregressive Dynamical System Forecasting

Tianyue Yang, Xiao Xue

Comments 42 pages, 15 figures

详情

英文摘要

Fast surrogate modeling for high-dimensional physical dynamics requires more than low short-term error: useful models must roll out efficiently while preserving the statistical structure of long trajectories. Neural operators provide inexpensive autoregressive forecasts but can drift in turbulent regimes, whereas rolling diffusion and latent generative surrogates can represent stochastic transitions at the cost of multi-step denoising, noise-schedule design, or auxiliary compression models. We propose MeanFlow Long-term Invariant Spatiotemporal Consistency Autoregressive Models (MeLISA), a latent-free autoregressive generative surrogate built on pixel-space MeanFlow. MeLISA defines a blockwise stochastic transition kernel that generates each forecast block with a single model evaluation, avoiding latent encoders and iterative diffusion solvers at inference time. To stabilize long-horizon rollouts, MeLISA combines a Window-Consistency MeanFlow objective that learns conditional spatiotemporal generation from partially observed temporal windows with a Time Increment Consistency loss that constrains multi-lag finite increments and targets temporal-correlation structure. We evaluate MeLISA with compact UNet and scalable DiT backbones on two high-resolution benchmarks, extended 2D Kolmogorov flow at $256 \times 256$ and turbulent channel-flow slice at $192 \times 192$. MeLISA outperforms neural-operator baselines on short-term forecasting accuracy and long-horizon statistical metrics, including energy spectra, turbulent kinetic energy, and mixing-rate-related dynamics, while achieving inference speeds comparable to, and in some cases faster than, neural operators. Compact 3.7-5.7M-parameter variants already deliver strong parameter efficiency, and DiT variants provide a scalable path up to 150M parameters. Overall, MeLISA benefits both rollout efficiency and long-horizon statistical accuracy.

URL PDF HTML ☆

赞 0 踩 0

2605.05538 2026-05-08 cs.AI cs.IR

AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

Susheel Suresh, Hazel Mak, Shangpo Chou, Fred Kroon, Sahil Bhatnagar

Comments 14 pages, 5 figures

2605.05535 2026-05-08 cs.AI

Housing Potential Common Data Model and City Digital Twin

Megan Katsumi, Mark Fox, Anderson Wong, Divnoor Chatha

2605.05534 2026-05-08 cs.LG

Adversarial Graph Neural Network Benchmarks: Towards Practical and Fair Evaluation

Tran Gia Bao Ngo, Zulfikar Alom, Federico Errica, Murat Kantarcioglu, Cuneyt Gurcan Akcora

Comments 49 pages, 6 figures

2605.05532 2026-05-08 cs.CL cs.CY

A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

Nicole Lincoln, Nick Whitehouse, Jaron Mar, Rivindu Perera

2605.05530 2026-05-08 cs.LG

Energy Generative Modeling: A Lyapunov-based Energy Matching Perspective

Yixuan Wang, Wenqian Xue, Warren E. Dixon

Comments 11 pages, 2 figures

2605.05524 2026-05-08 cs.LG cs.AI

MOSAIC: Module Discovery via Sparse Additive Identifiable Causal Learning for Scientific Time Series

Shicheng Fan, Nour Elhendawy, Jianle Sun, Ke Fang, Kun Zhang, Yihang Wang, Lu Cheng

2605.05519 2026-05-08 cs.LG cs.DC

OpenG2G: A Simulation Platform for AI Datacenter-Grid Runtime Coordination

Jae-Won Chung, Zhirui Liang, Yanyong Mao, Jiasi Chen, Mosharaf Chowdhury, Vladimir Dvorkin

Comments Open-source at https://github.com/gpu2grid/openg2g

2605.05511 2026-05-08 cs.LG stat.ML

Non-Myopic Active Feature Acquisition via Pathwise Policy Gradients

Linus Aronsson, Morteza Haghir Chehreghani

2605.05510 2026-05-08 cs.CV

The First Controllable Bokeh Rendering Challenge at NTIRE 2026

Tim Seizinger, Florin-Alexandru Vasluianu, Jeffrey Chen, Zhuyun Zhou, Zongwei Wu, Radu Timofte, Dafeng Zhang, Yipeng Lin, Qi Yan, Junhao Chen, Yang Yang, Divyavardhan Singh, Hariom Thacker, Hammad Mohammad, Aanchal Maurya, Kishor Upla, Kiran Raja, Wei Zhou, Hongyu Huang, Yujin Cho, Grigory Malivenko, Jiachen Tu, Yaokun Shi, Guoyi Xu, Yaoxin Jiang, Jiajia Liu

Comments Challenge report paper from NTIRE Workshop at CVPR 2026

2605.05503 2026-05-08 cs.CL

Chainwash: Multi-Step Rewriting Attacks on Diffusion Language Model Watermarks

Mohd Ruhul Ameen, Akif Islam, Nadim Mahmud, Md. Ekramul Hamid

Comments 13 pages, 5 figures, 3 tables

2605.05499 2026-05-08 cs.AI

FoodCHA: Multi-Modal LLM Agent for Fine-Grained Food Analysis

Woojin Lee, Pranav Mekkoth, Ye Tian, Onat Gungor, Tajana Rosing