arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.18107 2026-03-20 cs.LG cs.AI cs.CE q-fin.ST

ARTEMIS: A Neuro Symbolic Framework for Economically Constrained Market Dynamics

Rahul D Ray

详情

英文摘要

Deep learning models in quantitative finance often operate as black boxes, lacking interpretability and failing to incorporate fundamental economic principles such as no-arbitrage constraints. This paper introduces ARTEMIS (Arbitrage-free Representation Through Economic Models and Interpretable Symbolics), a novel neuro-symbolic framework combining a continuous-time Laplace Neural Operator encoder, a neural stochastic differential equation regularised by physics-informed losses, and a differentiable symbolic bottleneck that distils interpretable trading rules. The model enforces economic plausibility via two novel regularisation terms: a Feynman-Kac PDE residual penalising local no-arbitrage violations, and a market price of risk penalty bounding the instantaneous Sharpe ratio. We evaluate ARTEMIS against six strong baselines on four datasets: Jane Street, Optiver, Time-IMM, and DSLOB (a synthetic crash regime). Results demonstrate ARTEMIS achieves state-of-the-art directional accuracy, outperforming all baselines on DSLOB (64.96%) and Time-IMM (96.0%). A comprehensive ablation study confirms each component's contribution: removing the PDE loss reduces directional accuracy from 64.89% to 50.32%. Underperformance on Optiver is attributed to its long sequence length and volatility-focused target. By providing interpretable, economically grounded predictions, ARTEMIS bridges the gap between deep learning's power and the transparency demanded in quantitative finance.

URL PDF HTML ☆

赞 0 踩 0

2603.18101 2026-03-20 cs.CV cs.AI cs.LG

Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning Adapters

Mohammed Rahman Sherif Khan Mohammad, Ardhendu Behera, Sandip Pradhan, Swagat Kumar, Amr Ahmed

Comments Accepted at The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026

2603.18095 2026-03-20 cs.CV cs.LG

Q-Drift: Quantization-Aware Drift Correction for Diffusion Model Sampling

Sooyoung Ryu, Mathieu Salzmann, Saqib Javed

Comments 29 pages, 6 figures

2603.18091 2026-03-20 cs.CV cs.RO

Action Draft and Verify: A Self-Verifying Framework for Vision-Language-Action Model

Chen Zhao, Zhuoran Wang, Haoyang Li, Shifeng Bao, Guanlin Li, Youhe Feng, Yang Li, Jie Tang, Jing Zhang

2603.18089 2026-03-20 cs.CV cs.AI cs.LG

CytoSyn: a Foundation Diffusion Model for Histopathology -- Tech Report

Thomas Duboudin, Xavier Fontaine, Etienne Andrier, Lionel Guillou, Alexandre Filiot, Thalyssa Baiocco-Rodrigues, Antoine Olivier, Alberto Romagnoni, John Klein, Jean-Baptiste Schiratti

Comments 21 pages, 5 figures, tech report, model page: https://huggingface.co/Owkin-Bioptimus/CytoSyn

2603.18088 2026-03-20 cs.LG cs.AI

Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner

Hao Ma, Zhiqiang Pu, Yang Liu, Xiaolin Ai

2603.18086 2026-03-20 cs.CV

SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression Segmentation

Wei Tang, Xuejing Liu, Yanpeng Sun, Zechao Li

2603.18085 2026-03-20 cs.AI

Multi-Trait Subspace Steering to Reveal the Dark Side of Human-AI Interaction

Xin Wei Chia, Swee Liang Wong, Jonathan Pan

2603.18084 2026-03-20 cs.RO cs.AI

Uncovering Latent Phase Structures and Branching Logic in Locomotion Policies: A Case Study on HalfCheetah

Daisuke Yasui, Toshitaka Matsuki, Hiroshi Sato

Comments Accepted at XAI-2026: The 4th World Conference on eXplainable Artificial Intelligence

2603.18083 2026-03-20 cs.LG cs.AI

Probabilistic Federated Learning on Uncertain and Heterogeneous Data with Model Personalization

Ratun Rahman, Dinh C. Nguyen

Comments Accepted at IEEE Transactions on Emerging Topics in Computational Intelligence

2603.18079 2026-03-20 cs.LG cs.AI

SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training

Prince Zizhuang Wang, Shuli Jiang

2603.18078 2026-03-20 cs.LG

Variational Phasor Circuits for Phase-Native Brain-Computer Interface Classification

Dibakar Sigdel

2603.18074 2026-03-20 cs.LG cs.AI cs.IR stat.AP

Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction

Yi Yu, Junzhuo Ma, Chenghuang Shen, Xingyan Liu, Jing Gu, Hangyi Sun, Guangquan Hu, Jianfeng Liu, Weiting Liu, Mingyue Pu, Yu Wang, Zhengdong Xiao, Rui Xie, Longjiu Luo, Qianrong Wang, Gurong Cui, Honglin Qiao, Wenlian Lu

2603.18073 2026-03-20 cs.AI

Continually self-improving AI

Zitong Yang

Comments PhD thesis

2603.18046 2026-03-20 cs.LG cs.AI cs.CR

NANOZK: Layerwise Zero-Knowledge Proofs for Verifiable Large Language Model Inference

Zhaohui Geoffrey Wang

Comments 11 pages. Accepted at the VerifAI Workshop at ICLR 2026 (camera-ready version)

2603.18045 2026-03-20 cs.CV

RARE disease detection from Capsule Endoscopic Videos based on Vision Transformers

X. Gao, C. Chien, G. Liu, A. Manullang

2603.18041 2026-03-20 cs.LG cs.SI cs.SY eess.SY math.AT

Quotient Geometry and Persistence-Stable Metrics for Swarm Configurations

Mark M. Bailey

Comments 20 pages

2603.18037 2026-03-20 cs.LG

Adapting Methods for Domain-Specific Japanese Small LMs: Scale, Architecture, and Quantization

Takato Yasuno

Comments 16 pages, 11 figures, 6 tables

2603.18036 2026-03-20 cs.LG

MST-Direct: Matching via Sinkhorn Transport for Multivariate Geostatistical Simulation with Complex Non-Linear Dependencies

Tchalies Bachmann Schmitz

2603.18035 2026-03-20 cs.LG

Taming Epilepsy: Mean Field Control of Whole-Brain Dynamics

Ming Li, Ting Gao, Jingqiao Dua

Comments 22 pages, 7 figures

2603.18032 2026-03-20 cs.LG cs.AI stat.ML

Towards Differentiating Between Failures and Domain Shifts in Industrial Data Streams

Natalia Wojak-Strzelecka, Szymon Bobek, Grzegorz J. Nalepa, Jerzy Stefanowski

2603.18031 2026-03-20 cs.LG cs.AI

InfoMamba: An Attention-Free Hybrid Mamba-Transformer Model

Youjin Wang, Jiaqiao Zhao, Rong Fu, Run Zhou, Ruizhe Zhang, Jiani Liang, Suisuai Cao, Feng Zhou

2603.18029 2026-03-20 cs.LG cs.AI

Engineering Verifiable Modularity in Transformers via Per-Layer Supervision

J. Clayton Kerce

详情

英文摘要

Transformers resist surgical control. Ablating an attention head identified as critical for capitalization produces minimal behavioral change because distributed redundancy compensates for damage. This Hydra effect renders interpretability illusory: we may identify components through correlation, but cannot predict or control their causal role. We demonstrate that architectural interventions can expose hidden modularity. Our approach combines dual-stream processing separating token and contextual representations, per-layer supervision providing independent gradient signal at each depth, and gated attention regularizing toward discrete activation patterns. When trained with per-layer supervision, models produce ablation effects 5 to 23 times larger than architecturally identical controls trained with standard objectives. This enables 4 times greater control leverage on targeted behaviors: scaling identified attention heads produces smooth, predictable changes in model output. The key finding is architectural. Without per-layer supervision, ablation damage concentrates near zero with low variance (Winograd standard deviation 0.63%). With per-layer supervision, effects spread widely (standard deviation 6.32%), revealing which predictions depend on which circuits. The larger variance is not measurement noise but the signature of unmasked modularity. We validate our approach through three components: engineered features that capture computational dynamics rather than vocabulary structure (validated by near-zero correlation with raw activation clustering), an architecture providing positive control for modularity, and causal experiments demonstrating functional reorganization where different tasks route through different attention heads. This es tablishes a methodology for transforming interpretability from passive observation to active control.

URL PDF HTML ☆

赞 0 踩 0

2603.18018 2026-03-20 cs.CL cs.DB

An Agentic System for Schema Aware NL2SQL Generation

David Onyango, Naseef Mansoor

2603.18017 2026-03-20 cs.LG cs.CL

Frayed RoPE and Long Inputs: A Geometric Perspective

Davis Wertheimer, Aozhong Zhang, Derrick Liu, Penghang Yin, Naigang Wang

Comments Accepted by ICLR 2026

2603.18015 2026-03-20 cs.CL cs.AI

Beyond Accuracy: An Explainability-Driven Analysis of Harmful Content Detection

Trishita Dhara, Siddhesh Sheth

Comments This paper has been accepted at TrustNet 2026 (https://trustnetcon.in/). The final version will appear in Springer (LNNS), 2026

2603.18013 2026-03-20 cs.CL

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

Toshiyuki Shigemura

Comments 12 pages, 3 figures

2603.18012 2026-03-20 cs.CL cs.AI cs.IR

DynaRAG: Bridging Static and Dynamic Knowledge in Retrieval-Augmented Generation

Penghao Liang, Mengwei Yuan, Jianan Liu, Jing Yang, Xianyou Li, Weiran Yan, Yichao Wu

2603.18011 2026-03-20 cs.CL cs.IR

Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating

Victor P. Unda

Comments 21 pages, 1 figures, 4 tables

2603.18010 2026-03-20 cs.CL cs.AI cs.CY

Agentic Framework for Political Biography Extraction

Yifei Zhu, Songpo Yang, Jiangnan Zhu, Junyan Jiang

Comments 70 pages, 14 figures