arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.21448 2026-03-24 cs.AI

Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue

Cosimo Spera

详情

英文摘要

We introduce a new paradigm for task-oriented dialogue systems: safety certification as a computational primitive for answer reuse. Current systems treat each turn independently, recomputing answers via retrieval or generation even when they are already derivable from prior state. We show that in capability-based systems, the safety certification step computes a fixed-point closure cl(At) that already contains every answer reachable from the current configuration. We operationalize this insight with a Certified Answer Store (CAS) augmented by Pre-Answer Blocks (PAB): at each certified turn, the system materializes all derivable follow-up answers together with minimal provenance witnesses. Subsequent queries are answered in sub-millisecond time via formal containment checks, eliminating redundant retrieval and generation.

URL PDF HTML ☆

赞 0 踩 0

2603.21438 2026-03-24 cs.CL

PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts

Neeladri Bhuiya, Shib Sankar Dasgupta, Andrew McCallum, Haw-Shiuan Chang

2603.21436 2026-03-24 cs.CV

PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences

Lanbo Xu, Liang Guo, Caigui Jiang, Cheng Wang

2603.21435 2026-03-24 cs.AI econ.GN q-fin.EC

Behavioural feasible set: Value alignment constraints on AI decision support

Taejin Park

2603.21432 2026-03-24 cs.CV

Image-Based Structural Analysis Using Computer Vision and LLMs: PhotoBeamSolver

Altamirano-Muñiz Emilio Fernando

Comments 10 pages

2603.21426 2026-03-24 cs.CV

Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models

Jingchen Sun, Shaobo Han, Deep Patel, Wataru Kohno, Can Jin, Changyou Chen

Comments Accepted to CVPR 2026

2603.21419 2026-03-24 cs.AI

Is the future of AI green? What can innovation diffusion models say about generative AI's environmental impact?

Robert Viseur, Nicolas Jullien

2603.21418 2026-03-24 cs.CL cs.AI cs.LG

Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Mariela M. Nina, Caio Veloso Costa, Lilian Berton, Didier A. Vega-Oliveros

Comments 10 pages, 2 figures, PROPOR 2026

2603.21416 2026-03-24 cs.SD

Enterprise Sales Copilot: Enabling Real-Time AI Support with Automatic Information Retrieval in Live Sales Calls

Jielin Qiu, Liangwei Yang, Ming Zhu, Wenting Zhao, Zhiwei Liu, Juntao Tan, Zixiang Chen, Roshan Ram, Akshara Prabhakar, Rithesh Murthy, Shelby Heinecke, Caiming Xiong, Silvio Savarese, Huan Wang

2603.21415 2026-03-24 cs.AI cs.CR cs.LG

Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures

Gregory M. Ruddell

Comments 39 pages, 5 figures, 5 tables. Preprint. Submitted to NIST CAISI (Docket NIST-2025-0035, March 2026). Also available on Zenodo: https://doi.org/10.5281/zenodo.18971110

2603.21410 2026-03-24 cs.RO

Bayesian Active Object Recognition and 6D Pose Estimation from Multimodal Contact Sensing

Haodong Zheng, Gabriele M. Caddeo, Andrei C. Jalba, Wijnand A. IJsselsteijn, Lorenzo Natale, Raymond H. Cuijpers

2603.21404 2026-03-24 cs.CL

Multi-Perspective LLM Annotations for Valid Analyses in Subjective Tasks

Navya Mehrotra, Adam Visokay, Kristina Gligorić

2603.21399 2026-03-24 cs.AI

The Myhill-Nerode Theorem for Bounded Interaction: Canonical Abstractions via Agent-Bounded Indistinguishability

Anthony T. Nixon

Comments 43 pages, 4 figures, 23 tables. Code: https://github.com/alch3mistdev/finite-pomdp-abstraction (v0.1.1)

2603.21398 2026-03-24 cs.AI cs.GT

Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors

Johnathan Sun, Andrew Zhang

Comments 8 pages, 6 figures

2603.21393 2026-03-24 cs.LG stat.ML

A Generalised Exponentiated Gradient Approach to Enhance Fairness in Binary and Multi-class Classification Tasks

Maryam Boubekraoui, Giordano d'Aloisio, Antinisca Di Marco

2603.21389 2026-03-24 cs.CL cs.LG

Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Jinghan Cao, Yu Ma, Xinjin Li, Qingyang Ren, Xiangyun Chen

Comments Accepted for publication at ESANN 2025. This is a task-specific efficiency analysis comparing small language models

2603.21386 2026-03-24 cs.CV

Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation

Nikolay Kormushev, Josip Šarić, Matej Kristan

2603.21383 2026-03-24 cs.AI

PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost

Junkeun Yi, Damon Mosk-Aoyama, Baihe Huang, Ritu Gala, Charles Wang, Sugam Dipak Devare, Khushi Bhardwaj, Abhibha Gupta, Oleksii Kuchaiev, Jiantao Jiao, Jian Zhang, Venkat Srinivasan

Comments 22 pages, 5 figures, 6 tables

2603.21378 2026-03-24 cs.CV cs.AI physics.geo-ph

An InSAR Phase Unwrapping Framework for Large-scale and Complex Events

Yijia Song, Juliet Biggs, Alin Achim, Robert Popescu, Simon Orrego, Nantheera Anantrasirichai

2603.21377 2026-03-24 cs.CV cs.LG

HamVision: Hamiltonian Dynamics as Inductive Bias for Medical Image Analysis

Mohamed A Mabrok

2603.21375 2026-03-24 cs.LG stat.ML

Constrained Online Convex Optimization with Memory and Predictions

Mohammed Abdullah, George Iosifidis, Salah Eddine Elayoubi, Tijani Chahed

Comments accepted to AAAI 2026

2603.21368 2026-03-24 cs.CL

Conspiracy Frame: a Semiotically-Driven Approach for Conspiracy Theories Detection

Heidi Campana Piva, Shaina Ashraf, Maziar Kianimoghadam Jouneghani, Arianna Longo, Rossana Damiano, Lucie Flek, Marco Antonio Stranisci

2603.21366 2026-03-24 cs.CV

Relax Forcing: Relaxed KV-Memory for Consistent Long Video Generation

Zengqun Zhao, Yanzuo Lu, Ziquan Liu, Jifei Song, Jiankang Deng, Ioannis Patras

Comments Project page: see https://zengqunzhao.github.io/Relax-Forcing

详情

英文摘要

Autoregressive (AR) video diffusion has recently emerged as a promising paradigm for long video generation, enabling causal synthesis beyond the limits of bidirectional models. To address training-inference mismatch, a series of self-forcing strategies have been proposed to improve rollout stability by conditioning the model on its own predictions during training. While these approaches substantially mitigate exposure bias, extending generation to minute-scale horizons remains challenging due to progressive temporal degradation. In this work, we show that this limitation is not primarily caused by insufficient memory, but by how temporal memory is utilised during inference. Through empirical analysis, we find that increasing memory does not consistently improve long-horizon generation, and that the temporal placement of historical context significantly influences motion dynamics while leaving visual quality largely unchanged. These findings suggest that temporal memory should not be treated as a homogeneous buffer. Motivated by this insight, we introduce Relax Forcing, a structured temporal memory mechanism for AR diffusion. Instead of attending to the dense generated history, Relax Forcing decomposes temporal context into three functional roles: Sink for global stability, Tail for short-term continuity, and dynamically selected History for structural motion guidance, and selectively incorporates only the most relevant past information. This design mitigates error accumulation during extrapolation while preserving motion evolution. Experiments on VBench-Long demonstrate that Relax Forcing improves motion dynamics and overall temporal consistency while reducing attention overhead. Our results suggest that structured temporal memory is essential for scalable long video generation, complementing existing forcing-based training strategies.

URL PDF HTML ☆

赞 0 踩 0

2603.21365 2026-03-24 cs.LG cs.CL

TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

Jaber Jaber, Osama Jaber

Comments 9 pages, 5 tables, 2 figures. Code: https://github.com/RightNow-AI/TIDE

2603.21359 2026-03-24 cs.CL cs.AI cs.CY

Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF

K. M. Jubair Sami, Dipto Sumit, Ariyan Hossain, Farig Sadeque

Comments 12 pages, 1 figure, 5 tables

2603.21349 2026-03-24 cs.CV

Respiratory Status Detection with Video Transformers

Thomas Savage, Evan Madill

2603.21348 2026-03-24 cs.CV

Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution

Yu-Shan Tai, An-Yeu, Wu

2603.21344 2026-03-24 cs.AI

The AI Scientific Community: Agentic Virtual Lab Swarms

Ulisses Braga-Neto

2603.21341 2026-03-24 cs.AI

RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models

Dongyoung Kim, Sumin Park, Woomin Song, Seungku Kim, Taeyoung Kim, Huiwon Jang, Jinwoo Shin, Jaehyung Kim, Younggyo Seo

Comments 15 pages, 7 figures, 9 Tables

2603.21340 2026-03-24 cs.AI cs.DC

ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture

Seth Dobrin, Lukasz Chmiel

详情

英文摘要

This paper presents ARYA, a composable, physics-constrained, deterministic world model architecture built on five foundational principles: nano models, composability, causal reasoning, determinism, and architectural AI safety. We demonstrate that ARYA satisfies all canonical world model requirements, including state representation, dynamic prediction, causal and physical awareness, temporal consistency, generalization, learnability, and planning and control. Unlike monolithic foundation models, the ARYA foundation model implements these capabilities through a hierarchical system-of-system-of-systems of specialized nano models, orchestrated by AARA (ARYA Autonomous Research Agent), an always-on cognitive daemon that executes a continuous sense-decide-act-learn loop. The nano model architecture provides linear scaling, sparse activation, selective untraining, and sub-20-second training cycles, resolving the traditional tension between capability and computational efficiency. A central contribution is the Unfireable Safety Kernel: an architecturally immutable safety boundary that cannot be disabled or circumvented by any system component, including its own self-improvement engine. This is not a social or ethical alignment statement; it is a technical framework ensuring human control persists as autonomy increases. Safety is an architectural constraint governing every operation, not a policy layer applied after the fact. We present formal alignment between ARYA's architecture and canonical world model requirements, and report summarizing its state-of-the-art performance across 6 of 9 competitive benchmarks head-to-head with GPT-5.2, Opus 4.6, and V-JEPA-2. All with zero neural network parameters, across seven active industry domain nodes spanning aerospace, pharma manufacturing, oil and gas, smart cities, biotech, defense, and medical devices.

URL PDF HTML ☆

赞 0 踩 0