arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.15586 2026-03-24 cs.AI cs.SY eess.SY

Computational Concept of the Psyche

Anton Kolonin, Vladimir Krykov

Comments 19 pages, 5 figures

详情

英文摘要

This article presents an overview of approaches to modeling the human psyche in the context of constructing an artificial one. Based on this overview, a concept of cognitive architecture is proposed, in which the psyche is viewed as the operating system of a living or artificial subject, comprising a space of states, including the state of needs that determine the meaning of a subject's being in relation to stimuli from the external world, and intelligence as a decision-making system regarding actions in this world to satisfy these needs. Based on this concept, a computational formalization is proposed for creating artificial general intelligence systems for an agent through experiential learning in a state space that includes agent's needs, taking into account their biological or existential significance for the intelligent agent, along with agent's sensations and actions. Thus, the problem of constructing artificial general intelligence is formalized as a system for making optimal decisions in the space of specific agent needs under conditions of uncertainty, maximizing success in achieving goals, minimizing existential risks, and maximizing energy efficiency. A minimal experimental implementation of the model is presented.

URL PDF HTML ☆

赞 0 踩 0

2603.15232 2026-03-24 cs.LG math.ST stat.ML stat.TH

Decomposing Probabilistic Scores: Reliability, Information Loss and Uncertainty

Arthur Charpentier, Agathe Fernandes Machado

2603.14846 2026-03-24 cs.LG cs.CC

Lost in Aggregation: On a Fundamental Expressivity Limit of Message-Passing Graph Neural Networks

Eran Rosenbluth

2603.14602 2026-03-24 cs.CL cs.AI cs.LG

PA3: Policy-Aware Agent Alignment through Chain-of-Thought

Shubhashis Roy Dipta, Daniel Bis, Kun Zhou, Lichao Wang, Benjamin Z. Yao, Chenlei Guo, Ruhi Sarikaya

2603.14469 2026-03-24 cs.RO cs.LG

Physics-Informed Policy Optimization via Analytic Dynamics Regularization

Namai Chandra, Liu Mohan, Zhihao Gu, Lin Wang

Comments 11 pages, 8 figures

2603.14217 2026-03-24 cs.CL

Rethinking Evaluation in Retrieval-Augmented Personalized Dialogue: A Cognitive and Linguistic Perspective

Tianyi Zhang, David Traum

2603.14188 2026-03-24 cs.CV

Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis

Zhiwei Wang, Yuxing Li, Meilu Zhu, Defeng He, Edmund Y. Lam

2603.13733 2026-03-24 cs.RO cs.AI cs.LG

Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control

Grayson Lee, Minh Bui, Shuzi Zhou, Yankai Li, Mo Chen, Ke Li

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026

2603.13275 2026-03-24 cs.LG cs.AI

PREBA: Surgical Duration Prediction via PCA-Weighted Retrieval-Augmented LLMs and Bayesian Averaging Aggregation

Wanyin Wu, Kanxue Li, Baosheng Yu, Haoyun Zhao, Yibing Zhan, Dapeng Tao, Hua Jin

Comments 13 pages

详情

英文摘要

Accurate prediction of surgical duration is pivotal for hospital resource management. Although recent supervised learning approaches-from machine learning (ML) to fine-tuned large language models (LLMs)-have shown strong performance, they remain constrained by the need for high-quality labeled data and computationally intensive training. In contrast, zero-shot LLM inference offers a promising training-free alternative but it lacks grounding in institution-specific clinical context (e.g., local demographics and case-mix distributions), making its predictions clinically misaligned and prone to instability. To address these limitations, we present PREBA, a retrieval-augmented framework that integrates PCA-weighted retrieval and Bayesian averaging aggregation to ground LLM predictions in institution-specific clinical evidence and statistical priors. The core of PREBA is to construct an evidence-based prompt for the LLM, comprising (1) the most clinically similar historical surgical cases and (2) clinical statistical priors. To achieve this, PREBA first encodes heterogeneous clinical features into a unified representation space enabling systematic retrieval. It then performs PCA-weighted retrieval to identify clinically relevant historical cases, which form the evidence context supplied to the LLM. Finally, PREBA applies Bayesian averaging to fuse multi-round LLM predictions with population-level statistical priors, yielding calibrated and clinically plausible duration estimates. We evaluate PREBA on two real-world clinical datasets using three state-of-the-art LLMs, including Qwen3, DeepSeek-R1, and HuatuoGPT-o1. PREBA significantly improves performance-for instance, reducing MAE by up to 40% and raising R^2 from -0.13 to 0.62 over zero-shot inference-and it achieves accuracy competitive with supervised ML methods, demonstrating strong effectiveness and generalization.

URL PDF HTML ☆

赞 0 踩 0

2603.12918 2026-03-24 cs.CV

VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation

Juhye Park, Wooju Lee, Dasol Hong, Changki Sung, Youngwoo Seo, Dongwan Kang, Hyun Myung

Comments Accepted to CVPR 2026

2603.12055 2026-03-24 cs.CV cs.LG

Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

Chiyuan He, Zihuan Qiu, Fanman Meng, Runtong Zhang, Linfeng Xu, Qingbo Wu, Hongliang Li

Comments 14 pages, 11 figures, under review

2603.11721 2026-03-24 cs.AI

When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

Wenxian Yang, Hanzheng Qiu, Bangqun Zhang, Chengquan Li, Zhiyong Huang, Xiaobin Feng, Rongshan Yu, Jiahong Dong

详情

英文摘要

Large language model (LLM) agents extend generative models with reasoning, tool use, and persistent memory, thereby enabling the automation of complex tasks. In healthcare, such systems could support documentation, care coordination, and clinical decision making. Their reliable deployment in hospitals, however, remains constrained by safety risks, limited transparency, and inadequate mechanisms for handling longitudinal clinical context. Here we propose an architecture that adapts LLM agents to hospital environments. The design comprises four components: a restricted execution environment inspired by multi-user operating systems, a document-centric interaction model linking patient and clinician agents, a page-indexed memory architecture for longitudinal context management, and a curated library of composable medical skills. Implemented on top of OpenClaw, an open-source agent orchestration framework, this design provides the basis for an Agentic Operating System for Hospitals: a computing layer for coordinating clinical workflows while preserving safety, transparency, and auditability. To evaluate the memory component, we introduce manifest-guided retrieval for hierarchical navigation of longitudinal patient records. In a benchmark derived from the MIMIC-IV dataset (v2.2) comprising 100 de-identified patient records and 300 clinical queries stratified across three difficulty tiers (100 per tier), manifest-guided retrieval matched a metadata-filtered RAG baseline on overall recall (0.877 versus 0.876) while achieving 2.2x higher precision (0.779 versus 0.352) and retrieving fewer documents; on tier-3 longitudinal queries, manifest recall was 21% higher (0.846 versus 0.701), confirming that LLM-guided hierarchical navigation is most valuable when queries span multiple care episodes. These results outline a practical path toward hospital-scale agentic infrastructure.

URL PDF HTML ☆

赞 0 踩 0

2603.11461 2026-03-24 cs.RO

CoViLLM: An Adaptive Human-Robot Collaborative Assembly Framework Using Large Language Models

Jiabao Zhao, Jonghan Lim, Hongliang Li, Ilya Kovalenko

Comments 6 pages, 7 figures. Accepted to ASME MSEC 2026

2603.11428 2026-03-24 cs.LG cs.AI

A Stable Neural Statistical Dependence Estimator for Autoencoder Feature Analysis

Bo Hu, Jose C Principe

2603.10878 2026-03-24 cs.RO

RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

Andrea Patrizi, Carlo Rizzardo, Arturo Laurenzi, Francesco Ruscelli, Luca Rossini, Nikos G. Tsagarakis

2603.10748 2026-03-24 cs.CV

Event-based Photometric Stereo via Rotating Illumination and Per-Pixel Learning

Hyunwoo Kim, Won-Hoe Kim, Sanghoon Lee, Jianfei Cai, Giljoo Nam, Jae-Sang Hyun

2603.10568 2026-03-24 cs.CV

UniStitch: Unifying Semantic and Geometric Features for Image Stitching

Yuan Mei, Lang Nie, Kang Liao, Yunqiu Xu, Chunyu Lin, Bin Xiao

Comments Project Page: http://mmelodyy.github.io/projects/unistitch

2603.10448 2026-03-24 cs.RO

DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control

Teli Ma, Jia Zheng, Zifan Wang, Chunli Jiang, Andy Cui, Junwei Liang, Shuo Yang

Comments https://dit4dit.github.io/

2603.09313 2026-03-24 cs.AI

Curveball Steering: The Right Direction To Steer Isn't Always Linear

Shivam Raval, Hae Jin Song, Linlin Wu, Abir Harrasse, Jeff M. Phillips, Fazl Barez, Amirali Abdullah

2603.09051 2026-03-24 cs.RO

Cutting the Cord: System Architecture for Low-Cost, GPU-Accelerated Bimanual Mobile Manipulation

Artemis Shaw, Chen Liu, Justin Costa, Rane Gray, Alina Skowronek, Kevin Diaz, Nam Bui, Nikolaus Correll

2603.08536 2026-03-24 cs.CV

SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution

Chao Wang, Zijin Yang, Yaofei Wang, Yuang Qi, Weiming Zhang, Nenghai Yu, Kejiang Chen

Comments 8 pages. Accepted by CVPR 2026

2603.08104 2026-03-24 cs.LG

Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Guangnian Wan, Xinyin Ma, Gongfan Fang, Xinchao Wang

Comments Accepted at ICLR 2026

2603.06917 2026-03-24 cs.CV

PaQ-DETR: Learning Pattern and Quality-Aware Dynamic Queries for Object Detection

Zhengjian Kang, Jun Zhuang, Kangtong Mo, Qi Chen, Rui Liu, Ye Zhang

Comments 10 pages, 6 figures, Accepted at CVPR2026

2603.06887 2026-03-24 cs.RO

VertiAdaptor: Online Kinodynamics Adaptation for Vertically Challenging Terrain

Tong Xu, Chenhui Pan, Aniket Datar, Xuesu Xiao

2603.06866 2026-03-24 cs.RO

CAR: Cross-Vehicle Kinodynamics Adaptation via Mobility Representation

Tong Xu, Chenhui Pan, Xuesu Xiao

2603.03744 2026-03-24 cs.CV

DAGE: Dual-Stream Architecture for Efficient and Fine-Grained Geometry Estimation

Tuan Duc Ngo, Jiahui Huang, Seoung Wug Oh, Kevin Blackburn-Matzen, Evangelos Kalogerakis, Chuang Gan, Joon-Young Lee

Comments CVPR 2026. Project page: https://ngoductuanlhp.github.io/dage-site/

2603.03710 2026-03-24 cs.CV cs.AI

MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction

Seunghoi Kim, Chen Jin, Henry F. J. Tregidgo, Matteo Figini, Daniel C. Alexander

2603.03251 2026-03-24 cs.LG

Speculative Speculative Decoding

Tanishq Kumar, Tri Dao, Avner May

Comments ICLR 2026

2603.01498 2026-03-24 cs.CV

Tri-path DINO: Feature Complementary Learning for Remote Sensing Multi-Class Change Detection

Kai Zheng, Hang-Cheng Dong, Shoulei Liu, Zhenkai Wu, Fupeng Wei, Lei Ding, Wei Zhang

2603.01164 2026-03-24 cs.CV

FREE-Edit: Using Editing-aware Injection in Rectified Flow Models for Zero-shot Image-Driven Video Editing

Maomao Li, Yunfei Liu, Yu Li

Comments 13 pages