arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.21601 2026-03-24 cs.LG cs.AI

Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence

Philip S. Yu, Li Sun

Comments 7 pages

详情

英文摘要

Graphs provide a natural description of the complex relationships among objects, and play a pivotal role in communications, transportation, social computing, the life sciences, etc. Currently, there is strong agreement that Graph Foundation Models (GFMs) are essential for advancing graph learning, yet considerable disagreement persists on how to build a powerful, general-purpose GFM analogous to Large Language Models (LLMs). Graph Neural Networks (GNNs) exhibit limitations in memory retention and principled interpretability when confronted with multi-domain pretraining and adaptation. The challenge of graph serialization hinders the direct application of LLMs, as the words struggle to capture the structural complexity and diversity inherent in graphs. In contrast, Riemannian geometry offers an elegant mathematical framework for modeling structures, while remaining compatible with graph semantic learning, even with LLMs. In this paper, we argue that, for graphs, Riemannian geometry speaks louder than words, and lay out the foundational principles for GFM. Reimagining with Riemannian geometry, we introduce a blue sky idea-Riemannian Foundation Model (RFM)-that opens a new pathway for capturing complex structural patterns and uncovering cross-domain generalities. RFM emphasizes intrinsic graph geometry and embodies endogenous capacities for structural inference and generation, moving beyond mere representation-space switching. Accordingly, we outline a progressive agenda that begins with universal structural understanding through intrinsic geometry, and then rebuilds LLM with a Riemannian engine for general-purpose graph modeling and beyond. Thus, RFM enables a paradigm shift from designing graph models to solving graph-structured applications with RFM agents, unlocking the next-generation graph intelligence.

URL PDF HTML ☆

赞 0 踩 0

2603.21596 2026-03-24 cs.LG cs.CR

In-network Attack Detection with Federated Deep Learning in IoT Networks: Real Implementation and Analysis

Devashish Chaudhary, Sutharshan Rajasegarar, Shiva Raj Pokhrel, Lei Pan, Ruby D

Comments This paper has been accepted at the IEEE Conference on Engineering Informatics 2025

2603.21584 2026-03-24 cs.LG cs.CV

SSAM: Singular Subspace Alignment for Merging Multimodal Large Language Models

Md Kaykobad Reza, Ameya Patil, Edward Ayrapetian, M. Salman Asif

Comments 25 Pages, 9 Figures, 5 Tables

2603.21580 2026-03-24 cs.RO cs.SY eess.SY

Conformal Koopman for Embedded Nonlinear Control with Statistical Robustness: Theory and Real-World Validation

Koki Hirano, Hiroyasu Tsukamoto

Comments 8 pages, 6 figures. Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA). The final published version will be available via IEEE Xplore

2603.21577 2026-03-24 cs.AI

Mind over Space: Can Multimodal Large Language Models Mentally Navigate?

Qihui Zhu, Shouwei Ruan, Xiao Yang, Hao Jiang, Yao Huang, Shiji Zhao, Hanwei Fan, Hang Su, Xingxing Wei

2603.21574 2026-03-24 cs.AI

Adaptive Robust Estimator for Multi-Agent Reinforcement Learning

Zhongyi Li, Wan Tian, Jingyu Chen, Kangyao Huang, Huiming Zhang, Hui Yang, Tao Ren, Jinyang Jiang, Yijie Peng, Yikun Ban, Fuzhen Zhuang

2603.21573 2026-03-24 cs.CV

Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs

Efthymios Tsaprazlis, Tiantian Feng, Anil Ramakrishna, Sai Praneeth Karimireddy, Rahul Gupta, Shrikanth Narayanan

2603.21571 2026-03-24 cs.CL

DATASHI: A Parallel English-Tashlhiyt Corpus for Orthography Normalization and Low-Resource Language Processing

Nasser-Eddine Monir, Zakaria Baou

Comments This paper has been accepted for presentation at LREC 2026

2603.21567 2026-03-24 cs.LG

Kolmogorov Complexity Bounds for LLM Steganography and a Perplexity-Based Detection Proxy

Andrii Shportko

2603.21566 2026-03-24 cs.CV cs.AI cs.DB cs.LG cs.RO

CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation

Mohammad Eslami, Dhanvinkumar Ganeshkumar, Saber Kazeminasab, Michael G. Morley, Michael V. Boland, Michael M. Lin, John B. Miller, David S. Friedman, Nazlee Zebardast, Lucia Sobrin, Tobias Elze

2603.21565 2026-03-24 cs.CV cs.AI

Rethinking SAR ATR: A Target-Aware Frequency-Spatial Enhancement Framework with Noise-Resilient Knowledge Guidance

Yansong Lin, Zihan Cheng, Jielei Wang, Guoming Lua, Zongyong Cui

2603.21562 2026-03-24 cs.CV

Exploring Multimodal Prompts For Unsupervised Continuous Anomaly Detection

Mingle Zhou, Jiahui Liu, Jin Wan, Gang Li, Min Li

2603.21559 2026-03-24 cs.CV

Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity Learning

Minseok Kang, Minhyeok Lee, Minjung Kim, Jungho Lee, Donghyeong Kim, Sungmin Woo, Inseok Jeon, Sangyoun Lee

Comments 28 pages, 11 figures

2603.21557 2026-03-24 cs.CV

From Part to Whole: 3D Generative World Model with an Adaptive Structural Hierarchy

Bi'an Du, Daizong Liu, Pufan Li, Wei Hu

Comments Accepted to ICME 2026

2603.21547 2026-03-24 cs.CV

PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models

Yiwei Xie, Zheng Zhang, Ping Liu

Comments This preprint was posted after submission to IEEE Transactions

2603.21546 2026-03-24 cs.LG cs.AI

What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators

Xinyu Zhang

Comments 5 pages, 3 figures, 1 table

2603.21545 2026-03-24 cs.RO cs.SY eess.SY

Auction-Based Task Allocation with Energy-Conscientious Trajectory Optimization for AMR Fleets

Jiachen Li, Soovadeep Bakshi, Jian Chu, Shihao Li, Dongmei Chen

2603.21541 2026-03-24 cs.LG cs.AI

Sharper Generalization Bounds for Transformer

Yawen Li, Tao Hu, Zhouhui Lian, Wan Tian, Yijie Peng, Huiming Zhang, Zhongyi Li

2603.21534 2026-03-24 cs.LG cs.NA math.NA

Generalization Limits of In-Context Operator Networks for Higher-Order Partial Differential Equations

Jamie Mahowald, Tan Bui-Thanh

Comments 16 pages, 9 figures

2603.21529 2026-03-24 cs.CL

SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification

Migyeong Kang, Jihyun Kim, Hyolim Jeon, Sunwoo Hwang, Jihyun An, Yonghoon Kim, Haewoon Kwak, Jisun An, Jinyoung Han

详情

DOI: 10.1145/3770854.3785698 10.1145/3770854.3785698 10.1145/3770854.3785698

英文摘要

Psychiatric symptom identification on social media aims to infer fine-grained mental health symptoms from user-generated posts, allowing a detailed understanding of users' mental states. However, the construction of large-scale symptom-level datasets remains challenging due to the resource-intensive nature of expert labeling and the lack of standardized annotation guidelines, which in turn limits the generalizability of models to identify diverse symptom expressions from user-generated text. To address these issues, we propose SynSym, a synthetic data generation framework for constructing generalizable datasets for symptom identification. Leveraging large language models (LLMs), SynSym constructs high-quality training samples by (1) expanding each symptom into sub-concepts to enhance the diversity of generated expressions, (2) producing synthetic expressions that reflect psychiatric symptoms in diverse linguistic styles, and (3) composing realistic multi-symptom expressions, informed by clinical co-occurrence patterns. We validate SynSym on three benchmark datasets covering different styles of depressive symptom expression. Experimental results demonstrate that models trained solely on the synthetic data generated by SynSym perform comparably to those trained on real data, and benefit further from additional fine-tuning with real data. These findings underscore the potential of synthetic data as an alternative resource to real-world annotations in psychiatric symptom modeling, and SynSym serves as a practical framework for generating clinically relevant and realistic symptom expressions.

URL PDF HTML ☆

赞 0 踩 0

2603.21528 2026-03-24 cs.CV

PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation

Gensheng Pei, Xiruo Jiang, Xinhao Cai, Tao Chen, Yazhou Yao, Byeungwoo Jeon

Comments accepted by CVPR 2026

2603.21526 2026-03-24 cs.CV

VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection

Xinghan Li, Junhao Xu, Jingjing Chen

Comments Project Page: https://vigil.best

2603.21525 2026-03-24 cs.LG cs.AI

BOxCrete: A Bayesian Optimization Open-Source AI Model for Concrete Strength Forecasting and Mix Optimization

Bayezid Baten, M. Ayyan Iqbal, Sebastian Ament, Julius Kusuma, Nishant Garg

Comments Code and dataset are available at https://github.com/facebookresearch/SustainableConcrete

2603.21524 2026-03-24 cs.CL cs.AI

CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

Ravi Ranjan, Utkarsh Grover, Mayur Akewar, Xiaomin Lin, Agoritsa Polyzou

Comments 9 pages, 4 figures, and accepted in IJCNN 2026 (part of IEEE WCCI 2026)

2603.21523 2026-03-24 cs.RO cs.AI

SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

Weizhe Xu, Mengyu Liu, Fanxin Kong

Comments 12 pages, 8 figures

2603.21520 2026-03-24 cs.CL

Generalizable Self-Evolving Memory for Automatic Prompt Optimization

Guanbao Liang, Yuanchen Bei, Sheng Zhou, Yuheng Qin, Huan Zhou, Bingxin Jia, Bin Li, Jiajun Bu

2603.21508 2026-03-24 cs.LG cs.AI cs.HC

Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences

Chen Gong, Zhenzhe Zheng, Yiliu Chen, Sheng Wang, Fan Wu, Guihai Chen

2603.21504 2026-03-24 cs.CV

Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification

Jayanie Bogahawatte, Sachith Seneviratne, Saman Halgamuge

Comments Accepted for publication at CVPR 2026 Workshop on Medical Reasoning with Vision Language Foundation Models (Med-Reasoner)

2603.21502 2026-03-24 cs.LG cs.AI

Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks

Hang-Cheng Dong, Pengcheng Cheng

2603.21496 2026-03-24 cs.RO cs.AI physics.optics

A Framework for Closed-Loop Robotic Assembly, Alignment and Self-Recovery of Precision Optical Systems

Seou Choi, Sachin Vaidya, Caio Silva, Shiekh Zia Uddin, Sajib Biswas Shuvo, Shrish Choudhary, Marin Soljačić