2605.29591
2026-05-29
cs.AI
Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion
Mind-Omni:通过离散扩散实现脑-视觉-语言建模的统一多任务框架
Yizhuo Lu, Changde Du, Qingyu Shi, Hang Chen, Jie Peng, Liuyun Jiang, Shuangchen Zhao, Huiguang He
发表机构
*
NeuBCI Lab, State Key Laboratory of Brain Cognition
;
Brain-inspired Intelligence Technology, Institute of Automation, Chinese Academy of Sciences, Beijing, China
;
School of Future Technology, University of Chinese Academy of Sciences, Beijing, China
;
School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
;
Zhongguancun Academy, Beijing, China
;
Peking University, Beijing, China
AI总结
提出Mind-Omni框架,利用离散扩散范式统一七种编码与解码任务,通过脑分词器将连续脑信号转化为离散令牌,实现多模态交互,并构建脑问答指令调优数据集,在多项任务上达到或超越专用模型性能。