arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.09477 2026-02-13 cs.CV

Weakly Supervised Contrastive Learning for Histopathology Patch Embeddings

Bodong Zhang, Xiwen Li, Hamid Manoochehri, Xiaoya Tang, Deepika Sirohi, Beatrice S. Knudsen, Tolga Tasdizen

2602.09349 2026-02-13 cs.LG

Large Language Models for Designing Participatory Budgeting Rules

Nguyen Thach, Xingchen Sha, Hau Chan

Comments Accepted as full paper to AAMAS 2026

2602.09316 2026-02-13 cs.LG

Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density

Zhendong Mi, Yixiao Chen, Pu Zhao, Xiaodong Yu, Hao Wang, Yanzhi Wang, Shaoyi Huang

2602.09255 2026-02-13 cs.RO cs.AI

STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory

Mingfeng Yuan, Hao Zhang, Mahan Mohammadi, Runhao Li, Jinjun Shan, Steven L. Waslander

2602.09070 2026-02-13 cs.SD cs.AI eess.AS

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Yufan Wen, Zhaocheng Liu, YeGuo Hua, Ziyi Guo, Lihua Zhang, Chun Yuan, Jian Wu

2602.09013 2026-02-13 cs.RO cs.CV

Dexterous Manipulation Policies from RGB Human Videos via 3D Hand-Object Trajectory Reconstruction

Hongyi Chen, Tony Dong, Tiancheng Wu, Liquan Wang, Yash Jangir, Yaru Niu, Yufei Ye, Homanga Bharadhwaj, Zackory Erickson, Jeffrey Ichnowski

2602.08930 2026-02-13 cs.SD

No Word Left Behind: Mitigating Prefix Bias in Open-Vocabulary Keyword Spotting

Yi Liu, Chuan-Che Huang, Xiao Quan

Comments Published in ICASSP 2026

2602.08907 2026-02-13 cs.LG stat.ML

Positive Distribution Shift as a Framework for Understanding Tractable Learning

Marko Medvedev, Idan Attias, Elisabetta Cornacchia, Theodor Misiakiewicz, Gal Vardi, Nathan Srebro

Comments Added acknowledgments. Expanded the summary section

2602.08711 2026-02-13 cs.CV

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Linli Yao, Yuancheng Wei, Yaojie Zhang, Lei Li, Xinlong Chen, Feifan Song, Ziyue Wang, Kun Ouyang, Yuanxin Liu, Lingpeng Kong, Qi Liu, Pengfei Wan, Kun Gai, Yuanxing Zhang, Xu Sun

2602.08520 2026-02-13 cs.AI cs.LG

Reinforcement Inference: Leveraging Uncertainty for Self-Correcting Language Model Reasoning

Xinhai Sun

2602.08517 2026-02-13 cs.AI cs.SE

TreeTensor: Boost AI System on Nested Data with Constrained Tree-Like Tensor

Shaoang Zhang, Yazhe Niu

2602.08322 2026-02-13 cs.CL

A Generative Model for Joint Multiple Intent Detection and Slot Filling

Liz Li, Wei Zhu

2602.08126 2026-02-13 cs.CV

MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection

Venkatraman Narayanan, Bala Sai, Rahul Ahuja, Pratik Likhar, Varun Ravi Kumar, Senthil Yogamani

2602.07837 2026-02-13 cs.RO

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Hongzhi Zang, Shu'ang Yu, Hao Lin, Tianxing Zhou, Zefang Huang, Zhen Guo, Xin Xu, Jiakai Zhou, Yuze Sheng, Shizhe Zhang, Feng Gao, Wenhao Tang, Yufeng Yue, Quanlu Zhang, Xinlei Chen, Chao Yu, Yu Wang

2602.07512 2026-02-13 cs.CV

Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection

Tao Wang, Chenyu Lin, Chenwei Tang, Jizhe Zhou, Deng Xiong, Jianan Li, Jian Zhao, Jiancheng Lv

Comments paper accepted by ISPRS Journal of Photogrammetry and Remote Sensing ( IF=12.2)

2602.07497 2026-02-13 cs.CL

From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection

Mo Wang, Kaixuan Ren, Pratik Jalan, Ahmed Ashraf, Tuong Vy Vu, Rahul Seetharaman, Shah Nawaz, Usman Naseem

Comments 12 pages, 5 figures, Proceedings of the ACM Web Conference 2026 (WWW '26)

2602.07432 2026-02-13 cs.AI cs.HC

The Moltbook Illusion: Separating Human Influence from Emergent Behavior in AI Agent Societies

Ning Li

2602.07011 2026-02-13 cs.CV cs.AI eess.IV

MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation

Zhuonan Wang, Zhenxuan Fan, Siwen Tan, Yu Zhong, Yuqian Yuan, Haoyuan Li, Hao Jiang, Wenqiao Zhang, Feifei Shao, Hongwei Wang, Jun Xiao

Comments 9 pages, 5 figures

2602.05148 2026-02-13 cs.LG cs.AI

CoSA: Compressed Sensing-Based Adaptation of Large Language Models

Songtao Wei, Yi Li, Bohan Zhang, Zhichun Guo, Ying Huang, Yuede Ji, Miao Yin, Guanpeng Li, Bingzhe Li

2602.05014 2026-02-13 cs.AI cs.CL cs.IR

DeepRead: Document Structure-Aware Reasoning to Enhance Agentic Search

Zhanli Li, Huiwen Tian, Lvzhou Luo, Yixuan Cao, Ping Luo

Comments This version has significantly enhanced the clarity of our research

2602.03563 2026-02-13 cs.CL

ACL: Aligned Contrastive Learning Improves BERT and Multi-exit BERT Fine-tuning

Liz Li, Wei Zhu

2602.03507 2026-02-13 cs.CL

FaithRL: Learning to Reason Faithfully through Step-Level Faithfulness Maximization

Runquan Gui, Yafu Li, Xiaoye Qu, Ziyan Liu, Yeqiu Cheng, Yu Cheng

2602.03368 2026-02-13 cs.CL

Pursuing Best Industrial Practices for Retrieval-Augmented Generation in the Medical Domain

Liz Li, Wei Zhu

2602.02182 2026-02-13 cs.CL

Evaluating Metalinguistic Knowledge in Large Language Models across the World's Languages

Tjaša Arčon, Matej Klemen, Marko Robnik-Šikonja, Kaja Dobrovoljc

详情

英文摘要

LLMs are routinely evaluated on language use, yet their explicit knowledge about linguistic structure remains poorly understood. Existing linguistic benchmarks focus on narrow phenomena, emphasize high-resource languages, and rarely test metalinguistic knowledge - explicit reasoning about language structure. We present a multilingual evaluation of metalinguistic knowledge in LLMs, based on the World Atlas of Language Structures (WALS), documenting 192 linguistic features across 2,660 languages. We convert WALS features into natural-language multiple-choice questions and evaluate models across documented languages. Using accuracy and macro F1, and comparing to chance and majority-class baselines, we assess performance and analyse variation across linguistic domains and language-related factors. Results show limited metalinguistic knowledge: GPT-4o performs best but achieves moderate accuracy (0.367), while open-source models lag. Although all models perform above chance, they fail to outperform the majority-class baseline, suggesting they capture broad cross-linguistic patterns but lack fine-grained distinctions. Performance varies by domain, partly reflecting differences in online visibility. At the language level, accuracy correlates with digital language status: languages with greater digital presence and resources are evaluated more accurately, while low-resource languages perform worse. Analysis of predictive factors confirms that resource-related indicators (Wikipedia size, corpus availability) are more informative than geographic, genealogical, or sociolinguistic factors. Overall, LLM metalinguistic knowledge appears fragmented and shaped mainly by data availability, rather than broadly generalizable grammatical competence. We release the benchmark as an open-source dataset to support evaluation across languages and encourage greater global linguistic diversity in future LLMs.

URL PDF HTML ☆

赞 0 踩 0

2602.01511 2026-02-13 cs.CL cs.LG

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Ran Xu, Tianci Liu, Zihan Dong, Tony Yu, Ilgee Hong, Carl Yang, Linjun Zhang, Tao Zhao, Haoyu Wang

Comments The first two authors contributed equally

2602.00148 2026-02-13 cs.CV cs.AI

Learning Physics-Grounded 4D Dynamics with Neural Gaussian Force Fields

Shiqian Li, Ruihong Shen, Junfeng Ni, Chang Pan, Chi Zhang, Yixin Zhu

Comments 43 pages, ICLR 2026

2601.22548 2026-02-13 cs.CL cs.AI cs.LG

Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations

Dani Roytburg, Matthew Bozoukov, Matthew Nguyen, Jou Barzdukas, Mackenzie Puig-Hall, Narmeen Oozeer

2601.22257 2026-02-13 cs.LG hep-th

Symmetry Breaking in Transformers for Efficient and Interpretable Training

Eva Silverstein, Daniel Kunin, Vasudev Shyam

Comments 22 pages, 3 figures

2601.17311 2026-02-13 cs.AI

Phase Transition for Budgeted Multi-Agent Synergy

Bang Liu, Linglong Kong, Jian Pei

Comments 55 pages, 12 figures

2601.03192 2026-02-13 cs.CL

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

Shengtao Zhang, Jiaqian Wang, Ruiwen Zhou, Junwei Liao, Yuchen Feng, Zhuo Li, Yujie Zheng, Weinan Zhang, Ying Wen, Zhiyu Li, Feiyu Xiong, Yutao Qi, Bo Tang, Muning Wen

Comments 41 pages, 11 figures

AI 大模型

视觉与机器人

科学与医疗

Weakly Supervised Contrastive Learning for Histopathology Patch Embeddings

Large Language Models for Designing Participatory Budgeting Rules

Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density

STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Dexterous Manipulation Policies from RGB Human Videos via 3D Hand-Object Trajectory Reconstruction

No Word Left Behind: Mitigating Prefix Bias in Open-Vocabulary Keyword Spotting

Positive Distribution Shift as a Framework for Understanding Tractable Learning

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Reinforcement Inference: Leveraging Uncertainty for Self-Correcting Language Model Reasoning

TreeTensor: Boost AI System on Nested Data with Constrained Tree-Like Tensor

A Generative Model for Joint Multiple Intent Detection and Slot Filling

MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection

From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection

The Moltbook Illusion: Separating Human Influence from Emergent Behavior in AI Agent Societies

MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation

CoSA: Compressed Sensing-Based Adaptation of Large Language Models

DeepRead: Document Structure-Aware Reasoning to Enhance Agentic Search

ACL: Aligned Contrastive Learning Improves BERT and Multi-exit BERT Fine-tuning

FaithRL: Learning to Reason Faithfully through Step-Level Faithfulness Maximization

Pursuing Best Industrial Practices for Retrieval-Augmented Generation in the Medical Domain

Evaluating Metalinguistic Knowledge in Large Language Models across the World's Languages

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Learning Physics-Grounded 4D Dynamics with Neural Gaussian Force Fields

Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations

Symmetry Breaking in Transformers for Efficient and Interpretable Training

Phase Transition for Budgeted Multi-Agent Synergy

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory