arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.00947 2026-04-02 cs.CL cond-mat.stat-mech stat.ML

Phase transition on a context-sensitive random language model with short range interactions

Yuma Toji, Jun Takahashi, Vwani Roychowdhury, Hideyuki Miyahara

详情

英文摘要

Since the random language model was proposed by E. DeGiuli [Phys. Rev. Lett. 122, 128301], language models have been investigated intensively from the viewpoint of statistical mechanics. Recently, the existence of a Berezinskii--Kosterlitz--Thouless transition was numerically demonstrated in models with long-range interactions between symbols. In statistical mechanics, it has long been known that long-range interactions can induce phase transitions. Therefore, it has remained unclear whether phase transitions observed in language models originate from genuinely linguistic properties that are absent in conventional spin models. In this study, we construct a random language model with short-range interactions and numerically investigate its statistical properties. Our model belongs to the class of context-sensitive grammars in the Chomsky hierarchy and allows explicit reference to contexts. We find that a phase transition occurs even when the model refers only to contexts whose length remains constant with respect to the sentence length. This result indicates that finite-temperature phase transitions in language models are genuinely induced by the intrinsic nature of language, rather than by long-range interactions.

URL PDF HTML ☆

赞 0 踩 0

2604.00942 2026-04-02 cs.LG cs.CR math.ST stat.TH

Differentially Private Manifold Denoising

Jiaqi Wu, Yiqing Sun, Zhigang Yao

Comments 59 pages

2604.00940 2026-04-02 cs.CV

YieldSAT: A Multimodal Benchmark Dataset for High-Resolution Crop Yield Prediction

Miro Miranda, Deepak Pathak, Patrick Helber, Benjamin Bischke, Hiba Najjar, Francisco Mena, Cristhian Sanchez, Akshay Pai, Diego Arenas, Matias Valdenegro-Toro, Marcela Charfuelan, Marlon Nuske, Andreas Dengel

2604.00938 2026-04-02 cs.LG cs.AI

WARP: Guaranteed Inner-Layer Repair of NLP Transformers

Hsin-Ling Hsu, Min-Yu Chen, Nai-Chia Chen, Yan-Ru Chen, Yi-Ling Chang, Fang Yu

2604.00933 2026-04-02 cs.CV

EmoScene: A Dual-space Dataset for Controllable Affective Image Generation

Li He, Longtai Zhang, Wenqiang Zhang, Yan Wang, Lizhe Qi

2604.00928 2026-04-02 cs.CV cs.GR

Autoregressive Appearance Prediction for 3D Gaussian Avatars

Michael Steiner, Zhang Chen, Alexander Richard, Vasu Agrawal, Markus Steinberger, Michael Zollhöfer

Comments Project Page: https://steimich96.github.io/AAP-3DGA/

2604.00927 2026-04-02 cs.CV cs.AI

Learning Quantised Structure-Preserving Motion Representations for Dance Fingerprinting

Arina Kharlamova, Bowei He, Chen Ma, Xue Liu

2604.00923 2026-04-02 cs.CL

Positional Cognitive Specialization: Where Do LLMs Learn To Comprehend and Speak Your Language?

Luis Frentzen Salim, Lun-Wei Ku, Hsing-Kuo Kenneth Pao

Comments Accepted to AAAI26 Main

2604.00921 2026-04-02 cs.CV cs.AI

Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis

Dylan B. Lewis, Jens Gregor, Hector Santos-Villalobos

Comments 9 pages, 5 figures, 6 tables

2604.00920 2026-04-02 cs.CL

GPT-NL Public Corpus: A Permissively Licensed, Dutch-First Dataset for LLM Pre-training

Jesse van Oort, Frank Brinkkemper, Erik de Graaf, Bram Vanroy, Saskia Lensink

Comments Accepted at LREC 2026

2604.00918 2026-04-02 cs.LG

Generalization Bounds for Spectral GNNs via Fourier Domain Analysis

Vahan A. Martirosyan, Daniele Malitesta, Hugues Talbot, Jhony H. Giraldo, Fragkiskos D. Malliaros

Comments Accepted to AISTATS 2026

2604.00911 2026-04-02 cs.LG

Event Embedding of Protein Networks : Compositional Learning of Biological Function

Antonin Sulc

Comments Machine Learning for Genomics Explorations (MLGenX) ICLR 2026 Workshop

2604.00897 2026-04-02 cs.LG cs.CV

Super-Resolving Coarse-Resolution Weather Forecasts With Flow Matching

Aymeric Delefosse, Anastase Charantonis, Dominique Béréziat

Comments Accepted to Climate Informatics 2026

2604.00892 2026-04-02 cs.CL

When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

Henry Peng Zou, Chunyu Miao, Wei-Chieh Huang, Yankai Chen, Yue Zhou, Hanrong Zhang, Yaozu Wu, Liancheng Fang, Zhengyao Gu, Zhen Zhang, Kening Zheng, Fangxin Wang, Yi Nian, Shanghao Li, Wenzhe Fan, Langzhou He, Weizhi Zhang, Xue Liu, Philip S. Yu

2604.00890 2026-04-02 cs.AI cs.CL cs.CV

Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models

Md. Abu Bakor Siddique, Shahrin Hossain, Sadman Ahmed Siam, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan

Comments Under review, 4 figures, 7 tables

2604.00886 2026-04-02 cs.CV cs.AI cs.CL

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

Nan Wang, Zhiwei Jin, Chen Chen, Haonan Lu

2604.00878 2026-04-02 cs.CL cs.AI cs.LG

KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection

Abdullah Al Shafi, Md. Milon Islam, Sk. Imran Hossain, K. M. Azharul Hasan

Comments Accepted for workshop proceedings of the 15th International Conference on Language Resources and Evaluation (LREC'26)

2604.00867 2026-04-02 cs.CV

A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video

Maximilian Fehrentz, Nicolas Stellwag, Robert Wiebe, Nicole Thorisch, Fabian Grob, Patrick Remerscheid, Ken-Joel Simmoteit, Benjamin D. Killeen, Christian Heiliger, Nassir Navab

2604.00862 2026-04-02 cs.CV

Shape Representation using Gaussian Process mixture models

Panagiotis Sapoutzoglou, George Terzakis, Georgios Floros, Maria Pateraki

Comments To appear in ISPRS 2026

2604.00857 2026-04-02 cs.CV

Sparkle: A Robust and Versatile Representation for Point Cloud based Human Motion Capture

Yiming Ren, Yujing Sun, Aoru Xue, Kwok-Yan Lam, Yuexin Ma

Comments Accepted at ICLR 2026

2604.00854 2026-04-02 cs.CV

Perturb-and-Restore: Simulation-driven Structural Augmentation Framework for Imbalance Chromosomal Anomaly Detection

Yilan Zhang, Hanbiao Chen, Changchun Yang, Yuetan Chu, Siyuan Chen, Jing Wu, Jingdong Hu, Na Li, Junkai Su, Yuxuan Chen, Ao Xu, Xin Gao, Aihua Yin

Comments This preprint version of the manuscript has been submitted to the IEEE Journal of Biomedical and Health Informatics (JBHI) for review

2604.00853 2026-04-02 cs.CV

MotionGrounder: Grounded Multi-Object Motion Transfer via Diffusion Transformer

Samuel Teodoro, Yun Chen, Agus Gunawan, Soo Ye Kim, Jihyong Oh, Munchurl Kim

Comments Please visit our project page at https://kaist-viclab.github.io/motiongrounder-site/

2604.00852 2026-04-02 cs.RO

PanoAir: A Panoramic Visual-Inertial SLAM with Cross-Time Real-World UAV Dataset

Yiyang Wu, Xiaohu Zhang, Yanjin Du, Tongsu Zhang, Chujun Li, Siyang Chen, Guoyi Zhang, Xiangpeng Xu

2604.00849 2026-04-02 cs.CV

Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation

Shuang Li, Chao Deng, Hang Chen, Liqun Liu, Zhenyu Hu, Te Cao, Mengge Xue, Yuan Chen, Peng Shu, Huan Yu, Jie Jiang

Comments Accepted by CVPR 2026 (Main)

2604.00842 2026-04-02 cs.AI cs.LG cs.MA

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Deepak Nathani, Cheng Zhang, Chang Huan, Jiaming Shan, Yinfei Yang, Alkesh Patel, Zhe Gan, William Yang Wang, Michael Saxon, Xin Eric Wang

Comments 34 pages, 8 figures, 5 tables

2604.00835 2026-04-02 cs.CL

Agentic Tool Use in Large Language Models

Jinchao Hu, Meizhi Zhong, Kehai Chen, Xuefeng Bai, Min Zhang

2604.00827 2026-04-02 cs.CV

Video Patch Pruning: Efficient Video Instance Segmentation via Early Token Reduction

Patrick Glandorf, Thomas Norrenbrock, Bodo Rosenhahn

Comments CVPR'26 Workshops

2604.00821 2026-04-02 cs.LG

Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation

Yuhang Li, Donghyun Lee, Ruokai Yin, Priyadarshini Panda

2604.00820 2026-04-02 cs.CV

Continual Vision-Language Learning for Remote Sensing: Benchmarking and Analysis

Xingxing Weng, Ruifeng Ni, Chao Pang, XiangYu Hao, Yishan Wang, Xiaokang Zhang, Wei Xu, Gui-Song Xia

Comments 23 pages, 7 figures, 9 tables

2604.00817 2026-04-02 cs.CV math.OC

Multicentric thrombus segmentation using an attention-based recurrent network with gradual modality dropout

Sofia Vargas-Ibarra, Vincent Vigneron, Hichem Maaref, Sonia Garcia-Salicetti