arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2085
2605.07492 2026-05-11 cs.CV

How Far Is Document Parsing from Solved? PureDocBench: A Source-TraceableBenchmark across Clean, Degraded, and Real-World Settings

Zhiheng Li, Zongyang Ma, Jiaxian Chen, Jianing Zhang, Zhaolong Su, Yutong Zhang, Zhiyin Yu, Ruiqi Liu, Xiaolei Lv, Bo Li, Jun Gao, Ziqi Zhang, Chunfeng Yuan, Bing Li, Weiming Hu

AI总结 尽管已有超过20个开源文档解析模型,但现有基准OmniDocBench存在标注质量不高和数据污染的问题,其排名可靠性受到质疑。为此,研究者提出了PureDocBench,一个可追溯来源的基准,涵盖10个领域、66个子类和1,475页文档,分别生成清晰、数字退化和真实退化三个版本,共计4,425张图像。实验表明,当前最佳模型的性能仅为74/100,模型间性能差距显著,且通用视觉语言模型在退化场景下表现更稳健,凸显了现有文档解析任务仍面临诸多挑战。

Comments 42 pages, 20 figures, 16 tables

详情
英文摘要

The past year has seen over 20 open-source document parsing models, yet thefield still benchmarks almost exclusively on OmniDocBench, a 1,355-pagemanually annotated dataset whose top scores have saturated above 90%. Athree-stage audit pipeline we run on OmniDocBench screens its 21,353evaluator-scored blocks and confirms 2,580 errors (12.08%); combined with overa year of public availability, both annotation quality and contamination riskcall its rankings into question. To address these issues, we presentPureDocBench, a programmatically generated, source-traceable benchmark thatrenders document images from HTML/CSS and produces verifiable annotations fromthe same source, covering 10 domains, 66 subcategories, and 1,475 pages, eachin three versions: clean, digitally degraded, and real-degraded (4,425 imagestotal). Evaluating 40 models spanning pipeline specialists, end-to-endspecialists, and general-purpose VLMs, we find: (i) document parsing is farfrom solved: the best model scores only ~74 out of 100, with a 44.6-point gapbetween the strongest and weakest models; (ii) specialist parsers with <=4Bparameters rival or surpass general VLMs that are 5-100x larger, yet formularecognition remains a shared bottleneck where no model exceeds 67% whenaveraging the formula metric across all three tracks; (iii) general VLMs loseonly 0.99/8.52 Overall points under digital/real degradation versus 4.90/14.21for pipeline specialists, producing ranking reversals that make clean-onlyevaluation misleading for deployment. All data, code, and artifacts arepublicly released.

2605.07491 2026-05-11 cs.CV

Implicit Multi-Camera System Calibration Using Gaussian Processes

Ivan De Boi, Bart Ribbens, Veronika Golanova, Ursula Kapov, Simon Verspeek

AI总结 本文提出了一种基于高斯过程(GP)回归的隐式多相机系统标定新框架。与传统依赖刚性数学模型的显式标定方法以及数据需求大且缺乏不确定性量化的神经网络方法不同,该方法直接学习所有相机图像坐标到三维世界坐标的复杂非线性映射,无需显式估计内在和外在参数。通过引入主动学习策略,进一步提升数据效率和实用性,使标定过程更加鲁棒、高效且可靠,特别适用于难以获取大量标定数据的实际场景。

详情
英文摘要

This paper proposes a novel framework for implicit multi-camera system calibration utilizing Gaussian Process (GP) regression. Conventional explicit calibration methods are constrained by rigid mathematical models and struggle with complex, non-linear distortions from unconventional optics, while existing neural network-based implicit approaches are typically data-hungry and lack inherent uncertainty quantification (UQ). Our GP-based model directly learns the complex, non-linear mapping from 2D image coordinates across all cameras to a 3D world coordinate, completely bypassing time-consuming estimation of explicit intrinsic and extrinsic parameters. Moreover, the inherent UQ is critical for transforming a simple 3D point prediction into a verifiable 3D measurement, complete with statistically-sound confidence bounds. To further enhance data efficiency and practical deployment, we integrate Active Learning (AL), which intelligently leverages the GP's predictive uncertainty to strategically guide the acquisition of new calibration data. This approach results in a robust, data-efficient, and reliable calibration solution, proving particularly effective in practical scenarios where collecting extensive calibration data is a dominant constraint. Our experiments show that the uncertainty for the 3D predictions is higher closer to the cameras. The data points in $uv$-coordinate space are more sparse in that region, even though they are not in 3D space. This work is relevant for anyone who is tasked with the calibration of complex multi-camera systems.

2605.07489 2026-05-11 cs.SD cs.MM eess.SP

A Decomposed Retrieval-Edit-Rerank Framework for Chord Generation

Qiqi He, Dichucheng Li, Xiaoheng Sun, Anqi Huang

AI总结 该论文提出了一种用于和弦生成的分解式检索-编辑-重排序(RER)框架,旨在解决在保持音乐理论可行性的同时提升风格多样性这一挑战。该方法将生成过程分解为三个明确阶段:检索候选和弦、编辑以确保理论可行性、重排序以优化偏好。通过这种结构化流程,系统实现了更高的可控性和可解释性,并在客观指标和主观评估中优于现有端到端方法。

Comments Accepted by the 2026 ACM International Conference on Multimedia Retrieval (ICMR 2026)

详情
英文摘要

Chord generation is an inherently constrained creative task that requires balancing stylistic diversity with music-theoretic feasibility. Existing approaches typically entangle candidate generation and constraint enforcement within a single model, making the diversity-feasibility trade-off difficult to control and interpret. In this work, we approach chord generation from a system-level perspective, introducing a Retrieval-Edit-Rerank (RER) framework that decomposes the task into three explicit stages: i) retrieval, which defines a stylistically plausible candidate space; ii) editing, which enforces music-theoretic feasibility through minimal modifications; and iii) reranking, which resolves soft preferences among feasible candidates. This separation provides a controllable pipeline, where each component addresses a distinct aspect of the generation process, thereby enhancing both the interpretability and adjustability of the output chords. Through objective metrics and subjective evaluation, our decomposed system outperforms all end-to-end chord generation baselines in balancing chord diversity and music-theoretic feasibility. Ablation studies further confirm the complementary roles of each stage in creative exploration and constraint satisfaction.

2605.07488 2026-05-11 cs.AI cs.LG

Efficient Data Selection for Multimodal Models via Incremental Optimization Utility

Jinhao Jing, Qiannian Zhao, Chao Huang, Zhan Su

AI总结 本文针对大型多模态模型(LMMs)在合成数据质量与数量之间的权衡问题,提出了一种高效的数据选择方法One-Step-Train(OST)。该方法通过增量优化效用排名问题重新定义数据选择过程,利用轻量代理模型模拟单步更新来估计每个样本的边际效用,从而避免了传统方法的高计算成本和可解释性不足的问题。实验表明,OST在多模态数学推理任务中实现了帕累托最优效率,大幅降低了训练成本并提升了模型性能。

详情
英文摘要

The scaling of Large Multimodal Models (LMMs) is constrained by the quality-quantity trade-off inherent in synthetic data. Previous approaches, such as LLM-as-a-Judge, have proven their effectiveness in addressing this but suffer from prohibitive computational costs and lack of interpretability. To bridge this gap, we propose One-Step-Train (OST), a framework that reformulates data selection as an incremental optimization utility ranking problem. Instead of relying on semantic heuristics, OST estimates the marginal utility of each sample via a simulated single-step update on a lightweight proxy. Experiments on the Qwen series across multimodal mathematical reasoning benchmarks demonstrate that OST achieves Pareto-optimal efficiency. By selecting the top-50 subset, OST reduces training costs by 43% (and total time consumption by 17) while surpassing the strong LLM-as-a-Judge baseline by 1.8 points. Furthermore, under a fixed compute budget, our method using only the top-20 subset achieves a 5.6 point gain over LLM-as-a-Judge, improves upon heuristic scoring baselines like DEITA, and outperforms the Full-SFT baseline by 8.8 points. Notably, while Full-SFT suffers from performance degradation due to noise, our optimization-grounded approach effectively identifies toxic samples, successfully reversing the negative transfer frequently observed in complex reasoning tasks.

2605.07485 2026-05-11 cs.LG cs.AI

Excluding the Target Domain Improves Extrapolation: Deconfounded Hierarchical Physics Constraints

Tsuyoshi Okita

AI总结 本文研究了如何提升物理约束深度生成模型在分布外条件下的外推能力。作者提出了一种去混淆分层门控机制(DHG),通过识别温度混淆对各层次物理约束的影响,分离出真正的物理不一致性,从而更有效地应用分层物理约束。实验表明,在预训练阶段排除目标域数据反而提升了外推性能,作者在锂离子电池温度预测任务中实现了比基线方法高46%的性能提升。

Comments 16 pages, 2 figures

详情
英文摘要

Extrapolation to out-of-distribution conditions is a fundamental challenge for physics-constrained deep generative models. Existing methods apply physical constraints as a single static regularization term uniformly across the generation process, and address neither the hierarchical structure of physical laws and the confounding variable problem. We propose the Deconfounded Hierarchical Gate (DHG), which serves as a diagnostic and control mechanism: it identifies when and how strongly temperature confounding contaminates each constraint level, so that hierarchical gates reflect intrinsic physical inconsistency rather than spurious temperature effects. DHG combines counterfactual estimation via the do-operator with backdoor adjustment to remove confounding, then applies Coarse-to-Fine physical constraints progressively. We report a counter-intuitive finding in pretraining: excluding the target-domain data from pretraining outperforms including it by 39% in extrapolation performance (RMSE 0.224 vs. 0.324). This occurs because FNO learns domain-agnostic physical patterns that transfer more effectively when the target domain is withheld. On a lithium-ion battery temperature extrapolation benchmark (trained at 24 degrees Celsius, evaluated at 4.0--43.0 degrees Celsius), our method achieves RMSE = 0.215, a 46% improvement over the unconstrained baseline (Pure CFM: 0.397).

2605.07478 2026-05-11 cs.CV

AudioFace: Language-Assisted Speech-Driven Facial Animation with Multimodal Language Models

Kai Zheng, Zejian Kang, Rui Mao, Hongyuan Zou, Yuanchen Fei, Xuanyang Xu, Xiangru Huang

AI总结 本文提出了一种名为AudioFace的语言辅助语音驱动面部动画框架,旨在解决语音信号与面部运动之间精确对应的问题,特别是与发音相关的口部动作。该方法通过引入多模态大语言模型的先验知识,结合语音转录和音素级别的语言线索,将口部相关面部参数的生成建模为由语言和发音信息引导的结构化生成过程。实验表明,AudioFace在多个评估指标上均表现出色,验证了语言辅助和多模态先验引导方法在语音驱动面部动画中的有效性。

详情
英文摘要

Speech-driven facial animation requires accurate correspondence between acoustic signals and facial motion, especially for articulation-related mouth movements. However, directly mapping speech audio to facial coefficients often overlooks the linguistic and phonetic structure underlying speech production. In this paper, we propose AudioFace, a language-assisted framework for speech-driven blendshape generation that treats mouth-related facial coefficient prediction as a structured generation problem guided by linguistic and articulatory information. Instead of relying solely on acoustic features, our method leverages the prior knowledge of multimodal large language models and introduces transcript- and phoneme-level cues to bridge speech signals with interpretable facial actions. Extensive experiments show that AudioFace achieves superior performance across multiple evaluation metrics, validating the effectiveness of language-assisted and multimodal-prior-guided speech-driven facial animation.

2605.07477 2026-05-11 cs.CV

ReasonEdit: Towards Interpretable Image Editing Evaluation via Reinforcement Learning

Honghua Chen, Zitong Xu, Huiyu Duan, Xinyun Zhang, Xiongkuo Min, Guangtao Zhai

AI总结 近年来,文本引导的图像编辑模型虽然取得了显著进展,但生成结果仍常存在伪影、非预期修改和审美不足等问题。为解决现有评估方法缺乏可解释性的问题,本文提出了ReasonEdit-22K数据集,包含22,000张编辑图像和113,000个链式推理样本,并配有130万个人类评估结果。基于该数据集,研究者构建了RE-Reward奖励模型和ReasonEdit评估模型,通过强化学习方法实现了对图像编辑可解释性的高效评估,实验表明该方法在对齐人类偏好和跨基准泛化能力方面表现优异。

详情
英文摘要

Recent text-guided image editing (TIE) models have achieved remarkable progress, however, many edited results still suffer from artifacts, unintended modifications, and suboptimal aesthetics. Although several benchmarks and evaluation methods have been proposed, most existing approaches rely on scalar scores and lack interpretability. This limitation largely stems from the absence of high-quality interpretation datasets for TIE and effective reward models to train interpretable evaluators. To address these challenges, we introduce ReasonEdit-22K, the first dataset that combines 22K edited images with 113K Chain-of-Thought (CoT) samples, along with 1.3M human judgments assessing these interpretations in terms of logicality, accuracy, and usefulness. Building upon this dataset, we propose RE-Reward, a multimodal large language model (MLLM)-based reward model designed to provide human-aligned feedback for evaluating interpretable reasoning in image editing. Furthermore, we develop ReasonEdit, which is trained using reward signals derived from RE-Reward and the Group Relative Policy Optimization (GRPO) algorithm to learn an interpretable evaluation model. Extensive experiments demonstrate that ReasonEdit achieves superior alignment with human preferences and exhibits strong generalization across public benchmarks. In addition, it is capable of generating high-quality interpretable evaluation text, enabling more transparent and trustworthy assessment for image editing. The code is available at https://github.com/IntMeGroup/ReasonEdit.

2605.07476 2026-05-11 cs.LG

NPMixer: Hierarchical Neighboring Patch Mixing for Time Series Forecasting

Jung Min Choi, Vijaya Krishna Yalavarthi, Lars Schmidt-Thieme

AI总结 多变量时间序列预测面临局部时间动态和多变量全局依赖关系的复杂性挑战。本文提出了一种分层结构的NPMixer模型,通过可学习的平稳小波变换对信号进行数据依赖的时频分解,并引入邻域混合块以捕捉局部时间模式和跨尺度依赖关系。实验表明,NPMixer在七个基准数据集上表现优异,在28个实验设置中的20个(71.4%)上优于现有先进方法。

详情
英文摘要

Multivariate time series forecasting remains a challenge due to the complexity of local temporal dynamics and global dependencies across multiple variables. In this paper, we propose \textbf{N}eighboring \textbf{P}atching \textbf{Mixer} (\textbf{NPMixer}), a hierarchical architecture featuring a Learnable Stationary Wavelet Transform that adaptively learns filter coefficients to decompose signals into trend and detail components in a data-dependent manner. Our framework introduces a Neighboring Mixer Block that captures local temporal dynamics through a series of hierarchical MLP layers operating on non-overlapping patches. Specifically, the mixer block utilizes MLPs to learn temporal patterns within and across these patches, expanding the receptive field to capture multi-scale dependencies. A Channel-Mixing Encoder is applied to high-frequency components to learn channel correlations while preserving the stability of the underlying global trend. Extensive experiments on seven benchmark datasets demonstrate that NPMixer consistently outperforms state-of-the-art models, achieving better performance in 20 out of 28 ($71.4\%$) evaluated experimental setups for MSE.

2605.07474 2026-05-11 cs.CV cs.AI

ForgeVLA: Federated Vision-Language-Action Learning without Language Annotations

Yuhao Zhou, Yunpeng Zhu, Yang Zhou, Jindi Lyu, Jian Lan, Zhangyuan Wang, Dan Si, Thomas Seidl, Qing Ye, Jiancheng Lyu

AI总结 本文提出了一种名为ForgeVLA的联邦视觉-语言-动作学习框架,旨在在无需语言标注和中央数据聚合的情况下,利用分布式视觉-动作对训练通用机器人智能模型。每个客户端通过内嵌的指令分类器将视觉-动作对映射到预定义指令集,从而恢复缺失的语言模态,形成完整的三元组。为了解决联邦VLA中常被忽视的视觉-语言特征坍缩问题,ForgeVLA结合了客户端对比规划损失和服务器端自适应聚合策略,有效提升了模型的表示能力。实验表明,ForgeVLA在多个基准上显著优于现有方法。

Comments 26 pages

详情
英文摘要

Vision-Language-Action (VLA) models hold great promise for general-purpose robotic intelligence, yet scaling up such models is severely bottlenecked by the high cost of acquiring annotated training data. Fortunately, vision-equipped robots deployed across various domains already produce abundant vision-action pairs that can be leveraged to scale up VLA training more efficiently. However, these raw data cannot be centrally aggregated due to various constraints and also exhibit severe heterogeneity. To address these challenges, in this paper, we propose ForgeVLA, a federated VLA training framework that learns VLA models from distributed vision-action pairs without centralizing raw data or requiring manual annotations. Specifically, each client in ForgeVLA is equipped with an embodied instruction classifier that maps vision-action pairs to a predefined instruction set, recovering the missing language modality and forming complete vision-language-action triplets. Beyond triplet construction, we also identify vision-language feature collapse as a critical challenge that has been largely overlooked in prior federated VLA research. To mitigate this issue, ForgeVLA combines a client-side contrastive planning loss with a server-side adaptive aggregation strategy to learn task-discriminative representations efficiently. Extensive experiments across multiple benchmarks show that ForgeVLA significantly outperforms other baselines, and ablation studies further validate the contribution of each component.

2605.07471 2026-05-11 cs.LG hep-ex

Transfer Learning Across Fast- and Full-Simulation Domains in High-Energy Physics

Matthias Schott, Lucie Flek

AI总结 本文研究了在高能物理实验中,如何在快速模拟与全模拟数据之间进行迁移学习。作者在真实的LHC环境中,系统性地探讨了迁移学习方法,并应用于信号-背景分类、夸克-胶子喷注识别和缺失横向能量重建等任务,使用了密集神经网络、图神经网络和基于Transformer的架构。结果表明,基于快速模拟预训练的模型在全模拟数据上表现优于从头训练的模型,且所需的目标域训练数据量可减少约一半,展示了快速模拟在学习鲁棒可复用表示方面的潜力。

Comments 16 pages, 8 figures

详情
英文摘要

Machine-learning models in high-energy physics are often trained on simulated data, where fully simulated samples are computationally expensive while fast simulation provides large statistics at reduced realism. In this work, we systematically study transfer learning between fast-simulated and fully simulated datasets in a realistic LHC environment. We consider three representative tasks, signal-background classification, quark-gluon jet tagging, and missing transverse energy reconstruction, using dense neural networks, graph neural networks, and transformer-based architectures. Models are pretrained on ATLAS-like fast simulation and adapted to CMS-like fast simulation and to fully simulated ATLAS Open Data. Across all tasks, pretrained models consistently outperform independently trained baselines and require significantly less target-domain training data, typically reducing the needed statistics by about a factor of two. These results demonstrate that fast simulation can be used to learn robust, reusable representations and motivate publishing trained models as reusable scientific assets beyond large foundation models.

2605.07470 2026-05-11 cs.LG hep-ex

Uncovering Hidden Systematics in Neural Network Models for High Energy Physics

Lucie Flek, Philipp Alexander Jungs, Akbar Karimi, Timo Saala, Alexander Schmid, Matthias Schott, Philipp Soldin, Christopher Wiebusch, Ulrich Willemsen

AI总结 本文研究了高能物理分析中神经网络模型对输入变量细微变化的隐藏系统性敏感性问题。作者受对抗攻击研究的启发,发现即使输入变量的分布保持不变,神经网络的输出仍可能因实验不确定性的微小扰动而发生显著变化。为此,他们提出了一种量化框架,用于探测和测量神经网络在真实实验条件下的系统性不确定性,为高能物理分析中模型不确定性的评估和控制提供了实用方法。

Comments 18 pages, 9 figures

详情
英文摘要

Neural networks (NNs) are inherently multidimensional classifiers that learn complex, non-linear relationships among input observables. While their flexibility enables unprecedented performance in high-energy physics (HEP) analyses, it also makes them sensitive to small variations in their inputs. Consequently, the propagation and estimation of systematic uncertainties in NN-based models remain an open challenge. There are indications that uncertainties derived in control regions or from nominal variations of input features can underestimate the true model uncertainty, potentially leaving biases unaccounted for. Inspired by insights from adversarial-attack studies in machine learning, we explore how subtle perturbations, fully consistent with the experimental uncertainties on the input observables, can lead to substantial changes in NN outputs, while keeping the one-dimensional and correlated input distributions nearly unchanged. Using a set of representative HEP tasks, including event classification and object identification, and testing across a variety of network architectures, we demonstrate that networks can be systematically "fooled" at significant rates within the allowed uncertainty envelopes. Building on this observation, we introduce a quantitative framework to probe and measure the hidden sensitivity of neural networks to realistic experimental variations, providing a practical path to evaluate and control their systematic uncertainty in physics analyses.

2605.07467 2026-05-11 cs.LG cs.AI cs.ET

Physical Simulators as Do-Operators: Causal Discovery under Latent Confounders for AI-for-Science

Tsuyoshi Okita

AI总结 该研究针对AI-for-Science领域中普遍存在的潜在混杂因素问题,提出了一种新的因果发现方法CFM-SD,利用第一性原理的物理模拟器作为干预操作符,有效处理真实干预数据和潜在混杂因素。理论上,该方法仅需$O(d)$次单变量干预即可识别$d$变量因果结构,实验表明其在合成数据和真实科学数据上的表现显著优于现有方法,展现出在分子毒性预测和电池电解液优化等实际任务中的应用价值。

Comments 17 pages, 1 figure

详情
英文摘要

Existing interventional causal discovery methods -- IGSP, DCDI, ENCO -- assume causal sufficiency (no latent confounders) and rely on virtual interventions in synthetic simulators. In AI-for-Science settings such as molecular design and materials science, latent confounders are ubiquitous and real interventions (e.g., physics-based simulations) require hours to days per data point. We propose CFM-SD (Causal Flow Matching with Simulation Data), which uses first-principles physical simulators as do-operators in Pearl's interventional calculus to simultaneously handle latent confounders and real interventional data. Theoretically, $d$-variable causal structure is identifiable with $O(d)$ single-variable interventions -- the minimum under physical realizability constraints. In Intrinsic Evaluation on synthetic data ($γ=0.2$--$0.8$), CFM-SD achieves average F1$=0.800$ vs. F1$=0.127$--$0.562$ for all baselines. In Extrinsic Evaluation on real scientific data, CFM-SD achieves 57--58\% bias reduction in molecular toxicity prediction and battery electrolyte optimization, demonstrating practical value beyond synthetic benchmarks.

2605.07466 2026-05-11 cs.CV

A Unified Framework for the Detection and Classification of Fatty Pancreas in Ultrasound Images

Ioan-Tudor-Alexandru Anghel, Ciprian-Mihai Ceausescu, Elena Dana Nedelcu, Elena Raluca Stirban, Camelia Croitoru, Despina Ungureanu, Ana Maria Palan, Gabriela Pop

AI总结 本文提出了一种统一的端到端框架,用于从腹部超声图像中自动检测和分类脂肪性胰腺。该方法基于TransUNet架构,结合ResNet编码器和Transformer瓶颈模块进行胰腺和脾静脉的分割,随后通过解剖引导的图像块提取和纹理对比实现分类,模拟了临床判断过程。实验在包含214例超声图像的临床数据集上验证,结果显示该方法在无监督条件下仍能有效捕捉临床信号,分类准确率和F1值均优于基线方法,为脂肪性胰腺的自动化诊断提供了新的解决方案。

详情
英文摘要

Non-alcoholic fatty pancreas disease (NAFPD) is an underdiagnosed condition associated with metabolic syndrome, insulin resistance, and increased risk of pancreatic cancer. Diagnosis typically relies on subjective visual assessment of ultrasound images by clinicians. We propose an end-to-end framework for automatically classifying normal versus fatty pancreas from abdominal ultrasound images. Our method employs a TransUNet-based segmentation architecture with a ResNet encoder and transformer bottleneck to delineate the pancreas and the splenic vein, followed by anatomically-guided patch extraction and patient-level classification through pairwise texture comparison. The feature engineering mimics clinical reasoning by comparing the echogenicity of peri-venous fat to the pancreatic parenchyma, providing an interpretable signal for classification. The segmentation models are initialized via domain-specific transfer learning from a liver segmentation task. We validate the full pipeline on a clinical dataset of 214 abdominal ultrasound images with 107 expert-labeled cases using 5-fold cross-validation. SVM with RBF kernel achieves a mean cross-validated accuracy of 89.7\%\,$\pm$\,1.8\% and F1 of 0.898\,$\pm$\,0.019, while the unsupervised K-Means baseline reaches 87.8\% accuracy, demonstrating that the proposed features capture the relevant clinical signal even without labeled training data. To our knowledge, this is the first end-to-end automated framework for fatty pancreas classification from ultrasound using segmentation-guided texture analysis.

2605.07465 2026-05-11 cs.CL

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Qingyu Ren, Qianyu He, Jiajie Zhu, Xingzhou Chen, Jingwen Chang, Zeye Sun, Han Xia, Fei Yu, Jiaqing Liang, Yanghua Xiao

AI总结 本文提出了一种名为SEIF的自进化强化学习框架,旨在提升大语言模型的指令遵循能力。该方法通过构建一个包含生成指令、过滤指令、遵循指令和评判奖励的闭环系统,使指令难度与模型能力相互促进、共同进化。实验表明,SEIF在多种模型规模和架构上均能有效提升指令遵循性能,具有良好的通用性,并揭示了在开放任务中实现自进化训练的有效策略。

详情
英文摘要

Instruction following is a fundamental capability of large language models (LLMs), yet continuously improving this capability remains challenging. Existing methods typically rely either on costly external supervision from humans or strong teacher models, or on self-play training with static-difficulty instructions that cannot evolve as the model's capabilities improve. To address these limitations, we propose SEIF (Self-Evolving Reinforcement Learning for Instruction Following), a self-evolving framework for enhancing the instruction-following ability of LLMs. SEIF forms a closed self-evolution loop that improves the model's instruction-following ability, where instruction difficulty evolution and model capability evolution reinforce each other. SEIF consists of four roles: an Instructor that generates increasingly challenging instructions, a Filter that removes conflicting or invalid instructions to ensure data quality, a Follower that learns to follow evolved instructions, and a Judger that provides reward signals for reinforcement learning. The Instructor and Follower are alternately trained and co-evolve throughout the process. Experiments across multiple model scales and architectures show that SEIF consistently improves instruction-following performance, suggesting strong generality. Further analyses reveal the sources of improvement and identify an effective training strategy for self-evolution on open-ended tasks: sufficient early-stage training to build a solid foundation, followed by moderate late-stage training to mitigate overfitting and achieve better final performance. The code and data are publicly available at https://github.com/Rainier-rq1/SEIF.

2605.07463 2026-05-11 cs.LG

Approximation Error Upper and Lower Bounds for Hölder Class with Transformers

Xin He, Yuling Jiao, Xiliang Lu, Jerry Zhijian Yang

AI总结 本文研究了Transformer模型在逼近Hölder类函数时的表达能力,给出了精确的上界和下界误差分析。通过引入Softmax操作符、ReLU激活函数和残差连接,推导出标准Transformer架构的逼近上界,并证明在给定精度下,仅需$\mathcal{O}(\varepsilon^{-{d_{0}}/α})$个块即可逼近任意有界Hölder函数。同时,利用VC维上界首次严格证明了Transformer实现$\varepsilon$精度所需的块数下界为$Ω(\varepsilon^{-{d_{0}}/({4α})})$,并进一步将结果推广到一般回归任务,展示了Transformer在实际应用中的有效性。

Comments 31 pages, 2 figures. Accepted by ICML2026

详情
英文摘要

We explore the expressive power of Transformers by establishing precise approximation error upper and lower bounds for Hölder class. Specifically, a new approximation upper bound is derived for the standard Transformer architecture equipped with Softmax operators, ReLU activation functions, and residual connections. We prove that a Transformer network composed of at most $\mathcal{O}(\varepsilon^{-{d_{0}}/α})$ blocks can approximate any bounded Hölder function with $d_{0}$-dimensional input and smoothness $α\in(0,1]$ under any accuracy $\varepsilon>0$. In the case of approximation lower bounds, leveraging the VC-dimension upper bound, we are the first to rigorously prove that Transformers demand for at least $Ω(\varepsilon^{-{d_{0}}/({4α})})$ blocks to achieve the $\varepsilon$ approximation accuracy. As a final step, we extend the derived results for standard Transformers to a general regression task and establish the corresponding excess risk rates demonstrating Transformers' empirical effectiveness in real-world settings.

2605.07462 2026-05-11 cs.CL cs.AI

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

William Brach, Federico Torrielli, Stine Lyngsø Beltoft, Annemette Brok Pirchert, Peter Schneider-Kamp, Lukas Galke Poech

AI总结 《The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment》研究了一个类似Reddit的平台Moltbook上OpenClaw智能体的大规模发帖、评论和投票行为,分析其社区结构、语言特征、情感倾向及交互模式,并通过去隐私化处理后构建了一个包含23万篇帖子和220万条评论的数据集。研究还探讨了该数据对语言模型的影响,发现基于Moltbook微调的模型在真实性指标上有所下降,但与同规模的Reddit数据集效果相近,表明Moltbook可能更多是“无害的混乱”而非严重风险。研究强调了在评估新兴对齐问题时,控制基线的重要性。

详情
英文摘要

Moltbook is a Reddit-like platform where OpenClaw agents post, comment, and vote at scale - a so far unprecedented incident that comes with serious safety concerns. With the aim of studying emergent behavior in populations, we release the Moltbook Files, a dataset of 232k posts and 2.2M comments covering the platform's first 12 days, processed through a pipeline to identify and remove Personally-Identifiable Information (PII). We analyze community structure, authorship, lexical properties, sentiment, topics, semantic geometry, and comment interaction. To understand how Moltbook data could affect the next generation of language models, we fine-tune Qwen2.5-14B-Instruct on Moltbook Files with three adaptation levels. Our PII pipeline reveals that agents post API keys, passwords, BIP39 seed phrases on Moltbook, a publicly indexed platform. The overall sentiment is mostly neutral and mildly positive (66.6% neutral, 19.5% positive) and shows a tendency for self-referential linking. We find that fine-tuning on Moltbook data reduces truthfulness from 0.366 to 0.187. However, a model fine-tuned on a size-matched Reddit dataset produces a comparable decrease. Moltbook thus seems to be more of a harmless slopocalypse. However, tail risks remain, including agent affordances, contamination of future crawls through self-links, and potential transfer of traits to the next generation of language models. More broadly, our findings highlight the importance of control baselines in emergent misalignment evaluations.

2605.07461 2026-05-11 cs.CL

Think-with-Rubrics: From External Evaluator to Internal Reasoning Guidance

Jiachen Yu, Zhihao Xu, Junjie Wang, Yujiu Yang

AI总结 该研究提出了一种名为“Think-with-Rubrics”的新范式,旨在将评分标准(rubrics)从传统的外部评估工具转变为指导大语言模型生成过程的内部推理依据。通过在训练过程中让模型依次生成评分标准并根据其生成回答,同时利用评分验证器进行联合监督,该方法有效提升了模型的生成质量与一致性。实验表明,该方法在多个基准任务上优于基于黄金评分标准的奖励机制,平均提升了3.87个点。

详情
英文摘要

Rubrics have been extensively utilized for evaluating unverifiable, open-ended tasks, with recent research incorporating them into reward systems for reinforcement learning. However, existing frameworks typically treat rubrics only as external evaluator disjointed from the policy's primary reasoning trace. Such design confines rubrics to post-hoc measurement, leaving them unable to actively guide the model's generation process. In this work, we introduce Think-with-Rubrics, a novel paradigm for instruction following tasks. Think-with-Rubrics integrates rubric generation into the reasoning context, transforming the rubric from an independent artifact into an internal guidance of LLM's generation. During training, LLM sequentially generates a rubric followed by a response, while a trained rubric verifier provides joint supervision by evaluating the consistency between the answer and the self-generated / golden rubrics. Experiments across multiple benchmarks demonstrate that Think-with-Rubrics consistently outperforms the Rubric-as-Reward baseline supervised by golden rubrics by an average of 3.87 points. We have also discussed the mechanism by which Think-with-Rubrics enhances model performance. Experimental results demonstrate that supervision from golden rubrics and self-generated rubrics enhances the performance of Think-with-Rubrics by improving the quality of self-generated rubrics and increasing the internal consistency of responses respectively.

2605.07460 2026-05-11 cs.LG hep-ex

Learning Minimal-Deviation Corrections for Multi-Dimensional Mismodelling in HEP Simulations

Matthias Schott, Lucie Flek

AI总结 在高能物理实验中,精确的蒙特卡洛模拟面临多维建模误差的挑战,而实验数据通常仅提供一维分布信息。本文提出了一种基于神经网络的方法,在仅利用一维目标分布的情况下,学习对模拟事件进行最小偏差修正,从而在保持原始模拟全局相关性的同时,修正多维建模误差。该方法在控制实验中表现出对目标分布的良好拟合能力,为高维复杂分析提供了一种高效且可扩展的修正方案。

Comments 12 pages, 6 figures

详情
英文摘要

Accurate Monte Carlo (MC) modelling in high-energy physics is challenging, particularly in complex scenarios where simulations fail to reproduce observed data. In practice, experimental information is often limited to one-dimensional (1D) distributions, while mismodelling arises in a multidimensional feature space. This restricts traditional correction methods, as one-dimensional reweighting ignores correlations and fully multidimensional approaches require large target datasets. We propose a neural network-based method that operates under these constraints by learning a transformation of simulated events that reproduces the available 1D target distributions while remaining close to the original simulation. This minimal-deviation principle preserves the global correlation structure of the baseline model while enabling targeted corrections of mismodelled features. Using controlled studies with simulated pseudo-data, we show that the method improves agreement with target distributions and maintains a consistent multidimensional structure. The approach is designed for complex, high-dimensional analyses where traditional techniques are insufficient, providing a scalable way to enhance MC modelling under limited information.

2605.07458 2026-05-11 cs.LG

Estimation of Motor Unit Parameters from Surface Electromyograms using an Informed Autoencoder

Kaja Balzereit, Malte Mechtenberg, Axel Schneider

AI总结 该研究旨在从非侵入式表面肌电信号中同时估计多个个体特异的运动单元参数,如支配区中心和电位传导速度,以提升神经机械模型的预测精度。为解决这一非线性逆问题,作者提出了一种结合物理规律的有监督自编码器,能够在重构表面肌电信号的同时学习参数特征。实验表明,该方法在合成数据上实现了较高的参数估计精度,展示了其在减少人工建模工作量方面的潜力。

详情
英文摘要

Motor unit parameters such as the innervation zone centre or the conduction velocity of the electrical potential harbour the potential to improve the fidelity of neuromechanical models used for movement and force prediction. Determining these parameters in a non-invasive way is challenging, as they are subject-specific and may vary with muscle contraction. Existing work on the estimation of motor unit parameters mainly relies on white-box modelling and therefore requires substantial manual modelling effort. This work targets the simultaneous estimation of multiple subject-specific motor unit parameters from electromyography (EMG) recordings measured non-invasively at the skin surface. This results in an inverse problem with a nonlinear loss function. To address this problem, an informed autoencoder is developed. This autoencoder reconstructs the surface EMG recordings while learning the parameters in its latent space and adhering to physical laws that relate the parameters to the EMG signals. In experiments on synthetic data, innervation zone centres are estimated with a mean absolute error of 2.5989 $\mathrm{mm}$, and conduction velocities of the electric potential are estimated with a mean absolute error of 0.1697 $\mathrm{m}\mathrm{s}^{-1}$. These results demonstrate the plausibility of this novel approach, which enables the simultaneous estimation of several motor unit parameters while reducing manual modelling effort through the integration of data-driven machine learning.

2605.07457 2026-05-11 cs.CV

EditRefiner: A Human-Aligned Agentic Framework for Image Editing Refinement

Zitong Xu, Huiyu Duan, Yifei Nie, Mingda Du, Sijing Wu, Xiongkuo Min, Tianyi Zheng, Jian Zhang, Shusong Xu, Jinwei Chen, Bo Li, Guangtao Zhai

AI总结 本文提出了一种名为EditRefiner的人机对齐智能框架,用于解决文本引导图像编辑中的细粒度问题,如物体不自然、光照不匹配等。该方法基于一个包含15,000张图像和大量标注信息的EditFHF-15K数据集,构建了一个分层的感知-推理-行动-评估循环系统,实现了对编辑结果的精准诊断与局部修正。实验表明,EditRefiner在定位失真、诊断准确性和人类感知一致性方面均优于现有方法,为自纠正、感知可靠的图像编辑提供了新范式。

详情
英文摘要

Recent text-guided image editing (TIE) models have made remarkable progress, yet edited images still frequently suffer from fine-grained issues such as unnatural objects, lighting mismatch, and unexpected changes. Existing refinement approaches either rely on costly iterative regeneration or employ vision-language models (VLMs) with weak spatial grounding, often resulting in semantic drift and unreliable local corrections. To address these limitations, we first construct EditFHF-15K, a dataset of fine-grained human feedback for edited images, comprising (1) 15K images from 12 TIE models spanning 43 editing tasks, (2) 60K annotated artifact regions and 80K editing failure regions, each accompanied by textual reasoning, and (3) 45K mean opinion scores (MOSs) assessing perceptual quality, instruction following, and visual consistency. Based on EditFHF-15K, we propose EditRefiner, a hierarchical, interpretable, and human-aligned agentic framework that reformulates post-editing correction as a human-like perception-reasoning-action-evaluation loop. Specifically, we introduce: (1) a perception agent that detects contextual saliency maps of artifacts and editing failures, (2) a reasoning agent that interprets these perceptual cues to perform human-aligned diagnostic inference, (3) an action agent that uses the reasoning output to plan and execute localized re-editing, and (4) an evaluation agent that assesses the re-edited image and guides the action agent on whether further refinements are required. Extensive experiments demonstrate that EditRefiner consistently outperforms state-of-the-art methods in distortion localization, diagnose accuracy and human perception alignment, establishing a new paradigm for self-corrective and perceptually reliable image editing. The code is available at https://github.com/IntMeGroup/EditRefiner.

2605.07456 2026-05-11 cs.LG

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

Hao Luan, See-Kiong Ng, Chun Kai Ling

AI总结 本文研究了无条件扩散模型在推理阶段生成可控样本以满足特定属性分布的问题。为解决现有方法在群体属性分布对齐方面的不足,作者将问题建模为对反向扩散过程的最优控制问题,并通过添加时间依赖的扰动作为控制变量进行优化。实验表明,该方法无需重新训练模型即可在图像生成任务中更有效地实现多样且灵活的属性分布对齐。

Comments Preprint. 35 pages, 13 figures

详情
英文摘要

Inference-time controllable generation is essential for real-world applications of unconditional diffusion models. However, most existing techniques focus on individual samples, struggling in applications that require the sample population to follow specific attribute distributions (e.g., demographic balance or semantic proportions). We formalize this setting as the inference-time attribute distributional alignment problem for pretrained unconditional diffusion models. To address this, we cast inference-time attribute distributional alignment as an optimal control problem over the reverse diffusion process, viewing the process as the rollout of a dynamical system and augmenting it with additive, time-dependent perturbations as control. We solve for the perturbations using an optimal-control-based algorithm to optimize a differentiable distribution-matching objective while penalizing control effort to preserve data fidelity. Experiment results in image generation demonstrate that our proposed plug-and-play approach can better align attribute distributions to diverse and flexible test-time targets compared to baselines, without retraining or finetuning the pretrained diffusion model.

2605.07455 2026-05-11 cs.CV

EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing

Lan Chen, Qi Mao, Yiren Song, Yuchao Gu, Siwei Ma

AI总结 EditTransfer++ 是一种面向视觉提示引导图像编辑的方法,旨在通过示例对直接学习图像变换,实现比纯文本驱动方法更精确和可控的编辑效果。该方法通过解耦文本条件训练、引入对比优化机制以及条件压缩策略,有效提升了视觉提示的忠实度和推理效率,尤其在高分辨率图像编辑任务中表现出色。实验表明,EditTransfer++ 在多个基准测试中取得了最先进的性能,显著优于现有方法。

详情
英文摘要

Visual-prompt-guided edit transfer aims to learn image transformations directly from example pairs, offering more precise and controllable editing than purely text-driven approaches. However, existing diffusion transformer-based methods often fail to faithfully reproduce the demonstrated edits due to structural mismatches between the task and the backbone, including a pretrained bias toward textual conditioning and inherent stochastic instability during sampling. To bridge this gap, we present EditTransfer++, a framework that combines progressively structured training with an efficient conditioning scheme to improve both visual prompt faithfulness and inference efficiency. We first mitigate textual dominance with a text-decoupled training strategy that removes text conditioning during fine-tuning, compelling the model to infer transformations solely from visual evidence while still supporting optional text guidance at inference. On top of this visually grounded model, a best-worst contrastive refinement mechanism reshapes the denoising trajectories to suppress unfaithful generations and improve consistency across random seeds. To alleviate the computational bottleneck of high-resolution in-context editing, we further introduce a condition compression and reuse strategy that reduces token redundancy and enables efficient generation of images with a 1024-pixel long edge. Extensive experiments on existing benchmarks and the proposed EditTransfer-Bench show that EditTransfer++ achieves state-of-the-art visual prompt faithfulness with substantially faster inference than prior methods, suggesting a promising direction for scalable prompt-guided image editing and broader visual in-context learning.

2605.07454 2026-05-11 cs.CL

GRaSp: Automatic Example Optimization for In-Context Learning in Low-Data Tasks

Simen Bihaug-Frøyland, Henrik Brådland

AI总结 GRaSp 是一种用于低数据任务中上下文学习的自动示例优化框架,旨在提升大语言模型在领域特定任务中的表现。该方法采用三阶段策略,通过生成合成示例池、聚类结构化处理以及遗传算法优化,有效提升了命名实体识别任务的效果。研究还引入了一种自适应变异机制,增强了进化过程中的多样性控制,并在金融实体识别任务中验证了其优越性,显著优于零样本和随机少样本基线方法。

Comments 12 pages, 5 figures

详情
英文摘要

In-context learning enables large language models to adapt to new tasks, but their performance is highly sensitive to the selected examples. Finding effective demonstrations is particularly difficult in domain-specific, low-data settings where high-quality examples are scarce. We propose GRaSp, a three-stage framework for automatic in-context example optimization. By first generating a large synthetic candidate pool, then structuring it with clustering and dimensionality reduction, and finally using genetic algorithms to find the optimal in-context examples, the framework shows consistent improvements on the NER task. We also introduce a custom diversity-adaptive mutation mechanism, allowing it to transition from the initial broad inter-cluster exploration to focused intra-cluster refinement as the population converges. We evaluate GRaSp on financial named entity recognition (FiNER-139), comparing synthetic and human-annotated candidate pools across pool sizes of 500 and 5000. With non-synthetic data, GRaSp achieves 45.84% micro-F1, consistently outperforming both zero-shot and random few-shot baselines. Synthetic data matches the random baseline but does not exceed it, suggesting that distributional variety in the candidate pool is critical for generalization.

2605.07453 2026-05-11 cs.CL

Data Contamination in Neural Hieroglyphic Translation: A Reproducibility Study

Ammar Toutou, Abdelrahman Harb, Christine Basta

AI总结 本文研究了神经机器翻译在处理濒危语言——埃及圣书体文字到德语翻译任务中的数据污染问题。作者复现了先前使用M2M-100模型取得的61.5 BLEU分数,但发现实际仅能达到37.0 BLEU,差距源于测试集中2%的目标句子在训练数据中重复出现。研究进一步表明,数据污染显著高估了模型性能,并提出通过目标级去重的方法来缓解问题,最终发布了去污染后的测试集并提供了更准确的性能基准。

Comments Accepted to NLP4DH 2026 Conference

详情
英文摘要

Ancient and endangered languages pose a unique challenge for NLP: their datasets are inherently scarce, difficult to expand, and built from formulaic corpora -- making data-quality issues especially consequential yet rarely audited. Motivated by the need to understand what current NMT can realistically achieve for such languages, we investigate hieroglyphic-to-German translation, where a recent study reported 61.5 BLEU using fine-tuned M2M-100. Our reproduction yields only 37.0 BLEU with the released model. Investigating this gap, we find 2\% of test targets appear identically in training (16/50; 50\% under 8-gram overlap at 70\% threshold). This contamination inflates scores dramatically: contaminated samples achieve up to 83.8 BLEU / 0.924 COMET-22 versus 30.9--39.2 BLEU / 0.622--0.676 COMET-22 on clean samples across five model configurations spanning two architectures. Document-level decontamination reduces contaminated BLEU by only 4.6 points because 8/16 targets persist via other source documents -- target-level deduplication is required. We release a decontaminated 34-sample test set and establish corrected baselines (30.9--39.2 BLEU), providing a realistic assessment of NMT capability for this endangered writing system.

2605.07452 2026-05-11 cs.AI

Bounded Fitting for Expressive Description Logics

Maurice Funk, Jean Christoph Jung, Tom Voellmer

AI总结 本文研究了在扩展的描述逻辑中使用有界拟合方法学习概念的问题,该逻辑包含逆角色、限定数量限制和特征比较等复杂构造。作者分析了在这些扩展条件下有界拟合方法能否保持其理论优势,并基于SAT求解器实现了该方法。实验结果表明,该方法在实际应用中表现良好,是一种有效的表达性概念学习方案。

Comments 16 pages, full version of paper accepted at IJCAI-ECAI 2026

详情
英文摘要

Bounded fitting is an attractive paradigm for learning logical formulas from labeled data examples that offers PAC-style generalization guarantees and can often be implemented leveraging SAT solvers. It has been successfully applied to learning concepts of the description logic ALC. We study bounded fitting for learning concepts in expressive description logics that extend ALC with inverse roles, qualified number restrictions, and feature comparisons. We investigate under which conditions bounded fitting keeps its favorable theoretical properties in this setting, and implement it using a SAT solver. We compare our tool with state-of-the-art concept learners with encouraging results, demonstrating that it is a practical approach to expressive concept learning.

2605.07451 2026-05-11 cs.LG

VNN-LIB 2.0: Rigorous Foundations for Neural Network Verification

Ann Roy, Allen Antony, Andrea Gimelli, Matthew L. Daggitt

AI总结 本文介绍了 VNN-LIB 2.0,旨在为神经网络验证提供更严谨的理论基础。针对原版 VNN-LIB 在语法、语义和类型系统方面的不足,作者提出了“网络理论”的概念,抽象地定义了神经网络模型格式所需的最小语义接口,从而实现了与 ONNX 等模型表示的兼容性与独立性。基于这一理论,论文构建了更表达力强的查询语言形式语法、类型系统和形式语义,并通过 Agda 定理证明器进行机械化验证,为可信的神经网络验证提供了坚实基础。

详情
英文摘要

Neural network verification is an active and rapidly maturing research area, with a growing ecosystem of solvers and tools. The VNN-LIB standard was introduced to support interoperability in this ecosystem, but Version~1.0 has several serious short-comings as a formal foundation: it lacks a precise syntax, semantics, and type system, offers limited expressivity, and relies on externally defined ONNX models whose semantics are informal and constantly evolving. The latter distinguishes VNN-LIB from established standards such as SMT-LIB, where queries are self-contained and have fixed semantics. In this paper we address these challenges by developing the theoretical foundations of VNN-LIB~2.0. Our key contribution is the introduction of the notion of a \emph{network theory}, which abstractly characterises the minimal semantic interface required from a neural network model format. This abstraction enables VNN-LIB to be defined independently of any specific ONNX version while remaining compatible with evolving model representations. Building on this foundation, we present a formal syntax for a more expressive query language, a type system for it over the numeric domains provided by the network theory, and finally a formal semantics. To ensure internal consistency, the standard is mechanised in the Agda theorem prover. VNN-LIB~2.0 therefore provides robust and rigorous foundations for trustworthy neural network verification.

2605.07447 2026-05-11 cs.CV cs.AI cs.CL cs.LG

Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs

Hao Wang, Yiqun Sun, Pengfei Wei, Lawrence B. Hsieh, Daisuke Kawahara

AI总结 该研究提出了一种基于稀疏自编码器(SAE)的轻量级对抗攻击检测框架SAEgis,用于提升视觉语言模型(VLM)的安全性。通过将SAE模块嵌入预训练的VLM中,并利用标准重构目标进行训练,所学的稀疏潜在特征能够自然捕捉与攻击相关的信号,从而有效识别输入图像是否受到对抗性扰动。实验表明,SAEgis在领域内、跨领域和跨攻击场景下均表现出色,尤其在跨领域泛化能力方面显著优于现有方法,且无需额外对抗训练,具有较高的实用价值。

详情
英文摘要

Vision-language models (VLMs) have advanced rapidly and are increasingly deployed in real-world applications, especially with the rise of agent-based systems. However, their safety has received relatively limited attention. Even the latest proprietary and open-weight VLMs remain highly vulnerable to adversarial attacks, leaving downstream applications exposed to significant risks. In this work, we propose a novel and lightweight adversarial attack detection framework based on sparse autoencoders (SAEs), termed SAEgis. By inserting an SAE module into a pretrained VLM and training it with standard reconstruction objectives, we find that the learned sparse latent features naturally capture attack-relevant signals. These features enable reliable classification of whether an input image has been adversarially perturbed, even for previously unseen samples. Extensive experiments show that SAEgis achieves strong performance across in-domain, cross-domain, and cross-attack settings, with particularly large improvements in cross-domain generalization compared to existing baselines. In addition, combining signals from multiple layers further improves robustness and stability. To the best of our knowledge, this is the first work to explore SAE as a plug-and-play mechanism for adversarial attack detection in VLMs. Our method requires no additional adversarial training, introduces minimal overhead, and provides a practical approach for improving the safety of real-world VLM systems.

2605.07446 2026-05-11 cs.CL cs.LG

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

Suwon Choi, Shinwoo Kim, Changhoe Hwang, Gwanghoon Yoo, Eric Laporte, Jeesun Nam

AI总结 本文介绍了一种基于符号传播(SSP)的半自动标注方法,用于构建细粒度方面级情感分析的评价标注语料库EVAD,专门用于电商评论中的情感和非情感语言模式分析。研究通过构建有限状态转换器(FST)等语言资源,扩展了传统ABSA方法,引入了方面值的概念并根据其类型(单一、二元或多值)进行分类,从而更准确地提取目标特征。实验表明,基于该语料库训练的KoBERT和KcBERT模型在方面-值对识别任务中取得了较高的F1分数(分别为0.88和0.90)。

详情
Journal ref
29th International Conference on Computational Linguistics (COLING). Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning (Pan-DL), Oct 2022, Gyeongju, South Korea, pp.38-44
英文摘要

We report the construction of a Korean evaluation-annotated corpus, hereafter called 'Evaluation Annotated Dataset (EVAD)', and its use in Aspect-Based Sentiment Analysis (ABSA) extended in order to cover e-commerce reviews containing sentiment and non-sentiment linguistic patterns. The annotation process uses Semi-Automatic Symbolic Propagation (SSP). We built extensive linguistic resources formalized as a Finite-State Transducer (FST) to annotate corpora with detailed ABSA components in the fashion e-commerce domain. The ABSA approach is extended, in order to analyze user opinions more accurately and extract more detailed features of targets, by including aspect values in addition to topics and aspects, and by classifying aspectvalue pairs depending whether values are unary, binary, or multiple. For evaluation, the KoBERT and KcBERT models are trained on the annotated dataset, showing robust performances of F1 0.88 and F1 0.90, respectively, on recognition of aspect-value pairs.

2605.07442 2026-05-11 cs.LG

GameGen-Verifier: Parallel Keypoint-Based Verification for LLM-Generated Games via Runtime State Injection

Chaobo Jia, Ruipeng Wan, Ting Sun, Weihao Tan, Borui Wan, Yuxuan Tong, Guangming Sheng, Hong Xu

AI总结 本文提出了一种名为GameGen-Verifier的自动化验证框架,用于验证基于大语言模型生成的游戏是否符合自然语言规范。该方法通过将游戏规范分解为可验证的关键点,并将其转化为独立的验证单元,在运行时注入目标状态并执行有限交互以判断是否符合规范。实验表明,该方法在准确性上显著优于现有方法,同时大幅减少了验证所需的时间。

详情
英文摘要

LLM-based game generation promises to turn natural-language specifications into executable games, but progress is limited by the lack of reliable automated verification. Unlike conventional code generation, game correctness is defined over long-horizon interaction: a game may appear correct while violating core mechanics such as state updates, interaction rules, and phase transitions. Existing Agent-as-a-Verifier approaches collapse verification into open-ended gameplay, making verdicts reachability-bound, time-consuming, coverage-limited, and sensitive to the agent's gameplay ability. We present GameGen-Verifier, an automated verification paradigm for LLM-generated games that decomposes a specification into verifiable keypoints and grounds them into independent verification units. Each unit patches the game runtime into a concrete target state, executes a bounded interaction, and judges the outcome against the keypoint assertion. We implement GGV-Harness, a scalable agentic harness providing concurrency management, runtime isolation, and fault recovery. On VeriGame, our dataset of 100 games across seven genres, GameGen-Verifier achieves up to 92.2% accuracy against human judgments versus 58.8% for the coverage-enforced Agent-as-a-Verifier baseline, while reducing wall-clock time by up to 16.6x.

2605.07432 2026-05-11 cs.CL cs.LG

Generating training datasets for legal chatbots in Korean

Changhoe Hwang, Jee-Sun Nam, Eric Laporte

AI总结 本研究旨在解决法律聊天机器人训练数据多样性与标注成本高的问题,提出了一种基于本地语法图(LGG)的语言资源生成方法,能够同时生成大量对话文本及其高质量标签。该方法通过结合领域特定的分类体系,有效提升了数据的标注效率与质量。研究实现了韩国法律聊天机器人LIGA,其在处理用户法律咨询时能够准确匹配相关案例,实验表明所训练的模型在F1分数上达到了91%。

详情
Journal ref
International conference on Law and Society, Feb 2023, Hanoi, Vietnam. pp.1-4
英文摘要

Chatbots are robots that can communicate with humans using text or voice signals. Legal chatbots improve access to justice, since legal representation and legal advice by lawyers come with a high cost that excludes disadvantaged and vulnerable people. However, capturing the diversity of actual user input in datasets for deep-learning dialog systems (chatbots) is a technical challenge. Diversity requires large volumes of data, which must also be labelled in order to classify the user's intent, while the cost of labelling datasets increases with volume. Instead of labelling large volumes of authentic data from users, our approach consists in jointly generating large volumes of utterances and high-quality labels. The generator of labelled datasets is based on language resources that take the form of local grammar graphs (LGG), which capture and generalize the vocabulary and local syntax observed by linguists in text. The LGGs associate labels to the utterances according to a domain-specific classification system. We tested this approach by implementing LIGA, a legal chatbot in Korean. The chatbot answers users' conversational queries on legal situations by providing information on similar legal cases, made publicly available by the Korean government. We generated labelled utterances from the LGGs with the aid of the open-source Unitex platform. This process produced 700 million utterances. We trained a DIET classifier on a dataset made of these utterances, and the trained model reached 91% f1-score performance. We implemented a chatbot called LIGA, which uses the results of the model to select a link to a web page that documents similar legal cases.