arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2605.13434 2026-05-14 cs.LG cs.DC math.OC stat.ML

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Ammar Mahran, Artavazd Maranjyan, Peter Richtárik

AI总结本文研究了在数据和系统异构环境下分布式学习中的异步随机梯度下降（ASGD）方法。传统ASGD因未考虑不同工作节点的计算速度差异，导致模型更新偏向于局部目标的频率加权平均，而非全局目标。本文提出了一种名为Rescaled ASGD的新方法，通过按各节点计算时间比例调整步长，使得每个节点在周期内对模型的总学习率贡献相同，从而恢复对全局目标的正确优化。理论分析表明，该方法在非凸设置下能够收敛到全局目标的平稳点，且时间复杂度达到已知下界，实验验证了其有效性与先进性。

2605.13402 2026-05-14 cs.CV cs.DS

Fast and Compact Graph Cuts for the Boykov-Kolmogorov Algorithm

Christian Møller Mikkelstrup, Anders Bjorholm Dahl, Philip Bille, Vedrana Andersen Dahl, Inge Li Gørtz

AI总结本文研究了Boykov-Kolmogorov（BK）算法在计算最小$s$-$t$割问题中的性能优化，提出了改进的理论分析和新的快速紧凑算法（fcBK），将时间复杂度从$O(mn|C|)$降低至$O(m|C|)$。此外，作者设计了一种紧凑的图表示方法，使得算法能够在有限内存下处理包含数十亿顶点和万亿边的大规模图。实验表明，该实现是目前BK算法中最高效的实现，突显了内存效率在大规模图割计算中的重要性。

Comments 15 pages, 6 figures, submitted to the IEEE for possible publication

2604.28045 2026-05-14 cs.CV

TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement

Xiumei Li, Alexander Kopte, André Kaup

AI总结本文提出了一种名为TAFA-GSGC的可扩展点云几何压缩方法，能够在单一比特流和单一训练模型下实现多质量解码。该方法结合了分层残差细化与通道组熵编码，并引入了目标对齐特征聚合模块以减少增强残差中的跨层冗余。实验表明，TAFA-GSGC在保持良好压缩效率的同时，支持多达9个解码质量等级，并在D1-PSNR和D2-PSNR指标上分别实现了4.99%和5.92%的比特率降低。

Comments Accepted at IEEE International Conference on Image Processing (ICIP) 2026

2604.10720 2026-05-14 cs.AI cs.CL cs.CY

Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation

Charles Koutcheme, Juho Leinonen, Arto Hellas

AI总结本文提出了一种训练开放权重的编程学习模拟模型的新框架，通过将真实学生的学习过程数据转化为对话形式，模拟学生与自动评估系统之间的交互过程。该方法结合了监督微调和偏好优化，使模型能够更贴近真实学生的调试行为。实验表明，该方法在功能对齐和代码相似性方面优于传统仅基于代码的模型和提示生成的大语言模型。

Comments 8 pages, 2 figures, 2 tables. Accepted to Educational Data Mining 2026

2604.10634 2026-05-14 cs.CV

NTIRE 2026 The Second Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Xin Li, Yeying Jin, Suhang Yao, Beibei Lin, Zhaoxin Fan, Wending Yan, Xin Jin, Zongwei Wu, Bingchen Li, Peishu Shi, Yufei Wang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Runzhe Li, Kui Jiang, Zhaocheng Yu, Yiang Chen, Junjun Jiang, Xianming Liu, Hongde Gu, Zeliang Li, Mache You, Jiangxin Dong, Jinshan Pan, Qiyu Rong, Bowen Shao, Hongyuan Jing, Mengmeng Zhang, Bo Ding, Hui Zhang, Yi Ren, Mohab Kishawy, Jun Chen, Anh-Kiet Duong, Petra Gomez-Kramer, Jean-Michel Carozza, Wangzhi Xing, Xin Lu, Enxuan Gu, Jingxi Zhang, Diqi Chen, Qiaosi Yi, Bingcai Wei, Wenjie Li, Bowen Tie, Heng Guo, Zhanyu Ma, Jiachen Tu, Guoyi Xu, Yaoxin Jiang, Cici Liu, Yaokun Shi, Paula Garrido Mellado, Daniel Feijoo, Alvaro Garcia Lara, Marcos V. Conde, Zhidong Zhu, Bangshu Xiong, Qiaofeng Ou, Zhibo Rao, Wei Li, Zida Zhang, Hui Geng, Qisheng Xu, Xuyao Deng, Changjian Wang, Kele Xu, Guanglu Dong, Qiyao Zhao, Tianheng Zheng, Chunlei Li, Lichao Mou, Chao Ren, Chang-De Peng, Chieh-Yu Tsai, Guan-Cheng Liu, Li-Wei Kang, Abhishek Rajak, Milan Kumar Singh, Ankit Kumar, Dimple Sonone, Kishor Upla, Kiran Raja, Huilin Zhao, Xing Xu, Chuan Chen, Yeming Lao, Wenjing Xun, Li Yang, Bilel Benjdira, Anas M. Ali, Wadii Boulila, Hao Yang, Ruikun Zhang, Liyuan Pan

AI总结本文介绍了NTIRE 2026第二届昼夜雨滴去除双焦点图像挑战赛的整体情况。该挑战基于真实场景下的Raindrop Clarity数据集，旨在建立一个在不同光照和对焦条件下具有良好实用性的雨滴去除基准。本次挑战吸引了168支队伍参与，其中17支队伍提交了最终方案，并在测试集上取得了较好的性能，展示了该领域技术的持续进步。

Comments Accepted by CVPR2026 Workshop; NTIRE 2026 Challenge Report

2402.15415 2026-05-14 cs.LG math.DS stat.ML

Understanding Catastrophic Forgetting In LoRA via Mean-Field Attention Dynamics

Hugo Koubbi, Louis Hernandez, Matthieu Boussard

AI总结本文研究了LoRA（低秩适配）方法在微调过程中出现的灾难性遗忘问题，通过构建一个可解析的均场自注意力玩具模型，将令牌视为相互作用的粒子系统，并将LoRA视为低秩扰动。利用偏微分方程和动力系统理论，揭示了遗忘行为与非遗忘行为之间的相变机制，并分析了扰动大小和模型深度对遗忘的影响，同时通过实验验证了理论预测。

Comments New version accepted at ICML 2026, with new results and without previous results

2210.09114 2026-05-14 cs.RO

INSANE: Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators

Christian Brommer, Alessandro Fornasier, Martin Scheiber, Jeff Delaune, Roland Brockers, Jan Steinbrener, Stephan Weiss

AI总结本文提出了一种名为INSANE的跨领域无人机数据集，旨在支持自主移动机器人在复杂动态环境中的高精度定位研究。该数据集包含多种场景和不同难度级别的飞行轨迹，涵盖室内运动捕捉环境、室内外过渡飞行以及模拟火星环境的挑战性任务，提供了丰富的传感器数据和高精度真实值。数据集配备了多种传感器，包括多个惯性测量单元和摄像头，并支持基于机器学习的传感器信号增强方法研究。

Comments V2 with added dataset comparison tables

Journal ref Int. J. Robot. Res. 43 (2024) 1083-1113

详情

DOI: 10.1177/02783649241227245

英文摘要

For real-world applications, autonomous mobile robotic platforms must be capable of navigating safely in a multitude of different and dynamic environments with accurate and robust localization being a key prerequisite. To support further research in this domain, we present the INSANE data sets - a collection of versatile Micro Aerial Vehicle (MAV) data sets for cross-environment localization. The data sets provide various scenarios with multiple stages of difficulty for localization methods. These scenarios range from trajectories in the controlled environment of an indoor motion capture facility, to experiments where the vehicle performs an outdoor maneuver and transitions into a building, requiring changes of sensor modalities, up to purely outdoor flight maneuvers in a challenging Mars analog environment to simulate scenarios which current and future Mars helicopters would need to perform. The presented work aims to provide data that reflects real-world scenarios and sensor effects. The extensive sensor suite includes various sensor categories, including multiple Inertial Measurement Units (IMUs) and cameras. Sensor data is made available as raw measurements and each data set provides highly accurate ground truth, including the outdoor experiments where a dual Real-Time Kinematic (RTK) Global Navigation Satellite System (GNSS) setup provides sub-degree and centimeter accuracy (1-sigma). The sensor suite also includes a dedicated high-rate IMU to capture all the vibration dynamics of the vehicle during flight to support research on novel machine learning-based sensor signal enhancement methods for improved localization. The data sets and post-processing tools are available at: https://sst.aau.at/cns/datasets

URL PDF HTML ☆

赞 0 踩 0

1903.00745 2026-05-14 cs.AI cs.LO cs.RO

A Formal Framework for Robot Construction Problems: A Hybrid Planning Approach

Faseeh Ahmad, Esra Erdem, Volkan Patoglu

AI总结本文研究了由多个自主机器人协作堆叠预制模块构建稳定结构的机器人建造问题，该问题因动作的连锁效应、真正的并发操作以及结构稳定性和模块支撑性要求而具有挑战性。作者提出了一种基于答案集编程的混合规划框架，能够同时确定最终稳定结构配置并规划多机器人操作顺序，确保每一步部分结构的稳定性与支撑性。该方法在理论上有严格的正确性与完备性保证，并通过多个具有挑战性的建造实例验证了其有效性与实用性。

Comments 8 pages (double-column), 7 figures

2605.12805 2026-05-14 cs.LG cs.AI

Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels

Fairoz Nower Khan, Nabuat Zaman Nahim, Md Sajid Ahmed, Ruiquan Huang, Peizhong Ju

AI总结该论文提出了一种名为 Discrete MeanFlow 的新方法，用于在离散状态空间中实现一步生成。与连续空间中的 MeanFlow 不同，它通过连续时间马尔可夫链的条件转移核来建模概率质量的转移，并定义了一个平均离散速率来衡量转移概率在时间区间内的变化。该方法通过边界构建设计直接参数化转移核，确保生成过程无需迭代去噪或微分方程求解，只需一次前向传播和分类采样即可完成生成，实验表明其在有限状态马尔可夫链和合成序列生成任务中具有高精度。

2605.12785 2026-05-14 cs.LG cs.SY eess.SY math.DS

Identifying the nonlinear string dynamics with port-Hamiltonian neural networks

Maximino Linares, Guillaume Doras, Thomas Hélie

AI总结本文研究如何利用端口-哈密顿神经网络（PHNN）从数据中学习非线性弦动力学，提出了一种将物理知识融入神经网络结构的方法，用于识别由偏微分方程（PDE）描述的哈密顿系统。该方法通过构建基于端口-哈密顿系统（PHS）的结构化网络架构，能够同时恢复弦的哈密顿量和耗散项，相比非物理感知的基线方法，在准确性和可解释性方面均有显著提升。实验表明，该模型能够有效识别和模拟非线性弦的动态行为，在音乐声学等需要PDE建模的领域具有重要应用价值。

2605.12718 2026-05-14 cs.AI cs.LG cs.MA

CHAL: Council of Hierarchical Agentic Language

Tommaso Giovannelli, Griffin D. Kent

AI总结本文提出了一种名为CHAL的多智能体辩论框架，旨在通过可反驳的论证优化信念系统，解决当前多智能体辩论在结构上的局限性。CHAL引入了基于图结构的信念表示和梯度引导的动态更新机制，并将元认知价值系统作为可配置参数，以指导智能体的推理与裁决过程。该框架在多个领域展示了良好的泛化能力，并为构建透明、可审计的AI系统提供了基础。

详情

英文摘要

Multi-agent debate has emerged as a promising approach for improving LLM reasoning on ground-truth tasks, yet current methodologies face certain structural limitations: debate tends to induce a martingale over belief trajectories, majority voting accounts for most observed gains, and LLMs exhibit confidence escalation rather than calibration across rounds. We argue that the genuine value of debate, and dialectic systems as a whole, lies not in ground-truth tasks but in defeasible domains, where every position can in principle be defeated by better reasoning. We present the Council of Hierarchical Agentic Language (CHAL), a multi-agent dialectic framework that treats defeasible argumentation as an engine for belief optimization. Each agent maintains a CHAL Belief Schema (CBS), a graph-structured belief representation with a Bayesian-inspired architecture, that facilitates belief revision through a gradient-informed dynamic mechanism by leveraging the strength of the belief's thesis as a differentiable objective. Meta-cognitive value systems spanning epistemology, logic, and ethics are elevated to configurable hyperparameters governing agent reasoning and adjudication outcomes. We provide a series of ablation experiments that demonstrate systematic and interpretable effects: the adjudicator's value system determines the debate's overall trajectories in latent belief space, council diversity refines beliefs for all participants, and the framework generalizes across broad fields. CHAL is, to our knowledge, the first framework to treat multi-agent debate as structured belief optimization over defeasible domains. Further, the auditable belief artifacts it produces establish the foundation for dedicated evaluation suites for defeasible argumentation, with broader implications for building AI systems whose reasoning and value commitments are transparent, aligned, and subject to human oversight.

URL PDF HTML ☆

赞 0 踩 0

2605.12701 2026-05-14 cs.LG cs.AI cs.CE cs.CY

Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions

Gideon Popoola, John Sheppard

AI总结在信用决策等社会敏感领域，现有公平机器学习模型虽然能够实现预测结果的公平性，但仍可能在推理过程中对不同群体采用不同的逻辑，形成“隐藏的过程性偏差”。本文提出一种名为反事实解释一致性（CEC）的框架，通过对齐个体与其反事实样本的特征归因，检测并缓解这种偏差，并引入新的过程性公平度量与训练损失函数。实验表明，CEC能有效减少模型的隐藏偏差，且对模型性能的影响较小。

2605.12628 2026-05-14 cs.RO

Multistep Belief Space Dynamics Learning For Risk-Aware Control

Jason Gibson, Bogdan Vlahov, Patrick Spieler, Evangelos A. Theodorou

AI总结本文研究了如何在自动驾驶系统中实现风险感知的控制，针对动态不确定性随时间演变的问题，提出了一种用于模型预测控制（MPC）的分布动态学习框架。该方法通过学习环境动力学的分布特性，能够在保证安全性的前提下优化控制策略，避免过于保守。实验表明，该方法在真实复杂的越野环境中表现出良好的适应性和智能行为。

2605.10127 2026-05-14 cs.CV

Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition

Yu He, Ting Zhu, Yichun Liu, Lichen Ma, Xinyuan Shan, Jingling Fu, Yu Shi, Junshi Huang, Yan Li

AI总结本文提出一个名为Fashion130K的新电商时尚数据集，包含多种场合、模特和服装类型，旨在推动服装搭配生成的研究。为实现服装生成的视觉一致性，作者设计了统一多模态条件（UMC）框架，通过融合文本和图像提示的嵌入信息，并引入融合变换器对齐多模态特征，进而引导生成模型关注提示与噪声图像之间的关键关联。该数据集和框架为多模态提示在生成模型中的应用提供了全面而细致的探索，并在多个实际应用和基准测试中表现出优于现有方法的视觉一致性效果。

Comments Accepted to CVPR 2026 Findings

2605.10040 2026-05-14 cs.CV

Only Train Once: Uncertainty-Aware One-Class Learning for Face Authenticity Detection

Qingchao Jiang, Zhenxuan Hou, Zhiying Zhu, Zhenxing Qian, Xinpeng Zhang, Zaiwang Gu

AI总结随着生成式模型的快速发展，生成高度逼真的图像带来了身份欺诈和虚假信息传播的风险。现有方法大多将人脸伪造检测视为全监督的二分类问题，难以应对新型生成方法带来的挑战。本文提出FADNet，将人脸真实性检测重新建模为一类分类任务，仅使用真实人脸数据进行训练，通过引入证据深度学习和伪伪造图像生成器，有效提升了模型的泛化能力和检测精度，在多个基准测试中取得了优于现有方法的优异性能。

Comments The sole reason for our withdrawal application is that we have identified critical areas in our manuscript that require substantial revision and improvement to meet rigorous scientific standards. Our only intention is to retract the current draft to revise and enhance it, with no plans to replace it with a different version or redirect readers to other sources at this time

2605.09935 2026-05-14 cs.CV cs.CR

Evidence-based Decision Modeling for Synthetic Face Detection with Uncertainty-driven Active Learning

Qingchao Jiang, Zhenxuan Hou, Zhiying Zhu, Zhenxing Qian, Xinpeng Zhang, Zaiwang Gu

AI总结随着深度生成模型的快速发展，伪造人脸图像被广泛用于非法活动。现有合成人脸检测方法虽取得进展，但因依赖Softmax激活函数而存在过度自信的问题，导致在面对未知分布图像时预测不可靠。为此，本文提出EMSFD方法，通过狄利克雷分布建模类别证据并显式引入模型不确定性，提升检测可靠性与泛化能力；同时利用不确定性指导主动学习，减少标注成本，实验表明该方法在检测准确率上比现有最优方法提升了15%。

2605.09923 2026-05-14 cs.AI

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

Mingxiong Lin, Zhangquan Gong, Maowen Tang, Qian Li, Chuangchuang Wang, Jian Ma, Sutian Huang, Kai Tang, Haonan Lu

AI总结该论文针对基于可验证奖励的强化学习（RLVR）中主流算法Group Relative Policy Optimization（GRPO）存在的探索效率不足问题，提出了探索优先策略优化方法EXPO。EXPO通过引入动态调整的KL正则化模块和基于高斯分布的课程采样策略，有效提升了模型在数学推理任务中的探索能力和训练效率。实验表明，EXPO在多个基准测试中显著优于原始GRPO，尤其在高难度问题上的性能提升更为明显。

Comments Duplicate submission of arXiv:2605.11403

2605.01457 2026-05-14 cs.AI

CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making

Guowei Zou, Haitao Wang, Beiwen Zhang, Boning Zhang, Hejun Wu

AI总结本文提出了一种名为CoFlow的协调少步流方法，用于离线多智能体决策问题。该方法通过引入协调速度注意力机制和自适应协调门控，实现了在单次生成过程中保持智能体间协调性的目标，从而克服了现有少步生成方法在协调性上的不足。实验表明，CoFlow在多种任务中表现出色，能够在仅需1到3步去噪的情况下达到最先进的协调质量，且其性能提升主要归因于智能体间的协调能力增强。

Comments 34 pages, 15 figures, 10 tables. Project page: https://guowei-zou.github.io/coflow/

2603.25340 2026-05-14 cs.CL

Large Language Model as Token Compressor and Decompressor

Wenbing Li, Yiran Wang, Zikai Song, Jielei Zhang, Tianhao Zhao, Junkai Lin, Wei Yang

AI总结本文研究了如何将现成的大语言模型（LLM）适配为用于长文本处理的离散可变长度编码器和解码器。作者设计了一种自表达的自编码框架，通过轻量的LoRA适配器对预训练LLM进行微调，将长文本映射为紧凑的潜在编码序列（Z-tokens），并能将其解码回自然语言或任务输出。该方法在保持重建质量和下游任务性能的同时，有效减少了上下文长度、生成阶段的内存使用和端到端延迟，为高效长文本推理提供了实用的接口。

2602.09724 2026-05-14 cs.CL

Targum -- A Multilingual New Testament Translation Corpus

Maciej Rapacz, Aleksander Smywiński-Pohl

AI总结本文介绍了一个名为 Targum 的多语种新约圣经翻译语料库，旨在弥补现有语料库在语言深度上的不足。该语料库包含 651 个新约翻译版本，其中 334 个为独家版本，涵盖英语、法语、意大利语、波兰语和西班牙语五种语言，每种语言的翻译数量均远超以往任何语料库。每个翻译版本都附有标准化元数据，便于研究者进行多层次的翻译分析，为圣经翻译史的量化研究提供了重要资源。

Comments v3 - fixed duplicated references section heading, fixed reference v2 - camera ready version

2511.00066 2026-05-14 cs.LG

Sharpness-Guided Group Relative Policy Optimization via Probability Shaping

Tue Le, Linh Ngo Van, Trung Le

AI总结本文研究了可验证奖励强化学习（RLVR）中策略优化的泛化问题，提出了一种基于梯度范数的锐度代理来上界泛化损失，并在此基础上改进了组相对策略优化（GRPO）算法。通过引入锐度引导的GRPO（GRPO-SG），该方法对可能引发过大梯度的token进行降权处理，从而减少剧烈更新，提升优化稳定性与模型泛化能力。实验表明，GRPO-SG在数学推理、逻辑谜题和工具增强问答任务中均优于原始GRPO，且梯度轨迹更平稳。

2508.01049 2026-05-14 cs.LG

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies

Nicholas E. Corrado, Josiah P. Hanna

AI总结在多智能体强化学习中，独立策略梯度算法在合作且无冲突的游戏中广泛应用，但其收敛性能受限于联合策略分布的采样误差。本文提出了一种集中式自适应采样方法CoSER，通过协调各智能体的动作选择，减少联合采样误差，从而提升策略梯度学习的可靠性。实验表明，CoSER相比独立采样方法更有效地降低采样误差，并提高了算法收敛到最优联合策略的概率。

Comments RLC 2026

2308.10058 2026-05-14 cs.CV

R-C-P Method: An Autonomous Volume Calculation Method Using Image Processing and Machine Vision

MA Muktadir, Sydney Parker, Sun Yi

AI总结本文提出了一种基于图像处理和机器视觉的自主体积计算方法——R-C-P方法，旨在替代传统深度传感器（如LiDAR）以适应复杂环境下的应用需求。该方法利用两台2D摄像头实时测量矩形物体的尺寸，通过行-列-像素（R-C-P）策略结合边缘检测技术，实现了对物体表面积及不连续边缘或体积的检测。实验验证了该方法的有效性，并提供了基于摄像头与物体距离的尺寸计算公式，为实际物体的自主测量提供了可行的视觉解决方案。

Journal ref Communications in Computer and Information Science, vol. 2939, Springer, Cham (2026)

2605.12550 2026-05-14 cs.CV cs.AI

SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting

Mingrui Zhang, Hanchen Yang, Wengen Li, Xudong Jiang, Yichao Zhang, Jihong Guan, Shuigeng Zhou

AI总结该论文研究了基于视觉模型的时间序列预测问题，指出将时间序列渲染为图像后，仍存在光谱和结构上的差距，限制了预训练视觉模型的性能。为此，作者提出SSDA方法，通过光谱幅度对齐和结构引导的低秩适配，分别在数据和模型层面弥补这些差距，从而显著提升时间序列预测效果。实验表明，SSDA在多个真实数据集上优于现有方法，表现出良好的泛化能力。

2507.22095 2026-05-14 stat.ML cs.LG math.PR

Posterior Bayesian Neural Networks with Dependent Weights

Nicola Apollonio, Giovanni Franzina, Giovanni Luca Torrisi

AI总结本文研究具有依赖权重和可能重尾分布的全连接前馈深度神经网络，旨在克服标准高斯先验的局限性。通过引入高斯似然的后验分布视角，论文分析了在网络宽度趋于无穷时输出的后验分布行为，并在先验下随机协方差矩阵正定的条件下，确定了输出的后验分布。研究还给出了确保协方差矩阵可逆的温和条件，并展示了某些模型参数（如激活函数和相关Lévy测度）对极限独立性的影响，扩展了已有研究成果。

Comments 2 figures

2605.12524 2026-05-14 cs.LO cs.AI

Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism

Konstantine Arkoudas, Serafim Batzoglou

AI总结本文提出ProofGrid，一个用于评估大语言模型（LLM）推理能力的基准测试套件，通过机器可验证的证明而非仅最终答案来衡量模型能力。ProofGrid包含15个任务，涵盖证明生成、验证、掩码和补全，使用简洁的自然演绎语言NDL进行表达，支持精确且可审计的验证。该基准测试具有可重复、细粒度的评估机制，并覆盖从基础推理到复杂挑战任务的难度范围，揭示了当前模型在全局组合推理和低级证明合成等方面的显著局限。

详情

英文摘要

We introduce ProofGrid, a benchmark suite for evaluating LLM reasoning through machine-checkable proofs rather than final answers alone. ProofGrid contains 15 tasks spanning proof writing, proof checking, proof masking, and proof gap-filling. Tasks are expressed in minimal formal notation, especially NDL, a compact natural-deduction language that fits in short prompts and supports precise, auditable verification. This yields mechanical, reproducible, and fine-grained evaluation rather than judgments by humans or LLMs. ProofGrid covers a calibrated difficulty spectrum, from foundational reasoning tests to structurally rich challenge tasks that no current model solves, while minimizing reliance on domain knowledge, solver delegation, and long-context artifacts. We also develop a comparative framework for reasoning benchmarks and use it to situate ProofGrid relative to existing work in terms of representation, verification guarantees, and reasoning depth. Methodologically, we introduce an instrumented proof-checking pipeline that tolerates minor surface deviations while locating the first substantive reasoning failure, improving measurement resolution and separating proof planning from low-level execution noise. Using this pipeline, we evaluate a broad range of open and proprietary models. Results show rapid progress but substantial remaining limits: frontier models perform well on several foundational tasks, yet difficult tasks, especially those requiring global combinatorial reasoning or low-level proof synthesis, remain far from solved. We also identify epistemic instability, where models generate flawed proofs yet correctly reject those local inferences in isolation, and formalize this with an Epistemic Stability Index. Finally, we complement accuracy with 2PL IRT analyses, Wright maps, and a normalized task-discrimination measure based on Fisher information.

URL PDF HTML ☆

赞 0 踩 0

math/9901049 2026-05-14 math.GR

Rigidity of Right-Angled Coxeter Groups

David G. Radcliffe

AI总结本文研究了右角Coxeter群的刚性性质，探讨了在不同生成集下该群的Coxeter系统是否等价。作者证明了若两个有限生成集生成同一个右角Coxeter群，则对应的Coxeter系统是等价的。这一结果揭示了该类群结构的稳定性，为理解其代数与几何性质提供了重要依据。

Comments 6 pages. Improved exposition and formatting

2605.13844 2026-05-14 math.NT

Fields where torsion forms decompose

M. Archita, Karim Johannes Becher

AI总结本文研究了在特定实数域上挠二次型的分解问题，证明了在满足一定条件的实数域上，每个挠二次型都可以分解为若干个二维挠二次型的正交和。研究基于对赋值域和一变量函数域上弱各向同性形式的更一般性分析，为理解二次型的结构提供了新的视角和结果。

Comments 10 pages

2605.13843 2026-05-14 astro-ph.GA astro-ph.CO

The Galaxy Luminosity Functions in ASTRID: Predictions for LSST

Fatemeh Hafezianzadeh, Tianqing Zhang, Paul Rogozenski, Patrick Lachance, Yihao Zhou, Tiziana Di Matteo, Rupert A. C. Croft, Simeon Bird, Rachel Mandelbaum

AI总结本文利用ASTRID宇宙学流体动力学模拟，为Vera C. Rubin天文台的Legacy Survey of Space and Time（LSST）项目生成了验证过的星系光度函数和光度预测。研究结合恒星群体合成模型与物理驱动的尘埃消光模型，准确再现了不同红移和波段下的观测星系统计特性，并据此构建了包含约3.78亿个星系的LSST模拟光度目录。研究还提供了LSST各波段的光度函数预测，推导了最佳拟合的Schechter参数，并计算了从第一年到第十年不同观测深度的星系数目分布。

Comments 17 pages, 13 figures

2605.13842 2026-05-14 astro-ph.GA astro-ph.IM

From DES to KiDS: Domain adaptation for cross-survey detection of low-surface-brightness galaxies

Hareesh Thuruthipilly, Krzysztof Lisiecki, Junais, Katarzyna Małek, Agnieszka Pollo, William J. Pearson, Antonio Vanzanella, Saptarshi Pal, Miguel Figueira, Pratik Dabhade, Anna Durkalec, Aidan P. Cotter, Unnikrishnan Sureshkumar, Nandini Hazra, Patryk Matera, Subhrata Dey, Michal Vrábel, Anirban Dutta, Henry Willems, Nicola Principi Cavaterra, Natalia Dobrowolska, Wojciech Knop

AI总结该研究旨在解决跨巡天观测中低表面亮度星系（LSBG）的检测问题，利用域适应技术将基于暗能量巡天（DES）训练的深度学习模型应用于千里度巡天（KiDS）数据，实现了对KiDS DR5数据中LSBGs和超弥散星系（UDGs）的自动识别。研究共发现了20,180个LSBG和434个UDG，并揭示了它们的结构参数、颜色分布及与环境相关的演化特征，为未来大型巡天如LSST和Euclid提供了可扩展的LSBG目录构建方法。

Comments Accepted to Astronomy & Astrophysics

AI 大模型

视觉与机器人

科学与医疗

Rescaled Asynchronous SGD: Optimal Distributed Optimization under Data and System Heterogeneity

Fast and Compact Graph Cuts for the Boykov-Kolmogorov Algorithm

TAFA-GSGC: Group-wise Scalable Point Cloud Geometry Compression with Progressive Residual Refinement

Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation

NTIRE 2026 The Second Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Understanding Catastrophic Forgetting In LoRA via Mean-Field Attention Dynamics

INSANE: Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators

A Formal Framework for Robot Construction Problems: A Hybrid Planning Approach

Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels

Identifying the nonlinear string dynamics with port-Hamiltonian neural networks

CHAL: Council of Hierarchical Agentic Language

Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions

Multistep Belief Space Dynamics Learning For Risk-Aware Control

Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition

Only Train Once: Uncertainty-Aware One-Class Learning for Face Authenticity Detection

Evidence-based Decision Modeling for Synthetic Face Detection with Uncertainty-driven Active Learning

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making

Large Language Model as Token Compressor and Decompressor

Targum -- A Multilingual New Testament Translation Corpus

Sharpness-Guided Group Relative Policy Optimization via Probability Shaping

Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies

R-C-P Method: An Autonomous Volume Calculation Method Using Image Processing and Machine Vision

SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting

Posterior Bayesian Neural Networks with Dependent Weights

Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism

Rigidity of Right-Angled Coxeter Groups

Fields where torsion forms decompose

The Galaxy Luminosity Functions in ASTRID: Predictions for LSST

From DES to KiDS: Domain adaptation for cross-survey detection of low-surface-brightness galaxies