arXivDaily arXiv每日学术速递 周一至周五更新
2606.20550 2026-06-19 cs.DL cs.HC cs.IR 新提交

Easy Reads: A Python program for making Scientific Papers on arXiv more Reader Friendly and Accessible

Easy Reads: 一个使arXiv上的科学论文更易读和更易访问的Python程序

Vishal Verma

AI总结 针对科学论文排版紧凑、可读性差的问题,提出Easy Reads——一个自动化、端到端的开源Python程序,通过自定义字体大小和列数等格式,从arXiv获取论文并重新排版,提升可读性和可访问性。

Comments 9 pages. Open-source software project available at: https://github.com/Curious-flow/Easy-Reads

详情
AI中文摘要

科学论文通常排版紧凑,具有小字体、小行距、双栏文本和紧密排列的图表等特点。虽然这些特性使论文更紧凑,但会妨碍可读性,降低可访问性,并可能使读者感到疲劳。arXiv是一个跨学科的科学论文开放获取库,被包括物理学和天体物理学社区在内的研究人员广泛使用。Easy Reads是一个自动化、端到端的开源Python程序,通过使arXiv上的论文更易读和更易访问来帮助解决上述挑战。Easy Reads可以通过URL自动从arXiv获取论文,并处理源TeX文件,允许自定义论文的格式特性,主要是字体大小和使用的栏数。Easy Reads的主要目标是促进科学论文的易读性。

英文摘要

Scientific papers are frequently dense and characterized by features such as small fonts and line spacing, double columns of text, and tightly arranged figures. While these features make papers more compact, they can hinder readability, make them less accessible, and can strain the reader. arXiv is a premier open-access repository for scientific papers across different fields and is used extensively by researchers, including those in the physics and astrophysics communities. Easy Reads is an automated, end-to-end, open-source Python program that helps address the stated challenge by making papers from arXiv more reader-friendly and accessible. Easy Reads can automatically fetch a paper from arXiv via its URL and work with the source TeX file to allow custom formatting of the paper features, primarily the font size, and the number of columns used. The main goal of Easy Reads is to facilitate ease of reading of scientific papers.

2606.19630 2026-06-19 cs.AI cs.DL cs.SY eess.SY 交叉投稿

AI4SE and SE4AI Exploration: A Decade Looking Back and Forward

AI4SE 与 SE4AI 探索:回顾与展望的十年

H. Sinan Bank, Daniel R. Herber, Thomas Bradley

发表机构 * Colorado State University(科罗拉多州立大学)

AI总结 本文回顾了人工智能与系统工程在三个阶段的进展,通过人机一致性文献综述识别出五个关键研究空白,并提供了AI采纳、保障和劳动力转型的指导。

Comments 10 pages, 5 figure

详情
AI中文摘要

2020年3月INCOSE INSIGHT关于人工智能与系统工程的特刊成为该刊历史上下载量最高的一期,并催生了一个研究社区,其年度研讨会现吸引超过250名注册者。在本文中,我们基于作者对该领域核心论文的解读,追溯了人工智能与系统工程在三个阶段(标记为基础、应用和LLM转折点)的进展,并描述了我们对社区已达成共识以及仍存在关键空白的看法。此外,我们进行了一项人机一致性文献综述,利用人类专家和六个人工智能模型评估了1,712篇INCOSE INSIGHT文章和889篇SERC出版物的相关性。结果识别出五个关键研究空白,并为从业者在系统工程中应对AI采纳、保障和劳动力转型提供了指导。我们共享一致性数据以及AI4SE/SE4AI Explorer网络应用程序,以便读者将自己的相关性判断与人类和AI评分者进行比较。

英文摘要

The March 2020 INCOSE INSIGHT special issue on AI and Systems Engineering (SE) became the most downloaded issue in the publication's history and launched a research community that now draws over 250 registrants to its annual workshop. In this article, we trace the progress in AI and SE across three phases (labeled here foundational, applied, and LLM inflection) based on the authors' reading of the field's core papers, and describe our opinions of where the community has converged and where critical gaps remain. Separately, a human-AI agreement literature review leveraging both human expertise and six AI models was performed to assess the relevance of 1,712 INCOSE INSIGHT articles and 889 SERC publications. The results identify five critical research gaps and offer guidance for practitioners navigating AI adoption, assurance, and workforce transformation in SE. We share the agreement data and the AI4SE/SE4AI Explorer web application so readers can compare their own relevance judgments with the human and AI raters.

2603.27698 2026-06-19 cs.CV cs.DL 版本更新

Ink Detection from Surface Topography of the Herculaneum Papyri

Giorgio Angelotti, Federica Nicolardi, Paul Henderson, W. Brent Seales

发表机构 * Vesuvius Challenge, USA(维苏威挑战赛,美国) Università degli Studi di Napoli Federico II, Italy(那不勒斯费德里科二世大学,意大利) University of Glasgow, Scotland, UK(格拉斯哥大学,苏格兰,英国) EduceLab, University of Kentucky, USA(EduceLab,肯塔基大学,美国)

Comments 9 pages, 3 figures, 2 tables. Currently under review

Journal ref Scientific Reports (2026)

详情
英文摘要

Reading the Herculaneum papyri is challenging because both the scrolls and the ink, which is carbon-based, are carbonized. In X-ray radiography and tomography, ink detection typically relies on density- or composition-driven contrast, but carbon ink on carbonized papyrus provides little attenuation contrast. Building on the morphological hypothesis, we show that the surface morphology of written regions contains enough signal to distinguish ink from papyrus. To this end, we train machine learning models on three-dimensional optical profilometry from mechanically opened Herculaneum papyri to separate inked and uninked areas. We further quantify how lateral sampling governs learnability and how a native-resolution model behaves on coarsened inputs. We show that high-resolution topography alone contains a usable signal for ink detection. Diminishing segmentation performance with decreasing lateral resolution provides insight into the characteristic spatial scales that must be resolved on our dataset to exploit the morphological signal. These findings inform spatial resolution targets for morphology-based reading of closed scrolls through X-ray tomography.

2509.03391 2026-06-19 cs.DL cs.CY 版本更新

More Parameters Than Populations: A Systematic Literature Review of Large Language Models within Survey Research

参数多于总体:调查研究中的大语言模型系统文献综述

Trent D. Buskirk, Florian Keusch, Leah von der Heyde, Adam Eck

AI总结 通过系统文献综述,评估大语言模型在调查研究三个阶段(数据收集前、中、后)的应用,讨论其潜力与陷阱,并展望调查研究对LLM发展的贡献。

Comments This working paper is outdated as of June 2026 - please refer to the full version with substantive changes here: https://doi.org/10.31235/osf.io/eubj4_v1 This work was presented at NLPOR 2025 (non-archival): https://openreview.net/forum?id=0Hxhwa56Yg

详情
AI中文摘要

[工作论文]调查研究长期以来一直是人力驱动的领域,但也接纳了多种技术来收集、处理和分析各种行为、政治和社会结果。与此同时,大语言模型(LLM)带来了新的技术挑战和前提条件,以充分利用其潜力。在本文中,我们报告了一项基于多个大规模数据库关键词搜索和引文网络的系统文献综述的进展,评估LLM目前在调查研究过程中的应用情况。我们根据调查研究过程综合并组织我们的发现,包括LLM在三个广泛阶段的使用示例:数据收集前、数据收集和数据收集后。我们基于现有文献中的示例,讨论了LLM潜在用例的选定示例及其陷阱。考虑到调查研究在数据质量方面拥有丰富的经验和历史,我们讨论了一些机会,并描述了调查研究为LLM的持续发展和改进做出贡献的未来展望。

英文摘要

[Working Paper] Survey research has a long-standing history of being a human-powered field, but one that embraces various technologies for the collection, processing, and analysis of various behavioral, political, and social outcomes of interest, among others. At the same time, Large Language Models (LLMs) bring new technological challenges and prerequisites in order to fully harness their potential. In this paper, we report work-in-progress on a systematic literature review based on keyword searches from multiple large-scale databases as well as citation networks that assesses how LLMs are currently being applied within the survey research process. We synthesize and organize our findings according to the survey research process to include examples of LLM usage across three broad phases: pre-data collection, data collection, and post-data collection. We discuss selected examples of potential use cases for LLMs as well as its pitfalls based on examples from existing literature. Considering survey research has rich experience and history regarding data quality, we discuss some opportunities and describe future outlooks for survey research to contribute to the continued development and refinement of LLMs.

2509.02581 2026-06-19 cs.DL cs.AI 版本更新

Charting the Future of Scholarly Knowledge with AI: A Community Perspective

用AI绘制学术知识的未来:社区视角

Azanzi Jiomekong, Hande Küçük McGinty, Keith G. Mills, Allard Oelen, Enayat Rajabi, Harry McElroy, Antrea Christou, Anmol Saini, Janice Anta Zebaze, Hannah Kim, Anna M. Jacyszyn, Gollam Rabby, Dirk Betz, Claudia Biniossek, Sanju Tiwari, Sören Auer

发表机构 * TIB Leibniz Information Centre for Science and Technology(蒂宾根莱比锡科学与技术信息中心) Department of Computer Science, University of Yaounde 1(亚奥内1大学计算机科学系) Department of Computer Science, Kansas State University(堪萨斯州立大学计算机科学系) School of EECS, Louisiana State University(路易斯安那州立大学电子工程与计算机科学学院) Management Science Department, Cape Breton University(cape breton 大学管理科学系) Department of Development and Research, Performigence(Performigence 发展与研究部) Department of Engineering and Computer Science, Wright State University(怀特州立大学工程与计算机科学系) Department of Physics, University of Yaounde 1(亚奥内1大学物理系) FIZ Karlsruhe, Leibniz Institute for Information Infrastructure(卡尔斯鲁厄莱比锡信息基础设施研究所) Sharda University, Delhi-NCR, India(德里-纳尔默德印度大学) L3S Research Center, Leibniz University of Han(汉莱比锡大学L3S研究中心)

AI总结 本文从社区视角出发,识别促进跨学科对话、共享挑战、分类新合作并塑造学术知识组织未来研究方向的方法。

Comments 39 pages, 3 figures

详情
AI中文摘要

尽管支持学术知识提取和组织的工具日益普及,许多研究人员仍依赖手动方法,有时是因为对现有技术不熟悉或缺乏领域适应性解决方案。同时,跨学科学术出版物的快速增长使得跟上最新进展越来越困难,进一步凸显了对可扩展的、基于AI的方法来结构化和综合学术知识的需求。各个研究社区已开始独立应对这一挑战,开发旨在构建可靠、动态且可查询的学术知识库的工具和框架。然而,这些社区之间的有限互动阻碍了方法、模型和最佳实践的交流,减缓了向更集成解决方案的进展。本文确定了促进跨学科对话、识别共同挑战、分类新合作并塑造学术知识组织未来研究方向的方法。

英文摘要

Despite the growing availability of tools designed to support scholarly knowledge extraction and organization, many researchers still rely on manual methods, sometimes due to unfamiliarity with existing technologies or limited access to domain-adapted solutions. Meanwhile, the rapid increase in scholarly publications across disciplines has made it increasingly difficult to stay current, further underscoring the need for scalable, AI-enabled approaches to structuring and synthesizing scholarly knowledge. Various research communities have begun addressing this challenge independently, developing tools and frameworks aimed at building reliable, dynamic, and queryable scholarly knowledge bases. However, limited interaction across these communities has hindered the exchange of methods, models, and best practices, slowing progress toward more integrated solutions. This manuscript identifies ways to foster cross-disciplinary dialogue, identify shared challenges, categorize new collaboration and shape future research directions in scholarly knowledge and organization.