arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.12073 2026-06-11 cs.SI cs.AI 新提交

"That's AI Slop, You Bot!" Studying Accusations, Evidence, and Credibility in Online Discourse Towards LLM-Generated Comments

“那就是AI垃圾，你这个机器人！”：研究针对LLM生成评论的指责、证据与可信度

Jason Miklian, John E. Katsos

AI总结分析2023-2026年Hacker News和Reddit上2500万条评论，发现对AI生成文本的指责增长超十倍，但被指责的文本并非真正由AI生成，而是基于感知真实性的社会把关行为。

详情

AI中文摘要

生成式AI使得流畅的散文变得廉价易得，打破了“好文章意味着真思考”的旧承诺。读者如何回应？这能告诉我们关于反AI态度变化的什么信息？我们分析了来自Hacker News和Reddit（2023-2026年）的2500万条评论，结合了对7500个抽样AI使用指责的LLM判断、情感轨迹、300个确认AI使用指责的言语行为编码，以及被指责与未被指责的父评论的匹配对照测试。我们发现，两个平台上指责中贬义标签的份额增长了十倍以上，而2022年前的不真实性词汇（如shill、astroturf）的安慰剂词汇则没有。这一转变反映了一个快速增长的趋势：将任何可疑或看似不真实的散文标记为“AI垃圾”。AI垃圾框架现在占贬义提及的94%，主导评论的语气从嘲笑转向把关和结构性抗议。关键惊喜来自匹配对照测试，该测试发现，统计上区分AI与人类文本的散文特征并不能预测哪些人类文本会被指责为AI。新的指责作为感知真实性的社会把关，实际上并不筛查AI。这项研究扩展了信号理论，表明当底层检测问题无法在非专家层面解决时，即使不准确，社会使用的替代信号也会增长。它表明，AI对写作的影响从读者侧来看与生产（作者）侧不同。检测技术无法解决这种动态，因为指责的社会功能日益表现为社会把关和群体内信号传递，而非识别AI生成的写作。

英文摘要

Generative AI has made fluent prose cheap to produce, breaking the old promise to readers that good writing meant real thinking. How have readers responded, and what can this tell us about changing anti-AI attitudes? We analyzed 25 million comments from Hacker News and Reddit (2023-2026), combining LLM judgment on 7,500 sampled accusations of AI use, sentiment trajectories, speech-act coding of 300 confirmed accusations of AI use, and a matched-control test of accused versus non-accused parent comments. We found that the pejorative-label share of accusations rose more than tenfold on both platforms while a placebo vocabulary of pre-2022 inauthenticity terms (shill, astroturf) did not. This shift reflected a fast-growing trend of branding any suspicious or seemingly inauthentic prose as "AI slop". The slop frame now constitutes 94 percent of pejorative mentions, with the dominant comments shifting in tone from mockery toward gatekeeping and structural protest. The key surprise comes from a matched-control test which found that prose features that statistically distinguish AI from human text do not predict which human text gets accused as AI. The new accusations work as social gatekeeping of perceived authenticity without actually screening for AI. This research extends signaling theory by showing that substitute signals used socially can grow even when inaccurate if the underlying detection problem cannot be solved at the non-expert level. It shows that AI's effects on writing from the reader side are distinct from those on the production (writer) side. Detection technology cannot resolve this dynamic because the social function of accusations is increasingly to perform social gatekeeping and in-group signaling as opposed to identifying AI-generated writing.

URL PDF HTML ☆

赞 0 踩 0

2606.12026 2026-06-11 math.SP cs.SI math-ph physics.data-an 新提交

Generalizing Perron--Frobenius theory and eigenvector-based centralities to networks with complex edge weights

将Perron-Frobenius理论和基于特征向量的中心性推广到具有复数边权重的网络

Yu Tian, Mason A. Porter, Lucas Böttcher

AI总结本文将Perron-Frobenius定理推广到复数权重矩阵，建立不同推广之间的联系，并提出基于特征向量的中心性度量以分析复数边权重网络中的节点重要性。

详情

Comments: 34 pages, 9 figures, 1 table

AI中文摘要

线性代数及其在网络分析应用中的一个基本概念是Perron-Frobenius (PF)定理，它支撑着基于特征向量的中心性度量，如特征向量中心性、PageRank以及枢纽和权威中心性。通过引用PF定理，我们知道对于具有正边权重的强连通网络，权重矩阵最大特征值对应的特征向量产生一个明确定义的中心性度量（即特征向量中心性）。PF定理及其相关中心性度量的传统表述假设网络具有实数值权重。然而，量子信息、量子化学、电动力学和机器学习等领域的许多网络具有复数值边权重。在本文中，我们研究PF定理到复数值矩阵的推广，建立这些推广之间的联系，并提出基于特征向量的中心性度量以分析具有复数边权重的网络中的节点重要性。我们还证明了满足广义PF性质的复数权重网络的存在性结果，并计算了几个示例的相关中心性度量，这些示例来自电子传输、电路分析、数学化学和通信网络等应用领域。

英文摘要

A fundamental concept in linear algebra and its applications to network analysis is the Perron--Frobenius (PF) theorem, which underpins eigenvector-based centrality measures such as eigenvector centrality, PageRank, and hubs and authorities. By invoking the PF theorem, we know for strongly connected networks with positive edge weights that the eigenvector corresponding to the largest eigenvalue of the weight matrix yields a well-defined centrality measure (namely, eigenvector centrality). Traditional formulations of the PF theorem and associated centrality measures assume that networks have real-valued weights. However, many networks in areas such as quantum information, quantum chemistry, electrodynamics, and machine learning have complex-valued edge weights. In this paper, we study generalizations of the PF theorem to complex-valued matrices, establish connections between these generalizations, and propose generalized eigenvector-based centrality measures to analyzing node importances in networks with complex edge weights. We also prove results about the existence of complex-weighted networks that satisfy generalized PF properties and calculate associated centrality measures for several examples, which we draw from application areas such as electron transport, circuit analysis, mathematical chemistry, and communication networks.

URL PDF HTML ☆

赞 0 踩 0

2606.11692 2026-06-11 cs.CY cs.MA cs.SI 新提交

Evaluation of Alternative-Based Information Systems for Deliberative Polling using an Agentic Simulator

基于智能体模拟器的审议式投票中替代性信息系统评估

Rwaida Alssadi, Khulud Alawaji, Balaji Kasula, Muntaser Syed, Badria Alfurhood, Markus Zanker, Marius Silaghi

AI总结提出基于LLM的智能体双极论证模拟器（ABAS），通过覆盖率和语料多样性评估审议式投票中推荐机制的有效性，并测试了对抗性投票攻击下的鲁棒性。

详情

AI中文摘要

审议式投票旨在通过让股东在投票前接触广泛论点来改善集体决策。然而，确保每个选民遇到理由空间的代表性样本（覆盖问题）仍然是一个开放的挑战，特别是在大规模和对抗性或策略性动机的选民群体中。本文介绍了一种使用基于LLM的智能体双极论证模拟器（ABAS）评估解决方案的方法，该模拟器基于一个将投票形式化为六元组<Jend, Jopp, Ratt, Renh, VA, VR>（包含支持与反对理由、攻击与增强关系、股东权重和关系权重）的框架。ABAS模拟N个自主股东智能体，每个智能体根据[-1,1]内的期望分布分配潜在意见，依次投票、选择或撰写理由，并可选择提交论证图链接。该模拟器实现推荐机制，根据可观察的支持质量对现有理由进行排序。它通过覆盖率（即每个股东收到的K条推荐中代表语料库理由标签集的比例）来评估机制的成功，作为NP难子集理由问题的一个解决方案。报告的实验描述了创造力率（pown）、推荐大小（K）、论证密度（plinks）和人口规模（N）如何影响覆盖率和语料库多样性。在一个经过身份验证的选民群体中（Sybil攻击不可能，只有关系图可被操纵），我们通过协调策略性投票攻击对评分进行压力测试：标签洪泛攻击导致覆盖率崩溃，而通过反向PageRank规则的作者计数关系加权比均匀权重显著更好地抵抗了洪泛攻击。

英文摘要

Deliberative polling promises to improve collective decision-making by exposing shareholders to a broad range of arguments before they vote. Yet ensuring that every voter encounters a representative sample of the reason space, the coverage problem, remains an open challenge, particularly at scale and in adversarial or strategically motivated electorates. This paper introduces a way of evaluating solutions using the LLM-based Agentic Bipolar Argumentation Simulator, grounded in a framework which formalises a poll as a six-tuple <Jend, Jopp, Ratt, Renh, VA, VR> of endorsing and opposing justifications, attack and enhance relations, and shareholder- and relation-weights. ABAS simulates N autonomous shareholder agents, each assigned a latent opinion according to desired distributions in [-1, 1], who sequentially vote, choose or author justifications, and optionally submit argumentation-graph links. The simulator implements recommendations that rank existing justifications by their observable endorsement mass. It evaluates the mechanism's success by coverage, namely the fraction of the corpus reason-tag set represented in the K recommendations presented to each shareholder, as a solution to the NP-hard Subsuming Justification Problem. Reported experiments characterise how creativity rate (pown), recommendation size (K), argumentation density (plinks), and population size (N) affect coverage and corpus diversity. In an authenticated electorate where Sybil attacks are impossible and only the relation graph is gameable, we stress-test the scoring with coordinated strategic voting attacks: a tag-flood attack collapses coverage, while author-count relation weighting through a reversed-PageRank rule resists the flood markedly better than uniform weights.

URL PDF HTML ☆

赞 0 踩 0

2606.11663 2026-06-11 cs.SI cs.LG 新提交

Probabilistic Salary Prediction with Graph Attention Networks and a Mixture Density Network

基于图注意力网络和混合密度网络的概率薪资预测

Zhipei Qin, Mohammad Shokri, N. van Weeren, F.W. Takes

AI总结提出GAT-MDN框架，通过构建属性关系图并使用图注意力网络学习节点表示，结合混合密度网络输出薪资分布，在百万级荷兰招聘数据集上优于基线模型。

详情

Comments: 5 pages, 3 figures

AI中文摘要

准确的薪资预测对于弥合现代劳动力市场中雇主与求职者之间的信息差距至关重要。现有方法主要产生单点估计，并将工作属性（如地点、职业和行业）视为独立的分类特征，忽略了真实世界薪酬数据固有的不确定性和多模态性，以及支配薪资规范的丰富层次结构和语义相似性关系。在本文中，我们提出了GAT-MDN，一个同时解决这两个限制的统一框架。对于三个属性域中的每一个，我们构建了一个特定领域的图，其边编码了（i）层次化的父子包含关系和（ii）从预训练的Sentence-Transformer导出的加权相似性链接。具有边缘特征感知注意力的并行图注意力网络（GAT）从这些多关系图中学习丰富的、上下文感知的节点表示。然后，一个基于优先级的层次选择模块组装一个复合特征向量，优雅地处理缺失或粗略的属性，而混合密度网络（MDN）头将该向量映射到高斯混合模型（GMM）的参数，产生完整的条件薪资分布。在超过100万条记录的真实世界荷兰招聘数据集上的大量实验表明，GAT-MDN在负对数似然（NLL）和均方误差（MSE）方面均显著优于非图MLP-MDN基线。

英文摘要

Accurate salary prediction is critical for bridging the information gap between employers and job seekers in modern labor markets. Existing approaches predominantly yield a single point estimate and treat job attributes such as location, occupation, and industry as independent categorical features, ignoring both the inherent uncertainty and multi-modality of real-world compensation data and the rich hierarchical and semantic-similarity relationships that govern pay norms. In this paper we propose GAT-MDN, a unified framework that addresses both limitations simultaneously. For each of the three attribute domains we construct a domain-specific graph whose edges encode (i) hierarchical parent-child containment and (ii) weighted similarity links derived from a pre-trained Sentence-Transformer. Parallel Graph Attention Networks (GATs) with edge-feature-aware attention learn rich, context-sensitive node representations from these multi-relational graphs. A priority-based hierarchical selection module then assembles a composite feature vector that gracefully handles missing or coarse attributes, and a Mixture Density Network (MDN) head maps this vector to the parameters of a Gaussian Mixture Model (GMM), yielding a full conditional salary distribution. Extensive experiments on a real-world Dutch job-posting dataset of over 1 million records demonstrate that GAT-MDN significantly outperforms a non-graph MLP-MDN baseline in both Negative Log-Likelihood (NLL) and Mean Squared Error (MSE).

URL PDF HTML ☆

赞 0 踩 0

2606.11654 2026-06-11 cs.IR cs.CL cs.HC cs.SI 新提交

The Long Tail, Not the Front Page: Cold-Start Prediction of Crowd Highlight Salience

长尾而非首页：众包高亮显著性的冷启动预测

Kazuki Nakayashiki, Keisuke Watanabe

AI总结本文研究在无读者标记时，如何从文本预测文档的众包高亮显著性，提出基于句子嵌入和位置/上下文特征的对数排序模型，在平均精度上比位置基线提升0.044，并证明该优势源于真实读者标记的学习。

详情

Comments: 10 pages, 3 figures, 4 tables

AI中文摘要

社交高亮工具最有用的信号——一群读者标记的段落——仅存在于人们已经阅读过的文档中。能否在标记积累之前，从文本预测文档的聚合众包显著性？先前关于此数据的研究发现，零样本语言模型恢复高亮位置的效果不如简单的基线（位置），因此我们询问，在高亮语料上训练的模型能否击败该基线。使用预注册的模型阶梯和按文档的聚类自助法，我们发现一个微小但稳健的优势：基于句子嵌入和位置/上下文特征的对数排序器比位置基线平均精度高出+0.044（95%置信区间[+0.029, +0.058]；在97%的重采样中超过预注册的边界delta=0.03，且在流水线重复运行中稳定）。两种无监督抽取式基线（质心、LexRank风格中心性）均输给位置基线，而训练模型比它们高出+0.108，因此该优势并非由通用无监督代理恢复——它反映了从真实读者标记中学习。在产品术语中，precision@3从0.25上升到0.39（相对提升55%），模型在69%的文档上击败位置基线。消融实验将优势归因于原始嵌入（+0.014）和训练增强（+0.010），每个都有正的置信区间。该优势并非时间泛化失败，我们也没有发现内容漂移或近似重复泄露可以解释它的证据。标准化回归显示，优势主要由文档流行度（流行度越低，优势越大）和标签可靠性决定。它仅在流行度最高的内容上几乎消失；在那里，是位置基线变强，而非模型变弱。由于我们的评估条件设定在最终积累了读者的文档上，这些结果是回顾性的冷启动模拟。

英文摘要

A social highlighter's most useful signal -- which passages a crowd of readers marks -- exists only for documents people have already read. Can the aggregate crowd salience of a document be predicted from its text before its marks accumulate? Prior work on this data found that zero-shot language models recover highlight locations worse than a trivial lead (position) baseline, so we ask whether a model trained on the highlight corpus can beat that baseline. Using a pre-registered ladder of models and a by-document cluster bootstrap, we find a small but robust edge: a logistic ranker over sentence embeddings and positional/contextual features beats the lead baseline by +0.044 average precision (95% CI [+0.029, +0.058]; clears a pre-registered margin delta=0.03 in 97% of resamples, and stable across pipeline re-runs). Two unsupervised extractive baselines (centroid, LexRank-style centrality) lose to lead, and the trained model beats them by +0.108, so the edge is not recovered by generic unsupervised proxies -- it reflects learning from real reader marks. In product terms, precision@3 rises from 0.25 to 0.39 (+55% relative) and the model beats lead on 69% of documents. An ablation attributes the edge to the raw embedding (+0.014) and training augmentation (+0.010), each with a positive CI. The edge is not a temporal-generalization failure, and we find no evidence that content drift or near-duplicate leakage explains it. A standardized regression shows the advantage is governed mainly by document popularity (lower popularity, larger edge) and by label reliability. It nearly vanishes only on the most popular content; there it is the lead baseline that strengthens, not the model that weakens. Because our evaluation conditions on documents that eventually accumulated readers, these results are a retrospective cold-start simulation.

URL PDF HTML ☆

赞 0 踩 0

2606.11613 2026-06-11 cs.IR cs.CL cs.HC cs.SI 新提交

Factions Within, Uncertain Across: Within-Document Reader Sub-Groups in Social Highlighting

内部派系，跨文档不确定：社交高亮中的文档内读者子群体

Kazuki Nakayashiki, Keisuke Watanabe

AI总结通过保留边界的曲线球零模型，发现文档内读者形成强子群体，其一致性远超共享显著性预测，且大部分源于细粒度读者特定共识；跨文档稳定性未解决。

详情

Comments: 11 pages, 3 figures, 3 tables

AI中文摘要

当许多人高亮同一文档时，人群是单一共识，还是内部结构化为标记不同内容的读者子群体？这种结构是读者的稳定属性还是文档的属性？基于先前工作表明个体文档内高亮信号是低语而个体性存在于选择中，我们在一个共读平台上使用保留边界的曲线球零模型提出群体层面问题。实验1：在文档内，读者形成强子群体——配对一致性远超共享显著性、标记密度和句子流行度所预测的（最近邻一致性z=+6.3，在88%的文档中显著）。在八块区域保留零模型下，与文档相同粗略区域的共享参与解释了约40%的额外一致性；大部分以更细粒度的读者特定一致性存在（z=+3.6，77%显著）。因此，文档内人群在描述意义上是派系化的。实验2：这种分组是稳定的读者特质吗？这里我们诚实地面对统计功效。配对一致性的跨文档分半可重复性在合并后接近零（两个独立抽取样本中分别为+0.078和0.000），功效校准表明该检验仅对共读许多文档的配对有信息。在唯一有信息的高重叠子集（k>=4）中，点估计为正但小样本，在独立抽取样本间不精确，从未显著，并在区域保留零模型下衰减。因此，我们未解决跨文档稳定性：数据与从情境分组到弱至中等稳定读者特质的一切一致。人群在文档内是派系化的；这些派系是否随读者跨文档迁移，诚实地讲，超出了我们的能力范围。

英文摘要

When many people highlight the same document, is the crowd a single consensus, or is it internally structured into reader sub-groups that mark different things -- and is that structure a stable property of a reader or of the document? Building on prior work showing an individual's within-document highlighting signal is a whisper while individuality lives in selection, we ask the group-level question on a co-readership platform using a margin-preserving curveball null. Experiment 1: within a document, readers form strong sub-groups -- pairs agree far beyond what shared salience, mark density, and sentence popularity predict (nearest-neighbour agreement z=+6.3, significant in 88% of documents). Under an eight-block region-preserving null, shared engagement with the same coarse regions of the document accounts for about 40% of this excess; the majority survives as finer reader-specific agreement (z=+3.6, 77% significant). So the within-document crowd is, in a descriptive sense, factional. Experiment 2: is that grouping a stable reader trait? Here we are honest about power. The cross-document split-half reproducibility of a pair's agreement is near zero pooled (+0.078 and 0.000 in two separately drawn samples), and a power calibration shows the test is informative only for pairs that co-read many documents. In the only informative high-overlap subset (k>=4), point estimates are positive but small-sample, imprecise across the separately drawn samples, never significant, and attenuate under the region-preserving null. We therefore leave cross-document stability unresolved: the data is consistent with anything from situational grouping to a weak-to-moderate stable reader trait. The crowd is factional within a document; whether its factions follow the reader across documents is, honestly, beyond our reach.

URL PDF HTML ☆

赞 0 踩 0

2606.11582 2026-06-11 cs.DB cs.SI 新提交

Querying Cohesive Subgraph regarding Span-Constrained Triangles on Temporal Graphs with Dynamic Index Maintenance

关于时间图上跨度约束三角形的凝聚子图查询与动态索引维护

Chuhan Hu, Ming Zhong, Lei Li

AI总结提出时间图上的(k,δ)-truss概念，要求三角形在短时间窗口内存在，并设计基于索引的方法实现高效查询与动态维护，压缩比达10^{-4}，查询速度提升2~4个数量级。

详情

AI中文摘要

时间图研究的最新进展重新定义了传统的静态图概念，如三角形、模体和$k$-核。受此启发，我们为时间图引入了一种新颖的$(k,\delta)$-truss，要求三角形在足够短的时间窗口内存在。$(k,\delta)$-truss确保了静态和时间上的内聚性，而原始的$k$-truss是$\delta = \infty$时的特例。为了处理$(k,\delta)$-truss查询，我们提出了无索引和基于索引的方法。利用$(k,\delta)$-truss的双重包含关系，我们的索引将所有的$(k,\delta)$-truss无损压缩成映射或树结构，显著减少了空间，同时实现了最优时间检索。为了扩展到大规模时间图，我们分别基于truss分解和truss维护开发了两种索引构建算法，大大减少了冗余计算。此外，我们提出了所提索引的动态维护技术。实验结果表明，基于索引的方法以交互时间处理查询，比无索引方法快2~4个数量级，同时索引实现了高达$10^{-4}$的压缩比，并且可以在不从头重建的情况下高效更新。

英文摘要

Recent advances in temporal graph research have redefined traditional static graph concepts such as triangles, motifs, and $k$-cores. Inspired by this, we introduce a novel $(k,\delta)$-truss for temporal graphs, requiring triangles to exist within sufficiently short time windows. The $(k,\delta)$-truss ensures both static and temporal cohesion, while the original $k$-truss is a special case when $\delta = \infty$. To address $(k,\delta)$-truss queries, we propose index-free and index-based approaches. Utilizing the dual containment relation of $(k,\delta)$-trusses, our indexes losslessly compress all $(k,\delta)$-trusses into map or tree structures, significantly reducing space while enabling optimal-time retrieval. To scale to large temporal graphs, we develop two index construction algorithms based on truss decomposition and truss maintenance, respectively, which substantially reduce redundant computations. Moreover, we present techniques for the dynamic maintenance of the proposed indexes. The experimental results demonstrate that index-based approaches process queries in interactive time and outperform the index-free approach by 2$\sim$4 orders of magnitude, while the indexes achieve compression ratios of up to $10^{-4}$ and can be updated efficiently without rebuilding from scratch.

URL PDF HTML ☆

赞 0 踩 0

2606.11482 2026-06-11 cs.SI cs.CL 新提交

Building Social World Models with Large Language Models

用大型语言模型构建社会世界模型

Haofei Yu, Yining Zhao, Guanyu Lin, Jiaxuan You

AI总结提出社会世界模型（SWM）框架，利用LLM从社会数据中挖掘时间模式，学习社会信念的状态转移函数，无需人工标注或普查数据，在预测市场基准上超越时序基础模型。

详情

Comments: 9 pages. ICML 2026

AI中文摘要

理解和预测社会信念如何因事件（从政策变化到科学突破）而演变仍然是社会科学中的一个基本挑战。鉴于LLM的常识知识和社会智能，我们提出：LLM能否模拟社会事件后社会信念的动态？在这项工作中，我们引入了社会世界模型（SWM）的概念，这是一个通用框架，旨在捕捉社会信念如何因重大事件而演变。SWM通过挖掘社会数据中的时间模式并优化证据下界来学习社会信念的状态转移函数，无需将事件与信念转变联系起来的人工标注，也无需昂贵的普查数据。为了评估SWM，我们引入了一个基准SWM-bench，该基准源自真实世界的预测市场，特别是Kalshi和Polymarket。SWM-bench包含超过12k个数据点，用于跨政治、金融和加密货币等不同领域的社会信念预测任务。我们的实验结果表明，SWM显著优于时序基础模型，在Kalshi数据上取得了最先进的结果，并在Polymarket数据上展示了竞争性能，同时为社会信念动态的潜在机制提供了可解释的见解。

英文摘要

Understanding and predicting how social beliefs evolve in response to events -- from policy changes to scientific breakthroughs -- remains a fundamental challenge in social science. Given LLMs' commonsense knowledge and social intelligence, we ask: Can LLMs model the dynamics of social beliefs following social events? In this work, we introduce the concept of the Social World Model (SWM), a general framework designed to capture how social beliefs evolve in response to major events. SWM learns state-transition functions for social beliefs by mining temporal patterns in social data and optimizing the evidence lower bound, without the need for explicit human annotations linking events to belief shifts, or for expensive census data. To evaluate SWM, we introduce a benchmark, SWM-bench, derived from real-world prediction markets, specifically Kalshi and Polymarket. SWM-bench includes over 12k data points for social belief prediction tasks spanning diverse domains such as politics, finance, and cryptocurrency. Our experimental results show that SWM significantly outperforms time-series foundation models, achieving state-of-the-art results on Kalshi data and demonstrating competitive performance on Polymarket data, while offering interpretable insights into the underlying mechanisms of social belief dynamics.

URL PDF HTML ☆

赞 0 踩 0

2606.11420 2026-06-11 cs.CL cs.SI 新提交

Context-Aware Multimodal Claim Verification in Spoken Dialogues

口语对话中的上下文感知多模态声明验证

Chaewan Chun, Delvin Ce Zhang, Dongwon Lee

发表机构 * The Pennsylvania State University（宾夕法尼亚州立大学）； University of Sheffield（谢菲尔德大学）

AI总结提出MAD2基准和上下文感知多模态融合方法，验证对话音频中的声明，发现对话结构比虚假信息框架对验证更重要。

详情

AI中文摘要

每天，数百万人从播客和流媒体中吸收声明，而这些声明从未被事实核查员看到。口语错误信息是通过对话构建的，其中可信度不仅来自事实本身，还来自声明如何在对话轮次中被构建、强化或未被质疑。然而，事实核查一直专注于孤立的文本，对话音频研究不足。我们引入了MAD2，一个新的用于口语声明验证的多轮音频对话基准，包含1,000个双说话者对话，3,368个值得核查的声明和约10小时的音频，并提出了上下文感知音频编码器和对话感知文本模型的校准多模态融合。在各种设置下，添加对话上下文改善了验证，但收益取决于场景类型。仅使用前文上下文通常与离线性能相当，支持实时审核设置，而当基于转录的模型被额外上下文 destabilized 时，音频贡献最大。总体而言，对话结构对验证的影响比错误信息框架更大。

英文摘要

Every day, millions absorb claims from podcasts and streams that no fact-checker ever sees. Spoken misinformation is built through conversation, where credibility comes not from facts alone but from how claims are framed, reinforced, or left unchallenged across turns. Yet fact-checking has focused on isolated text, leaving dialogue audio under-studied. We introduce MAD2, a new Multi-turn Audio Dialogues benchmark for spoken claim verification, containing 1,000 two-speaker dialogues with 3,368 check-worthy claims and approximately 10 hours of audio, and propose calibrated multimodal fusion of a context-aware audio encoder and a dialogue-aware text model. Across settings, adding dialogue context improves verification, but the gains depend on scenario type. Using only preceding context often matches offline performance, supporting live-moderation settings, and audio contributes most when transcript-based models are destabilized by additional context. Overall, conversational structure matters more for verification than misinformation framing.

URL PDF HTML ☆

赞 0 踩 0

2606.11259 2026-06-11 nlin.AO cond-mat.stat-mech cs.SI math.DS q-bio.PE 新提交

Stabilizing Role of Uninformed Participants in Collective Decision Making

无信息参与者在集体决策中的稳定作用

Leonardo Colombo, Marıa Emma Eyrea Irazu, Laura P. Schaposnik, James Unwin

AI总结通过耗散哈密顿量建模，发现无信息参与者通过方向无关的耗散延迟极化转变，稳定集体决策。

详情

Comments: 23 pages, 6 images

AI中文摘要

对于没有严格等级制度的群体，集体决策通常通过妥协产生。我们使用耗散哈密顿量公式开发了一个集体决策的二阶网络模型，其中知情代理引入偏好方向，而无信息参与者仅贡献方向无关的耗散。我们表明，在低冲突下，该模型允许一个局部唯一、指数稳定的妥协状态。使用结构化模块网络，我们进一步表明，随着冲突增加，局部妥协分支通过鞍节点折叠终止，而不是通过平滑的平均场对称破缺转变。模块化极化状态在局部与妥协分支分离的分支上持续存在。方向无关的耗散不会改变静态结构阈值，但会延迟从鞍节点幽灵的逃逸，并将极化的可观察起始点推向更大的冲突。我们的工作确定了一种耗散介导的机制，与基于连通性的解释互补，通过该机制，无信息参与者稳定了生物和工程群体中的集体行为。

英文摘要

For groups without strict hierarchy, collective decisions often emerge through compromise. We develop a second-order network model of collective decision-making using a dissipative Hamiltonian formulation, in which informed agents introduce preferred directions while uninformed participants contribute only direction-free dissipation. We show that under low conflict, the model admits a locally unique, exponentially stable compromise state. Using a structured modular network we further show that as conflict increases the local compromise branch terminates through a saddle-node fold rather than through a smooth mean-field symmetry-breaking transition. Modular polarized states persist on branches that are locally separated from the compromise branch. Direction-free dissipation does not shift the static structural threshold, but it delays escape from the saddle-node ghost and pushes the observable onset of polarization to larger conflicts. Our work identifies a dissipation-mediated mechanism, complementary to connectivity-based accounts, through which uninformed participants stabilize collective behavior in biological and engineered swarms.

URL PDF HTML ☆

赞 0 踩 0

2606.11216 2026-06-11 cs.CY cs.SI 新提交

Great Disappearance Acts Generative Search and Shadow Banning

消失的行为：生成式搜索与影子封禁

Danny Friedmann

AI总结本文研究生成式搜索和影子封禁对互联网开放生态的破坏，分析其法律与监管问题，并提出增强透明度与公平性的解决方案。

详情

AI中文摘要

互联网曾被誉为去中心化的公共领域，但如今日益受到生成式搜索和影子封禁等做法的破坏，这些做法转移流量并压制可见性。由检索增强生成（RAG）驱动的生成式搜索将内容综合成直接答案，绕过网站并剥夺其流量和收入，威胁独立内容创作者、小企业和开放网络生态的可持续性。影子封禁通过算法审核故意降低社交媒体帖子的可见性，通过压制言论自由、限制透明度和问责制加剧了这些问题。本文从法律和监管角度探讨这些不透明做法。第一部分考察生成式搜索的兴起，分析其技术和法律影响，包括版权侵权、不正当竞争和不当得利，并评估许可协议和代理型AI等潜在解决方案。第二部分聚焦影子封禁：算法劝阻、降级和流量减少，特别关注中国的《算法推荐管理规定》（RAR）和欧盟的《人工智能法案》（AIA）。这两个框架都提供了部分解决方案，但在确保公平、透明和救济机制方面仍显不足。最终，主导平台向集中控制的转变优先考虑利润和风险管理，而非在线表达中的创新、公平和多样性。为应对这些趋势，监管干预、算法透明度和公平框架至关重要。若无此类措施，互联网将面临失去其作为自由表达和创新的民主化公共领域特征的风险。

英文摘要

The internet, once celebrated as a decentralized public sphere, is increasingly undermined by practices such as generative search and shadow banning, which divert traffic and suppress visibility. Generative search, powered by Retrieval Augmented Generation RAG, synthesizes content into direct answers, bypassing websites and depriving them of traffic and revenue. This threatens the sustainability of independent content creators, small enterprises, and the open web ecosystem. Shadow banning, a practice that intentionally reduces the visibility of social media posts through algorithmic moderation, exacerbates these issues by chilling free expression and limiting transparency and accountability. This article explores these opaque practices through a legal and regulatory lens. The first part examines the rise of generative search, analyzing its technological and legal implications, including copyright infringement, unfair competition, and unjust enrichment. It also evaluates potential solutions such as licensing agreements and agentic AI. The second part focuses on shadow banning: algorithmic dissuasion, de-ranking, and the reduction of traffic, with specific attention to Chinas Regulation on Algorithmic Recommendations RAR and the EUs Artificial Intelligence Act AIA. Both frameworks offer partial solutions but fall short of ensuring fairness, transparency, and redress mechanisms. Ultimately, the shift toward centralized control by dominant platforms prioritizes profit and risk management over innovation, fairness, and diversity in online expression. To counteract these trends, regulatory interventions, algorithmic transparency, and equitable frameworks are essential. Without such measures, the internet risks losing its character as a democratized public sphere for free expression and innovation.

URL PDF HTML ☆

赞 0 踩 0

2605.22346 2026-06-11 stat.ML cs.LG cs.SI 版本更新

The ASE-LSE Disagreement Landscape: An End-to-End Characterisation of Extremes and Structural Drivers

偏离正则性：度异质性和特征间隙作为ASE-LSE潜在子空间分歧的结构驱动因素

Minh Triet Pham, Ian Gallagher

AI总结本文研究了图数据分析中邻接谱嵌入和拉普拉斯谱嵌入方法在相同网络上产生不同结果的结构原因，揭示了度异质性和社区结构强度对潜在子空间分歧的影响。

详情

Comments: This paper is being withdrawn as it was submitted without the consent of all listed authors, and contains work that is currently under academic assessment. It will be resubmitted at an appropriate time once evaluation is complete

AI中文摘要

图数据分析中，邻接谱嵌入和拉普拉斯谱嵌入两种最常用方法在相同网络上常产生不同结果。本文提供了结构上的解释。我们证明正则性是完美一致的充分条件：当每个节点具有相同数量的连接时，两种方法产生相同的潜在子空间。任何偏离正则性都会引入分歧，我们证明了一个显式的界限，其两个术语表明控制分歧的结构因素：度异质性推动方法分离，社区结构强度则拉近它们。我们通过成千上万个模拟网络验证了这两种驱动因素，确认异质性推动分歧增加，社区强度抑制它，其比值提供了两种嵌入可以互换或不可互换的强预测。

英文摘要

Two of the most widely used methods for analysing graph data, Adjacency Spectral Embedding and Laplacian Spectral Embedding, often produce different results when applied to the same graph. Yet the structural reasons behind this disagreement remain incompletely understood. This paper provides an end-to-end account of ASE-LSE latent subspace disagreement. We first prove that the two methods produce identical latent subspaces for every embedding dimension whenever the Laplacian is a scalar multiple of the adjacency matrix, and show that this scalar relationship holds if and only if the graph is either regular or bipartite biregular. This anchor result identifies a sufficient condition for perfect agreement that pins down the floor of the disagreement spectrum and supplies the baseline for the perturbation analysis. We then prove that no maximal-disagreement graph or family of graphs exists: the disagreement is always strictly below its theoretical ceiling, and we exhibit a witness family demonstrating that no finite maximum is attainable, so the disagreement landscape has no maximiser. With both endpoints established, we derive a Regularity Departure Bound whose two terms isolate degree heterogeneity and eigengap as the primary structural factors influencing disagreement in the middle regime. Empirical validation across thousands of simulated graphs confirms the mechanisms predicted by the bound: heterogeneity pushes disagreement up, eigengap suppresses it, and their joint ratio emerges as a unified predictor of ASE-LSE disagreement, suggesting when the two embeddings can be treated as interchangeable and when they cannot.

URL PDF HTML ☆

赞 0 踩 0

2601.11128 2026-06-11 cs.SI cs.HC cs.IR 版本更新

The Big Ban Theory: A Pre- and Post-Intervention Dataset of Online Content Moderation Actions

大封禁理论：在线内容审核行为的前后干预数据集

Aldo Cerulli, Lorenzo Cima, Benedetta Tessa, Serena Tardelli, Stefano Cresci

AI总结针对在线平台审核干预研究缺乏综合数据集的问题，构建了包含Reddit和Voat上25种干预措施、超33.9万用户和近3900万条消息的数据集，提供标准化元数据和匿名化用户活动数据，支持干预效果的可比分析。

详情

Comments: Article published in ICWSM'26 - 20th AAAI Conference on Web and Social Media. Please, cite the published version

AI中文摘要

在线平台依赖审核干预来遏制仇恨言论、毒性以及错误和虚假信息的传播等有害行为。然而，关于此类干预的效果和潜在偏见的研究面临多重限制。例如，由于缺乏全面的数据集，现有工作通常只关注单一或少数干预措施。因此，研究人员通常需要为每项新研究收集必要的数据，这限制了系统比较的机会。为了克服这些挑战，我们引入了大封禁理论（TBBT）——一个大型的审核干预数据集。TBBT涵盖了25种不同类型、严重程度和范围的干预措施，总计包括Reddit和Voat上的超过33.9万用户和近3900万条发布的消息。对于每次干预，我们提供标准化的元数据和在干预实施前后三个月收集的匿名化用户活动数据，从而能够对干预效果进行一致且可比较的分析。此外，我们提供了数据集的描述性探索性分析，以及几个用例说明它如何支持内容审核研究。通过这个数据集，我们旨在支持研究审核干预效果的研究人员，并促进更系统、可重复和可比较的研究。

英文摘要

Online platforms rely on moderation interventions to curb harmful behavior such as hate speech, toxicity, and the spread of mis- and disinformation. Yet research on the effects and possible biases of such interventions faces multiple limitations. For example, existing works frequently focus on single or a few interventions, due to the absence of comprehensive datasets. As a result, researchers must typically collect the necessary data for each new study, which limits opportunities for systematic comparisons. To overcome these challenges, we introduce The Big Ban Theory (TBBT) -- a large dataset of moderation interventions. TBBT covers 25 interventions of varying type, severity, and scope, comprising in total over 339K users and nearly 39M posted messages on Reddit and Voat. For each intervention, we provide standardized metadata and pseudonymized user activity collected three months before and after its enforcement, enabling consistent and comparable analyses of intervention effects. In addition, we provide a descriptive exploratory analysis of the dataset, along with several use cases of how it can support research on content moderation. With this dataset, we aim to support researchers studying the effects of moderation interventions and to promote more systematic, reproducible, and comparable research.

URL PDF HTML ☆

赞 0 踩 0

2502.05255 2026-06-11 cs.SI cs.CY physics.soc-ph 版本更新

Incivility in Public Health Policy Discussions Spills Over to Public Engagement with Climate Issues

公共卫生政策讨论中的不文明行为溢出至公众对气候问题的参与

Hasti Narimanzadeh, Arash Badie-Modiri, Iuliia Smirnova, Ted Hsuan Yun Chen

AI总结本研究利用COVID-19时期作为案例，通过分析Twitter和Reddit上的数据，发现围绕COVID-19的情感极化显著溢出到气候变化讨论中，表现为不文明行为增加，且这种溢出沿袭了疫情前的政治分歧。

详情

Comments: 33 pages, 5 figures

AI中文摘要

情感极化和政治分类加剧了公众在科学-政策交汇点上对气候变化及其他问题的对抗。我们以COVID-19时期为案例，研究了Twitter和Reddit上公众参与气候变化和公共卫生时的不文明行为的跨领域溢出。我们发现强烈证据表明，围绕COVID-19的情感极化特征溢出到了气候变化领域。在不同的社交媒体系统中，COVID-19内容与气候讨论中的不文明行为相关。这些对抗加剧的模式对使科学与公共政策联系更突出的大流行事件反应敏感。观察到的溢出沿袭了疫情前的政治分歧，特别是反国际主义的民粹主义信念，这些信念将气候政策反对与疫苗犹豫联系起来。我们的发现显示了公众参与科学时的情感极化如何在政策领域间变得根深蒂固，这对公众如何参与和沟通气候变化及公共卫生等问题具有影响。

英文摘要

Affective polarization and political sorting drive public antagonism around climate change and other issues at the science-policy nexus. We study cross-domain spillover of incivility in public engagements with climate change and public health on Twitter and Reddit using the COVID-19 period as a case study. We find strong evidence of the signatures of affective polarization surrounding COVID-19 spilling into the climate change domain. Across different social media systems, COVID-19 content is associated with incivility in climate discussions. These patterns of increased antagonism were responsive to pandemic events that made the link between science and public policy more salient. The observed spillover activated along pre-pandemic political cleavages, specifically anti-internationalist populist beliefs, that linked climate policy opposition to vaccine hesitancy. Our findings show how affective polarization in public engagement with science becomes entrenched across policy domains, which has implications for how the public engages with and communicates about issues such as climate change and public health.

URL PDF HTML ☆

赞 0 踩 0

2504.06967 2026-06-11 math.OC cs.SI 版本更新

Optimal promotions of new products on networks

新产品在网络上的最优推广

Gadi Fibich, Amit Golan

AI总结提出一种分析Bass模型中新产品在网络上的最优推广的新方法，发现最优广告策略随时间递减，而最优增强同伴效应先增后减，且网络度高低影响广告与同伴效应的优先级。

详情

Comments: 47 pages, 14 figures

AI中文摘要

我们提出了一种新颖的方法，用于分析新产品在网络传播的Bass模型中的最优推广。对于具有$M$个节点的通用网络，最优推广是$2^M-1$个非线性耦合边值问题的解。然而，在结构化网络上，方程的数量可以缩减到易于模拟和分析的规模。这使我们能够深入了解网络结构对最优推广的影响。我们发现，最优广告策略随时间递减，而最优增强同伴效应从零开始增加然后递减。在低度网络中，优先考虑广告而非增强同伴效应是最优的，但在高度网络中这种关系颠倒。当规划期有限时，最优推广持续到最后一刻，与无限规划期下最优推广衰减到零的情况相反。最后，与长规划期相比，短规划期的推广可以带来数量级更高的利润增长。

英文摘要

We present a novel methodology for analyzing the optimal promotion in the Bass model for the spreading of new products on networks. For general networks with $M$ nodes, the optimal promotion is the solution of $2^M-1$ nonlinearly-coupled boundary-value problems. On structured networks, however, the number of equations can be reduced to a manageable size which is amendable to simulations and analysis. This enables us to gain insight into the effect of the network structure on optimal promotions. We find that the optimal advertising strategy decreases with time, whereas the optimal boosting of peer effects increases from zero and then decreases. In low-degree networks, it is optimal to prioritize advertising over boosting peer effects, but this relation is flipped in high-degree networks. When the planning horizon is finite, the optimal promotion continues until the last minute, as opposed to an infinite planning horizon where the optimal promotion decays to zero. Finally, promotions with short planning horizons can yield an order of magnitude higher increase of profits, compared to those with long planning horizons.

URL PDF HTML ☆

赞 0 踩 0