arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.19843 2026-03-23 cs.CY cs.CL cs.HC

Overreliance on AI in Information-seeking from Video Content

Anders Giovanni Møller, Elisa Bassignana, Francesco Pierri, Luca Maria Aiello

详情

英文摘要

The ubiquity of multimedia content is reshaping online information spaces, particularly in social media environments. At the same time, search is being rapidly transformed by generative AI, with large language models (LLMs) routinely deployed as intermediaries between users and multimedia content to retrieve and summarize information. Despite their growing influence, the impact of LLM inaccuracies and potential vulnerabilities on multimedia information-seeking tasks remains largely unexplored. We investigate how generative AI affects accuracy, efficiency, and confidence in information retrieval from videos. We conduct an experiment with around 900 participants on 8,000+ video-based information-seeking tasks, comparing behavior across three conditions: (1) access to videos only, (2) access to videos with LLM-based AI assistance, and (3) access to videos with a deceiving AI assistant designed to provide false answers. We find that AI assistance increases accuracy by 3-7% when participants viewed the relevant video segment, and by 27-35% when they did not. Efficiency increases by 10% for short videos and 25% for longer ones. However, participants tend to over-rely on AI outputs, resulting in accuracy drops of up to 32% when interacting with the deceiving AI. Alarmingly, self-reported confidence in answers remains stable across all three conditions. Our findings expose fundamental safety risks in AI-mediated video information retrieval.

URL PDF HTML ☆

赞 0 踩 0

2603.19841 2026-03-23 physics.flu-dyn cs.LG

Modeling subgrid scale production rates on complex meshes using graph neural networks

Priyabrat Dash, Mathis Bode, Konduri Aditya

2603.19840 2026-03-23 stat.ML cs.LG

Explainable cluster analysis: a bagging approach

Federico Maria Quetti, Elena Ballante, Silvia Figini, Paolo Giudici

2603.19831 2026-03-23 eess.AS cs.AI cs.MM

Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?

Lokesh Kumar, Nirmesh Shah, Ashishkumar P. Gudmalwar, Pankaj Wasnik

Comments Accepted at The 2nd International Workshop on Bodily Expressed Emotion Understanding (BEEU) at AAAI 2026 [non-archival]

2603.19736 2026-03-23 stat.ML cs.LG

A two-step sequential approach for hyperparameter selection in finite context models

José Contente, Ana Martins, Armando J. Pinho, Sónia Gouveia

2603.19710 2026-03-23 cs.IR cs.AI

AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation

Jingcao Xu, Jianyun Zou, Renkai Yang, Zili Geng, Qiang Liu, Haihong Tang

2603.19703 2026-03-23 math.ST cs.LG stat.TH

Minimax and Adaptive Covariance Matrix Estimation under Differential Privacy

T. Tony Cai, Yicheng Li

2603.19687 2026-03-23 cs.LO cs.LG

Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits

Angshul Majumdar

详情

英文摘要

Modern generative modelling systems are increasingly improved by expanding model capacity, training data, and computational resources. While empirical studies have documented such scaling behaviour across architectures including generative adversarial networks, variational autoencoders, transformer-based models, and diffusion models, the theoretical limits of capability growth in expanding generative systems remain poorly understood. In this paper we develop a general task-space framework for analysing expanding generative reasoning systems. Each system induces a subset of a global task space representing the tasks it can successfully solve, and system capability is measured by the probability mass of this solved-task set under a fixed task distribution. Within this framework we prove a structural result showing that, under mild assumptions, the marginal improvement in solved tasks must converge to zero as system capacity increases. Thus expanding generative systems may continue to gain capability, but the probability mass of newly solvable tasks necessarily diminishes asymptotically. We further provide a prediction-theoretic refinement based on complexity-weighted hypothesis classes inspired by algorithmic probability, yielding quantitative bounds on marginal improvement in prediction settings. Finally, we examine logical reasoning tasks and show that classical results from mathematical logic -- including Rosser incompleteness, Tarski's undefinability theorem, and Löb's theorem -- imply the persistence of unresolved logical tasks within sufficiently expressive reasoning systems. Together these results provide a mathematical perspective on the asymptotic behaviour of expanding generative systems, showing that long-run capability growth is constrained both by diminishing marginal improvements in task coverage and by fundamental logical limitations on internal reasoning.

URL PDF HTML ☆

赞 0 踩 0

2603.19657 2026-03-23 stat.ML cs.LG

Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model

Xinyu Liu, Hai Zhang

2603.19649 2026-03-23 cs.SI cs.AI

PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization

Renhong Huang, Ning Tang, Jiarong Xu, Yuxuan Cao, Qingqian Tu, Sheng Guo, Bo Zheng, Huiyuan Liu, Yang Yang

2603.19634 2026-03-23 cs.HC cs.AI cs.CY cs.IR

MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking

Anjali Singh, Karan Taneja, Zhitong Guan, Soo Young Rieh

2603.19629 2026-03-23 stat.ML cs.LG physics.geo-ph

On the role of memorization in learned priors for geophysical inverse problems

Ali Siahkoohi, Davide Sabeddu

2603.19599 2026-03-23 cs.SI cs.AI

Physics-Informed Neural Network with Adaptive Clustering Learning Mechanism for Information Popularity Prediction

Guangyin Jin, Xiaohan Ni, Yanjie Song, Kun Wei, Jie Zhao, Leiming Jia, Witold Pedrycz

2603.19591 2026-03-23 physics.ao-ph cs.AI

Data-driven ensemble prediction of the global ocean

Qiusheng Huang, Xiaohui Zhong, Anboyu Guo, Ziyi Peng, Lei Chen, Hao Li

2603.19588 2026-03-23 cs.HC cs.CV

HiFiGaze: Improving Eye Tracking Accuracy Using Screen Content Knowledge

Taejun Kim, Vimal Mollyn, Riku Arakawa, Chris Harrison

Comments ACM CHI 2026

2603.19583 2026-03-23 cs.SE cs.AI

Skilled AI Agents for Embedded and IoT Systems Development

Yiming Li, Yuhan Cheng, Mingchen Ma, Yihang Zou, Ningyuan Yang, Wei Cheng, Hai "Helen" Li, Yiran Chen, Tingjun Chen

2603.18389 2026-03-23 physics.chem-ph cs.AI

An SO(3)-equivariant reciprocal-space neural potential for long-range interactions

Lingfeng Zhang, Taoyong Cui, Dongzhan Zhou, Lei Bai, Sufei Zhang, Luca Rossi, Mao Su, Wanli Ouyang, Pheng-Ann Heng

2603.18377 2026-03-23 cs.CR cs.AI cs.ET

PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents

Guangsheng Yu, Qin Wang, Rui Lang, Shuai Su, Xu Wang

2603.18196 2026-03-23 cs.CR cs.AI

Retrieval-Augmented LLMs for Security Incident Analysis

Xavier Cadet, Aditya Vikram Singh, Harsh Mamania, Edward Koh, Alex Fitts, Dirk Van Bruggen, Simona Boboila, Peter Chin, Alina Oprea

2603.18168 2026-03-23 stat.ML cs.LG math.PR

ResNets of All Shapes and Sizes: Convergence of Training Dynamics in the Large-scale Limit

Louis-Pierre Chaintron, Lénaïc Chizat, Javier Maass

2603.15781 2026-03-23 stat.ML cs.LG

Learnability with Partial Labels and Adaptive Nearest Neighbors

Nicolas A. Errandonea, Santiago Mazuelas, Jose A. Lozano, Sanjoy Dasgupta

2603.15727 2026-03-23 cs.CR cs.AI cs.LG cs.MA cs.SE

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Yihao Zhang, Zeming Wei, Xiaokun Luan, Chengcan Wu, Zhixin Zhang, Jiangrong Wu, Haolin Wu, Huanran Chen, Jun Sun, Meng Sun

2603.14049 2026-03-23 math.OC cs.LG cs.SY eess.SY math.PR

Schrödinger Bridge Over A Compact Connected Lie Group

Hamza Mahmood, Abhishek Halder, Adeel Akhtar

2601.18921 2026-03-23 cs.DB cs.CE cs.LG q-bio.QM

Accelerating Large-Scale Cheminformatics Using a Byte-Offset Indexing Architecture for Terabyte-Scale Data Integration

Malikussaid, Septian Caesar Floresko, Sutiyo

Comments 6 pages, 3 figures, 5 equations, 3 algorithms, 4 tables, to be published in ICoICT 2026, unabridged version exists as arXiv:2512.24643v1

2511.21448 2026-03-23 cs.CR cs.AI cs.DB

The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs

Rebeka Toth, Tamas Bisztray, Nils Gruschka

2509.24773 2026-03-23 eess.AS cs.AI cs.CL cs.CV cs.SD

VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning

Xin Cheng, Yuyue Wang, Xihua Wang, Yihan Wu, Kaisi Guan, Yijing Chen, Peng Zhang, Xiaojiang Liu, Meng Cao, Ruihua Song

Comments Paper Under Review

2508.10515 2026-03-23 physics.comp-ph cs.CE cs.LG cs.SY eess.SY

Virtual Sensing for Solder Layer Degradation and Temperature Monitoring in IGBT Modules

Andrea Urgolo, Monika Stipsitz, Hèlios Sanchis-Alepuz

Comments Andrea Urgolo and Monika Stipsitz contributed equally to this work

2507.21543 2026-03-23 math.OC cs.LG cs.SY eess.SY

On Policy Stochasticity in Mutual Information Optimal Control of Linear Systems

Shoju Enami, Kenji Kashima

Comments 18 pages. Revised potentially misleading phrasing from v1. The main arguments and discussions remain unchanged

2506.20703 2026-03-23 cs.GR cs.CV

Generative Blocks World: Moving Things Around in Pictures

Vaibhav Vavilala, Seemandhar Jain, Rahul Vasanth, D. A. Forsyth, Anand Bhattad

Comments ICLR 2026 34 pages, 25 figures, 4 tables

2506.15047 2026-03-23 cs.HC cs.AI cs.CY

Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers

Jiayue Melissa Shi, Dong Whi Yoo, Keran Wang, Violeta J. Rodriguez, Ravi Karkar, Koustuv Saha