arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.11806 2026-04-30 math.GR cs.CV

A Diffeomorphism Groupoid and Algebroid Framework for Discontinuous Image Registration

Lili Bao, Bin Xiao, Shihui Ying, Stefan Sommer

详情

英文摘要

In this paper, we propose a novel mathematical framework for piecewise diffeomorphic image registration that involves discontinuous sliding motion using a diffeomorphism groupoid and algebroid approach. The traditional Large Deformation Diffeomorphic Metric Mapping (LDDMM) registration method builds on Lie groups, which assume continuity and smoothness in velocity fields, limiting its applicability in handling discontinuous sliding motion. To overcome this limitation, we extend the diffeomorphism Lie groups to a framework of discontinuous diffeomorphism Lie groupoids, allowing for discontinuities along sliding boundaries while maintaining diffeomorphism within homogeneous regions. We provide a rigorous analysis of the associated mathematical structures, including Lie algebroids and their duals, and derive specific Euler-Arnold equations to govern optimal flows for discontinuous deformations. Numerical tests are performed to validate the efficiency of the proposed approach.

URL PDF HTML ☆

赞 0 踩 0

2603.10992 2026-04-30 stat.ML cs.LG physics.chem-ph physics.comp-ph

A Tutorial Review of Bayesian Optimization with Gaussian Processes to Accelerate Stationary Point Searches

Rohit Goswami

Comments 66 pages, 24 figures (main). Accepted article for ACS Physical Chemistry Au

2603.07955 2026-04-30 math.GT cs.LG stat.ML

RL unknotter, hard unknots and unknotting number

Anne Dranowski, Yura Kabkov, Daniel Tubbenhauer

Comments 19 pages, many figures, comments welcome

2603.03664 2026-04-30 eess.SY cs.LG cs.MA cs.SY math.OC

Principled Learning-to-Communicate with Quasi-Classical Information Structures

Xiangyu Liu, Haoyi You, Kaiqing Zhang

Comments Preliminary version appeared at IEEE CDC 2025

2602.23390 2026-04-30 cs.SI cs.LG

PACIFIER: Pacing Opinion Depolarization via a Unified Graph Learning Framework

Mingkai Liao

Comments 45 pages, 32 figure. final version

2602.21480 2026-04-30 cs.DB cs.CL cs.IR

Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"?

Germán T. Eizaguirre, Lars Tissen, Marc Sánchez-Artigas

Comments 14 pages, 8 figures

2602.12924 2026-04-30 cs.HC cs.AI

Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

Stephan Vonschallen, Rahel Häusler, Theresa Schmiedel, Friederike Eyssel

详情

DOI: 10.3389/frai.2026.1810725

英文摘要

Generative Social Agents (GSAs) are increasingly impacting human users through persuasive means. On the one hand, they might motivate users to pursue personal goals, such as healthier lifestyles. On the other hand, they are associated with potential risks like manipulation and deception, which are induced by limited control over probabilistic agent outputs. However, as GSAs manifest communicative patterns based on available knowledge, their behavior may be regulated through their access to such knowledge. Following this approach, we explored persuasive ChatGPT-generated messages in the context of human-robot physiotherapy motivation. We did so by comparing ChatGPT-generated responses to predefined inputs from a hypothetical physiotherapy patient. In Study 1, we qualitatively analyzed 13 ChatGPT-generated dialogue scripts with varying knowledge configurations regarding persuasive message characteristics. In Study 2, third-party observers (N = 27) rated a selection of these dialogues in terms of the agent's expressiveness, assertiveness, and persuasiveness. Our findings indicate that LLM-based GSAs can adapt assertive and expressive personality traits - significantly enhancing perceived persuasiveness. Moreover, persuasiveness significantly benefited from the availability of information about the patients' age and past profession, mediated by perceived assertiveness and expressiveness. Contextual knowledge about physiotherapy benefits did not significantly impact persuasiveness, possibly because the LLM had inherent knowledge about such benefits even without explicit prompting. Overall, the study highlights the importance of empirically studying behavioral patterns of GSAs, specifically in terms of what information generative AI systems require for consistent and responsible communication.

URL PDF HTML ☆

赞 0 踩 0

2602.03169 2026-04-30 stat.ML cs.LG

NeuralFLoC: Neural Flow-Based Joint Registration and Clustering of Functional Data

Xinyang Xiong, Siyuan jiang, Pengcheng Zeng

2601.19216 2026-04-30 cs.NI cs.AI cs.CV cs.LG

Bridging Visual and Wireless Sensing via a Unified Radiation Field for 3D Radio Map Construction

Chaozheng Wen, Jingwen Tong, Zehong Lin, Chenghong Bian, Jun Zhang

Comments The code for this work will be publicly available at: https://github.com/wenchaozheng/URF-GS

2601.17617 2026-04-30 cs.IR cs.CL

Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests

Jingjie Ning, João Coelho, Yibo Kong, Yunfan Long, Bruno Martins, João Magalhães, Jamie Callan, Chenyan Xiong

Comments Accepted at SIGIR 2026. DOI: 10.1145/3805712.3809627

详情

DOI: 10.1145/3805712.3809627

英文摘要

LLM-powered search agents are increasingly being used for multi-step information seeking tasks, yet the IR community lacks empirical understanding of how agentic search sessions unfold and how retrieved evidence is reflected in later queries. This paper presents a large-scale log analysis of agentic search based on 14.44M search requests (3.97M sessions) collected from DeepResearchGym, i.e., an open-source search API accessed by external agentic clients. We sessionize the logs, assign session-level intents and step-wise query-reformulation labels using LLM-based annotation, and propose Context-driven Term Adoption Rate (CTAR) to quantify whether newly introduced query terms are lexically traceable to previously retrieved evidence. Our analyses reveal distinctive behavioral patterns. First, over 90\% of multi-turn sessions contain at most ten steps, and 89\% of inter-step intervals fall under one minute. Second, behavior varies by intent. Fact-seeking sessions exhibit high repetition that increases over time, while sessions requiring reasoning sustain broader exploration. Third, query reformulations are often traceable to retrieved evidence across steps. On average, 54\% of newly introduced query terms appear in the accumulated evidence context, with additional traceability to earlier steps beyond the most recent retrieval. These findings provide candidate signals for repetition-aware stopping, intent-adaptive retrieval budgeting, and explicit cross-step context tracking. We released the anonymized logs, making them available at a public HuggingFace~\chref{https://huggingface.co/datasets/cx-cmu/deepresearchgym-agentic-search-logs}{repository}.

URL PDF HTML ☆

赞 0 踩 0

2512.23726 2026-04-30 physics.med-ph cs.AI cs.CV

q3-MuPa: Quick, Quiet, Quantitative Multi-Parametric MRI using Physics-Informed Diffusion Models

Shishuai Wang, Florian Wiesinger, Noemi Sgambelluri, Carolin Pirkl, Stefan Klein, Juan A. Hernandez-Tamames, Dirk H. J. Poot

2512.22113 2026-04-30 cs.DC cs.AI cs.SE

PRAXIS: Integrating Program Analysis with Observability for Root-Cause Analysis

Shengkun Cui, Rahul Krishna, Saurabh Jha, Ravishankar K. Iyer

Comments 15 pages. Accepted to appear in The 56th Annual IEEE/IFIP International Conference on Dependable Systems and Networks

2512.10998 2026-04-30 cs.CR cs.CL

SCOUT: A Defense Against Data Poisoning Attacks in Fine-Tuned Language Models

Mohamed Afane, Abhishek Satyam, Ke Chen, Tao Li, Junaid Farooq, Juntao Chen

Comments 9 pages, 3 figures

详情

DOI: 10.1109/BigData66926.2025.11402507
Journal ref: 2025 IEEE International Conference on Big Data (BigData), 2025

英文摘要

Backdoor attacks create significant security threats to language models by embedding hidden triggers that manipulate model behavior during inference, presenting critical risks for AI systems deployed in healthcare and other sensitive domains. While existing defenses effectively counter obvious threats such as out-of-context trigger words and safety alignment violations, they fail against sophisticated attacks using contextually-appropriate triggers that blend seamlessly into natural language. This paper introduces three novel contextually-aware attack scenarios that exploit domain-specific knowledge and semantic plausibility: the ViralApp attack targeting social media addiction classification, the Fever attack manipulating medical diagnosis toward hypertension, and the Referral attack steering clinical recommendations. These attacks represent realistic threats where malicious actors exploit domain-specific vocabulary while maintaining semantic coherence, demonstrating how adversaries can weaponize contextual appropriateness to evade conventional detection methods. To counter both traditional and these sophisticated attacks, we present \textbf{SCOUT (Saliency-based Classification Of Untrusted Tokens)}, a novel defense framework that identifies backdoor triggers through token-level saliency analysis rather than traditional context-based detection methods. SCOUT constructs a saliency map by measuring how the removal of individual tokens affects the model's output logits for the target label, enabling detection of both conspicuous and subtle manipulation attempts. We evaluate SCOUT on established benchmark datasets (SST-2, IMDB, AG News) against conventional attacks (BadNet, AddSent, SynBkd, StyleBkd) and our novel attacks, demonstrating that SCOUT successfully detects these sophisticated threats while preserving accuracy on clean inputs.

URL PDF HTML ☆

赞 0 踩 0

2511.18239 2026-04-30 cs.CY cs.AI

Can LLMs Help Allocate Public Health Resources? A Case Study on Childhood Lead Testing

Mohamed Afane, Ying Wang, Juntao Chen

详情

DOI: 10.1109/BigData66926.2025.11402390
Journal ref: 2025 IEEE International Conference on Big Data (BigData), 2025

英文摘要

Public health agencies face critical challenges in identifying high-risk neighborhoods for childhood lead exposure with limited resources for outreach and intervention programs. To address this, we develop a Priority Score integrating untested children proportions, elevated blood lead prevalence, and public health coverage patterns to support optimized resource allocation decisions across 136 neighborhoods in Chicago, New York City, and Washington, D.C. We leverage these allocation tasks, which require integrating multiple vulnerability indicators and interpreting empirical evidence, to evaluate whether large language models (LLMs) with agentic reasoning and deep research capabilities can effectively allocate public health resources when presented with structured allocation scenarios. LLMs were tasked with distributing 1,000 test kits within each city based on neighborhood vulnerability indicators. Results reveal significant limitations: LLMs frequently overlooked neighborhoods with highest lead prevalence and largest proportions of untested children, such as West Englewood in Chicago, while allocating disproportionate resources to lower-priority areas like Hunts Point in New York City. Overall accuracy averaged 0.46, reaching a maximum of 0.66 with ChatGPT 5 Deep Research. Despite their marketed deep research capabilities, LLMs struggled with fundamental limitations in information retrieval and evidence-based reasoning, frequently citing outdated data and allowing non-empirical narratives about neighborhood conditions to override quantitative vulnerability indicators.

URL PDF HTML ☆

赞 0 踩 0

2510.05174 2026-04-30 cs.MA cs.AI

Emergent Coordination in Multi-Agent Language Models

Christoph Riedl

2509.18391 2026-04-30 cs.HC cs.CV

Does Embodiment Matter to Biomechanics and Function? A Comparative Analysis of Head-Mounted and Hand-Held Assistive Devices for Individuals with Blindness and Low Vision

Gaurav Seth, Hoa Pham, Giles Hamilton-Fletcher, Charles Leclercq, John-Ross Rizzo

Comments 30 pages, 7 figures, 5 tables. Pre-print submitted to International Journal of Human-Computer Interaction. Also to appear as a late-breaking poster at ACRM. Limited AI (ChatGPT-4/5) used for language refinement and figure schematics under author supervision. One author (CL) is CEO of ARx Vision; others report no conflicts

2509.13387 2026-04-30 cs.CY cs.AI

Uncovering AI Governance Themes in EU Policies using BERTopic and Thematic Analysis

Delaram Golpayegani, Marta Lasek-Markey, Arjumand Younus, Aphra Kerr, Dave Lewis

2509.09870 2026-04-30 cs.HC cs.AI cs.CL

Vibe Check: Understanding the Effects of LLM-Based Conversational Agents' Personality and Alignment on User Perceptions in Goal-Oriented Tasks

Hasibur Rahman, Smit Desai

Comments 30 pages, CHI 2026 conference paper (article no. 371)

2508.16131 2026-04-30 cs.SE cs.AI

The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion

Zoe Kotti, Konstantina Dritsa, Diomidis Spinellis, Panos Louridas

Comments 32 pages, 10 figures, 1 table

2508.07852 2026-04-30 cs.GR cs.AI

Vertex Features for Neural Global Illumination

Rui Su, Honghao Dong, Haojie Jin, Yisong Chen, Guoping Wang, Sheng Li

Comments Accepted by ACM SIGGRAPH Asia'2025

2508.06550 2026-04-30 cs.GT cs.LG

Generative Bid Shading in Real-Time Bidding Advertising

Yinqiu Huang, Hao Ma, Wenshuai Chen, Zongwei Wang, Shuli Wang, Yongqiang Zhang, Xue Wei, Yinhua Zhu, Haitao Wang, Xingxing Wang

Comments SIGIR 2026

2507.22558 2026-04-30 cond-mat.mtrl-sci cs.AI

aLLoyM: A large language model for alloy phase diagram prediction

Yuna Oikawa, Guillaume Deffrennes, Taichi Abe, Ryo Tamura, Koji Tsuda

Comments 24 pages, 6 figures

2507.19067 2026-04-30 cs.IR cs.AI cs.NE

PBiLoss: Popularity-Aware Regularization to Improve Fairness in Graph-Based Recommender Systems

Mohammad Naeimi, Mostafa Haghir Chehreghani

2507.17544 2026-04-30 stat.ML cs.LG stat.ME

Optimal differentially private kernel learning with random projection

Bonwoo Lee, Cheolwoo Park, Jeongyoun Ahn

Comments 139 page, 3 figures

2507.01110 2026-04-30 cs.GR cs.LG

A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory

Felix Windisch, Thomas Köhler, Lukas Radl, Mattia D'Urso, Michael Steiner, Dieter Schmalstieg, Markus Steinberger

2506.23040 2026-04-30 stat.OT cs.AI

Treatment, evidence, imitation, and chat

Samuel J. Weisenthal

Comments 12 pages

2505.13518 2026-04-30 stat.ML cs.AI cs.LG

Data Balancing Strategies: A Systematic Survey of Resampling and Augmentation Methods

Behnam Yousefimehr, Mehdi Ghatee, Javad Fazli, Shervin Ghaffari, Zahra Rafei, Mohammad Amin Seifi, Sajed Tavakoli, Abolfazl Nikahd, Mahdi Razi Gandomani, Alireza Orouji, Ramtin Mahmoudi Kashani, Sarina Heshmati, Negin Sadat Mousavi

2505.02077 2026-04-30 cs.CR cs.AI cs.MA

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Christian Schroeder de Witt, Klaudia Krawiecka, Igor Krawczuk, Ben Hagag, William L. Anderson, Peter Belcak, Ben Bucknall, Xiaohong Cai, Ayush Chopra, Doron Cohen, Ron F. Del Rosario, Andis Draguns, Annie Gray, Keren Katz, Vasilios Mavroudis, Jaron Mink, Sumeet Ramesh Motwani, Jonathan Petit, Leif-Sebastian Rembeck, Chandler Smith, John Sotiropoulos, Steven Young, Sarah Scheffler, Mary Llewellyn

2503.17897 2026-04-30 cs.GR cs.CV

Real-time Global Illumination for Dynamic 3D Gaussian Scenes

Chenxiao Hu, Meng Gai, Guoping Wang, Sheng Li

Comments accepted by IEEE Transactions on Visualization and Computer Graphics

2503.02332 2026-04-30 eess.IV cs.CV

COMMA: Coordinate-aware Modulated Mamba Network for 3D Dispersed Vessel Segmentation

Gen Shi, Hui Zhang, Jie Tian

Comments Accepted by IEEE TIP