arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.05495 2026-05-08 cs.LG

Shortcut Solutions Learned by Transformers Impair Continual Compositional Reasoning

William T. Redman, Erik C. Johnson, Brian Robinson

Comments 17 pages, 6 figures

详情

英文摘要

Identifying and exploiting common features across domains is at the heart of the human ability to make analogies, and is believed to be crucial for the ability to continually learn. To do this successfully, general and flexible computational strategies must be developed. While the extent to which Transformer neural network models can perform compositional reasoning has been the subject of intensive recent investigation, little work has been done to systematically understand how well these models can leverage their representations to learn new, related experiences. To address this gap, we expand the previously developed Learning Equality and Group Operations (LEGO) framework to a continual learning (CL) setting ("continual LEGO"). Using this continual LEGO experimental paradigm, we study the capability of feedforward and recurrent Transformer models to perform CL. We find that BERT, a canonical feedforward Transformer model, learns shortcut solutions that limits its ability to generalize and prevents strong forward transfer to new experiences. In contrast, we find evidence supporting the hypothesis that ALBERT, a recurrent version of BERT, learns a For loop-esque solution, which leads to better CL performance. When applying BERT and ALBERT models to a CL setting that requires composition across experiences, we find that both model families fail. Our investigation suggests that ALBERT models can have their performance drop rescued by use of training strategies that combine data across experiences, but this is not true for BERT models, where a detrimental shortcut solution becomes entrenched with initial training. Our results demonstrate that the recurrent ALBERT model may have an inductive bias better suited for CL and motivate future investigation of the interplay between Transformer architecture and computational solutions that emerge in modern models and tasks.

URL PDF HTML ☆

赞 0 踩 0

2605.05492 2026-05-08 cs.LG

MEMOA: Massive Mixtures of Online Agents via Mean-Field Decentralized Nash Equilibria

Xuwei Yang, David B. Emerson, Fatemeh Tavakoli, Anastasis Kratsios

Comments 43 pages, 11 tables, 1 figure

2605.05488 2026-05-08 cs.LG

A Robust Foundation Model for Conservation Laws: Injecting Context into Flux Neural Operators via Recurrent Vision Transformers

Taeyoung Kim, Joon-Hyuk Ko

Comments 14 pages, 3 figures

2605.05485 2026-05-08 cs.CL cs.AI

ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis

Atharva Naik, Yash Mathur, Prakam, Carolyn Rose, David Mortensen

2605.05483 2026-05-08 cs.RO

Robust $\mathcal{H}_\infty$ Controller Design For INDI-Controlled Quadrotor Using Online Parameter Identification

Tom Aantjes, Till M. Blaha, Spilios Theodoulis, Ewoud J. J. Smeur

Comments 8 pages, 11 figures, Accepted to the ICUAS 2026 conference

2605.05482 2026-05-08 cs.AI cs.CL cs.MA

FinRAG-12B: A Production-Validated Recipe for Grounded Question Answering in Banking

Denys Katerenchuk, Pablo Duboue, Keelan Evanini, David Gondek, Nithin Govindugari, Olivier Allauzen, Joshua Baptiste, David J More, Joshua Schechter

Comments 7 pages, ACL 2026 conference

2605.05481 2026-05-08 cs.LG

Approximate Next Policy Sampling: Replacing Conservative Target Policy Updates in Deep RL

Dillon Sandhu, Ronald Parr

2605.05478 2026-05-08 cs.AI

LANTERN: LLM-Augmented Neurosymbolic Transfer with Experience-Gated Reasoning Networks

Mahyar Alinejad, Yue Wang, Amrit Singh Bedi, George Atia

2605.05476 2026-05-08 cs.LG cs.AI cs.CL

A Unified Benchmark for Evaluating Knowledge Graph Construction Methods and Graph Neural Networks

Othmane Kabal, Mounira Harzallah, Fabrice Guillet, Hideaki Takeda, Ryutaro Ichise

2605.05475 2026-05-08 cs.AI

Intentionality is a Design Decision: Measuring Functional Intentionality for Accountable AI Systems

Allessia Chiappetta, Robert Mahari

2605.05463 2026-05-08 cs.LG cs.AI

Robustness of Graph Self-Supervised Learning to Real-World Noise: A Case Study on Text-Driven Biomedical Graphs

Othmane Kabal, Mounira Harzallah, Fabrice Guillet, Hideaki Takeda, Ryutaro Ichise

详情

英文摘要

Graph Self-Supervised Learning (GSSL) offers a powerful paradigm for learning graph representations without labeled data. However, existing work assumes clean, manually curated graphs. Recent advances in NLP enable the large-scale automatic extraction of knowledge graphs from text, opening new opportunities for GSSL while introducing substantial real-world noise. This type of noise remains largely unexplored, as prior robustness studies typically rely on synthetic perturbations. To address this gap, we present the first comprehensive evaluation of GSSL methods on text-driven graphs for unsupervised term typing. We introduce Noise-Aware Text-Driven Graph GSSL (NATD-GSSL), a unified framework that combines automatic graph construction, graph refinement, and GSSL. Our evaluation follows a dual-graph protocol that contrasts a noisy graph derived from MedMentions with a clean Unified Medical Language System (UMLS) reference graph, aligned through a shared gold standard. Our results reveal variability in robustness across both pretext tasks and Graph Neural Network (GNN) architectures. Relation reconstruction is highly sensitive to noise and benefits from well-defined schemas, whereas feature reconstruction is considerably more robust, achieving performance comparable to clean-graph settings. Contrastive objectives are generally less affected by noise but depend strongly on alignment with downstream tasks. GNN architecture also plays a critical role: bidirectional relational message-passing designs are better suited to noisy, text-driven graphs, while unidirectional relational ones perform best on clean graphs. Overall, NATD-GSSL provides practical guidance for applying GSSL to real-world, noisy graphs and achieves up to a 7\% improvement over pretrained language model baselines. All code and benchmarks are publicly available at https://github.com/OthmaneKabal/MC2GAE.

URL PDF HTML ☆

赞 0 踩 0

2605.05461 2026-05-08 cs.RO

Contact-Free Grasp Stability Prediction with In-Hand Time-of-Flight Sensors

Kyle DuFrene, Cindy Grimm

2605.05460 2026-05-08 cs.AI physics.chem-ph

Agentic Discovery of Exchange-Correlation Density Functionals

Titouan Duston, Jiashu Liang, Yuanheng Wang, Weihao Gao, Xuelan Wen, Nan Sheng, Weiluo Ren, Yang Sun, Yixiao Chen

Comments 20 pages, 2 figues, 4 tables

2605.05447 2026-05-08 cs.CV

EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function

Elias Stenhede, Joanna Sulkowska, Eivind Bjørkan Orstad, Henrik Schirmer, Arian Ranjbar

2605.05440 2026-05-08 cs.AI

Authorization Propagation in Multi-Agent AI Systems: Identity Governance as Infrastructure

Krti Tallam

Comments Security and systems paper, 20 pages

2605.05439 2026-05-08 cs.CV

Safety-Critical Camera Reliability Monitoring for ADAS via Degradation-Aware Uncertainty Pattern Analysis

Shiva Aher

2605.05438 2026-05-08 cs.LG cs.AI

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

Pratik Deshmukh, Atirek Gupta

Comments 14 pages, 6 figures

2605.05435 2026-05-08 cs.LG cs.NA math.NA

Active Learning for Conditional Generative Compressed Sensing

Alexander DeLise, Nick Dexter

Comments 33 pages, 11 figures

2605.05415 2026-05-08 cs.LG cs.AI cs.CR

Information Theoretic Adversarial Training of Large Language Models

Yiwei Zhang, Jeremiah Birrell, Reza Ebrahimi, Rouzbeh Behnia, Jason Pacheco, Elisa Bertino

2605.05413 2026-05-08 cs.AI

From History to State: Constant-Context Skill Learning for LLM Agents

Haoyang Xie, Xinyuan Wang, Yancheng Wang, Puda Zhao, Feng Ju

2605.05411 2026-05-08 cs.RO cs.AI

Creative Robot Tool Use by Counterfactual Reasoning

M. Tuluhan Akbulut, Varun Satheesh, Ahmed Jaafar, Alper Ahmetoglu, Shane Parr, Aditya Ganeshan, Shivam Vats, George Konidaris

Comments Under review

2605.05410 2026-05-08 cs.AI cs.HC physics.ed-ph

LaTA: A Drop-in, FERPA-Compliant Local-LLM Autograder for Upper-Division STEM Coursework

Jesse A. Rodríguez

Comments Submitted to Computers & Education

2605.05409 2026-05-08 cs.AI cs.CL

Agentic Retrieval-Augmented Generation for Financial Document Question Answering

Yang Shu, Yingmin Liu, Zequn Xie

Comments 22 pages, 11 figures, 13 tables, submitted to Expert Systems with Applications

2605.05403 2026-05-08 cs.AI

When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models

Jiechen Li, Catherine A. Barry, Rishika Randev, Janet Chen, Ella Jorgensen, Brinnae Bent

Comments Currently under review

2605.05402 2026-05-08 cs.AI cs.CV eess.IV

Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections

Vinit Katariya, Seungjin Kim, Curtis Craig, Nichole Morris, Hamed Tabkhi

Comments 16 pages, 6 figures, 7 tables, Submitted/Under Review at the International Journal of Transportation Research (Submitted on 12 Jan 2026)

2605.05395 2026-05-08 cs.LG cs.MS

Differentiable Parameter Optimization for DAEs with State-Dependent Events

Ion Matei, Maksym Zhenirovskyy, Anthony Wong

2605.05392 2026-05-08 cs.CL cs.AI

Generating Query-Focused Summarization Datasets from Query-Free Summarization Datasets

Yllias Chali, Deen Abdullah

Comments 7 pages, 1 figure

2605.05390 2026-05-08 cs.CV

LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World

Nan Yang, Julian Straub, Fan Zhang, Richard Newcombe, Jakob Engel, Lingni Ma

Comments CVPR 2026. Project page: https://facebookresearch.github.io/LAMP

2605.05389 2026-05-08 cs.LG cs.AI

Two-Stage Learned Decomposition for Scalable Routing on Multigraphs

Filip Rydin, Morteza Haghir Chehreghani, Balázs Kulcsár

Comments 20 pages, 3 figures

2605.05387 2026-05-08 cs.LG cs.IT math.IT

Conditional Diffusion Under Linear Constraints: Langevin Mixing and Information-Theoretic Guarantees

Ahmad Aghapour, Erhan Bayraktar, Asaf Cohen