arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2508.09954 2026-03-23 cs.CL

Disambiguation of Emotion Annotations by Contextualizing Events in Plausible Narratives

Johannes Schäfer, Roman Klinger

Comments accepted to LREC 2026

详情

英文摘要

Ambiguity in emotion analysis stems both from potentially missing information and the subjectivity of interpreting a text. The latter did receive substantial attention, but can we fill missing information to resolve ambiguity? We address this question by developing a method to automatically generate reasonable contexts for an otherwise ambiguous classification instance. These generated contexts may act as illustrations of potential interpretations by different readers, as they can fill missing information with their individual world knowledge. This task to generate plausible narratives is a challenging one: We combine techniques from short story generation to achieve coherent narratives. The resulting English dataset of Emotional BackStories, EBS, allows for the first comprehensive and systematic examination of contextualized emotion analysis. We conduct automatic and human annotation and find that the generated contextual narratives do indeed clarify the interpretation of specific emotions. Particularly relief and sadness benefit from our approach, while joy does not require the additional context we provide.

URL PDF HTML ☆

赞 0 踩 0

2508.07901 2026-03-23 cs.CV

Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

Bowen Xue, Zheng-Peng Duan, Qixin Yan, Wenjing Wang, Hao Liu, Chun-Le Guo, Chongyi Li, Chen Li, Jing Lyu

2507.23508 2026-03-23 cs.CV

Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion

Timing Li, Bing Cao, Jiahe Feng, Haifang Cao, Qinghau Hu, Pengfei Zhu

2507.21802 2026-03-23 cs.AI cs.CV

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Junzhe Li, Yutao Cui, Tao Huang, Yinping Ma, Chun Fan, Yiming Cheng, Miles Yang, Zhao Zhong, Liefeng Bo

2507.17343 2026-03-23 cs.CV cs.LG cs.MM

Principled Multimodal Representation Learning

Xiaohao Liu, Xiaobo Xia, See-Kiong Ng, Tat-Seng Chua

Comments Accepted by IEEE TPAMI 2026

2507.02245 2026-03-23 cs.RO

CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset for Vehicle-Infrastructure Cooperation in Adverse Weather

Minghao Ning, Yufeng Yang, Keqi Shu, Shucheng Huang, Jiaming Zhong, Maryam Salehi, Mahdi Rahmani, Jiaming Guo, Yukun Lu, Chen Sun, Aladdin Saleh, Ehsan Hashemi, Amir Khajepour

Comments This paper has been submitted to the Transportation Research Part C: Emerging Technologies for review

2506.21349 2026-03-23 cs.CV eess.IV

Electromagnetic Inverse Scattering from a Single Transmitter

Yizhe Cheng, Chunxun Tian, Haoru Wang, Wentao Zhu, Xiaoxuan Ma, Yizhou Wang

2506.16931 2026-03-23 cs.AI cs.RO

Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning

Jiaqi Cheng, Mingfeng Fan, Xuefeng Zhang, Jingsong Liang, Yuhong Cao, Guohua Wu, Guillaume Adrien Sartoretti

Comments 14 pages, 6 figures, Proceedings of the Conference on Robot Learning (CoRL 2025)

2506.15872 2026-03-23 cs.LG

Hidden Breakthroughs in Language Model Training

Sara Kangaslahti, Elan Rosenfeld, Naomi Saphra

Comments ICLR 2026 Camera-ready

2506.14608 2026-03-23 cs.RO

Latent Action Diffusion for Cross-Embodiment Manipulation

Erik Bauer, Elvis Nava, Robert K. Katzschmann

Comments 8 pages, 5 figures. Accepted to the 2026 IEEE International Conference on Robotics & Automation (ICRA). Website: https://mimicrobotics.github.io/lad/

2506.09814 2026-03-23 cs.CV

DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision

Xiandong Zou, Ruihao Xia, Hongsong Wang, Pan Zhou

2506.08898 2026-03-23 cs.AI

Preference-Driven Multi-Objective Combinatorial Optimization with Conditional Computation

Mingfeng Fan, Jianan Zhou, Yifeng Zhang, Yaoxin Wu, Jinbiao Chen, Guillaume Adrien Sartoretti

Comments 22 pages, 6 figures, 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

2506.04218 2026-03-23 cs.RO cs.AI cs.CV cs.LG

Pseudo-Simulation for Autonomous Driving

Wei Cao, Marcel Hallgarten, Tianyu Li, Daniel Dauner, Xunjiang Gu, Caojun Wang, Yakov Miron, Marco Aiello, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta

Comments CoRL 2025, updated with leaderboard snapshot from March 2026

2505.19361 2026-03-23 cs.AI cs.CV cs.LG cs.LO

Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments

Mario Leiva, Noel Ngu, Joshua Shay Kricheli, Aditya Taparia, Ransalu Senanayake, Paulo Shakarian, Nathaniel Bastian, John Corcoran, Gerardo Simari

Comments Accepted to AAAI 2026. Code available at https://github.com/lab-v2/EDCR_PyReason_AirSim

2505.17393 2026-03-23 cs.LG math.SP

CatBOX: A Categorical-Continuous Bayesian Optimization with Spectral Mixture Kernels for Accelerated Catalysis Experiments

Changquan Zhao, Yi Zhang, Zhuo Li, Li Jin, Cheng Hua, Yulian He

2505.03424 2026-03-23 cs.LG cs.AI

Framework GNN-AID: Graph Neural Network Analysis Interpretation and Defense

Kirill Lukyanov, Mikhail Drobyshevskiy, Georgii Sazonov, Mikhail Soloviov, Ilya Makarov

详情

DOI: 10.1609/aaai.v40i48.42364
Journal ref: 2026 Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41634-41636

英文摘要

The growing need for Trusted AI (TAI) highlights the importance of interpretability and robustness in machine learning models. However, many existing tools overlook graph data and rarely combine these two aspects into a single solution. Graph Neural Networks (GNNs) have become a popular approach, achieving top results across various tasks. We introduce GNN-AID (Graph Neural Network Analysis, Interpretation, and Defense), an open-source framework designed for graph data to address this gap. Built as a Python library, GNN-AID supports advanced trust methods and architectural layers, allowing users to analyze graph datasets and GNN behavior using attacks, defenses, and interpretability methods. GNN-AID is built on PyTorch-Geometric, offering preloaded datasets, models, and support for any GNNs through customizable interfaces. It also includes a web interface with tools for graph visualization and no-code features like an interactive model builder, simplifying the exploration and analysis of GNNs. The framework also supports MLOps techniques, ensuring reproducibility and result versioning to track and revisit analyses efficiently. GNN-AID is a flexible tool for developers and researchers. It helps developers create, analyze, and customize graph models, while also providing access to prebuilt datasets and models for quick experimentation. Researchers can use the framework to explore advanced topics on the relationship between interpretability and robustness, test defense strategies, and combine methods to protect against different types of attacks. We also show how defenses against evasion and poisoning attacks can conflict when applied to graph data, highlighting the complex connections between defense strategies. GNN-AID is available at \href{https://github.com/ispras/GNN-AID}{github.com/ispras/GNN-AID}

URL PDF HTML ☆

赞 0 踩 0

2504.08246 2026-03-23 cs.RO cs.LG cs.SY eess.SY

Spectral Normalization for Lipschitz-Constrained Policies on Learning Humanoid Locomotion

Jaeyong Shin, Woohyun Cha, Donghyeon Kim, Junhyeok Cha, Jaeheung Park

Comments This work has been submitted to the IEEE for possible publication

2504.08114 2026-03-23 cs.RO cs.LG cs.SY eess.SY

RL-based Control of UAS Subject to Significant Disturbance

Kousheek Chakraborty, Thijs Hof, Ayham Alharbat, Abeje Mersha

Comments Accepted at ICUAS 2025

2504.05786 2026-03-23 cs.CV cs.AI

How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM

Jirong Zha, Yuxuan Fan, Xiao Yang, Chen Gao, Xinlei Chen

Comments 9 pages, 5 figures

2503.19486 2026-03-23 cs.CV

Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage

Zhengwentai Sun, Chenghong Li, Hongjie Liao, Xihe Yang, Keru Zheng, Heyuan Li, Yihao Zhi, Shuliang Ning, Shuguang Cui, Xiaoguang Han

Comments A new version with additional experiments

2502.20795 2026-03-23 cs.CL

Test-Time Alignment for Large Language Models via Textual Model Predictive Control

Kuang-Da Wang, Teng-Ruei Chen, Yu Heng Hung, Guo-Xun Ko, Shuoyang Ding, Yueh-Hua Wu, Yu-Chiang Frank Wang, Chao-Han Huck Yang, Wen-Chih Peng, Ping-Chun Hsieh

Comments Accepted for ICLR 2026. Project page: https://rl-bandits-lab.github.io/TMPC/

2502.15851 2026-03-23 cs.CL cs.AI

Control Illusion: The Failure of Instruction Hierarchies in Large Language Models

Yilin Geng, Haonan Li, Honglin Mu, Xudong Han, Timothy Baldwin, Omri Abend, Eduard Hovy, Lea Frermann

Comments Accepted to AAAI-26 Main Technical Track Proceedings

2502.14400 2026-03-23 cs.AI

HPS: Hard Preference Sampling for Human Preference Alignment

Xiandong Zou, Wanyu Lin, Yuchen Li, Pan Zhou

2502.10647 2026-03-23 cs.LG math.ST stat.ML stat.TH

A Power Transform

Jonathan T. Barron

2501.13516 2026-03-23 cs.LG cs.SY eess.SY math.OC

Communication-Efficient Stochastic Distributed Learning

Xiaoxing Ren, Nicola Bastianello, Karl H. Johansson, Thomas Parisini

2501.11421 2026-03-23 cs.LG cs.IT math.IT math.ST stat.TH

Online Clustering of Data Sequences with Bandit Information

G Dhinesh Chandran, Srinivas Reddy Kota, Srikrishna Bhashyam

详情

英文摘要

We study the problem of online clustering of data sequences in the multi-armed bandit (MAB) framework under the fixed-confidence setting. There are $M$ arms, each providing i.i.d. samples from a parametric distribution whose parameters are unknown. The $M$ arms form $K$ clusters based on the distance between the true parameters. In the MAB setting, one arm can be sampled at each time. The objective is to estimate the clusters of the arms using as few samples as possible from the arms, subject to an upper bound on the error probability. Our setting allows for: arms within a cluster to have non-identical distributions, vector parameter arms, vector observations, and $K \le M$ clusters. We propose and analyze the Average Tracking Bandit Online Clustering (ATBOC) algorithm. ATBOC is asymptotically order-optimal for multivariate Gaussian arms, with expected sample complexity grows at most twice as fast as the lower bound as $δ\rightarrow 0$, and this guarantee extends to multivariate sub-Gaussian arms. For single-parameter exponential family arms, ATBOC is asymptotically optimal, matching the lower bound. We also propose a computationally more efficient alternatives Lower and Upper Confidence Bound based Bandit Online Clustering Algorithm (LUCBBOC), and Bandit Online Clustering-Elimination (BOC-ELIM). We derive the computational complexity of the proposed algorithms and compare their per-sample runtime through simulations. LUCBBOC and BOC-ELIM require lower per-sample runtime than ATBOC while achieving comparable performance. All the proposed algorithms are $δ$-Probably correct, i.e., the error probability of cluster estimate at the stopping time is atmost $δ$. We validate the asymptotic optimality guarantees through simulations, and present the comparison of our proposed algorithms with other related work through simulations on both synthetic and real-world datasets.

URL PDF HTML ☆

赞 0 踩 0

2412.17861 2026-03-23 cs.RO

From Vocal Instructions to Household Tasks: The Inria TIAGo++ in the euROBIN Service Robots Coopetition

Fabio Amadio, Clemente Donoso, Dionis Totsila, Raphael Lorenzo, Quentin Rouxel, Olivier Rochel, Enrico Mingo Hoffman, Jean-Baptiste Mouret, Serena Ivaldi

2410.22862 2026-03-23 cs.CV cs.LG

AtGCN: A Graph Convolutional Network For Ataxic Gait Detection

Karan Bania, Tanmay Verlekar

Comments Accepted as a Long Oral (top-5%) at AIME 2025

2409.19435 2026-03-23 cs.LG stat.CO stat.ML

Simulation-based Inference with the Python Package sbijax

Simon Dirmeier, Antonietta Mira, Carlo Albert

2406.15189 2026-03-23 cs.LG

Causal Learning in Biomedical Applications: Krebs Cycle as a Benchmark

Xiaoyu He, Petr Ryšavý, Jakub Mareček