arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.16379 2026-02-19 cs.CL

Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents

Mohammad H. A. Monfared, Lucie Flek, Akbar Karimi

Comments Accepted to WASSA Workshop at EACL 2026

详情

英文摘要

We propose an agentic data augmentation method for Aspect-Based Sentiment Analysis (ABSA) that uses iterative generation and verification to produce high quality synthetic training examples. To isolate the effect of agentic structure, we also develop a closely matched prompting-based baseline using the same model and instructions. Both methods are evaluated across three ABSA subtasks (Aspect Term Extraction (ATE), Aspect Sentiment Classification (ATSC), and Aspect Sentiment Pair Extraction (ASPE)), four SemEval datasets, and two encoder-decoder models: T5-Base and Tk-Instruct. Our results show that the agentic augmentation outperforms raw prompting in label preservation of the augmented data, especially when the tasks require aspect term generation. In addition, when combined with real data, agentic augmentation provides higher gains, consistently outperforming prompting-based generation. These benefits are most pronounced for T5-Base, while the more heavily pretrained Tk-Instruct exhibits smaller improvements. As a result, augmented data helps T5-Base achieve comparable performance with its counterpart.

URL PDF HTML ☆

赞 0 踩 0

2602.16371 2026-02-19 cs.RO

Dynamic Modeling and MPC for Locomotion of Tendon-Driven Soft Quadruped

Saumya Karan, Neerav Maram, Suraj Borate, Madhu Vadali

2602.16365 2026-02-19 cs.RO cs.CV

Markerless 6D Pose Estimation and Position-Based Visual Servoing for Endoscopic Continuum Manipulators

Junhyun Park, Chunggil An, Myeongbo Park, Ihsan Ullah, Sihyeong Park, Minho Hwang

Comments 20 pages, 13 figures, 7 tables

2602.16358 2026-02-19 cs.RO

System Identification under Constraints and Disturbance: A Bayesian Estimation Approach

Sergi Martinez, Steve Tonneau, Carlos Mastalli

2602.15689 2026-02-19 cs.CL cs.AI cs.CR

A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models

Noa Linder, Meirav Segal, Omer Antverg, Gil Gekker, Tomer Fichman, Omri Bodenheimer, Edan Maor, Omer Nevo

2602.15238 2026-02-19 cs.LG cs.AI cs.CR

Closing the Distribution Gap in Adversarial Training for LLMs

Chengzhi Hu, Jonas Dornbusch, David Lüdke, Stephan Günnemann, Leo Schwinn

2602.15001 2026-02-19 cs.LG

Boundary Point Jailbreaking of Black-Box LLMs

Xander Davies, Giorgi Giglemiani, Edmund Lau, Eric Winsor, Geoffrey Irving, Yarin Gal

2602.13194 2026-02-19 cs.CL cond-mat.dis-nn cond-mat.stat-mech cs.AI

Semantic Chunking and the Entropy of Natural Language

Weishun Zhong, Doron Sivan, Tankut Can, Mikhail Katkov, Misha Tsodyks

Comments 29 pages, 9 figures; typos fixed

2602.11358 2026-02-19 cs.CL cs.AI cs.LG

When Models Examine Themselves: Vocabulary-Activation Correspondence in Self-Referential Processing

Zachary Pedram Dadfar

Comments Code and data: https://doi.org/10.5281/zenodo.18567446 Repro: https://github.com/patternmatcher/TRACE-REPRO

2602.11047 2026-02-19 cs.CL

Embedding Inversion via Conditional Masked Diffusion Language Models

Han Xiao

Comments 8 pages, 3 figures, 4 tables. Code and demo: https://github.com/jina-ai/embedding-inversion-demo

2602.09238 2026-02-19 cs.LG

Feature salience - not task-informativeness - drives machine learning model explanations

Benedict Clark, Marta Oliveira, Rick Wilming, Stefan Haufe

详情

英文摘要

Explainable AI (XAI) promises to provide insight into machine learning models' decision processes, where one goal is to identify failures such as shortcut learning. This promise relies on the field's assumption that input features marked as important by an XAI must contain information about the target variable. However, it is unclear whether informativeness is indeed the main driver of importance attribution in practice, or if other data properties such as statistical suppression, novelty at test-time, or high feature salience substantially contribute. To clarify this, we trained deep learning models on three variants of a binary image classification task, in which translucent watermarks are either absent, act as class-dependent confounds, or represent class-independent noise. Results for five popular attribution methods show substantially elevated relative importance in watermarked areas (RIW) for all models regardless of the training setting ($R^2 \geq .45$). By contrast, whether the presence of watermarks is class-dependent or not only has a marginal effect on RIW ($R^2 \leq .03$), despite a clear impact impact on model performance and generalisation ability. XAI methods show similar behaviour to model-agnostic edge detection filters and attribute substantially less importance to watermarks when bright image intensities are encoded by smaller instead of larger feature values. These results indicate that importance attribution is most strongly driven by the salience of image structures at test time rather than statistical associations learned by machine learning models. Previous studies demonstrating successful XAI application should be reevaluated with respect to a possibly spurious concurrency of feature salience and informativeness, and workflows using feature attribution methods as building blocks should be scrutinised.

URL PDF HTML ☆

赞 0 踩 0

2602.09203 2026-02-19 cs.RO cs.HC

Elements of Robot Morphology: Supporting Designers in Robot Form Exploration

Amy Koike, Serena Ge Guo, Xinning He, Callie Y. Kim, Dakota Sullivan, Bilge Mutlu

Comments 10 pages, 5 figures, Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction (HRI '26)

2602.08755 2026-02-19 cs.LG

Align and Adapt: Multimodal Multiview Human Activity Recognition under Arbitrary View Combinations

Duc-Anh Nguyen, Nhien-An Le-Khac

2602.07680 2026-02-19 cs.CV cs.AI cs.LG cs.RO

Vision and Language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning

Ross Greer, Maitrayee Keskar, Angel Martinez-Sanchez, Parthib Roy, Shashank Shriram, Mohan Trivedi

详情

英文摘要

Vision-language models (VLMs) have recently emerged as powerful representation learning systems that align visual observations with natural language concepts, offering new opportunities for semantic reasoning in safety-critical autonomous driving. This paper investigates how vision-language representations support driving scene safety assessment and decision-making when integrated into perception, prediction, and planning pipelines. We study three complementary system-level use cases. First, we introduce a lightweight, category-agnostic hazard screening approach leveraging CLIP-based image-text similarity to produce a low-latency semantic hazard signal. This enables robust detection of diverse and out-of-distribution road hazards without explicit object detection or visual question answering. Second, we examine the integration of scene-level vision-language embeddings into a transformer-based trajectory planning framework using the Waymo Open Dataset. Our results show that naively conditioning planners on global embeddings does not improve trajectory accuracy, highlighting the importance of representation-task alignment and motivating the development of task-informed extraction methods for safety-critical planning. Third, we investigate natural language as an explicit behavioral constraint on motion planning using the doScenes dataset. In this setting, passenger-style instructions grounded in visual scene elements suppress rare but severe planning failures and improve safety-aligned behavior in ambiguous scenarios. Taken together, these findings demonstrate that vision-language representations hold significant promise for autonomous driving safety when used to express semantic risk, intent, and behavioral constraints. Realizing this potential is fundamentally an engineering problem requiring careful system design and structured grounding rather than direct feature injection.

URL PDF HTML ☆

赞 0 踩 0

2602.07568 2026-02-19 cs.CV

Visualizing the Invisible: Enhancing Radiologist Performance in Breast Mammography via Task-Driven Chromatic Encoding

Hui Ye, Shilong Yang, Chulong Zhang, Yexuan Xing, Juan Yu, Yaoqin Xie, Wei Zhang

2602.06051 2026-02-19 cs.CL

CAST: Character-and-Scene Episodic Memory for Agents

Kexin Ma, Bojun Li, Yuhua Tang, Liting Sun, Ruochun Jin

2602.00663 2026-02-19 cs.AI cs.LG q-bio.BM

SEISMO: Increasing Sample Efficiency in Molecular Optimization with a Trajectory-Aware LLM Agent

Fabian P. Krüger, Andrea Hunklinger, Adrian Wolny, Tim J. Adler, Igor Tetko, Santiago David Villalba

Comments Fabian P. Krüger and Andrea Hunklinger contributed equally to this work

2601.11616 2026-02-19 cs.LG cs.AI cs.CL

Mixture-of-Experts as Soft Clustering: A Dual Jacobian-PCA Spectral Geometry Perspective

Feilong Liu

2601.07611 2026-02-19 cs.AI

DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning

Zhuoyang Zou, Abolfazl Ansari, Delvin Ce Zhang, Dongwon Lee, Wenpeng Yin

2601.05378 2026-02-19 cs.LG cs.RO

Inverting Non-Injective Functions with Twin Neural Network Regression

Sebastian J. Wetzel

2512.20773 2026-02-19 cs.CL

DIAL: Direct Iterative Adversarial Learning for Realistic Multi-Turn Dialogue Simulation

Ziyi Zhu, Olivier Tieleman, Caitlin A. Stamatis, Luka Smyth, Thomas D. Hull, Daniel R. Cahn, Matteo Malgaroli

2512.07680 2026-02-19 cs.RO

AMBER: A tether-deployable gripping crawler with compliant microspines for canopy manipulation

P. A. Wigner, L. Romanello, A. Hammad, P. H. Nguyen, T. Lan, S. F. Armanini, B. B. Kocer, M. Kovac

2511.17178 2026-02-19 cs.RO

Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models

Kento Kawaharazuka, Yoshiki Obinata, Naoaki Kanazawa, Haoyu Jia, Kei Okada

Comments Accepted to IEEE Access, website: https://haraduka.github.io/urdf-llm-opt/ , video: https://www.youtube.com/watch?v=N9iMjx7of1w

2511.14406 2026-02-19 cs.LG cs.CR

Watch Out for the Lifespan: Evaluating Backdoor Attacks Against Federated Model Adaptation

Bastien Vuillod, Pierre-Alain Moellic, Jean-Max Dutertre

Comments Accepted at FPS 2025

2511.10515 2026-02-19 cs.CL cs.AI physics.ed-ph

Mastering Olympiad-Level Physics with Artificial Intelligence

Dong-Shan Jian, Xiang Li, Chen-Xu Yan, Hui-Wen Zheng, Zhi-Zhang Bian, You-Le Fang, Ren-Xi He, Jing-Tian Zhang, Ce Meng, Ling-Shi Meng, Bing-Rui Gong, Sheng-Qi Zhang, Yan-Qing Ma

Comments 8 pages, 3 figures, Content from the previous article 2510.01249 is included

2511.04485 2026-02-19 cs.LG cs.AI math.OC

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training

Ipsita Ghosh, Ethan Nguyen, Christian Kümmerle

2511.03710 2026-02-19 cs.LG

Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards

Guanning Zeng, Zhaoyi Zhou, Daman Arora, Andrea Zanette

Comments Preprint. Under Review

2510.18478 2026-02-19 cs.LG

Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation

Daniel Bethell, Simos Gerasimou, Radu Calinescu, Calum Imrie

Comments Accepted into AAMAS '26

2510.16161 2026-02-19 cs.LG stat.ML

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction

Ankitkumar Joshi, Milos Hauskrecht

Comments Published in Transactions on Machine Learning Research, 2026

2510.08102 2026-02-19 cs.CL cs.AI cs.LG stat.ML

Lossless Vocabulary Reduction for Auto-Regressive Language Models

Daiki Chijiwa, Taku Hasegawa, Kyosuke Nishida, Shin'ya Yamaguchi, Tomoya Ohba, Tamao Sakao, Susumu Takeuchi

Comments The Fourteenth International Conference on Learning Representations (ICLR 2026)