arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Jialiang Zhu, Gongrui Zhang, Xiaolong Ma, Lin Xu, Miaosen Zhang, Ruiqi Yang, Song Wang, Kai Qiu, Zhirong Wu, Qi Dai, Ruichun Ma, Bei Liu, Yifan Yang, Chong Luo, Zhengyuan Yang, Linjie Li, Lijuan Wang, Weizhu Chen, Xin Geng, Baining Guo

2602.02481 2026-02-03 cs.RO cs.AI

Flow Policy Gradients for Robot Control

Brent Yi, Hongsuk Choi, Himanshu Gaurav Singh, Xiaoyu Huang, Takara E. Truong, Carmelo Sferrazza, Yi Ma, Rocky Duan, Pieter Abbeel, Guanya Shi, Karen Liu, Angjoo Kanazawa

Comments Project webpage: https://hongsukchoi.github.io/fpo-control

2602.02477 2026-02-03 cs.CL

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Xiao Liang, Zhong-Zhi Li, Zhenghao Lin, Eric Hancheng Jiang, Hengyuan Zhang, Yelong Shen, Kai-Wei Chang, Ying Nian Wu, Yeyun Gong, Weizhu Chen

2602.02475 2026-02-03 cs.AI

AgentRx: Diagnosing AI Agent Failures from Execution Trajectories

Shraddha Barke, Arnav Goyal, Alind Khare, Avaljot Singh, Suman Nath, Chetan Bansal

2508.12726 2026-02-03 cs.CL

DESIGNER: Design-Logic-Guided Multidisciplinary Data Synthesis for LLM Reasoning

Weize Liu, Yongchi Zhao, Yijia Luo, Mingyu Xu, Jiaheng Liu, Yanan Li, Xiguo Hu, Zhiqi Bai, Yuchi Xu, Wenbo Su, Bo Zheng

Comments Accepted to ICLR 2026. Project page: https://attention-is-all-i-need.github.io/Design-Logic-Reasoning

2602.02473 2026-02-03 cs.RO cs.LG

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

Yinhuai Wang, Qihan Zhao, Yuen Fui Lau, Runyi Yu, Hok Wai Tsui, Qifeng Chen, Jingbo Wang, Jiangmiao Pang, Ping Tan

2602.02472 2026-02-03 cs.LG cs.CL

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Qifan Yu, Xinyu Ma, Zhijian Zhuo, Minrui Wang, Deyi Liu, Shiyi Zhan, Yiyuan Ma, Liang Xiang, Xingyan Bin, Di He

2602.02468 2026-02-03 cs.AI cs.CL

Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts

Aiden Yiliu Li, Xinyue Hao, Shilong Liu, Mengdi Wang

2602.02467 2026-02-03 cs.CL

Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models

Noam Steinmetz Yalon, Ariel Goldstein, Liad Mudrik, Mor Geva

2602.02464 2026-02-03 cs.CL

From Directions to Regions: Decomposing Activations in Language Models via Local Geometry

Or Shafran, Shaked Ronen, Omri Fahn, Shauli Ravfogel, Atticus Geiger, Mor Geva

2602.02462 2026-02-03 cs.CL cs.AI

Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models

Gabriele Maraia, Marco Valentino, Fabio Massimo Zanzotto, Leonardo Ranaldi

2602.02458 2026-02-03 cs.LG cs.NI

Conflict-Aware Client Selection for Multi-Server Federated Learning

Mingwei Hong, Zheng Lin, Zehang Lin, Lin Li, Miao Yang, Xia Du, Zihan Fang, Zhaolu Kang, Dianxin Luan, Shunzhi Zhu

Comments 6 pages, 4 figures

2602.02457 2026-02-03 cs.CY

MetaCLASS: Metacognitive Coaching for Learning with Adaptive Self-regulation Support

Naiming Liu, Richard Baraniuk, Shashank Sonkar

2602.02456 2026-02-03 cs.RO

Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning

Albert Gassol Puigjaner, Angelos Zacharia, Kostas Alexis

Comments ICRA 2026, 8 pages

2602.02455 2026-02-03 cs.AI cs.CL cs.SE

Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction

Han Bao, Zheyuan Zhang, Pengcheng Jing, Zhengqing Yuan, Kaiwen Shi, Yanfang Ye

Comments 65 pages, 40 figures

2602.02454 2026-02-03 cs.RO cs.AI

World-Gymnast: Training Robots with Reinforcement Learning in a World Model

Ansh Kumar Sharma, Yixiang Sun, Ninghao Lu, Yunzhe Zhang, Jiarao Liu, Sherry Yang

Comments https://world-gymnast.github.io/

2602.02452 2026-02-03 eess.SY cs.SY

Robust Safety-Critical Control of Networked SIR Dynamics

Saba Samadi, Brooks A. Butler, Philip E. Paré

Comments 8 pages, 7 figures, accepted to the 2026 American Control Conference (ACC)

2602.02451 2026-02-03 cs.LG cs.AI

Active Causal Experimentalist (ACE): Learning Intervention Strategies via Direct Preference Optimization

Patrick Cooper, Alvaro Velasquez

Comments 9 pages, 5 figures

2602.02447 2026-02-03 cs.FL cs.DS

Deciding Reachability and the Covering Problem with Diagnostics for Sound Acyclic Free-Choice Workflow Nets

Thomas M. Prinz, Christopher T. Schwanen, Wil M. P. van der Aalst

Comments 38 pages, 18 figures

2602.02445 2026-02-03 cs.LG math.ST stat.TH

Finite-Sample Wasserstein Error Bounds and Concentration Inequalities for Nonlinear Stochastic Approximation

Seo Taek Kong, R. Srikant

2602.02440 2026-02-03 cs.CL

Large Language Models for Mental Health: A Multilingual Evaluation

Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Ana-Maria Bucur, Stevie Chancellor, Marcos Zampieri

2602.02439 2026-02-03 cs.NE cs.ET cs.LG

Energy-Efficient Neuromorphic Computing for Edge AI: A Framework with Adaptive Spiking Neural Networks and Hardware-Aware Optimization

Olaf Yunus Laitinen Imanov, Derya Umut Kulali, Taner Yilmaz, Duygu Erisken, Rana Irem Turhan

Comments 8 pages, 4 figures, 4 tables. Submitted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

2602.02438 2026-02-03 cs.DC

sVIRGO: A Scalable Virtual Tree Hierarchical Framework for Distributed Systems

Lican Huang

Comments 10 pages

2602.02435 2026-02-03 cs.IT cs.NI cs.SY eess.SY math.IT

Preemptive Scheduling for Age of Job Minimization in Task-Specific Machine Networks

Subhankar Banerjee, Sennur Ulukus

2602.02432 2026-02-03 cs.LG math.OC stat.ML

Maximizing Reliability with Bayesian Optimization

Jack M. Buckingham, Ivo Couckuyt, Juergen Branke

Comments 25 pages, 9 figures

2602.02430 2026-02-03 cs.RO

3D Foundation Model-Based Loop Closing for Decentralized Collaborative SLAM

Pierre-Yves Lajoie, Benjamin Ramtoula, Daniele De Martini, Giovanni Beltrame

2602.02426 2026-02-03 cs.CV

SelvaMask: Segmenting Trees in Tropical Forests and Beyond

Simon-Olivier Duguay, Hugo Baudchon, Etienne Laliberté, Helene Muller-Landau, Gonzalo Rivas-Torres, Arthur Ouaknine

Comments 22 pages, 8 figures

2602.02425 2026-02-03 cs.LG q-bio.QM

Repurposing Protein Language Models for Latent Flow-Based Fitness Optimization

Amaru Caceres Arroyo, Lea Bogensperger, Ahmed Allam, Michael Krauthammer, Konrad Schindler, Dominik Narnhofer

2602.02422 2026-02-03 cs.LG cs.AI

Poly-attention: a general scheme for higher-order self-attention

Sayak Chakrabarti, Toniann Pitassi, Josh Alman

详情

英文摘要

The self-attention mechanism, at the heart of the Transformer model, is able to effectively model pairwise interactions between tokens. However, numerous recent works have shown that it is unable to perform basic tasks involving detecting triples of correlated tokens, or compositional tasks where multiple input tokens need to be referenced to generate a result. Some higher-dimensional alternatives to self-attention have been proposed to address this, including higher-order attention and Strassen attention, which can perform some of these polyadic tasks in exchange for slower, superquadratic running times. In this work, we define a vast class of generalizations of self-attention, which we call poly-attention mechanisms. Our mechanisms can incorporate arbitrary higher-order (tensor) computations as well as arbitrary relationship structures between the input tokens, and they include the aforementioned alternatives as special cases. We then systematically study their computational complexity and representational strength, including giving new algorithms and matching complexity-theoretic lower bounds on the time complexity of computing the attention matrix exactly as well as approximately, and tightly determining which polyadic tasks they can each perform. Our results give interesting trade-offs between different desiderata for these mechanisms, including a tight relationship between how expressive a mechanism is, and how large the coefficients in the model may be so that the mechanism can be approximated in almost-linear time. Notably, we give a new attention mechanism which can be computed exactly in quadratic time, and which can perform function composition for any fixed number of functions. Prior mechanisms, even for just composing two functions, could only be computed in superquadratic time, and our new lower bounds show that faster algorithms for them are not possible.

URL PDF HTML ☆

赞 0 踩 0

2602.02415 2026-02-03 cs.LG

Active Transfer Bagging: A New Approach for Accelerated Active Learning Acquisition of Data by Combined Transfer Learning and Bagging Based Models

Vivienne Pelletier, Daniel J. Rivera, Obinna Nwokonkwo, Steven A. Wilson, Christopher L. Muhich