arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.25140 2026-03-18 cs.AI cs.CL

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

Siru Ouyang, Jun Yan, I-Hung Hsu, Yanfei Chen, Ke Jiang, Zifeng Wang, Rujun Han, Long T. Le, Samira Daruki, Xiangru Tang, Vishy Tirumalashetty, George Lee, Mahsan Rofouei, Hangfei Lin, Jiawei Han, Chen-Yu Lee, Tomas Pfister

Comments Accepted to ICLR 2026; Code: https://github.com/google-research/reasoning-bank

详情

英文摘要

With the growing adoption of large language model agents in persistent real-world roles, they naturally encounter continuous streams of tasks. A key limitation, however, is their failure to learn from the accumulated interaction history, forcing them to discard valuable insights and repeat past errors. We propose ReasoningBank, a novel memory framework that distills generalizable reasoning strategies from an agent's self-judged successful and failed experiences. At test time, an agent retrieves relevant memories from ReasoningBank to inform its interaction and then integrates new learnings back, enabling it to become more capable over time. Building on this powerful experience learner, we further introduce memory-aware test-time scaling (MaTTS), which accelerates and diversifies this learning process by scaling up the agent's interaction experience. By allocating more compute to each task, the agent generates abundant, diverse experiences that provide rich contrastive signals for synthesizing higher-quality memory. The better memory in turn guides more effective scaling, establishing a powerful synergy between memory and test-time scaling. Across web browsing and software engineering benchmarks, ReasoningBank consistently outperforms existing memory mechanisms that store raw trajectories or only successful task routines, improving both effectiveness and efficiency; MaTTS further amplifies these gains. These findings establish memory-driven experience scaling as a new scaling dimension, enabling agents to self-evolve with emergent behaviors naturally arise. Our code can be found at https://github.com/google-research/reasoning-bank.

URL PDF HTML ☆

赞 0 踩 0

2509.23252 2026-03-18 cs.LG

NanoFlux: Adversarial Dual-LLM Evaluation and Distillation For Multi-Domain Reasoning

Raviteja Anantha, Soheil Hor, Teodor Nicola Antoniu, Layne C. Price

Comments Preprint version 3; Updated References

2509.22819 2026-03-18 cs.AI cs.FL cs.LG

Hilbert: Recursively Building Formal Proofs with Informal Reasoning

Sumanth Varambally, Thomas Voice, Yanchao Sun, Zhifeng Chen, Rose Yu, Ke Ye

2509.22493 2026-03-18 cs.RO cs.AI cs.IR cs.LO

Ontological foundations for contrastive explanatory narration of robot plans

Alberto Olivares-Alarcos, Sergi Foix, Júlia Borràs, Gerard Canal, Guillem Alenyà

2509.21991 2026-03-18 cs.CV cs.AI cs.CL cs.LG

ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models

Jewon Lee, Wooksu Shin, Seungmin Yang, Ki-Ung Song, DongUk Lim, Jaeyeon Kim, Tae-Ho Kim, Bo-Kyeong Kim

2509.20557 2026-03-18 cs.CL

SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages

Hannah Liu, Junghyun Min, En-Shiun Annie Lee, Ethan Yue Heng Cheung, Shou-Yi Hung, Elsie Chan, Shiyao Qian, Runtong Liang, Kimlan Huynh, Wing Yu Yip, York Hay Ng, TSZ Fung Yau, Ka Ieng Charlotte Lo, You-Wei Wu, Richard Tzong-Han Tsai

Comments LREC 2026 camera-ready. 23 pages, 2 figures, 11 tables

2509.19142 2026-03-18 cs.RO

BiGraspFormer: End-to-End Bimanual Grasp Transformer

Kangmin Kim, Seunghyeok Back, Geonhyup Lee, Sangbeom Lee, Sangjun Noh, Kyoobin Lee

Comments 8 pages, 5 figures

2509.17930 2026-03-18 cs.CL cs.AI

Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation

Yiwen Guan, Jacob Whitehill

2509.17325 2026-03-18 cs.LG cs.AI cs.CL

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

Weihua Du, Hailei Gong, Zhan Ling, Kang Liu, Lingfeng Shen, Xuesong Yao, Yufei Xu, Dingyuan Shi, Yiming Yang, Jiecao Chen

Comments 24 pages. Accepted to ICLR 2026. Project repository: https://github.com/StigLidu/CodeGym

2509.05469 2026-03-18 cs.AI cs.CV cs.CY cs.HC

From Image Generation to Infrastructure Design: a Multi-agent Pipeline for Street Design Generation

Chenguang Wang, Xiang Yan, Yilong Dai, Ziyi Wang, Susu Xu

Comments 25 pages, 8 figures

2509.03951 2026-03-18 cs.CV

ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning

Wenjie Zhu, Yabin Zhang, Xin Jin, Wenjun Zeng, Lei Zhang

Comments Accepted by CVPR2026, Project Page: https://zhuwenjie98.github.io/ANTS-project-page/

2509.01167 2026-03-18 cs.CV cs.CL cs.LG

TempCore: Are Video QA Benchmarks Temporally Grounded? A Frame Selection Sensitivity Analysis and Benchmark

Hyunjong Ok, Jaeho Lee

Comments preprint

2508.13911 2026-03-18 cs.CV

PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis

Chunji Lv, Zequn Chen, Donglin Di, Weinan Zhang, Hao Li, Wei Chen, Yinjie Lei, Changsheng Li

Comments CVPR 2026

2508.13697 2026-03-18 cs.AI

The DeepLog Neurosymbolic Machine

Vincent Derkinderen, Robin Manhaeve, Rik Adriaensen, Lucas Van Praet, Lennert De Smet, Giuseppe Marra, Luc De Raedt

2508.13000 2026-03-18 cs.CV

Omni Survey for Multimodality Analysis in Visual Object Tracking

Zhangyong Tang, Tianyang Xu, Xuefeng Zhu, Hui Li, Shaochuan Zhao, Tao Zhou, Chunyang Cheng, Xiaojun Wu, Josef Kittler

Comments The first comprehensive survey for multi-modal visual object tracking; 6 multi-modal tasks; 338 references

2508.11929 2026-03-18 cs.RO cs.AI

No More Blind Spots: Learning Vision-Based Omnidirectional Bipedal Locomotion for Challenging Terrain

Mohitvishnu S. Gadde, Pranay Dugar, Ashish Malik, Alan Fern

2508.11408 2026-03-18 cs.LG cs.AI

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Wenhao Zhang, Yuexiang Xie, Yuchang Sun, Yanxi Chen, Guoyin Wang, Yaliang Li, Bolin Ding, Jingren Zhou

2508.08139 2026-03-18 cs.CL cs.AI

Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models

Tianyi Zhou, Johanne Medina, Sanjay Chawla

Comments Published at AAAI'26

2508.05190 2026-03-18 cs.LG

Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference

Luis Mandl, Dibyajyoti Nayak, Tim Ricken, Somdatta Goswami

Comments 22 pages, 21 figures, 4 tables

详情

DOI: 10.1016/j.cma.2026.118917

英文摘要

Accurately modeling and inferring solutions to time-dependent partial differential equations (PDEs) over extended horizons remains a core challenge in scientific machine learning. Traditional full rollout (FR) methods, which predict entire trajectories in one pass, often fail to capture the causal dependencies and generalize poorly outside the training time horizon. Autoregressive (AR) approaches, evolving the system step by step, suffer from error accumulation, limiting long-term accuracy. These shortcomings limit the long-term accuracy and reliability of both strategies. To address these issues, we introduce the Physics-Informed Time-Integrated Deep Operator Network (PITI-DeepONet), a dual-output architecture trained via physics-informed or hybrid physics- and data-driven objectives to ensure stable, accurate long-term evolution well beyond the training horizon. Instead of forecasting future states, the network learns the time-derivative operator from the current state, integrating it using classical time-stepping schemes to advance the solution in time. Additionally, the framework can leverage residual monitoring during inference to estimate prediction quality and detect when the system transitions outside the training domain. Applied to benchmark problems, PITI-DeepONet demonstrates enhanced accuracy and stability over extended inference time horizons when compared to traditional methods. Mean relative $\mathcal{L}_2$ errors reduced by 84\% (versus FR) and 79\% (versus AR) for 1D heat equation; by 87\% (versus FR) and 98\% (versus AR) for the 1D Burgers equation; by 42\% (versus FR) and 89\% (versus AR) for the 2D Allen-Cahn equation; and by 58\% (vs. FR) and 61\% (vs. AR) for the 1D Kuramoto-Sivashinsky equation. By moving beyond classic FR and AR schemes, PITI-DeepONet paves the way for more reliable, long-term integration of complex, time-dependent PDEs.

URL PDF HTML ☆

赞 0 踩 0

2508.02192 2026-03-18 cs.CV

Content-Aware Mamba for Learned Image Compression

Yunuo Chen, Zezheng Lyu, Bing He, Hongwei Hu, Qi Wang, Yuan Tian, Li Song, Wenjun Zhang, Guo Lu

Comments ICLR2026 poster

2508.00635 2026-03-18 cs.LG

KFS: KAN based adaptive Frequency Selection learning architecture for long term time series forecasting

Changning Wu, Gao Wu, Rongyao Cai, Yong Liu, Kexin Zhang

Comments arXiv admin note: text overlap with arXiv:2406.03751 by other authors

2507.21524 2026-03-18 cs.AI cs.IT math.IT

Large Language Models for Wireless Communications: From Adaptation to Autonomy

Le Liang, Hao Ye, Yucheng Sheng, Ouya Wang, Jiacheng Wang, Shi Jin, Geoffrey Ye Li

2507.13353 2026-03-18 cs.CV cs.AI

VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

Shihao Wang, Guo Chen, De-an Huang, Zhiqi Li, Minghan Li, Guilin Liu, Jose M. Alvarez, Lei Zhang, Zhiding Yu

Comments Accepted by CVPR

2507.03637 2026-03-18 cs.AI

Large Language Models for Combinatorial Optimization: A Systematic Review

Francesca Da Ros, Michael Soprano, Luca Di Gaspero, Kevin Roitero

2507.02438 2026-03-18 cs.RO cs.HC cs.SY eess.SY

Minimal Intervention Shared Control with Guaranteed Safety under Non-Convex Constraints

Shivam Chaubey, Francesco Verdoja, Shankar Deka, Ville Kyrki

Comments Accepted for publication at the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2507.00965 2026-03-18 cs.LG

Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning

Félix Lefebvre, Gaël Varoquaux

Comments Code available at https://github.com/flefebv/sepal.git

2506.23205 2026-03-18 cs.CV

BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion

Dequan Kong, Honghua Chen, Zhe Zhu, Mingqiang Wei

Comments Accepted by AAAI2026

2506.14837 2026-03-18 cs.CV cs.AI

Improved Iterative Refinement for Chart-to-Code Generation via Structured Instruction

Chengzhi Xu, Yuyang Wang, Lai Wei, Lichao Sun, Weiran Huang

2506.03674 2026-03-18 cs.LG

Out-of-Distribution Graph Models Merging

Yidi Wang, Ziyue Qiao, Jiawei Gu, Xubin Zheng, Pengyang Wang, Xiaobing Pei, Xiao Luo

2506.01989 2026-03-18 cs.LG cs.AI cs.CR

Coded Robust Aggregation for Distributed Learning under Byzantine Attacks

Chengxi Li, Ming Xiao, Mikael Skoglund

Comments C. Li, M. Xiao and M. Skoglund, "Coded Robust Aggregation for Distributed Learning Under Byzantine Attacks," in IEEE Transactions on Information Forensics and Security, vol. 20, pp. 11636-11651, 2025, doi: 10.1109/TIFS.2025.3624620