arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.08468 2026-04-10 cs.LG cs.AI

TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis

Sikai Bai, Haoxi Li, Jie Zhang, Yongjiang Liu, Song Guo

详情

英文摘要

Despite significant advances in Large Reasoning Models (LRMs) driven by reinforcement learning with verifiable rewards (RLVR), this paradigm is fundamentally limited in specialized or novel domains where such supervision is prohibitively expensive or unavailable, posing a key challenge for test-time adaptation. While existing test-time methods offer a potential solution, they are constrained by learning from static query sets, risking overfitting to textual patterns. To address this gap, we introduce Test-Time Variational Synthesis (TTVS), a novel framework that enables LRMs to self-evolve by dynamically augmenting the training stream from unlabeled test queries. TTVS comprises two synergistic modules: (1) Online Variational Synthesis, which transforms static test queries into a dynamic stream of diverse, semantically-equivalent variations, enforcing the model to learn underlying problem logic rather than superficial patterns; (2) Test-time Hybrid Exploration, which balances accuracy-driven exploitation with consistency-driven exploration across synthetic variants. Extensive experiments show TTVS yields superior performance across eight model architectures. Notably, using only unlabeled test-time data, TTVS not only surpasses other test-time adaptation methods but also outperforms state-of-the-art supervised RL-based techniques trained on vast, high-quality labeled data.

URL PDF HTML ☆

赞 0 踩 0

2604.08465 2026-04-10 cs.AI cs.CY cs.MA

From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis

Juergen Dietrich

Comments 9 pages, 1 figure

2604.08461 2026-04-10 cs.CV cs.AI

OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance

Haoxi Zeng, Qiankun Liu, Yi Bin, Haiyue Zhang, Yujuan Ding, Guoqing Wang, Deqiang Ouyang, Heng Tao Shen

Comments 14 pages, 12 figures, 5 tables

2604.08460 2026-04-10 cs.LG cs.AI

A Machine Learning Framework for Turbofan Health Estimation via Inverse Problem Formulation

Milad Leyli-Abadi, Lucas Thil, Sebastien Razakarivony, Guillaume Doquet, Jesse Read

Comments Submitted at ECML PKDD 2026

2604.08456 2026-04-10 cs.CV cs.CL

Entropy-Gradient Grounding: Training-Free Evidence Retrieval in Vision-Language Models

Marcel Gröpl, Jaewoo Jung, Seungryong Kim, Marc Pollefeys, Sunghwan Hong

Comments Project Page : https://entropy-gradient-grounding.github.io/

2604.08455 2026-04-10 cs.AI

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Tongbo Chen, Zhengxi Lu, Zhan Xu, Guocheng Shao, Shaohan Zhao, Fei Tang, Yong Du, Kaitao Song, Yizhou Liu, Yuchen Yan, Wenqi Zhang, Xu Tan, Weiming Lu, Jun Xiao, Yueting Zhuang, Yongliang Shen

2604.08454 2026-04-10 cs.LG

Less Approximates More: Harmonizing Performance and Confidence Faithfulness via Hybrid Post-Training for High-Stakes Tasks

Haokai Ma, Lee Yan Zhen, Gang Yang, Yunshan Ma, Ee-Chien Chang, Tat-Seng Chua

2604.08450 2026-04-10 cs.SD eess.AS

DeepFense: A Unified, Modular, and Extensible Framework for Robust Deepfake Audio Detection

Yassine El Kheir, Arnab Das, Yixuan Xiao, Xin Wang, Feidi Kallel, Enes Erdem Erdogan, Ngoc Thang Vu, Tim Polzehl, Sebastian Moeller

Comments Deepfense Toolkit

2604.08448 2026-04-10 cs.CL

AfriVoices-KE: A Multilingual Speech Dataset for Kenyan Languages

Lilian Wanzare, Cynthia Amol, zekiel Maina, Nelson Odhiambo, Hope Kerubo, Leila Misula, Vivian Oloo, Rennish Mboya, Edwin Onkoba, Edward Ombui, Joseph Muguro, Ciira wa Maina, Andrew Kipkebut, Alfred Omondi Otom, Ian Ndung'u Kang'ethe, Angela Wambui Kanyi, Brian Gichana Omwenga

Comments 10 pages, 5 figures, 3 tables

2604.08443 2026-04-10 cs.RO cs.HC

A Soft Robotic Interface for Chick-Robot Affective Interactions

Jue Chen, Alexander Mielke, Kaspar Althoefer, Elisabetta Versace

2604.08435 2026-04-10 cs.CV cs.AI

HST-HGN: Heterogeneous Spatial-Temporal Hypergraph Networks with Bidirectional State Space Models for Global Fatigue Assessment

Changdao Chen

Comments 10 pages

2604.08425 2026-04-10 cs.AI cs.CL

Learning Who Disagrees: Demographic Importance Weighting for Modeling Annotator Distributions with DiADEM

Samay U. Shetty, Tharindu Cyril Weerasooriya, Deepak Pandita, Christopher M. Homan

2604.08424 2026-04-10 cs.AI cs.LG

On-board Telemetry Monitoring in Autonomous Satellites: Challenges and Opportunities

Lorenzo Capelli, Leandro de Souza Rosa, Maurizio De Tommasi, Livia Manovi, Andriy Enttsel, Mauro Mangia, Riccardo Rovatti, Ilaria Pinci, Carlo Ciancarelli, Eleonora Mariotti, Gianluca Furano

2604.08423 2026-04-10 cs.CL cs.AI cs.LG stat.ML

Synthetic Data for any Differentiable Target

Tristan Thrush, Sung Min Park, Herman Brunborg, Luke Bailey, Marcel Roed, Neil Band, Christopher Potts, Tatsunori Hashimoto

2604.08418 2026-04-10 cs.RO cs.AI

Exploring Temporal Representation in Neural Processes for Multimodal Action Prediction

Marco Gabriele Fedozzi, Yukie Nagai, Francesco Rea, Alessandra Sciutti

Comments Submitted to the AIC 2023 (9th International Workshop on Artificial Intelligence and Cognition)

2604.08412 2026-04-10 cs.SD cs.AI eess.AS

Selective Attention System (SAS): Device-Addressed Speech Detection for Real-Time On-Device Voice AI

David Joohun Kim, Daniyal Anjum, Bonny Banerjee, Omar Abbasi

2604.08401 2026-04-10 cs.AI cs.CL

Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing

Wenhao Yuan, Chenchen Lin, Jian Chen, Jinfeng Xu, Xuehe Wang, Edith Cheuk Han Ngai

Comments Accepted by ACL2026 Main Conference

2604.08400 2026-04-10 cs.LG cs.AI

Zero-shot Multivariate Time Series Forecasting Using Tabular Prior Fitted Networks

Mayuka Jayawardhana, Nihal Sharma, Kazem Meidani, Bayan Bruss, Tom Goldstein, Doron Bergman

2604.08398 2026-04-10 cs.LG cs.AI

ADAPTive Input Training for Many-to-One Pre-Training on Time-Series Classification

Paul Quinlan, Qingguo Li, Xiaodan Zhu

2604.08395 2026-04-10 cs.CV cs.AI

Phantasia: Context-Adaptive Backdoors in Vision Language Models

Nam Duong Tran, Phi Le Nguyen

Comments CVPR 2026 Findings

2604.08388 2026-04-10 cs.AI

Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover

Jui-Hui Chung, Hongzhou Lin, Lai Jiang, Shange Tang, Chi Jin

2604.08381 2026-04-10 cs.CL cs.AI

A GAN and LLM-Driven Data Augmentation Framework for Dynamic Linguistic Pattern Modeling in Chinese Sarcasm Detection

Wenxian Wang, Xiaohu Luo, Junfeng Hao, Xiaoming Gu, Xingshu Chen, Zhu Wang, Haizhou Wang

2604.08377 2026-04-10 cs.AI cs.CL

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Ziyu Ma, Shidong Yang, Yuxiang Ji, Xucong Wang, Yong Wang, Yiming Hu, Tongwen Huang, Xiangxiang Chu

Comments Work in progress

2604.08370 2026-04-10 cs.CV

SurfelSplat: Learning Efficient and Generalizable Gaussian Surfel Representations for Sparse-View Surface Reconstruction

Chensheng Dai, Shengjun Zhang, Min Chen, Yueqi Duan

Comments Code is available at https://github.com/Simon-Dcs/Surfel_Splat

2604.08369 2026-04-10 cs.AI cs.CL cs.MA

Don't Overthink It: Inter-Rollout Action Agreement as a Free Adaptive-Compute Signal for LLM Agents

Khushal Sethi

2604.08368 2026-04-10 cs.LG cs.CL cs.CV

SOLAR: Communication-Efficient Model Adaptation via Subspace-Oriented Latent Adapter Reparametrization

Seyed Mahmoud Sajjadi Mohammadabadi, Xiaolong Ma, Lei Yang, Feng Yan, Junshan Zhang

2604.08366 2026-04-10 cs.LG cs.AI cs.CV

Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems

Tolga Dimlioglu, Nadine Chang, Maying Shen, Rafid Mahmood, Jose M. Alvarez

Comments Accepted to CVPR 2026, 8 pages of main body and 10 pages of appendix

2604.08363 2026-04-10 cs.SD

CapTalk: Unified Voice Design for Single-Utterance and Dialogue Speech Generation

Xiaosu Su, Zihan Sun, Peilei Jia, Jun Gao

Comments 14 pages, 2 figures

2604.08344 2026-04-10 cs.AI cs.HC

Human-AI Collaboration Reconfigures Group Regulation from Socially Shared to Hybrid Co-Regulation

Yujing Zhang, Xianghui Meng, Shihui Feng, Jionghao Lin

Comments 9 pages, 2 figures. Accepted at AIED 2026. Camera-ready version with updated references

2604.08342 2026-04-10 cs.LG

EgoEverything: A Benchmark for Human Behavior Inspired Long Context Egocentric Video Understanding in AR Environment

Qiance Tang, Ziqi Wang, Jieyu Lin, Ziyun Li, Barbara De Salvo, Sai Qian Zhang