arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.11924 2026-04-15 cs.AI cs.CL

GoodPoint: Learning Constructive Scientific Paper Feedback from Author Responses

Jimin Mun, Chani Jung, Xuhui Zhou, Hyunwoo Kim, Maarten Sap

Comments 22 pages, 6 figures

详情

英文摘要

While LLMs hold significant potential to transform scientific research, we advocate for their use to augment and empower researchers rather than to automate research without human oversight. To this end, we study constructive feedback generation, the task of producing targeted, actionable feedback that helps authors improve both their research and its presentation. In this work, we operationalize the effectiveness of feedback along two author-centric axes-validity and author action. We first curate GoodPoint-ICLR, a dataset of 19K ICLR papers with reviewer feedback annotated along both dimensions using author responses. Building on this, we introduce GoodPoint, a training recipe that leverages success signals from author responses through fine-tuning on valid and actionable feedback, together with preference optimization on both real and synthetic preference pairs. Our evaluation on a benchmark of 1.2K ICLR papers shows that a GoodPoint-trained Qwen3-8B improves the predicted success rate by 83.7% over the base model and sets a new state-of-the-art among LLMs of similar size in feedback matching on a golden human feedback set, even surpassing Gemini-3-flash in precision. We further validate these findings through an expert human study, demonstrating that GoodPoint consistently delivers higher practical value as perceived by authors.

URL PDF HTML ☆

赞 0 踩 0

2604.11915 2026-04-15 cs.LG cs.AI cs.NE q-bio.PE

Can AI Detect Life? Lessons from Artificial Life

Ankit Gupta, Christoph Adami

Comments 6 pages, 7 figures. Alife 2026

2604.11914 2026-04-15 cs.AI

Self-Monitoring Benefits from Structural Integration: Lessons from Metacognition in Continuous-Time Multi-Timescale Agents

Ying Xie

详情

英文摘要

Self-monitoring capabilities -- metacognition, self-prediction, and subjective duration -- are often proposed as useful additions to reinforcement learning agents. But do they actually help? We investigate this question in a continuous-time multi-timescale agent operating in predator-prey survival environments of varying complexity, including a 2D partially observable variant. We first show that three self-monitoring modules, implemented as auxiliary-loss add-ons to a multi-timescale cortical hierarchy, provide no statistically significant benefit across 20 random seeds, 1D and 2D predator-prey environments with standard and non-stationary variants, and training horizons up to 50,000 steps. Diagnosing the failure, we find the modules collapse to near-constant outputs (confidence std < 0.006, attention allocation std < 0.011) and the subjective duration mechanism shifts the discount factor by less than 0.03%. Policy sensitivity analysis confirms the agent's decisions are unaffected by module outputs in this design. We then show that structurally integrating the module outputs -- using confidence to gate exploration, surprise to trigger workspace broadcasts, and self-model predictions as policy input -- produces a medium-large improvement over the add-on approach (Cohen's d = 0.62, p = 0.06, paired) in a non-stationary environment. Component-wise ablations reveal that the TSM-to-policy pathway contributes most of this gain. However, structural integration does not significantly outperform a baseline with no self-monitoring (d = 0.15, p = 0.67), and a parameter-matched control without modules performs comparably, so the benefit may lie in recovering from the trend-level harm of ignored modules rather than in self-monitoring content. The architectural implication is that self-monitoring should sit on the decision pathway, not beside it.

URL PDF HTML ☆

赞 0 踩 0

2604.11913 2026-04-15 cs.CV

V-Nutri: Dish-Level Nutrition Estimation from Egocentric Cooking Videos

Chengkun Yue, Chuanzhi Xu, Jiangpeng He

Comments Accepted to the 3rd MetaFood Workshop at CVPR 2026

2604.11912 2026-04-15 cs.LG cs.AI

How Transformers Learn to Plan via Multi-Token Prediction

Jianhao Huang, Zhanpeng Zhou, Renqiu Xia, Baharan Mirzasoleiman, Weijie Su, Wei Huang

2604.11868 2026-04-15 cs.CV

MedConcept: Unsupervised Concept Discovery for Interpretability in Medical VLMs

Md Rakibul Haque, KM Arefeen Sultan, Tushar Kataria, Shireen Elhabian

2604.11867 2026-04-15 cs.LG cs.AI

Disposition Distillation at Small Scale: A Three-Arc Negative Result

Hari Sadasivan

Comments 16 pages, 4 figures

2604.11861 2026-04-15 cs.RO cs.MA

BIND-USBL: Bounding IMU Navigation Drift using USBL in Heterogeneous ASV-AUV Teams

Pranav Kedia, Rajini Makam, Heiko Hamann, Suresh Sundaram

Comments Accepted at OCEANS 2026, Sanya, China

2604.11854 2026-04-15 cs.RO cs.AI

MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving

Haesung Oh, Jaeheung Park

2604.11843 2026-04-15 cs.CV

UniMark: Unified Adaptive Multi-bit Watermarking for Autoregressive Image Generators

Yigit Yilmaz, Elena Petrova, Mehmet Kaya, Lucia Rossi, Amir Rahman

Comments work in progress

2604.11842 2026-04-15 cs.LG cs.AI

DBGL: Decay-aware Bipartite Graph Learning for Irregular Medical Time Series Classification

Jian Chen, Yuzhu Hu, Xiaoyan Yuan, Yuxuan Hu, Jinfeng Xu, Yipeng Du, Wenhao Yuan, Wei Wang, Edith C. H. Ngai

2604.11841 2026-04-15 cs.LG cs.AI

Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order Interactions

Wenhao Zhang, Lin Mu, Li Ni, Peiquan Jin, Yiwen Zhang

Comments Accepted by ACL 2026 findings

2604.11838 2026-04-15 cs.LG cs.AI

A Layer-wise Analysis of Supervised Fine-Tuning

Qinghua Zhao, Xueling Gong, Xinyu Chen, Zhongfeng Kang, Xinlu Li

Comments Accepted by ACL 2026 main conference

2604.11835 2026-04-15 cs.LG cs.AI

Schema-Adaptive Tabular Representation Learning with LLMs for Generalizable Multimodal Clinical Reasoning

Hongxi Mao, Wei Zhou, Mengting Jia, Tao Fang, Huan Gao, Bin Zhang, Shangyang Li

Comments 11 pages, 4 figures

2604.11833 2026-04-15 cs.LG

Uncertainty Quantification in CNN Through the Bootstrap of Convex Neural Networks

Hongfei Du, Emre Barut, Fang Jin

Comments 9 pages, 1 figure. Accepted at AAAI 2021

2604.11628 2026-04-15 cs.CL cs.AI

Back to Basics: Let Conversational Agents Remember with Just Retrieval and Generation

Yuqian Wu, Wei Chen, Zhengjun Huang, Junle Chen, Qingxiang Liu, Kai Wang, Xiaofang Zhou, Yuxuan Liang

Comments 23 pages, 12 figures

2604.11626 2026-04-15 cs.AI cs.LG

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Haozhe Wang, Cong Wei, Weiming Ren, Jiaming Liu, Fangzhen Lin, Wenhu Chen

Comments Project Page: https://tiger-ai-lab.github.io/RationalRewards/ ; Code, Dataset, Models are released

2604.11554 2026-04-15 cs.CL

Relax: An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Liujie Zhang, Benzhe Ning, Rui Yang, Xiaoyan Yu, Jiaxing Li, Lumeng Wu, Jia Liu, Minghao Li, Weihang Chen, Weiqi Hu, Lei Zhang

Comments 17 pages, 22 figures

2604.11479 2026-04-15 cs.LG econ.GN physics.soc-ph q-fin.EC

Structural Consequences of Policy-Based Interventions on the Global Supply Chain Network

Lea Karbevska, Liming Xu, Zehui Dai, Sara AlMahri, Alexandra Brintrup

2604.11390 2026-04-15 cs.CV

Beyond Reconstruction: Reconstruction-to-Vector Diffusion for Hyperspectral Anomaly Detection

Jijun Xiang, Tao Wang, Jiayi Wang, Pengxiang Wang, Cheng Chen, Nian Wang

2604.11246 2026-04-15 cs.CL

Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers

Guoxin Yu, Chulun Zhou, Lemao Liu, Qi Wang, Mo Yu, Jialong Tang, Baosong Yang, Xiang Ao, Wai Lam, Yue Yu

Comments 21 pages

2604.11201 2026-04-15 cs.CL cs.AI

CocoaBench: Evaluating Unified Digital Agents in the Wild

CocoaBench Team, Shibo Hao, Zhining Zhang, Zhiqi Liang, Tianyang Liu, Yuheng Zha, Qiyue Gao, Jixuan Chen, Zilong Wang, Zhoujun Cheng, Haoxiang Zhang, Junli Wang, Hexi Jin, Boyuan Zheng, Kun Zhou, Yu Wang, Feng Yao, Licheng Liu, Yijiang Li, Zhifei Li, Zhengtao Han, Pracha Promthaw, Tommaso Cerruti, Xiaohan Fu, Ziqiao Ma, Jingbo Shang, Lianhui Qin, Julian McAuley, Eric P. Xing, Zhengzhong Liu, Rupesh Kumar Srivastava, Zhiting Hu

Comments Project page: https://cocoabench.github.io/

2604.11146 2026-04-15 cs.LG cs.DC

A Full Compression Pipeline for Green Federated Learning in Communication-Constrained Environments

Elouan Colybes, Shirin Salehi, Anke Schmeink

Comments This work was accepted at IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN), 2026

2604.11120 2026-04-15 cs.AI

Persona Non Grata: Single-Method Safety Evaluation Is Incomplete for Persona-Imbued LLMs

Wenkai Li, Fan Yang, Shaunak A. Mehta, Koichi Onoue

2604.10950 2026-04-15 cs.CV

Bootstrapping Video Semantic Segmentation Model via Distillation-assisted Test-Time Adaptation

Jihun Kim, Hoyong Kwon, Hyeokjun Kweon, Kuk-Jin Yoon

Comments accepted at CVPR 2026

2604.10911 2026-04-15 cs.AI cs.LG

EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation

Chongliu Jia, Yi Luo, Sipeng Han, Pengwei Li, Jie Ding, Youshuang Hu, Yimiao Qian, Qiya Wang

2604.10910 2026-04-15 cs.CV

STGV: Spatio-Temporal Hash Encoding for Gaussian-based Video Representation

Jierun Lin, Jiacong Chen, Qingyu Mao, Shuai Liu, Xiandong Meng, Fanyang Meng, Yongsheng Liang

2604.10815 2026-04-15 cs.SD cs.AI cs.MA

MeloTune: On-Device Arousal Learning and Peer-to-Peer Mood Coupling for Proactive Music Curation

Hongwei Xu

Comments 31 pages, 1 figures, 3 tables

2604.10695 2026-04-15 cs.CV

Retrieving to Recover: Towards Incomplete Audio-Visual Question Answering via Semantic-consistent Purification

Jiayu Zhang, Shuo Ye, Qilang Ye, Zihan Song, Jiajian Huang, Zitong Yu

Comments Accepted by ACL 2026 Main Conference

2604.10584 2026-04-15 cs.CV

CoFusion: Multispectral and Hyperspectral Image Fusion via Spectral Coordinate Attention

Baisong Li