arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2511.13018 2026-04-02 cs.LG

The Final-Stage Bottleneck: A Systematic Dissection of the R-Learner for Network Causal Inference

S Sairam, Sara Girdhar, Shivam Soni

Comments Published In TMLR 15 pages, 4 figures

详情

英文摘要

The R-Learner is a powerful, theoretically-grounded framework for estimating heterogeneous treatment effects, prized for its robustness to nuisance model errors. However, its application to network data, where causal heterogeneity is often graph-dependent, presents a critical challenge to its core assumption of a well-specified final-stage model. In this paper, we conduct a large-scale empirical study to systematically dissect the R-Learner framework on graphs. We provide the first rigorous evidence that the primary driver of performance is the inductive bias of the final-stage CATE estimator, an effect that dominates the choice of nuisance models. Our central finding is the quantification of a catastrophic "representation bottleneck": we prove with overwhelming statistical significance (p < 0.001) that R-Learners with a graph-blind final stage fail completely (MSE > 4.0), even when paired with powerful GNN nuisance models. Conversely, our proposed end-to-end Graph R-Learner succeeds and significantly outperforms a strong, non-DML GNN T-Learner baseline. Furthermore, we identify and provide a mechanistic explanation for a subtle, topology-dependent "nuisance bottleneck," linking it to GNN over-squashing via a targeted "Hub-Periphery Trade-off" analysis. Our findings are validated across diverse synthetic and semi-synthetic benchmarks. We release our code as a reproducible benchmark to facilitate future research on this critical "final-stage bottleneck."

URL PDF HTML ☆

赞 0 踩 0

2511.11132 2026-04-02 cs.CV

From Hindsight to Foresight: Self-Encouraged Hindsight Distillation for Knowledge-based Visual Question Answering

Yu Zhao, Ying Zhang, Xuhui Sui, Baohang Zhou, Li Shen, Dacheng Tao

2511.09388 2026-04-02 cs.CV

Learning by Neighbor-Aware Semantics, Deciding by Open-form Flows: Towards Robust Zero-Shot Skeleton Action Recognition

Yang Chen, Miaoge Li, Zhijie Rao, Deze Zeng, Song Guo, Jingcai Guo

Comments Accepted by CVPR 2026 Findings; Project Code: https://github.com/cseeyangchen/Flora

2511.08522 2026-04-02 cs.CL

AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Zhaojian Yu, Kaiyue Feng, Yilun Zhao, Shilin He, Xiao-Ping Zhang, Arman Cohan

2511.08225 2026-04-02 cs.CL cs.AI cs.CY cs.HC

Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Yishan Du, Conrad Borchers, Mutlu Cukurova

Comments 21 pages, 7 figures

2511.08206 2026-04-02 cs.AI

EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks

Xiao Yang, Xuejiao Zhao, Zhiqi Shen

Comments 28pages, 6 figures, 6 tables

2511.06328 2026-04-02 cs.CV

Improving Multimodal Sentiment Analysis via Modality Optimization and Dynamic Primary Modality Selection

Dingkang Yang, Mingcheng Li, Xuecheng Wu, Zhaoyu Chen, Kaixun Jiang, Keliang Liu, Peng Zhai, Lihua Zhang

2511.04921 2026-04-02 cs.CL

AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent

Yu Li, Lehui Li, Lin Chen, Qingmin Liao, Fengli Xu, Yong Li

Comments 10 pages

详情

英文摘要

Large language model agents are becoming increasingly capable at web-centric tasks such as information retrieval, complex reasoning. These emerging capabilities have given rise to surge research interests in developing LLM agent for facilitating scientific quest. One key application in AI research is to automate experiment design through agentic dataset and baseline retrieval. However, prior efforts suffer from limited data coverage, as recommendation datasets primarily harvest candidates from public portals and omit many datasets actually used in published papers, and from an overreliance on content similarity that biases model toward superficial similarity and overlooks experimental suitability. Harnessing collective perception embedded in the baseline and dataset citation network, we present a comprehensive framework for baseline and dataset recommendation. First, we design an automated data-collection pipeline that links roughly one hundred thousand accepted papers to the baselines and datasets they actually used. Second, we propose a collective perception enhanced retriever. To represent the position of each dataset or baseline within the scholarly network, it concatenates self-descriptions with aggregated citation contexts. To achieve efficient candidate recall, we finetune an embedding model on these representations. Finally, we develop a reasoning-augmented reranker that exact interaction chains to construct explicit reasoning chains and finetunes a large language model to produce interpretable justifications and refined rankings. The dataset we curated covers 85\% of the datasets and baselines used at top AI conferences over the past five years. On our dataset, the proposed method outperforms the strongest prior baseline with average gains of +5.85\% in Recall@20, +8.30\% in HitRate@5. Taken together, our results advance reliable, interpretable automation of experimental design.

URL PDF HTML ☆

赞 0 踩 0

2511.02272 2026-04-02 cs.LG cs.DS stat.ML

Beyond Spectral Clustering: Probabilistic Cuts for Differentiable Graph Partitioning

Ayoub Ghriss

Comments AISTATS 2026, https://openreview.net/forum?id=FN6QAT5Tmc

2510.23286 2026-04-02 cs.RO

Precise Time Delay Measurement and Compensation for Tightly Coupled Underwater SINS/piUSBL Navigation

Jin Huang, Yingqiang Wang, Haoda Li, Zichen Liu, Zhikun Wang, Ying Chen

Comments Published in IEEE Transactions on Instrumentation and Measurement. This is the author's accepted manuscript

详情

DOI: 10.1109/TIM.2026.3676179
Journal ref: IEEE Trans. Instrum. Meas., vol. 75, 2026, Art. no. 3676179

英文摘要

In multisensor systems, time synchronization is particularly challenging for underwater integrated navigation systems (INSs) incorporating acoustic positioning, where time delays can significantly degrade accuracy when measurement and fusion epochs are misaligned. This article introduces a tightly coupled navigation framework that integrates a passive inverted ultrashort baseline (piUSBL) acoustic positioning system, a strapdown inertial navigation system (SINS), and a depth gauge under precise time synchronization. The framework fuses piUSBL azimuth and slant range with depth measurements, avoiding poor vertical-angle observability in planar arrays. By combining synchronized timing with acoustic signal processing, the proposed method transforms delay from an unobservable error into a measurable parameter, enabling explicit quantification of both acoustic propagation and system processing delays. Field experiments demonstrate that the proposed approach reduces position RMSE by 44.02% and maximum error (MAXERR) by 40.79% compared to the uncompensated baseline while achieving further RMSE reductions of 37.66% and 35.82% in horizontal directions relative to filter-based delay compensation. The results confirm that explicit delay measurement outperforms filter-based estimation though instantaneous performance remains sensitive to acoustic signal quality, emphasizing the need for robust signal processing alongside accurate time synchronization in latency-sensitive multisensor systems.

URL PDF HTML ☆

赞 0 踩 0

2510.18739 2026-04-02 cs.CV

Moving Light Adaptive Colonoscopy Reconstruction via Illumination-Attenuation-Aware 3D Gaussian Splatting

Hao Wang, Ying Zhou, Haoyu Zhao, Rui Wang, Qiang Hu, Xing Zhang, Qiang Li, Zhiwei Wang

Comments Accepted by ICME2026

2510.18314 2026-04-02 cs.AI

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming

Zheng Zhang, Jiarui He, Yuchen Cai, Deheng Ye, Peilin Zhao, Ruili Feng, Hao Wang

Comments Accepted by ICME 2026

2510.14377 2026-04-02 cs.CL cs.IR cs.LG

PluriHopRAG: Exhaustive, Recall-Sensitive QA Through Corpus-Specific Document Structure Learning

Mykolas Sveistrys, Richard Kunert

2510.12463 2026-04-02 cs.CL

Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test

Nikoleta Pantelidou, Evelina Leivada, Raquel Montero, Paolo Morosi

2510.06545 2026-04-02 cs.LG cs.AI

Incoherence in Goal-Conditioned Autoregressive Models

Jacek Karwowski, Raymond Douglas

Comments To appear in the Proceedings of the 29th International Conference on Artificial Intelligence and Statistics (AISTATS) 2026

2510.02226 2026-04-02 cs.CV cs.AI cs.LG

TempoControl: Temporal Attention Guidance for Text-to-Video Models

Shira Schiber, Ofir Lindenbaum, Idan Schwartz

Comments Accepted CVPR'26

2510.00766 2026-04-02 cs.CV cs.AI

Are Large Vision-Language Models Ready to Guide Blind and Low-Vision Individuals?

Eunki Kim, Na Min An, Wan Ju Kang, Sangryul Kim, James Thorne, Hyunjung Shim

Comments 42 pages, 14 figures, 28 tables

2510.00293 2026-04-02 cs.CV cs.CR cs.LG

MOLM: Mixture of LoRA Markers

Samar Fares, Nurbek Tastan, Noor Hussein, Karthik Nandakumar

Comments ICLR 2026

2509.25302 2026-04-02 cs.AI cs.CL cs.LG cs.MA

Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents

Boxuan Zhang, Yi Yu, Jiaxuan Guo, Jing Shao

Comments 26 pages, 6 figures

2509.17180 2026-04-02 cs.LG econ.EM stat.ME

Regularizing Extrapolation in Causal Inference

David Arbour, Harsh Parikh, Bijan Niknam, Elizabeth Stuart, Kara Rudolph, Avi Feller

2509.14078 2026-04-02 cs.LG

Exploring the Relationship between Brain Hemisphere States and Frequency Bands through Classical Machine Learning and Deep Learning Optimization Techniques with Neurofeedback

Robiul Islam, Dmitry I. Ignatov, Karl Kaberg, Roman Nabatchikov

2509.11536 2026-04-02 cs.CL cs.AI

HARP: Hallucination Detection via Reasoning Subspace Projection

Junjie Hu, Gang Tu, ShengYu Cheng, Jinxin Li, Jinting Wang, Rui Chen, Zhilong Zhou, Dongbo Shan

2508.13749 2026-04-02 cs.LG cs.IT math.IT

Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling

Mohammad Taha Shah, Sabrina Khurshid, Gourab Ghatak

2508.12094 2026-04-02 cs.CV

Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion

Songwei Liu, Chao Zeng, Chenqian Yan, Xurui Peng, Xing Wang, Fangmin Chen, Xing Mei

2508.10637 2026-04-02 cs.CV

Processing and acquisition traces in visual encoders: What does CLIP know about your camera?

Ryan Ramos, Vladan Stojnić, Giorgos Kordopatis-Zilos, Yuta Nakashima, Giorgos Tolias, Noa Garcia

Comments 8 main pages, supplementary attached, ICCV 2025 highlight

2508.07629 2026-04-02 cs.LG cs.AI cs.CL

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Zhenpeng Su, Leiyu Pan, Xue Bai, Dening Liu, Guanting Dong, Jiaming Huang, Minxuan Lv, Wenping Hu, Fuzheng Zhang, Kun Gai, Guorui Zhou

2508.01184 2026-04-02 cs.CV

Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning

Xinhang Wan, Dongqiang Gou, Xinwang Liu, En Zhu, Xuming He

2507.18551 2026-04-02 cs.CV

A 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration

Daniil Morozov, Reuben Dorent, Nazim Haouchine

Comments Accepted in IEEE Transactions on Medical Imaging

2507.17851 2026-04-02 cs.SD eess.AS

Speaker Disentanglement of Speech Pre-trained Model Based on Interpretability

Xiaoxu Zhu, Junhua Li, Aaron J. Li, Guangchao Yao, Xiaojie Yu

Comments 5 pages, 4 figures

2507.14570 2026-04-02 cs.LG cs.AI

LPS-GNN : Deploying Graph Neural Networks on Graphs with 100-Billion Edges

Xu Cheng, Liang Yao, Feng He, Yukuo Cen, Yufei He, Chenhui Zhang, Wenzheng Feng, Hongyun Cai, Jie Tang