arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.06736 2026-04-09 cs.CL cs.DB

SQLStructEval: Structural Evaluation of LLM Text-to-SQL Generation

Yixi Zhou, Fan Zhang, Zhiqiao Guo, Yu Chen, Haipeng Zhang, Preslav Nakov, Zhuohan Xie

Comments 17 pages, including figures and tables

详情

英文摘要

Despite strong performance on Text-to-SQL benchmarks, it remains unclear whether LLM-generated SQL programs are structurally reliable. In this work, we investigate the structural behavior of LLM-generated SQL queries and introduce SQLStructEval, a framework for analyzing program structures through canonical abstract syntax tree (AST) representations. Our experiments on the Spider benchmark show that modern LLMs often produce structurally diverse queries for the same input, even when execution results are correct, and that such variance is frequently triggered by surface-level input changes such as paraphrases or schema presentation. We further show that generating queries in a structured space via a compile-style pipeline can improve both execution accuracy and structural consistency. These findings suggest that structural reliability is a critical yet overlooked dimension for evaluating LLM-based program generation systems. Our code is available at https://anonymous.4open.science/r/StructEval-2435.

URL PDF HTML ☆

赞 0 踩 0

2604.06732 2026-04-09 cs.LG

Extraction of linearized models from pre-trained networks via knowledge distillation

Fumito Kimura, Jun Ohkubo

Comments 9 pages, 5 figures

2604.06728 2026-04-09 cs.CV cs.AI cs.MM

URMF: Uncertainty-aware Robust Multimodal Fusion for Multimodal Sarcasm Detection

Zhenyu Wang, Weichen Cheng, Weijia Li, Junjie Mou, Zongyou Zhao, Guoying Zhang

2604.06727 2026-04-09 cs.LG

Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach

Shengchao Chen, Guodong Long, Dikai Liu, Jing Jiang

Comments 31 pages

2604.06725 2026-04-09 cs.CV

Enhancing MLLM Spatial Understanding via Active 3D Scene Exploration for Multi-Perspective Reasoning

Jiahua Chen, Qihong Tang, Weinong Wang, Qi Fan

2604.06714 2026-04-09 cs.AI cs.CL cs.CV cs.LG

Steering the Verifiability of Multimodal AI Hallucinations

Jianhong Pang, Ruoxi Cheng, Ziyi Ye, Xingjun Ma, Zuxuan Wu, Xuanjing Huang, Yu-Gang Jiang

2604.06713 2026-04-09 cs.CV

Improving Local Feature Matching by Entropy-inspired Scale Adaptability and Flow-endowed Local Consistency

Ke Jin, Jiming Chen, Qi Ye

2604.06711 2026-04-09 cs.CV cs.CL

Specializing Large Models for Oracle Bone Script Interpretation via Component-Grounded Multimodal Knowledge Augmentation

Jianing Zhang, Runan Li, Honglin Pang, Ding Xia, Zhou Zhu, Qian Zhang, Chuntao Li, Xi Yang

2604.06701 2026-04-09 cs.LG stat.ML

Bi-Lipschitz Autoencoder With Injectivity Guarantee

Qipeng Zhan, Zhuoping Zhou, Zexuan Wang, Qi Long, Li Shen

Comments Accepted for publication at ICLR 2026, 27 Pages, 15 Figures

2604.06699 2026-04-09 cs.CL cs.LG

Adaptive Prompt Structure Factorization: A Framework for Self-Discovering and Optimizing Compositional Prompt Programs

Haoyue Liu, Zhichao Wang, Yongxin Guo, Haoran Shou, Xiaoying Tang

2604.06696 2026-04-09 cs.AI

AgentGate: A Lightweight Structured Routing Engine for the Internet of Agents

Yujun Cheng, Enfang Cui, Hao Qin, Zhiyuan Liang, Qi Xu

2604.06695 2026-04-09 cs.AI

Reasoning Fails Where Step Flow Breaks

Xiaoyu Xu, Yulan Pan, Xiaosong Yuan, Zhihong Shen, Minghao Su, Yuanhao Su, Xiaofeng Zhang

Comments Accepted at ACL 2026

2604.06694 2026-04-09 cs.SD

AudioKV: KV Cache Eviction in Efficient Large Audio Language Models

Yuxuan Wang, Peize He, Xiyan Gui, Xiaoqian Liu, Junhao He, Xuyang Liu, Zichen Wen, Xuming Hu, Linfeng Zhang

2604.06691 2026-04-09 cs.AI

KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

Monirul Islam Pavel, Siyi Hu, Muhammad Anwar Masum, Mahardhika Pratama, Ryszard Kowalczyk, Zehong Jimmy Cao

Comments Accepted in IJCNN 2026

2604.06687 2026-04-09 cs.CV

RASR: Retrieval-Augmented Semantic Reasoning for Fake News Video Detection

Hui Li, Peien Ding, Jun Li, Guoqi Ma, Zhanyu Liu, Ge Xu, Junfeng Yao, Jinsong Su

Comments 10 pages,5 figures

2604.06685 2026-04-09 cs.CL cs.AI

ChemVLR: Prioritizing Reasoning in Perception for Chemical Vision-Language Understanding

Xuanle Zhao, Xinyuan Cai, Xiang Cheng, Xiuyi Chen, Bo Xu

Comments Accepted by ACL 2026 Findings, Preprint Version

2604.06674 2026-04-09 cs.CL cs.AI

Between Century and Poet: Graph-Based Lexical Semantic Change in Persian Poetry

Kourosh Shahnazari, Seyed Moein Ayyoubzadeh, Mohammadali Keshtparvar

2604.06666 2026-04-09 cs.CL cs.AI

A Graph-Enhanced Defense Framework for Explainable Fake News Detection with LLM

Bo Wang, Jing Ma, Hongzhan Lin, Zhiwei Yang, Ruichao Yang, Yuan Tian, Yi Chang

Comments Accepted by TOIS

2604.06662 2026-04-09 cs.CV cs.LG

Towards Robust Content Watermarking Against Removal and Forgery Attacks

Yifan Zhu, Yihan Wang, Xiao-Shan Gao

Comments 14 pages, 5 figures, CVPR 2026 Findings

2604.06658 2026-04-09 cs.CV

GPAFormer: Graph-guided Patch Aggregation Transformer for Efficient 3D Medical Image Segmentation

Chung-Ming Lo, I-Yun Liu, Wei-Yang Lin

2604.06655 2026-04-09 cs.CV

Controllable Generative Video Compression

Ding Ding, Daowen Li, Ying Chen, Yixin Gao, Ruixiao Dong, Kai Li, Li Li

2604.06652 2026-04-09 cs.LG

FlowAdam: Implicit Regularization via Geometry-Aware Soft Momentum Injection

Devender Singh, Tarun Sheel

Comments Accepted at IJCNN 2026 (IEEE WCCI). 8 pages, 4 figures

2604.06650 2026-04-09 cs.CL cs.AI

A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP

Cheng Peng, Mengxian Lyu, Ziyi Chen, Yonghui Wu

2604.06644 2026-04-09 cs.CV cs.LG

Variational Feature Compression for Model-Specific Representations

Zinan Guo, Zihan Wang, Chuan Yan, Liuhuo Wan, Ethan Ma, Guangdong Bai

2604.06636 2026-04-09 cs.LG cs.AI cs.CL

SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning

Zhengyang Ai, Zikang Shan, Xiaodong Ai, Jingxian Tang, Hangkai Hu, Pinyan Lu

Comments ACL 2026 Main

2604.06631 2026-04-09 cs.LG cs.AI cs.CV

SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport

Zheng Jiang, Nan He, Yiming Chen, Lifeng Sun

Comments Accepted by CVPR 2026

2604.06628 2026-04-09 cs.AI

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Qihan Ren, Peng Wang, Ruikun Cai, Shuai Shao, Dadi Guo, Yuejin Xie, Yafu Li, Quanshi Zhang, Xia Hu, Jing Shao, Dongrui Liu

Comments Preprint. Under review

2604.06627 2026-04-09 cs.CL

DiffuMask: Diffusion Language Model for Token-level Prompt Pruning

Caleb Zheng, Jyotika Singh, Fang Tu, Weiyi Sun, Sujeeth Bharadwaj, Yassine Benajiba, Sujith Ravi, Eli Shlizerman, Dan Roth

2604.06623 2026-04-09 cs.CV

WeatherRemover: All-in-one Adverse Weather Removal with Multi-scale Feature Map Compression

Weikai Qu, Sijun Liang, Cheng Pan, Zikuan Yang, Guanchi Zhou, Xianjun Fu, Bo Liu, Changmiao Wang, Ahmed Elazab

Comments Accepted by IEEE Transactions on Artificial Intelligence

详情

DOI: 10.1109/TAI.2025.3633206

英文摘要

Photographs taken in adverse weather conditions often suffer from blurriness, occlusion, and low brightness due to interference from rain, snow, and fog. These issues can significantly hinder the performance of subsequent computer vision tasks, making the removal of weather effects a crucial step in image enhancement. Existing methods primarily target specific weather conditions, with only a few capable of handling multiple weather scenarios. However, mainstream approaches often overlook performance considerations, resulting in large parameter sizes, long inference times, and high memory costs. In this study, we introduce the WeatherRemover model, designed to enhance the restoration of images affected by various weather conditions while balancing performance. Our model adopts a UNet-like structure with a gating mechanism and a multi-scale pyramid vision Transformer. It employs channel-wise attention derived from convolutional neural networks to optimize feature extraction, while linear spatial reduction helps curtail the computational demands of attention. The gating mechanisms, strategically placed within the feed-forward and downsampling phases, refine the processing of information by selectively addressing redundancy and mitigating its influence on learning. This approach facilitates the adaptive selection of essential data, ensuring superior restoration and maximizing efficiency. Additionally, our lightweight model achieves an optimal balance between restoration quality, parameter efficiency, computational overhead, and memory usage, distinguishing it from other multi-weather models, thereby meeting practical application demands effectively. The source code is available at https://github.com/RICKand-MORTY/WeatherRemover.

URL PDF HTML ☆

赞 0 踩 0

2604.06622 2026-04-09 cs.CV

Balancing Efficiency and Restoration: Lightweight Mamba-Based Model for CT Metal Artifact Reduction

Weikai Qu, Sijun Liang, Xianfeng Li, Cheng Pan, An Yan, Ahmed Elazab, Shanzhou Niu, Dong Zeng, Xiang Wan, Changmiao Wang

Comments Accepted by IEEE Transactions on Radiation and Plasma Medical Sciences