arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2112.07874 2026-04-06 cs.CL cs.AI

Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling

Jakob Prange, Nathan Schneider, Lingpeng Kong

Comments Accepted to NAACL 2022 (slight typesetting divergences to NAACL camera-ready due to TexLive 2020/2021 mismatches)

2604.03224 2026-04-06 eess.IV cs.CV

HyperCT: Low-Rank Hypernet for Unified Chest CT Analysis

Fengbei Liu, Sunwoo Kwak, Hao Phung, Nusrat Binta Nizam, Ilan Richter, Nir Uriel, Hadar Averbuch-Elor, Daborah Estrin, Mert R. Sabuncu

Comments MIDL 2026

2604.03219 2026-04-06 eess.AS cs.SD

Unmixing the Crowd: Learning Mixture-to-Set Speaker Embeddings for Enrollment-Free Target Speech Extraction

FNU Sidharth, Meysam Asgari, Hao-Wen Dong, Dhruv Jain

Comments Submitted to ISCA Interspeech 2026

2604.03205 2026-04-06 cs.CR cs.LG

A Tsetlin Machine-driven Intrusion Detection System for Next-Generation IoMT Security

Rahul Jaiswal, Per-Arne Andersen, Linga Reddy Cenkeramaddi, Lei Jiao, Ole-Christoffer Granmo

Comments 8 pages, 15 figures, 9 tables. Accepted at the 7th Silicon Valley Cybersecurity Conference (SVCC 2026), California, USA

2604.03159 2026-04-06 cs.DL cs.CL

BibTeX Citation Hallucinations in Scientific Publishing Agents: Evaluation and Mitigation

Delip Rao, Chris Callison-Burch

Comments 37 pages

详情

英文摘要

Large language models with web search are increasingly used in scientific publishing agents, yet they still produce BibTeX entries with pervasive field-level errors. Prior evaluations tested base models without search, which does not reflect current practice. We construct a benchmark of 931 papers across four scientific domains and three citation tiers -- popular, low-citation, and recent post-cutoff -- designed to disentangle parametric memory from search dependence, with version-aware ground truth accounting for multiple citable versions of the same paper. Three search-enabled frontier models (GPT-5, Claude Sonnet-4.6, Gemini-3 Flash) generate BibTeX entries scored on nine fields and a six-way error taxonomy, producing ~23,000 field-level observations. Overall accuracy is 83.6%, but only 50.9% of entries are fully correct; accuracy drops 27.7pp from popular to recent papers, revealing heavy reliance on parametric memory even when search is available. Field-error co-occurrence analysis identifies two failure modes: wholesale entry substitution (identity fields fail together) and isolated field error. We evaluate clibib, an open-source tool for deterministic BibTeX retrieval from the Zotero Translation Server with CrossRef fallback, as a mitigation mechanism. In a two-stage integration where baseline entries are revised against authoritative records, accuracy rises +8.0pp to 91.5%, fully correct entries rise from 50.9% to 78.3%, and regression rate is only 0.8%. An ablation comparing single-stage and two-stage integration shows that separating search from revision yields larger gains and lower regression (0.8% vs. 4.8%), demonstrating that integration architecture matters independently of model capability. We release the benchmark, error taxonomy, and clibib tool to support evaluation and mitigation of citation hallucinations in LLM-based scientific writing.

URL PDF HTML ☆

赞 0 踩 0

2604.03144 2026-04-06 cs.AR cs.AI cs.CL

InCoder-32B-Thinking: Industrial Code World Model for Thinking

Jian Yang, Wei Zhang, Jiajun Wu, Junhang Cheng, Tuney Zheng, Fanglin Xu, Weicheng Gu, Lin Jing, Yaxin Du, Joseph Li, Yizhi Li, Yan Xing, Chuan Hao, Ran Tao, Ruihao Gong, Aishan Liu, Zhoujun Li, Mingjie Tang, Chenghua Lin, Siheng Chen, Wayne Xin Zhao, Xianglong Liu, Ming Zhou, Bryan Dai, Weifeng Lv

2604.03135 2026-04-06 cs.SE cs.AI

AI-Assisted Unit Test Writing and Test-Driven Code Refactoring: A Case Study

Ema Smolic, Mario Brcic, Luka Hobor, Mihael Kovac

Comments 6 pages, 3 figures, 2 tables

2604.03132 2026-04-06 eess.SY cs.RO cs.SY

Minimal Information Control Invariance via Vector Quantization

Ege Yuceel, Teodor Tchalakov, Sayan Mitra

2604.03131 2026-04-06 cs.CR cs.AI

A Systematic Security Evaluation of OpenClaw and Its Variants

Yuhang Wang, Haichang Gao, Zhenxing Niu, Zhaoxiang Liu, Wenjing Zhang, Xiang Wang, Shiguo Lian

Comments 39 pages, 14 figures

2604.03121 2026-04-06 cs.CR cs.AI cs.CL

An Independent Safety Evaluation of Kimi K2.5

Zheng-Xin Yong, Parv Mahajan, Andy Wang, Ida Caspary, Yernat Yestekov, Zora Che, Mosh Levy, Elle Najt, Dennis Murphy, Prashant Kulkarni, Lev McKinney, Kei Nishimura-Gasparian, Ram Potham, Aengus Lynch, Michael L. Chen

2604.03112 2026-04-06 eess.IV cs.CV cs.MM

ARIQA-3DS: A Stereoscopic Image Quality Assessment Dataset for Realistic Augmented Reality

Aymen Sekhri, Seyed Ali Amirshahi, Mohamed-Chaker Larabi

2604.03104 2026-04-06 cs.CR cs.AI

AlertStar: Path-Aware Alert Prediction on Hyper-Relational Knowledge Graphs

Zahra Makki Nayeri, Mohsen Rezvani

详情

英文摘要

Cyber-attacks continue to grow in scale and sophistication, yet existing network intrusion detection approaches lack the semantic depth required for path reasoning over attacker-victim interactions. We address this by first modelling network alerts as a knowledge graph, then formulating hyper-relational alert prediction as a hyper-relational knowledge graph completion (HR-KGC) problem, representing each network alert as a qualified statement (h, r, t, Q), where h and t are source and destination IPs, r denotes the attack type, and Q encodes flow-level metadata such as timestamps, ports, protocols, and attack intensity, going beyond standard KGC binary triples (h, r, t) that would discard this contextual richness. We introduce five models across three contributions: first, Hyper-relational Neural Bellman-Ford (HR-NBFNet) extends Neural Bellman-Ford Networks to the hyper-relational setting with qualifier-aware multi-hop path reasoning, while its multi-task variant MT-HR-NBFNet jointly predicts tail, relation, and qualifier-value within a single traversal pass; second, AlertStar fuses qualifier context and structural path information entirely in embedding space via cross-attention and learned path composition, and its multi-task extension MT-AlertStar eliminates the overhead of full knowledge graph propagation; third, HR-NBFNet-CQ extends qualifier-aware representations to answer complex first-order logic queries, including one-hop, two-hop chain, two-anchor intersection, and union, enabling multi-condition threat reasoning over the alert knowledge graph. Evaluated inductively on the Warden and UNSW-NB15 benchmarks across three qualifier-density regimes, AlertStar and MT-AlertStar achieve superior MR, MRR, and Hits@k, demonstrating that local qualifier fusion is both sufficient and more efficient than global path propagation for hyper-relational alert prediction.

URL PDF HTML ☆

赞 0 踩 0

2604.03086 2026-04-06 eess.SY cs.LG cs.SY math.DS

On Data-Driven Koopman Representations of Nonlinear Delay Differential Equations

Santosh Mohan Rajkumar, Dibyasri Barman, Kumar Vikram Singh, Debdipta Goswami

Comments Github: https://github.com/santoshrajkumar/koopman-dde-kEDMD

2604.03081 2026-04-06 cs.CR cs.AI cs.CL

Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems

Yubin Qu, Yi Liu, Tongcheng Geng, Gelei Deng, Yuekang Li, Leo Yu Zhang, Ying Zhang, Lei Ma

2604.03074 2026-04-06 eess.AS cs.CL cs.SD

Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR

Zhennan Lin, Shuai Wang, Zhaokai Sun, Pengyuan Xie, Chuan Xie, Jie Liu, Qiang Zhang, Lei Xie

2604.03070 2026-04-06 cs.CR cs.AI

Credential Leakage in LLM Agent Skills: A Large-Scale Empirical Study

Zhihao Chen, Ying Zhang, Yi Liu, Gelei Deng, Yuekang Li, Yanjun Zhang, Jianting Ning, Leo Yu Zhang, Lei Ma, Zhiqiang Li

2604.03050 2026-04-06 cs.HC cs.AI

MECO: A Multimodal Dataset for Emotion and Cognitive Understanding in Older Adults

Hongbin Chen, Jie Li, Wei Wang, Siyang Song, Xiao Gu, Jianqing Li, Wentao Xiang

Comments 8 pages, 3 figures

2604.03043 2026-04-06 cs.CR cs.AI

Analyzing Healthcare Interoperability Vulnerabilities: Formal Modeling and Graph-Theoretic Approach

Jawad Mohammed, Gahangir Hossain

2604.03035 2026-04-06 cs.SE cs.AI

Beyond Isolated Tasks: A Framework for Evaluating Coding Agents on Sequential Software Evolution

KN Ajay Shastry, Ganesh Senrayan, Shrey Satapara, Pranoy Panda, Chaitanya Devaguptapu

2604.03034 2026-04-06 math.NA cs.LG cs.NA

Learning Contractive Integral Operators with Fredholm Integral Neural Operators

Kyriakos C. Georgiou, Constantinos Siettos, Athanasios N. Yannacopoulos

2604.03022 2026-04-06 cs.SI cs.AI cs.HC

Comparing the Impact of Pedagogy-Informed Custom and General-Purpose GAI Chatbots on Students' Science Problem-Solving Processes and Performance Using Heterogeneous Interaction Network Analysis

Hanyu Su, Huilin Zhang, Shihui Feng

Comments Full paper accepted to the 27th International Conference on AI in Education (AIED 2026)

详情

英文摘要

Problem solving plays an essential role in science education, and generative AI (GAI) chatbots have emerged as a promising tool for supporting students' science problem solving. However, general-purpose chatbots (e.g., ChatGPT), which often provide direct, ready-made answers, may lead to students' cognitive offloading. Prior research has rarely focused on custom chatbots for facilitating students' science problem solving, nor has it examined how they differently influence problem-solving processes and performance compared to general-purpose chatbots. To address this gap, we developed a pedagogy-informed custom GAI chatbot grounded in the Socratic questioning method, which supports students by prompting them with guiding questions. This study employed a within-subjects counterbalanced design in which 48 secondary school students used both custom and general-purpose chatbot to complete two science problem-solving tasks. 3297 student-chatbot dialogues were collected and analyzed using Heterogeneous Interaction Network Analysis (HINA). The results showed that: (1) students demonstrated significantly higher interaction intensity and cognitive interaction diversity when using custom chatbot than using general-purpose chatbot; (2) students were more likely to follow custom chatbot's guidance to think and reflect, whereas they tended to request general-purpose chatbot to execute specific commands; and (3) no statistically significant difference was observed in students' problem-solving performance evaluated by solution quality between two chatbot conditions. This study provides novel theoretical insights and empirical evidence that custom chatbots are less likely to induce cognitive offloading and instead foster greater cognitive engagement compared to general-purpose chatbots. This study also offers insights into the design and integration of GAI chatbots in science education.

URL PDF HTML ☆

赞 0 踩 0

2604.03014 2026-04-06 cs.IR cs.AI

User-Aware Conditional Generative Total Correlation Learning for Multi-Modal Recommendation

Jing Du, Zesheng Ye, Congbo Ma, Feng Liu, Flora. D. Salim

Comments 11 pages, 7 figures, 3 tables

2604.02995 2026-04-06 math.AG cs.LG math.CO

A semicontinuous relaxation of Saito's criterion and freeness as angular minimization

Tomás S. R. Silva

Comments This manuscript is a working paper, and an updated version will be posted later. 26 pages

2604.02988 2026-04-06 cs.IR cs.AI

Self-Optimizing Multi-Agent Systems for Deep Research

Arthur Câmara, Vincent Slot, Jakub Zavrel

Comments Accepted at the Workshop on Conversational Search for Complex Information Needs at ECIR 2026

2604.02985 2026-04-06 cs.IR cs.AI cs.CL

Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM Inference

Cornelius Kummer, Lena Jurkschat, Michael Färber, Sahar Vahdati

Comments Accepted at ECIR 2026 (Full Paper)

2604.02917 2026-04-06 math.OC cs.CE cs.DC cs.LG cs.NA math.NA

Scalable Mean-Variance Portfolio Optimization via Subspace Embeddings and GPU-Friendly Nesterov-Accelerated Projected Gradient

Yi-Shuai Niu, Yajuan Wang

Comments 28 pages, 7 figures

2604.02912 2026-04-06 cs.CY cs.AI

Corporations Constitute Intelligence

Gilad Abiri

2604.02887 2026-04-06 stat.ML cs.LG

Lipschitz bounds for integral kernels

Justin Reverdi, Sixin Zhang, Fabrice Gamboa, Serge Gratton

2604.02868 2026-04-06 eess.IV cs.CV

Few-Shot Distribution-Aligned Flow Matching for Data Synthesis in Medical Image Segmentation

Jie Yang, Ziqi Ye, Aihua Ke, Jian Luo, Bo Cai, Xiaosong Wang

2604.02850 2026-04-06 physics.ao-ph cs.AI math.DS nlin.CD

High-resolution probabilistic estimation of three-dimensional regional ocean dynamics from sparse surface observations

Niloofar Asefi, Tianning Wu, Ruoying He, Ashesh Chattopadhyay

Comments Supplementary information: https://drive.google.com/file/d/12FPQujokmSOUktTftfYjPFVNnSYHfszv/view?usp=sharing