arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20913 2026-04-24 cs.LG

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

Fei Zuo, Xiaoyan Xi, Quanyi Zeng, Feiyu Wang, Ho Fai Leung

Comments 16 pages, 10 figures, 4 tables

详情

英文摘要

Large language models are increasingly deployed on CPU-only platforms where memory bandwidth is the primary bottleneck for autoregressive generation. Weight quantization to four bits or below reduces memory pressure, yet existing systems still dequantize weights and perform floating-point multiplications, limiting the achievable gains. Ternary weights in {-1, 0, +1} provide a more efficient alternative, replacing multiplications with conditional additions, subtractions, or no-ops. While Fairy2i shows that ternary LLMs can match FP16 quality, its runtime does not exploit this structure. We present FairyFuse, an inference system that enables multiplication-free execution on commodity CPUs by fusing the eight real-valued sub-GEMVs of each widely-linear layer into a single AVX-512 loop using masked additions and subtractions, with zero floating-point multiplications. Roofline analysis shows that 16x weight compression shifts memory-bound GEMV toward the compute regime on bandwidth-limited CPUs, yielding a 29.6x kernel speedup while offering little benefit on GPUs. End-to-end, FairyFuse achieves 32.4 tokens per second on a single Intel Xeon 8558P, outperforming llama.cpp Q4_K_M by 1.24x with near-lossless quality (WikiText-2 perplexity 5.52 vs. 5.47 FP16; downstream accuracy 66.0%).

URL PDF HTML ☆

赞 0 踩 0

2604.20909 2026-04-24 cs.LG

Do Masked Autoencoders Improve Downhole Prediction? An Empirical Study on Real Well Drilling Data

Aleksander Berezowski, Hassan Hassanzadeh, Gouri Ginde

2604.20904 2026-04-24 cs.LG cs.AI

Reinforcing privacy reasoning in LLMs via normative simulacra from fiction

Matt Franchi, Madiha Zahrah Choksi, Harold Triedman, Helen Nissenbaum

详情

英文摘要

Information handling practices of LLM agents are broadly misaligned with the contextual privacy expectations of their users. Contextual Integrity (CI) provides a principled framework, defining privacy as the appropriate flow of information within context-relative norms. However, existing approaches either double inference cost via supervisor-assistant architectures, or fine-tune on narrow task-specific data. We propose extracting normative simulacra (structured representations of norms and information flows) from fiction novels and using them to fine-tune LLMs via supervised learning followed by GRPO reinforcement learning. Our composite reward function combines programmatic signals, including task clarity (subsuming schema validity, construct discrimination, and extraction confidence), structural completeness, internal consistency, and context identification, with an LLM judge that evaluates whether the model's privacy reasoning is grounded in the held-out normative universe of the source text. To mitigate overfitting, we introduce per-completion contrastive scoring: each completion is evaluated against both the correct normative universe and a randomly selected wrong one, teaching the model to condition on context rather than memorize source-specific norms. We evaluate on five CI-aligned benchmarks spanning distinct societal contexts and ablate the contributions of RL and normative grounding. Across seven models, SFT introduces a conservative prior toward restricting information flow, improving recognition of privacy-relevant situations but not the correctness of privacy judgments. GRPO with normative grounding achieves the highest score on a law compliance benchmark and strongest correlation with crowdsourced human privacy expectations, demonstrating that fiction-derived normative simulacra can teach contextual privacy reasoning that transfers to real-world domains.

URL PDF HTML ☆

赞 0 踩 0

2604.20902 2026-04-24 cs.LG cs.AI

Frequency-Forcing: From Scaling-as-Time to Soft Frequency Guidance

Weitao Du

Comments ongoing project

2604.20898 2026-04-24 cs.RO cs.SY eess.SY

A Tendon-Driven Wrist Abduction-Adduction Joint Improves Performance of a 5 DoF Upper Limb Exoskeleton -- Implementation and Experimental Evaluation

Juwairiya S. Khan, Mostafa Mohammadi, Alexander L. Ammitzbøll, Ellen-Merete Hagen, Jakob Blicher, Izabella Obál, Ana S. S. Cardoso, Oguzhan Kirtas, Rasmus L. Kæseler, John Rasmussen, Lotte N. S. Andreasen Struijk

Comments 9 pages, 5 figures and 1 table. Submitted to IEEE Transactions on Biomedical Engineering as invited IEEE EMBC special issue paper. Under review after first revision

2604.20878 2026-04-24 cs.CL cs.CV cs.LG eess.IV

AITP: Traffic Accident Responsibility Allocation via Multimodal Large Language Models

Zijin Zhou, Songan Zhang

2604.20862 2026-04-24 cs.AI cs.MA

Architecture of an AI-Based Automated Course of Action Generation System for Military Operations

Ji-il Park, Inwook Shim, Chong Hui Kim

Comments 15 figures, 2 tables

2604.20789 2026-04-24 cs.CL cs.AI cs.LG

Working Memory Constraints Scaffold Learning in Transformers under Data Scarcity

Pranava Madhyastha, Dagmar Adamcova

Comments Published in ACL 2026 Findings track

2604.20730 2026-04-24 cs.CV

Render-in-the-Loop: Vector Graphics Generation via Visual Self-Feedback

Guotao Liang, Zhangcheng Wang, Juncheng Hu, Haitao Zhou, Ziteng Xue, Jing Zhang, Dong Xu, Qian Yu

详情

英文摘要

Multimodal Large Language Models (MLLMs) have shown promising capabilities in generating Scalable Vector Graphics (SVG) via direct code synthesis. However, existing paradigms typically adopt an open-loop "blind drawing" approach, where models generate symbolic code sequences without perceiving intermediate visual outcomes. This methodology severely underutilizes the powerful visual priors embedded in MLLMs vision encoders, treating SVG generation as a disjointed textual sequence modeling task rather than an integrated visuo-spatial one. Consequently, models struggle to reason about partial canvas states and implicit occlusion relationships, which are visually explicit but textually ambiguous. To bridge this gap, we propose Render-in-the-Loop, a novel generation paradigm that reformulates SVG synthesis as a step-wise, visual-context-aware process. By rendering intermediate code states into a cumulative canvas, the model explicitly observes the evolving visual context at each step, leveraging on-the-fly feedback to guide subsequent generation. However, we demonstrate that applying this visual loop naively to off-the-shelf models is suboptimal due to their inability to leverage incremental visual-code mappings. To address this, we first utilize fine-grained path decomposition to construct dense multi-step visual trajectories, and then introduce a Visual Self-Feedback (VSF) training strategy to condition the next primitive generation on intermediate visual states. Furthermore, a Render-and-Verify (RaV) inference mechanism is proposed to effectively filter degenerate and redundant primitives. Our framework, instantiated on a multimodal foundation model, outperforms strong open-weight baselines on the standard MMSVGBench. This result highlights the remarkable data efficiency and generalization capability of our Render-in-the-Loop paradigm for both Text-to-SVG and Image-to-SVG tasks.

URL PDF HTML ☆

赞 0 踩 0

2604.20726 2026-04-24 cs.CL cs.AI

Exploiting LLM-as-a-Judge Disposition on Free Text Legal QA via Prompt Optimization

Mohamed Hesham Elganayni, Runsheng Chen, Sebastian Nagl, Matthias Grabmair

Comments Accepted at the 21st International Conference on Artificial Intelligence and Law (ICAIL 2026), Singapore, June 8-12, 2026. 10 pages, 14 figures, 2 tables

2604.20677 2026-04-24 cs.CL

Intersectional Fairness in Large Language Models

Chaima Boufaied, Ronnie De Souza Santos, Ann Barcomb

2604.20543 2026-04-24 cs.CV

RefAerial: A Benchmark and Approach for Referring Detection in Aerial Images

Guyue Hu, Hao Song, Yuxing Tong, Duzhi Yuan, Dengdi Sun, Aihua Zheng, Chenglong Li, Jin Tang

2604.20487 2026-04-24 cs.CL cs.AI

Knowledge Capsules: Structured Nonparametric Memory Units for LLMs

Bin Ju, Shenfeng Weng, Danying Zhou, Rongkai Xu, Kunkai Su

2604.20468 2026-04-24 cs.RO cs.AI cs.CL cs.HC cs.LG

MOMO: A framework for seamless physical, verbal, and graphical robot skill learning and adaptation

Markus Knauer, Edoardo Fiorini, Maximilian Mühlbauer, Stefan Schneyer, Promwat Angsuratanawech, Florian Samuel Lay, Timo Bachmann, Samuel Bustamante, Korbinian Nottensteiner, Freek Stulp, Alin Albu-Schäffer, João Silvério, Thomas Eiband

Comments 15 pages, 13 figures, 3 tables

2604.20331 2026-04-24 cs.CL cs.AI cs.LG

Surrogate modeling for interpreting black-box LLMs in medical predictions

Changho Han, Songsoo Kim, Dong Won Kim, Leo Anthony Celi, Jaewoong Kim, SungA Bae, Dukyong Yoon

2604.20300 2026-04-24 cs.AI

FSFM: A Biologically-Inspired Framework for Selective Forgetting of Agent Memory

Yingjie Gu, Wenjian Xiong, Liqiang Wang, Pengcheng Ren, Chao Li, Xiaojing Zhang, Yijuan Guo, Qi Sun, Jingyao Ma, Shidang Shi

Comments 28 pages, 5 figures, 3 tables

2604.20293 2026-04-24 cs.LG

Synthetic Flight Data Generation Using Generative Models

Karim Aly, Alexei Sharpanskykh

Comments 10 pages

详情

DOI: 10.1109/ICNS65417.2025.10976960
Journal ref: 2025 Integrated Communications, Navigation and Surveillance Conference (ICNS)

英文摘要

The increasing adoption of synthetic data in aviation research offers a promising solution to data scarcity and confidentiality challenges. This study investigates the potential of generative models to produce realistic synthetic flight data and evaluates their quality through a comprehensive four-stage assessment framework. The need for synthetic flight data arises from their potential to serve as an alternative to confidential real-world records and to augment rare events in historical datasets. These enhanced datasets can then be used to train machine learning models that predict critical events, such as flight delays, cancellations, diversions, and turnaround times. Two generative models, Tabular Variational Autoencoder (TVAE) and Gaussian Copula (GC), are adapted to generate synthetic flight information and compared based on their ability to preserve statistical similarity, fidelity, diversity, and predictive utility. Results indicate that while GC achieves higher statistical similarity and fidelity, its computational cost hinders its applicability to large datasets. In contrast, TVAE efficiently handles large datasets and enables scalable synthetic data generation. The findings demonstrate that synthetic data can support flight delay prediction models with accuracy comparable to those trained on real data. These results pave the way for leveraging synthetic flight data to enhance predictive modeling in air transportation.

URL PDF HTML ☆

赞 0 踩 0

2604.20281 2026-04-24 cs.CV

Fourier Series Coder: A Novel Perspective on Angle Boundary Discontinuity Problem for Oriented Object Detection

Minghong Wei, Pu Cao, Zhihao Chen, Zhiyuan Zang, Lu Yang, Qing Song

Comments This work has been submitted to the IEEE for possible publication

2604.20169 2026-04-24 cs.CV

Semantic-Fast-SAM: Efficient Semantic Segmenter

Byunghyun Kim

Comments APSIPA ASC 2025

2604.20100 2026-04-24 cs.RO

JoyAI-RA 0.1: A Foundation Model for Robotic Autonomy

Tianle Zhang, Zhihao Yuan, Dafeng Chi, Peidong Liu, Dongwei Li, Kejun Hu, Likui Zhang, Junnan Nie, Ziming Wei, Zengjue Chen, Yili Tang, Jiayi Li, Zhiyuan Xiang, Mingyang Li, Tianci Luo, Hanwen Wan, Ao Li, Linbo Zhai, Zhihao Zhan, Xiaodong Bai, Jiakun Cai, Peng Cao, Kangliang Chen, Siang Chen, Yixiang Dai, Shuai Di, Yicheng Gong, Chenguang Gui, Yucheng Guo, Peng Hao, Qingrong He, Haoyang Huang, Kunrui Huang, Zhixuan Huang, Shibo Jin, Yixiang Jin, Anson Li, Dongjiang Li, Jiawei Li, Ruodai Li, Yihang Li, Yuzhen Li, Jiaming Liang, Fangsheng Liu, Jing Long, Mingxi Luo, Xing Pan, Hui Shen, Xiaomeng Tian, Daming Wang, Song Wang, Junwu Xiong, Hang Xu, Wanting Xu, Zhengcheng Yu, He Zhang, Jiyao Zhang, Lin Zhao, Chen Zhou, Nan Duan, Yuzheng Zhuang, Liang Lin

2604.19934 2026-04-24 cs.CL

Tracing Relational Knowledge Recall in Large Language Models

Nicholas Popovič, Michael Färber

Comments ACL 2026 (findings)

2604.19794 2026-04-24 cs.AI cs.CE cs.LG

Handbook of Rough Set Extensions and Uncertainty Models

Takaaki Fujita, Florentin Smarandache

Comments 159 pages. Peer-Reviewed Book. ISBN: 978-1-59973-867-3. Publisher: Neutrosophic Science International Association (NSIA) Publishing House

2604.19598 2026-04-24 cs.CL cs.AI

Cross-Model Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Across Three Large Language Models

Kihyuk Lee

Comments 24 Pages, 2 Figures, 6 Tables and 2 Supplementary Materials. v2: Removed personal contact information

2604.18779 2026-04-24 cs.CL cs.AI

Mango: Multi-Agent Web Navigation via Global-View Optimization

Weixi Tong, Yifeng Di, Tianyi Zhang

2604.18724 2026-04-24 cs.AI

Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations

Emily Reif, Claire Yang, Jared Hwang, Deniz Nazar, Noah A. Smith, Jeff Heer

2604.18438 2026-04-24 cs.LG cs.SY eess.SY nlin.AO

Scalable Physics-Informed Neural Differential Equations and Data-Driven Algorithms for HVAC Systems

Hanfeng Zhai, Hongtao Qiao, Hassan Mansour, Christopher Laughman

Comments 50 pages, 26 figures

2604.17969 2026-04-24 cs.CV

E3VS-Bench: A Benchmark for Viewpoint-Dependent Active Perception in 3D Gaussian Splatting Scenes

Koya Sakamoto, Taiki Miyanishi, Daichi Azuma, Shuhei Kurita, Shu Morikuni, Naoya Chiba, Motoaki Kawanabe, Yusuke Iwasawa, Yutaka Matsuo

Comments Project page: https://k0uya.github.io/e3vs-proj/

2604.17656 2026-04-24 cs.SD cs.AI cs.CL cs.CV cs.LG

Video-Robin: Autoregressive Diffusion Planning for Intent-Grounded Video-to-Music Generation

Vaibhavi Lokegaonkar, Aryan Vijay Bhosale, Vishnu Raj, Gouthaman KV, Ramani Duraiswami, Lie Lu, Sreyan Ghosh, Dinesh Manocha

2604.17628 2026-04-24 cs.CL

Does Welsh media need a review? Detecting bias in Nation.Cymru's political reporting

Cai Parry-Jones

2604.15770 2026-04-24 cs.CV cs.RO

PLAF: Pixel-wise Language-Aligned Feature Extraction for Efficient 3D Scene Understanding

Junjie Wen, Junlin He, Fei Ma, Jinqiang Cui

Comments Accepted by ICCA 2026