arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18637 2026-05-05 q-bio.NC cs.AI cs.CY

NeuroAI and Beyond: Bridging Between Advances in Neuroscience and ArtificialIntelligence

Anthony Zador, Jean-Marc Fellous, Terrence Sejnowski, Gina Adam, James B Aimone, Akwasi Akwaboah, Yiannis Aloimonos, Carmen Amo Alonso, Chiara Bartolozzi, Michael J. Bennington, Michael Berry, Bing W. Brunton, Gert Cauwenberghs, Hillel J. Chiel, Tobi Delbruck, John Doyle, Jason Eshraghian, Ralph Etienne-Cummings, Cornelia Fermuller, Matthew Jacobsen, Ali A. Minai, Barbara Oakley, Alexander G. Ororbia, Joe Paton, Blake Richards, Yulia Sandamirskaya, Abhronil Sengupta, Shihab Shamma, Michael P. Stryker, Seong Jong Yoo, Steven W. Zucker

2604.17815 2026-05-05 cs.HC cs.CL cs.CY

Navigating the Conceptual Multiverse

Andre Ye, Jenny Y. Huang, Alicia Guo, Rose Novick, Tamara Broderick, Mitchell L. Gordon

2604.16323 2026-05-05 cs.SE cs.AI

Beyond the 'Diff': Addressing Agentic Entropy in Agentic Software Development

Matteo Casserini, Alessandro Facchini, Andrea Ferrario

Comments Camera-ready version of the position paper accepted to the Human-Centered Explainable AI (HCXAI) Workshop at CHI 2026

2604.09111 2026-05-05 eess.AS cs.AI

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

Changi Hong, Yoonah Song, Hwayoung Park, Chaewoon Bang, Dayeon Ku, Do Hyun Lee, Hong Kook Kim

Comments Accepted to ICPR 2026

2604.03401 2026-05-05 cs.HC cs.AI cs.CV

Can LLMs Reason About Attention? Towards Zero-Shot Analysis of Multimodal Classroom Behavior

Nolan Platt, Sehrish Nizamani, Alp Tural, Elif Tural, Saad Nizamani, Andrew Katz, Yoonje Lee, Nada Basit

Comments 8 pages, 2 figures. Preprint

2604.00187 2026-05-05 cs.HC cs.AI cs.ET

Explainable AI for Blind and Low-Vision Users: Navigating Trust, Modality, and Interpretability in the Agentic Era

Abu Noman Md Sakib, Protik Dey, Zijie Zhang, Taslima Akter

Comments Proceedings of the CHI 2026 Workshop on Human-Centered Explainable AI (HCXAI), April 13-17, 2026, Barcelona, Spain

2603.27833 2026-05-05 math.OC cs.IT cs.MA cs.RO cs.SY eess.SY math.IT

Separation is Optimal for LQR under Intermittent Feedback

Abdullah Y. Etcibasi, C. Emre Koksal, Eylem Ekici

2603.20999 2026-05-05 cs.NI cs.CV cs.MM cs.RO eess.IV

Training-Free Adaptive 360-degree Video Streaming via Semantic Potential Fields

Aizierjiang Aiersilan, Zhangfei Yang

Comments We are pleased to announce that this paper has been accepted by the 35th International Conference on Computer Communications and Networks (ICCCN 2026). We appreciate the valuable feedback from the reviewers and look forward to sharing our findings with the community

2603.18066 2026-05-05 cs.NE cs.AI cs.AR cs.LG

A Synthesizable RTL Implementation of Predictive Coding Networks

Timothy Oh

2602.14012 2026-05-05 cs.CR cs.AI cs.SE

From SFT to RL: Demystifying the Post-Training Pipeline for LLM-based Vulnerability Detection

Youpeng Li, Fuxun Yu, Xinda Wang

详情

英文摘要

The integration of LLMs into vulnerability detection (VD) has shifted the field toward more interpretable and context-aware analysis. While post-training techniques have shown promise in general coding tasks, their systematic application to VD remains underexplored. In this paper, we present the first comprehensive investigation into the post-training pipeline for LLM-based VD, demonstrating that on-policy RL with GRPO consistently outperforms SFT, off-policy preference optimization methods, and specialized VD LLMs. Our study further reveals VD-specific post-training guidelines and insights beyond common practices: (1) For data curation, contrary to the widespread use of rationalization-based supervision in prior VD work, SFT based on rejection sampling proves more effective, as rationalization can introduce hallucinations; in RL training, the inherently skewed difficulty distribution of vulnerabilities leads difficulty-aware data filtering to drastically reduce data coverage, causing non-negligible performance loss, and undermines curriculum learning, while pair-based data scheduling can partially mitigate this. (2) For stage interactions, unlike preference optimization typically applied to lightly trained SFT models, increasing SFT epochs consistently benefits off-policy preference optimization in VD tasks; however, excessive SFT suppresses self-exploration in on-policy RL, limiting its gains. (3) For reward mechanisms, naively treating vulnerability classification correctness as reward signals leads to reward hacking, whereas fine-grained root-cause judgments provide more reliable credit assignment; specification-based rewards further improve efficiency at the cost of additional design and generation effort. (4) For evaluation protocols, LLM-as-a-Judge based on root-cause analysis offers a more robust alternative, albeit with variability across judge models.

URL PDF HTML ☆

赞 0 踩 0

2601.07885 2026-05-05 cs.CR cs.AI cs.SE

False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models

Weipeng Jiang, Xiaoyu Zhang, Juan Zhai, Shiqing Ma, Chao Shen, Yang Liu

2601.06035 2026-05-05 cs.GR cs.CV

Investigating Anthropometric Fidelity in SAM 3D Body

Aizierjiang Aiersilan, Ruting Cheng, James Hahn

2601.05254 2026-05-05 cs.IR cs.CL

TagRAG: Tag-guided Hierarchical Knowledge Graph Retrieval-Augmented Generation

Wenbiao Tao, Xinyuan Li, Yunshi Lan, Weining Qian

Comments Accepted by ACL 2026 Findings

2512.12109 2026-05-05 cs.CY cs.AI cs.LO

A Neuro-Symbolic Framework for Accountability in Public-Sector AI

Allen Daniel Sunny, Ido Sivan-Sevilla

Comments Accepted at FAccT 2026 (The 2026 ACM Conference on Fairness, Accountability, and Transparency), June 25-28, Montreal, Canada

2512.11415 2026-05-05 cond-mat.stat-mech cs.LG

Emergence of Nonequilibrium Latent Cycles in Unsupervised Generative Modeling

Marco Baiesi, Alberto Rosso

Comments v2: 11 pages, 7 figures. Accepted in PRE

2511.20657 2026-05-05 cs.HC cs.AI

Intelligent Agents with Emotional Intelligence: Current Trends, Challenges, and Future Prospects

Raziyeh Zall, Alireza Kheyrkhah, Erik Cambria, Zahra Naseri, M. Reza Kangavari

Comments Enhanced the quality of figures, incorporated additional and recent references, and improved the manuscript for better clarity and writing quality

2511.06838 2026-05-05 cs.AR cs.LG

P3-LLM: An Integrated NPU-PIM Accelerator for Edge LLM Inference Using Hybrid Numerical Formats

Yuzong Chen, Chao Fang, Xilai Dai, Yuheng Wu, Thierry Tambe, Marian Verhelst, Mohamed S. Abdelfattah

Comments Accepted to the 53rd IEEE/ACM International Symposium on Computer Architecture (ISCA), 2026

2510.20103 2026-05-05 physics.chem-ph cs.LG

Extending machine learning model for implicit solvation to free energy calculations

Rishabh Dey, Michael Brocidiacono, Kushal Koirala, Alexander Tropsha, Konstantin I. Popov

2510.08599 2026-05-05 eess.AS cs.AI cs.CL cs.SD

BaldWhisper: Faster Whisper with Head Shearing and Layer Merging

Yaya Sy, Christophe Cerisara, Irina Illina

2510.05109 2026-05-05 cs.DC cs.AI cs.CL eess.SP

Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

Yilong Li, Shuai Zhang, Yijing Zeng, Hao Zhang, Xinmiao Xiong, Jingyu Liu, Pan Hu, Suman Banerjee

2509.17661 2026-05-05 eess.AS cs.SD

Comparator Loss: An Ordinal Contrastive Loss to Derive a Severity Score for Speech-based Health Monitoring

Jacob J Webber, Oliver Watts, Lovisa Wihlborg, Johnny Tam, Christine Weaver, Suvankar Pal, Siddharthan Chandran, Cassia Valentini-Botinhao

Comments Accepted to Odyssey 2026

2509.10577 2026-05-05 cs.CR cs.AI

The Coding Limits of Robust Watermarking for Generative Models

Danilo Francati, Yevin Nikhel Goonatilake, Shubham Pawar, Daniele Venturi, Giuseppe Ateniese

Comments Accepted at IEEE EuroS&P 2026

2509.10468 2026-05-05 cs.IR cs.AI cs.CL

Learning Decomposed Contextual Token Representations from Pretrained and Collaborative Signals for Generative Recommendation

Yifan Liu, Yaokun Liu, Zelin Li, Zhenrui Yue, Gyuseok Lee, Ruichen Yao, Yang Zhang, Dong Wang

Comments Accepted to SIGIR 2026. Full author version

2508.12674 2026-05-05 stat.ML cs.LG cs.SI

Unfolded Laplacian Spectral Embedding: A Theoretically Grounded Approach to Dynamic Network Representation

Haruka Ezoe, Hiroki Matsumoto, Ryohei Hisano

2508.09179 2026-05-05 eess.IV cs.CV

HiFi-Mamba: Dual-Stream W-Laplacian Enhanced Mamba for High-Fidelity MRI Reconstruction

Hongli Chen, Pengcheng Fang, Yuxia Chen, Yingxuan Ren, Jing Hao, Fangfang Tang, Xiaohao Cai, Shanshan Shan, Feng Liu

2506.24056 2026-05-05 cs.CR cs.CL cs.LG

Logit-Gap Steering: A Forward-Pass Diagnostic for Alignment Robustness

Tung-Ling Li, Hongliang Liu

2503.09559 2026-05-05 eess.IV cs.CV cs.LG eess.SP

Interlaced R2D2 DNN Series for Scalable Non-Cartesian MRI with Sensitivity Self-calibration

Shijie Chen, Yiwei Chen, Amir Aghabiglou, Motahare Torki, Chao Tang, Ruud B. van Heeswijk, Yves Wiaux

Comments 13 pages, 8 figures

2501.10859 2026-05-05 eess.SY cs.LG cs.SY math.OC

What price to pay? Auto-tuning a building MPC controller for optimal economic cost

Jiarui Yu, Jicheng Shi, Wenjie Xu, Colin N. Jones

Comments 11 pages, 5 figures

2412.02408 2026-05-05 cs.SI cs.LG q-fin.GN

Leveraging Ensemble-Based Semi-Supervised Learning for Illicit Account Detection in Ethereum DeFi Transactions

Shabnam Fazliani, Mohammad Mowlavi Sorond, Arsalan Masoudifard

Comments 23 pages, 12 figures

2408.01914 2026-05-05 math.NA cs.AI cs.NA

Partial-differential-algebraic equations of nonlinear dynamics by Physics-Informed Neural-Network: (I) Operator splitting and framework assessment

Loc Vu-Quoc, Alexander Humer

Comments 70 pages, 52 figures

详情

DOI: 10.1002/nme.7586
Journal ref: International Journal for Numerical Methods in Engineering, 2024;e7586

英文摘要

Several forms for constructing novel physics-informed neural-networks (PINN) for the solution of partial-differential-algebraic equations based on derivative operator splitting are proposed, using the nonlinear Kirchhoff rod as a prototype for demonstration. The open-source DeepXDE is likely the most well documented framework with many examples. Yet, we encountered some pathological problems and proposed novel methods to resolve them. Among these novel methods are the PDE forms, which evolve from the lower-level form with fewer unknown dependent variables to higher-level form with more dependent variables, in addition to those from lower-level forms. Traditionally, the highest-level form, the balance-of-momenta form, is the starting point for (hand) deriving the lowest-level form through a tedious (and error prone) process of successive substitutions. The next step in a finite element method is to discretize the lowest-level form upon forming a weak form and linearization with appropriate interpolation functions, followed by their implementation in a code and testing. The time-consuming tedium in all of these steps could be bypassed by applying the proposed novel PINN directly to the highest-level form. We developed a script based on JAX. While our JAX script did not show the pathological problems of DDE-T (DDE with TensorFlow backend), it is slower than DDE-T. That DDE-T itself being more efficient in higher-level form than in lower-level form makes working directly with higher-level form even more attractive in addition to the advantages mentioned further above. Since coming up with an appropriate learning-rate schedule for a good solution is more art than science, we systematically codified in detail our experience running optimization through a normalization/standardization of the network-training process so readers can reproduce our results.

URL PDF HTML ☆

赞 0 踩 0