arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.21611 2026-04-24 cs.CL cs.AI

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Hao-Yuan Chen

详情

英文摘要

Inference-time scaling for LLM reasoning has focused on three axes: chain depth, sample breadth, and learned step-scorers (PRMs). We introduce a fourth axis, granularity of external verbal supervision, via Verbal Process Supervision (VPS), a training-free framework that uses structured natural-language critique from a stronger supervisor to guide an iterative generate-critique-refine loop up to a round budget R. Across GPQA Diamond, AIME 2025, and LiveCodeBench V6 (covering both closed and open models), VPS yields three key results. First, on GPQA Diamond, GPT-5.4 (High) | GPT-5.4 (Low) reaches 94.9% at R=4, surpassing the 94.1% state of the art without gradient updates. Second, on AIME 2025, VPS enables strong weak-actor rescue, boosting scores from 11.7-26.7% to 63.3-90.0% (up to +63.3 points). Third, at matched compute, VPS outperforms Reflexion by +8.5 to +12.1 points and Self-Consistency@5 by +5.0 pp (GPQA) and +8.3 pp (LiveCodeBench), isolating critique granularity as the key driver. Performance scales with the supervisor-actor capability gap (Pearson r=0.90) and degrades when errors are not linguistically expressible (e.g., code synthesis), motivating hybrid verbal-executable methods. These results establish critique granularity as a new axis of inference-time scaling.

URL PDF HTML ☆

赞 0 踩 0

2604.21608 2026-04-24 eess.SY cs.SY

ADMM-Based Distributed Kalman-like Observer with Applications to Cooperative Localization

Nicola De Carli, Nicola Bastianello, Dimos V. Dimarogonas

2604.21606 2026-04-24 cs.CR

Process-Mining of Hypertraces: Enabling Scalable Formal Security Verification of (Automotive) Network Architectures

Julius Figge, David Knuplesch, Andreas Maletti, Dragan Zuvic

Comments Full version prior to submission for publication

2604.21604 2026-04-24 cs.CR cs.CY econ.GN q-fin.EC

Mitigate or Fail: How Risk Management Shapes Cybersecurity Competency

Jeffrey T. Gardiner

Comments Doctor of Business Administration (DBA) Dissertation

详情

DOI: 10.31237/osf.io/rf8xj_v1

英文摘要

Contemporary cybersecurity governance assumes that professionals apply risk reasoning. Yet major organisational failures persist despite investment in tools, staffing, and credentials. This study investigates the structural source of that paradox. Cybersecurity speaks the language of risk, but its training architecture has shaped the profession to think in terms of threats. A sequential mixed-methods design integrated four analyses; NLP of the NIST NICE Framework v2.0.0 (2,111 TKS statements), SEM (n = 126 cybersecurity professionals), a control-group comparison (n = 133 general professionals), and thematic coding of seven leadership interviews. Four convergent findings emerged. First, "likelihood" and "probability" appear zero times across all TKS statements. Risk management content accounts for 4.5% of high-confidence semantic classifications, ranking 18th of 29 competency domains. NICE codifies threat-management activity while invoking risk mainly at the category level. Second, SEM showed that training exposure significantly predicts risk management competence directly and indirectly through conceptual salience, for a total effect of Beta = .629. However, the theoretically four-dimensional competence construct collapsed into a single factor, indicating epistemic compression. Third, cybersecurity professionals showed no measurable advantage over the general professional population in foundational risk reasoning; only 11.9% showed high differentiation. Fourth, all seven leaders expected Likelihood x Impact reasoning, yet five did not articulate the formula themselves. These findings support a structural conclusion: cybersecurity has taken professional form as a threat-management discipline that has borrowed risk vocabulary. Remediation requires redesign of professional formation, not marginal curriculum reform.

URL PDF HTML ☆

赞 0 踩 0

2604.21603 2026-04-24 cs.LO cs.AI cs.DB

Using ASP(Q) to Handle Inconsistent Prioritized Data

Meghyn Bienvenu, Camille Bourgaux, Robin Jean, Giuseppe Mazzotta

Comments This is an extended version of a paper appearing at the 23rd International Conference on Principles of Knowledge Representation and Reasoning (KR 2026). 21 pages

2604.21602 2026-04-24 cs.NE cs.AI cs.AR cs.ET cs.LG

On the Role of Preprocessing and Memristor Dynamics in Reservoir Computing for Image Classification

Rishona Daniels, Duna Wattad, Ronny Ronen, David Saad, Shahar Kvatinsky

Comments Accepted for publication in Advanced Electronic Materials. Main text: pages 1-32, 11 figures. Supporting information: pages 24-32, 11 figures

2604.21600 2026-04-24 math.NA cs.NA

Positivity-Preserving and Entropy-Stable Oscillation-Eliminating DGSEM for the Compressible Euler Equations on Curvilinear Meshes with Adaptive Mesh Refinement

Jieling Yang, Guosheng Fu

2604.21599 2026-04-24 cs.SE cs.LG

Verifying Machine Learning Interpretability Requirements through Provenance

Lynn Vonderhaar, Juan Couder, Daryela Cisneros, Omar Ochoa

2604.21595 2026-04-24 stat.ML cs.LG

A Kernel Nonconformity Score for Multivariate Conformal Prediction

Louis Meyer, Wenkai Xu

2604.21593 2026-04-24 cs.CL

Language as a Latent Variable for Reasoning Optimization

Linjuan Wu, Haoran Wei, Jialong Tang, Shuang Luo, Baosong Yang, Yongliang Shen, Weiming Lu

Comments 17 pages, 7 figures, Under Reviewing

2604.21592 2026-04-24 cs.CV

Sculpt4D: Generating 4D Shapes via Sparse-Attention Diffusion Transformers

Minghao Yin, Wenbo Hu, Jiale Xu, Ying Shan, Kai Han

2604.21590 2026-04-24 cs.CL

AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use

Yuanjie Lyu, Chengyu Wang, Haonan Zheng, Yuanhao Yue, Junbing Yan, Ming Wang, Jun Huang

2604.21587 2026-04-24 cs.IT math.IT

Generative Learning Enhanced Intelligent Resource Management for Cell-Free Delay Deterministic Communications

Shuangbo Xiong, Cheng Zhang, Wen Wang, Wenwu Yu, Yongming Huang

Comments The paper has been submitted to IEEE Transactions on Wireless Communications

2604.21584 2026-04-24 cs.AI cs.CE cs.LG

CoFEE: Reasoning Control for LLM-Based Feature Discovery

Maximilian Westermann, Ben Griffin, Aaron Ontoyin Yin, Zakari Salifu, Yagiz Ihlamur, Kelvin Amoaba, Joseph Ternasky, Fuat Alican, Yigit Ihlamur

2604.21580 2026-04-24 cs.IT math.IT

Robust Beamforming for MIMO Radar with Imperfect Prior Distribution Information

Yizhuo Wang, Shuowen Zhang

Comments Accepted to appear in IEEE International Symposium on Information Theory (ISIT), 2026

详情

英文摘要

This paper studies a multiple-input multiple-output (MIMO) radar system for sensing the unknown and random angular location (angle) of a point target, based on the target-reflected echo signals and known prior distribution information about the target's angle specified by a probability density function (PDF). We consider a challenging yet practical scenario where the knowledge of such PDF is imperfect, due to the inaccuracy in PDF acquisition or unpredicted change of target appearance pattern; while the real (actual) PDF is modeled as an unknown perturbed version of the imperfect known PDF bounded by a given uncertainty radius. Such PDF imperfection motivates us to study the robust transmit beamforming design to optimize the worst-case sensing performance among all possible real PDFs. Since the sensing mean-squared error (MSE) is difficult to be characterized explicitly, we adopt the worst-case posterior Cramér-Rao bound (PCRB) as the performance metric. We formulate the beamforming optimization problem to minimize the maximum PCRB among all possible real PDFs, which is highly non-trivial since the PCRB has a complex intractable expression over the real PDF, and there are infinite constraints corresponding to the continuous set of real PDFs bounded by the uncertainty radius. To address these challenges, we derive a tractable quadratic approximation of the PCRB via second-order Taylor expansion, and leverage the S-procedure to equivalently transform the infinite constraints into a linear matrix inequality, based on which the problem is reformulated into a convex optimization problem solvable with polynomial time complexity. The obtained solution approaches the globally optimal robust beamforming solution as the uncertainty radius decreases. Numerical results validate the effectiveness of our proposed robust beamforming design.

URL PDF HTML ☆

赞 0 踩 0

2604.21579 2026-04-24 cs.SE cs.AI

A Metamorphic Testing Approach to Diagnosing Memorization in LLM-Based Program Repair

Milan De Koning, Ali Asgari, Pouria Derakhshanfar, Annibale Panichella

Comments 12 pages

2604.21575 2026-04-24 cs.CV cs.GR

OmniFit: Multi-modal 3D Body Fitting via Scale-agnostic Dense Landmark Prediction

Zeyu Cai, Yuliang Xiu, Renke Wang, Zhijing Shao, Xiaoben Li, Siyuan Yu, Chao Xu, Yang Liu, Baigui Sun, Jian Yang, Zhenyu Zhang

Comments Project Page: https://zcai0612.github.io/OmniFit/

2604.21573 2026-04-24 cs.CV q-bio.QM

CHRep: Cross-modal Histology Representation and Post-hoc Calibration for Spatial Gene Expression Prediction

Changfan Wang, Xinran Wang, Donghai Liu, Fei Su, Lulu Sun, Zhicheng Zhao, Zhu Meng

2604.21572 2026-04-24 cs.CV

Deep kernel video approximation for unsupervised action segmentation

Silvia L. Pintea, Jouke Dijkstra

Comments Accepted at ICPR 2026

2604.21571 2026-04-24 cs.AI cs.LG

Separable Expert Architecture: Toward Privacy-Preserving LLM Personalization via Composable Adapters and Deletable User Proxies

Chris Schneider, Philipp Schoenegger, Ben Bariach

2604.21570 2026-04-24 cs.SE

SpecSyn: LLM-based Synthesis and Refinement of Formal Specifications for Real-world Program Verification

Lezhi Ma, Shangqing Liu, Yi Li, Qiong Wu, Han Wang, Lei Bu

2604.21568 2026-04-24 cs.RO

A Bayesian Reasoning Framework for Robotic Systems in Autonomous Casualty Triage

Szymon Rusiecki, Cecilia Morales, Pia Störy, Kimberly Elenberg, Leonard Weiss, Artur Dubrawski

Comments Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA)

2604.21567 2026-04-24 cs.LG cs.AI

Hybrid Deep Learning Approach for Coupled Demand Forecasting and Supply Chain Optimization

Nusrat Yasmin Nadia, Md Habibul Arif, Habibor Rahman Rabby, Md Iftekhar Monzur Tanvir, Md. Jakir Hossen, M. F. Mridha

Comments The paper is accepted in the Computers, Materials & Continua journal

2604.21558 2026-04-24 math.NA cs.NA

A nonconforming method for a generalized Darcy-Forchheimer model

Michele Botti, Lorenzo Mascotto, Marialetizia Mosconi

2604.21556 2026-04-24 cs.AI cs.SE

Probabilistic Verification of Neural Networks via Efficient Probabilistic Hull Generation

Jingyang Li, Xin Chen, Hongfei Fu, Guoqiang Li

Comments 22 pages, 5 figures

2604.21555 2026-04-24 cs.CL

Finding Meaning in Embeddings: Concept Separation Curves

Paul Keuren, Marc Ponsen, Robert Ayoub Bagheri

Comments The code is open source and located on github at https://github.com/pkun-cbs/ConceptSeparationCurves. Original conference paper

2604.21554 2026-04-24 cs.AI

Engaged AI Governance: Addressing the Last Mile Challenge Through Internal Expert Collaboration

Simon Jarvers, Orestis Papakyriakopoulos

2604.21549 2026-04-24 cs.AI stat.ME

Unbiased Prevalence Estimation with Multicalibrated LLMs

Fridolin Linder, Thomas Leeper, Daniel Haimovich, Niek Tax, Lorenzo Perini, Milan Vojnovic

2604.21546 2026-04-24 cs.CV

Component-Based Out-of-Distribution Detection

Wenrui Liu, Hong Chang, Ruibing Hou, Shiguang Shan, Xilin Chen

2604.21544 2026-04-24 cs.IT math.IT

Design of MDP Convolutional Codes and Maximally Recoverable Codes Through the Lens of Matrix Completion

Sakshi Dang, Julia Lieb, Pedro Soto, Alex Sprintson