arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.13227 2026-03-16 cs.LG cs.CV

Representation Learning for Spatiotemporal Physical Systems

Helen Qu, Rudy Morel, Michael McCabe, Alberto Bietti, François Lanusse, Shirley Ho, Yann LeCun

Comments Published at ICLR 2026 Workshop on AI & PDE

详情

英文摘要

Machine learning approaches to spatiotemporal physical systems have primarily focused on next-frame prediction, with the goal of learning an accurate emulator for the system's evolution in time. However, these emulators are computationally expensive to train and are subject to performance pitfalls, such as compounding errors during autoregressive rollout. In this work, we take a different perspective and look at scientific tasks further downstream of predicting the next frame, such as estimation of a system's governing physical parameters. Accuracy on these tasks offers a uniquely quantifiable glimpse into the physical relevance of the representations of these models. We evaluate the effectiveness of general-purpose self-supervised methods in learning physics-grounded representations that are useful for downstream scientific tasks. Surprisingly, we find that not all methods designed for physical modeling outperform generic self-supervised learning methods on these tasks, and methods that learn in the latent space (e.g., joint embedding predictive architectures, or JEPAs) outperform those optimizing pixel-level prediction objectives. Code is available at https://github.com/helenqu/physical-representation-learning.

URL PDF HTML ☆

赞 0 踩 0

2603.13220 2026-03-16 cs.MA

A Generative Model of Conspicuous Consumption and Status Signaling

Logan Cross, Jordi Grau-Moya, William A. Cunningham, Alexander Sasha Vezhnevets, Joel Z. Leibo

Comments 29 pages, 13 figures

2603.13215 2026-03-16 cs.CV

Out of Sight, Out of Mind? Evaluating State Evolution in Video World Models

Ziqi Ma, Mengzhan Liufu, Georgia Gkioxari

Comments https://glab-caltech.github.io/STEVOBench/

2603.13214 2026-03-16 math.OC cs.DM

Investigating mixed-integer programming approaches for the $p$-$α$-closest-center problem

Elisabeth Gaar, Sara Joosten, Markus Sinnl

详情

英文摘要

In this work, we introduce and study the $p$-$α$-closest-center problem ($pα$CCP), which generalizes the $p$-second-center problem, a recently emerged variant of the classical $p$-center problem. In the $pα$CCP, we are given sets of customers and potential facility locations, distances between each customer and potential facility location as well as two integers $p$ and $α$. The goal is to open facilities at $p$ of the potential facility locations, such that the maximum $α$-distance between each customer and the open facilities is minimized. The $α$-distance of a customer is defined as the sum of distances from the customer to its $α$ closest open facilities. If $α$ is one, the $pα$CCP is the $p$-center problem, and for $α$ being two, the $p$-second-center problem is obtained, for which the only existing algorithm in literature is a variable neighborhood search (VNS). We present four mixed-integer programming (MIP) formulations for the $pα$CCP, strengthen them by adding valid and optimality-preserving inequalities and conduct a polyhedral study to prove relationships between their linear programming relaxations. Moreover, we present iterative procedures for lifting some valid inequalities to improve initial lower bounds on the optimal objective function value of the $pα$CCP and characterize the best lower bounds obtainable by this iterative lifting approach. Based on our theoretical findings, we develop a branch-and-cut algorithm (B&C) to solve the $pα$CCP exactly. We improve its performance by a starting and a primal heuristic, variable fixings and separating inequalities. In our computational study, we investigate the effect of the various ingredients of our B&C on benchmark instances from related literature. Our B&C is able to prove optimality for 17 of the 40 instances from the work on the VNS heuristic.

URL PDF HTML ☆

赞 0 踩 0

2603.13213 2026-03-16 cs.SE

MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models

Md. Abdul Awal, Mrigank Rochan, Chanchal K. Roy

Comments Accepted to the Research Track of the Evaluation and Assessment in Software Engineering (EASE) 2026

详情

英文摘要

Large language models for code have achieved strong performance across diverse software analytics tasks, yet their real-world adoption remains limited by high computational demands, slow inference speeds, significant energy consumption, and environmental impact. Knowledge distillation (KD) offers a practical solution by transferring knowledge from a large model to a smaller and more efficient model. Despite its effectiveness, recent studies show that models distilled from a single source often exhibit degraded adversarial robustness, even when robustness-aware distillation techniques are employed. These observations suggest a fundamental limitation of single-source distillation in simultaneously transferring high-quality and robust knowledge. To overcome this limitation, we propose Mixture of Experts Knowledge Distillation (MoEKD), a KD framework that leverages a Mixture of Experts (MoE) architecture to enable more effective and robust knowledge transfer from multiple specialized experts into a compact model. MoEKD decomposes the distillation process into expert and router training, aggregation of expert knowledge through a learned routing mechanism, and distillation from the aggregated knowledge. We evaluate MoEKD on the vulnerability detection task using CodeBERT and GraphCodeBERT models. Experimental results show that MoEKD not only improves adversarial robustness by up to 35.8%, but also enhances predictive performance by up to 13%, compared to state-of-the-art KD baselines, including Compressor and AVATAR. Furthermore, an ablation study demonstrates that aggregating expert knowledge enables ultra-compact models to maintain competitive performance even when their size is reduced by approximately half. Overall, these results highlight the effectiveness of multi-expert knowledge aggregation in addressing key limitations of existing single-source KD approaches.

URL PDF HTML ☆

赞 0 踩 0

2603.13201 2026-03-16 cs.CL

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Xin Chen, Junchao Wu, Shu Yang, Runzhe Zhan, Zeyu Wu, Min Yang, Shujian Huang, Lidia S. Chao, Derek F. Wong

2603.13191 2026-03-16 physics.comp-ph cond-mat.mtrl-sci cs.AI

From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research

Haonan Huang

2603.13190 2026-03-16 cs.CE cond-mat.mtrl-sci

Lattice Discrete Particle Model (LDPM): Comparison of Various Time Integration Solvers and Implementations

Erol Lale, Jan Eliáš, Ke Yu, Matthew Troemner, Monika Středulová, Julien Khoury, Tianju Xue, Ioannis Koutromanos, Alessandro Fascetti, Bahar Ayhan, Baixi Chen, Giovanni Di Luzio, Yuhui Lyu, Madura Pathirage, Gilles Pijaudier-Cabot, Lei Shen, Alessandro Tasora, Lifu Yang, Jiawei Zhong, Gianluca Cusatis

Comments 28 pages, 15 tables, 8 figures

详情

DOI: 10.1002/nag.70286
Journal ref: International Journal for Numerical and Analytical Methods in Geomechanics, 2026

英文摘要

This article presents a comparison of various implementations of the Lattice Discrete Particle Model (LDPM) for the numerical simulation of concrete and other heterogeneous quasibrittle materials. The comparison involves the use of transient implicit and explicit solvers and steady-state (static) solvers and implementations for Central Processing Unit (CPU) as well as Graphics Processing Unit (GPU). The various implementations are compared on the basis of a set of benchmarks tests describing behaviors of increasing computational complexity. They include elastic vibrations, confined strain-hardening compressive response, tensile fracture, and unconfined strain-softening compressive response. Metrics of interest extracted from the simulations include macroscopic stress versus strain responses, computational times, number of iterations, and energy balance error. Pairwise comparison of final crack patterns is provided through the correlation coefficient and normalized root mean square error of the crack opening vectors. Moreover, for the most numerically challenging case of unconfined compression with sliding boundary conditions, the stability of the strain-softening response is tested by perturbing the solutions as well as changing the convergence criteria and time step size. Attached to this paper is the complete input data of the benchmark tests; this will allow researchers to run the examples and compare them with their own implementations. In addition, most of the reported implementations are publicly available in open source packages.

URL PDF HTML ☆

赞 0 踩 0

2603.13189 2026-03-16 cs.MA cs.AI

LLM Constitutional Multi-Agent Governance

J. de Curtò, I. de Zarzà

Comments Accepted for publication in 20th International Conference on Agents and Multi-Agent Systems: Technologies and Applications (AMSTA 2026), to appear in Springer Nature proceedings (KES Smart Innovation Systems and Technologies). The final authenticated version will be available online at Springer

详情

英文摘要

Large Language Models (LLMs) can generate persuasive influence strategies that shift cooperative behavior in multi-agent populations, but a critical question remains: does the resulting cooperation reflect genuine prosocial alignment, or does it mask erosion of agent autonomy, epistemic integrity, and distributional fairness? We introduce Constitutional Multi-Agent Governance (CMAG), a two-stage framework that interposes between an LLM policy compiler and a networked agent population, combining hard constraint filtering with soft penalized-utility optimization that balances cooperation potential against manipulation risk and autonomy pressure. We propose the Ethical Cooperation Score (ECS), a multiplicative composite of cooperation, autonomy, integrity, and fairness that penalizes cooperation achieved through manipulative means. In experiments on scale-free networks of 80 agents under adversarial conditions (70% violating candidates), we benchmark three regimes: full CMAG, naive filtering, and unconstrained optimization. While unconstrained optimization achieves the highest raw cooperation (0.873), it yields the lowest ECS (0.645) due to severe autonomy erosion (0.867) and fairness degradation (0.888). CMAG attains an ECS of 0.741, a 14.9% improvement, while preserving autonomy at 0.985 and integrity at 0.995, with only modest cooperation reduction to 0.770. The naive ablation (ECS = 0.733) confirms that hard constraints alone are insufficient. Pareto analysis shows CMAG dominates the cooperation-autonomy trade-off space, and governance reduces hub-periphery exposure disparities by over 60%. These findings establish that cooperation is not inherently desirable without governance: constitutional constraints are necessary to ensure that LLM-mediated influence produces ethically stable outcomes rather than manipulative equilibria.

URL PDF HTML ☆

赞 0 踩 0

2603.13186 2026-03-16 cs.LG cs.AI cs.CR

Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights

Xingli Fang, Jung-Eun Kim

Comments ICLR 2026

2603.13185 2026-03-16 cs.CV

Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos

Rohith Peddi, Saurabh, Shravan Shanmugam, Likhitha Pallapothula, Yu Xiang, Parag Singla, Vibhav Gogate

Comments https://github.com/rohithpeddi/WorldSGG

2603.13181 2026-03-16 cs.CR cs.LO

Verification of Robust Properties for Access Control Policies

Alexander V. Gheorghiu

2603.13180 2026-03-16 cs.LG cs.AI cs.NE

MXNorm: Reusing MXFP block scales for efficient tensor normalisation

Callum McLean, Luke Y. Prince, Alexandre Payot, Paul Balança, Carlo Luschi

Comments Preprint, Under Review. 15 pages, 12 figures

2603.13177 2026-03-16 astro-ph.EP astro-ph.IM cs.AI

Clustering Astronomical Orbital Synthetic Data Using Advanced Feature Extraction and Dimensionality Reduction Techniques

Eraldo Pereira Marinho, Nelson Callegari Junior, Fabricio Aparecido Breve, Caetano Mazzoni Ranieri

Comments This paper has been accepted for publication in Neural Computing and Applications (Springer Nature)

2603.13176 2026-03-16 cs.CV

Perceive What Matters: Relevance-Driven Scheduling for Multimodal Streaming Perception

Dingcheng Huang, Xiaotong Zhang, Kamal Youcef-Toumi

Comments Accepted to ICRA 2026

2603.13168 2026-03-16 cs.AI cs.CL cs.IR

Developing and evaluating a chatbot to support maternal health care

Smriti Jha, Vidhi Jain, Jianyu Xu, Grace Liu, Sowmya Ramesh, Jitender Nagpal, Gretchen Chapman, Benjamin Bellows, Siddhartha Goyal, Aarti Singh, Bryan Wilder

Comments 17 pages; submitted to IJCAI 2026 AI and Social Good Track

2603.13163 2026-03-16 cs.CV cs.LG

Towards Faithful Multimodal Concept Bottleneck Models

Pierre Moreau, Emeline Pineau Ferrand, Yann Choho, Benjamin Wong, Annabelle Blangero, Milan Bhan

2603.13162 2026-03-16 eess.IV cs.CV

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression

Junqi Shi, Ming Lu, Xingchen Li, Anle Ke, Ruiqi Zhang, Zhan Ma

2603.13158 2026-03-16 math.NA cs.NA math.CV

PhaseJumps: fast computation of zeros from planar grid samples

Antti Haimi, Günther Koliander, José Luis Romero

Comments 39 pages, 8 figures

2603.13154 2026-03-16 cs.CL cs.AI

ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation

Siqi Sun, Ben Peng Wu, Mali Jin, Peizhen Bai, Hanpei Zhang, Xingyi Song

Comments To be published in the AAAI 2026 proceedings

2603.13151 2026-03-16 cs.CR

Defensible Design for OpenClaw: Securing Autonomous Tool-Invoking Agents

Zongwei Li, Wenkai Li, Xiaoqi Li

2603.13147 2026-03-16 cs.DC

A common parallel framework for LLP combinatorial problems

David Ribeiro Alves, Vijay K. Garg

2603.13136 2026-03-16 eess.SY cs.SY math.OC

Unifying Decision Making and Trajectory Planning in Automated Driving through Time-Varying Potential Fields

David Costa, Francesco Cerrito, Massimo Canale, Carlo Novara

2603.13135 2026-03-16 cs.IT math.IT math.PR

Reweighted information inequalities

Jonathan Niles-Weed

2603.13134 2026-03-16 cs.AI

When Right Meets Wrong: Bilateral Context Conditioning with Reward-Confidence Correction for GRPO

Yu Li, Tian Lan, Zhengling Qi

2603.13126 2026-03-16 q-bio.NC cs.AI

Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science -- A Three-Cycle Action Design Science Study

Zhiye Jin, Yibai Li, K. D. Joshi, Xuefei, Deng, Xiaobing, Li

Comments 10 pages. Prepared: April 2025; submitted: June 15, 2025; accepted: August 2025. In: Proceedings of the 59th Hawaii International Conference on System Sciences (HICSS 2026), January 2026

2603.13121 2026-03-16 cs.CV

FDeID-Toolbox: Face De-Identification Toolbox

Hui Wei, Hao Yu, Guoying Zhao

Comments Technical Report. Codebase: https://github.com/infraface/FDeID-Toolbox

2603.13118 2026-03-16 cs.CV

NOIR: Neural Operator mapping for Implicit Representations

Sidaty El Hadramy, Nazim Haouchine, Michael Wehrli, Philippe C. Cattin

2603.13116 2026-03-16 cs.HC

Memory Printer: Exploring Everyday Reminiscing by Combining Slow Design with Generative AI-based Image Creation

Zhou Fang, Janet Yi-Ching Huang

Comments Accepted to CHI 2026

2603.13115 2026-03-16 cs.LG

ZO-SAM: Zero-Order Sharpness-Aware Minimization for Efficient Sparse Training

Jie Ji, Gen Li, Kaiyuan Deng, Fatemeh Afghah, Xiaolong Ma