arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.13479 2026-04-16 eess.IV cs.CV

Learning Class Difficulty in Imbalanced Histopathology Segmentation via Dynamic Focal Attention

Lakmali Nadeesha Kumari, Sen-Ching Samson Cheung

详情

英文摘要

Semantic segmentation of histopathology images under class imbalance is typically addressed through frequency-based loss reweighting, which implicitly assumes that rare classes are difficult. However, true difficulty also arises from morphological variability, boundary ambiguity, and contextual similarity-factors that frequency cannot capture. We propose Dynamic Focal Attention (DFA), a simple and efficient mechanism that learns class-specific difficulty directly within the cross-attention of query-based mask decoders. DFA introduces a learnable per-class bias to attention logits, enabling representation-level reweighting prior to prediction rather than gradient-level reweighting after prediction. Initialised from a log-frequency prior to prevent gradient starvation, the bias is optimised end-to-end, allowing the model to adaptively capture difficulty signals through training, effectively unifying frequency-based and difficulty-aware approaches under a common attention-bias framework. On three histopathology benchmarks (BDSA, BCSS, CRAG), DFA consistently improves Dice and IoU, matching or exceeding a difficulty-aware baseline without a separate estimator or additional training stage. These results demonstrate that encoding class difficulty at the representation level provides a principled alternative to conventional loss reweighting for imbalanced segmentation.

URL PDF HTML ☆

赞 0 踩 0

2604.13474 2026-04-16 cs.CR cs.AI cs.DC

Secure and Privacy-Preserving Vertical Federated Learning

Shan Jin, Sai Rahul Rachuri, Yizhen Wang, Anderson C. A. Nascimento, Yiwei Cai

2604.13462 2026-04-16 cs.SE cs.AI cs.CE cs.LG

Learning from Change: Predictive Models for Incident Prevention in a Regulated IT Environment

Eileen Kapel, Jan Lennartz, Luis Cruz, Diomidis Spinellis, Arie van Deursen

Comments 12 pages, 6 figures, 2026 IEEE/ACM 48th International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP)

2604.13427 2026-04-16 cs.GR cs.AI cs.CV

A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargeting

Junlin Li, Xinhao Song, Siqi Wang, Haibin Huang, Yili Zhao

Comments 11 pages, 7 figures

2604.13417 2026-04-16 cs.SE cs.AI

The Cognitive Circuit Breaker: A Systems Engineering Framework for Intrinsic AI Reliability

Jonathan Pan

Comments 2 Figures

2604.13393 2026-04-16 math.OC cs.LG stat.ML

A short proof of near-linear convergence of adaptive gradient descent under fourth-order growth and convexity

Damek Davis, Dmitriy Drusvyatskiy

2604.13385 2026-04-16 cs.NE cs.AI

On the Use of Evolutionary Optimization for the Dynamic Chance Constrained Open-Pit Mine Scheduling Problem

Ishara Hewa Pathiranage, Aneta Neumann

Comments Accepted to publish in 2026 IEEE World Congress on Computational Intelligence (WCCI)

2604.13381 2026-04-16 cs.HC cs.AI

Young people's perceptions and recommendations for conversational generative artificial intelligence in youth mental health

Adam Poulsen, Ian B. Hickie, Carla Gorban, Zsofi de Haan, William Capon, Ebenezer Eyeson-Annan, Jalal Radwan, Elizabeth M. Scott, Frank Iorfino, Haley M. LaMonica

2604.13369 2026-04-16 physics.comp-ph cs.LG physics.flu-dyn

AeTHERON: Autoregressive Topology-aware Heterogeneous Graph Operator Network for Fluid-Structure Interaction

Sushrut Kumar

详情

英文摘要

Surrogate modeling of body-driven fluid flows where immersed moving boundaries couple structural dynamics to chaotic, unsteady fluid phenomena remains a fundamental challenge for both computational physics and machine learning. We present AeTHERON, a heterogeneous graph neural operator whose architecture directly mirrors the structure of the sharp-interface immersed boundary method (IBM): a dual-graph representation separating fluid and structural domains, coupled through sparse cross-attention that reflects the compact support of IBM interpolation stencils. This physics-informed inductive bias enables AeTHERON to learn nonlinear fluid-structure coupling in a shared high-dimensional latent space, with continuous sinusoidal time embeddings providing temporal generalization across lead times. We evaluate AeTHERON on direct numerical simulations of a flapping flexible caudal fin, a canonical FSI benchmark featuring leading-edge vortex formation, large membrane deformation, and chaotic wake shedding across a 4x5 parameter grid of membrane thickness (h* = 0.01-0.04) and Strouhal number (St = 0.30-0.50). As a proof-of-concept, we train on the first 150 timesteps of a representative case using a 70/30 train/validation split and evaluate on the fully unseen extrapolation window t=150-200. AeTHERON captures large-scale vortex topology and wake structure with qualitative fidelity, achieving a mean extrapolation MAE of 0.168 without retraining, with error peaking near flapping half-cycle transitions where flow reorganization is most rapid -- a physically interpretable pattern consistent with the nonlinear fluid-membrane coupling. Inference requires milliseconds per timestep on a single GPU versus hours for equivalent DNS computation. This is a continuously developing preprint; results and figures will be updated in subsequent versions.

URL PDF HTML ☆

赞 0 踩 0

2604.13297 2026-04-16 eess.SY cs.LG cs.SY math.DS

Structure- and Stability-Preserving Learning of Port-Hamiltonian Systems

Binh Nguyen, Nam T. Nguyen, Truong X. Nghiem

2604.13243 2026-04-16 cs.HC cs.AI

Lazy or Efficient? Towards Accessible Eye-Tracking Event Detection Using LLMs

Dongyang Guo, Yasmeen Abdrabou, Enkelejda Kasneci

2604.13218 2026-04-16 stat.ML cs.AI cs.LG math.ST stat.TH

Identifiability of Potentially Degenerate Gaussian Mixture Models With Piecewise Affine Mixing

Danru Xu, Sébastien Lachapelle, Sara Magliacane

Comments 49 pages, 10 figures, AISTATS 2026

2604.13203 2026-04-16 cs.HC cs.AI

Inclusive Kitchen Design for Older Adults: Generative AI Visualizations to Support Mild Cognitive Impairment

Ibrahim Bilau, Nicole Li, Terrence Malayvong, Eunhwa Yang

Comments 19 pages, 7 figures, 5 tables, IAFOR Agen2026 Conference Proceedings

2604.13192 2026-04-16 eess.SY cs.RO cs.SY

Synthesis and Deployment of Maximal Robust Control Barrier Functions through Adversarial Reinforcement Learning

Donggeon David Oh, Duy P. Nguyen, Haimin Hu, Jaime Fernández Fisac

Comments 8 pages, 2 figures. This work has been submitted to the IEEE for possible publication

2604.13191 2026-04-16 cs.GR cs.LG

Fast Voxelization and Level of Detail for Microgeometry Rendering

Javier Fabre, Carlos Castillo, Carlos Rodriguez-Pardo, Jorge Lopez-Moreno

Comments Accepted for publication in The Visual Computer. 16 pages, 7 figures, 3 tables. Supplementary material: https://javierfabre.com/projects/voxel-lod/supp.pdf

2604.13179 2026-04-16 math.OC cs.LG cs.SY eess.SY

HUANet: Hard-Constrained Unrolled ADMM for Constrained Convex Optimization

Trinh Tran, Binh Nguyen, Truong X. Nghiem

2604.13128 2026-04-16 cs.MA cs.LG cs.RO cs.SY eess.SY

Learning Probabilistic Responsibility Allocations for Multi-Agent Interactions

Isaac Remy, Caleb Chang, Karen Leung

2604.13120 2026-04-16 cs.SE cs.AI

AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering

Rajesh Kumar, Waqar Ali, Junaid Ahmed, Najma Imtiaz Ali, Shaban Usman

2604.13114 2026-04-16 cs.SE cs.AI

The Code Whisperer: LLM and Graph-Based AI for Smell and Vulnerability Resolution

Mohammad Baqar, Raji Rustamov, Alexander Hughes

Comments 10 Pages

2604.13109 2026-04-16 cs.SE cs.AI

Applying an Agentic Coding Tool for Improving Published Algorithm Implementations

Worasait Suwannik

2604.13108 2026-04-16 cs.SE cs.AI

Formal Architecture Descriptors as Navigation Primitives for AI Coding Agents

Ruoqi Jin

Comments 4 pages, 4 tables, preprint. Code and data: https://doi.org/10.5281/zenodo.19500105

2604.13107 2026-04-16 cs.SE cs.AI cs.LG

Can Coding Agents Be General Agents?

Maksim Ivanov, Abhijay Rana, Gokul Prabhakaran

2604.13102 2026-04-16 cs.SE cs.AI

CCCE: A Continuous Code Calibration Engine for Autonomous Enterprise Codebase Maintenance via Knowledge Graph Traversal and Adaptive Decision Gating

Santhosh Kusuma Kumar Parimi

详情

英文摘要

Enterprise software organizations face an escalating challenge in maintaining the integrity, security, and freshness of codebases that span hundreds of repositories, multiple programming languages, and thousands of interdependent packages. Existing approaches to codebase maintenance -- including static analysis, software composition analysis (SCA), and dependency management tools -- operate in isolation, address only narrow subsets of maintenance concerns, and require substantial manual intervention to propagate changes across interconnected systems. We present the Continuous Code Calibration Engine (CCCE), an event-driven, AI-agentic system that autonomously maintains enterprise codebases throughout the Software Development Life Cycle (SDLC). The CCCE introduces three key technical innovations: (1) a dynamic knowledge graph with bidirectional traversal algorithms that simultaneously compute forward impact propagation and backward test adequacy analysis; (2) an adaptive multi-stage gating framework that classifies calibration actions into four risk tiers using learned risk-confidence scoring rather than static rules; and (3) a multi-model continuous learning architecture operating at multiple temporal scales to refine calibration strategies, risk models, and organizational policies from operational feedback. We formalize the system's graph model, traversal algorithms, and decision logic, and demonstrate through three representative enterprise scenarios that the CCCE reduces mean time to remediation by enabling coordinated, cross-repository calibrations with human-in-the-loop (HITL) oversight where appropriate. The system generates atomic, semantically verified patches with progressive validation and intelligent rollback capabilities, providing end-to-end traceability from triggering events through calibration execution and outcome learning.

URL PDF HTML ☆

赞 0 踩 0

2604.13101 2026-04-16 cs.SE cs.AI

Building Trust in the Skies: A Knowledge-Grounded LLM-based Framework for Aviation Safety

Anirudh Iyengar, Alisa Tiselska, Dumindu Samaraweera, Hong Liu

Comments Initial version of a conference publication

2604.13100 2026-04-16 cs.SE cs.AI

Contract-Coding: Towards Repo-Level Generation via Structured Symbolic Paradigm

Yi Lin, Lujin Zhao, Yijie Shi

2604.13098 2026-04-16 cs.MA cs.CV cs.RO

C$^2$T: Captioning-Structure and LLM-Aligned Common-Sense Reward Learning for Traffic--Vehicle Coordination

Yuyang Chen, Kaiyan Zhao, Yiming Wang, Ming Yang, Bin Rao, Zhenning Li

Comments Accepted to CVPR 2026 Findings Track

2604.13097 2026-04-16 cs.SE cs.AI

ECM Contracts: Contract-Aware, Versioned, and Governable Capability Interfaces for Embodied Agents

Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li

Comments 24 pages, 4 figures, 12 tables

详情

英文摘要

Embodied agents increasingly rely on modular capabilities that can be installed, upgraded, composed, and governed at runtime. Prior work has introduced embodied capability modules (ECMs) as reusable units of embodied functionality, and recent research has explored their runtime governance and controlled evolution. However, a key systems question remains unresolved: how can ECMs be composed and released as a stable software ecosystem rather than as ad hoc skill bundles? We present ECM Contracts, a contract-based interface model for embodied capability modules. Unlike conventional software interfaces that specify only input and output types, ECM Contracts encode six dimensions essential for embodied execution: functional signature, behavioral assumptions, resource requirements, permission boundaries, recovery semantics, and version compatibility. Based on this model, we introduce a compatibility framework for ECM installation, composition, and upgrade, enabling static and pre-deployment checks for type mismatches, dependency conflicts, policy violations, resource contention, and recovery incompatibilities. We further propose a release discipline for embodied capabilities, including version-aware compatibility classes, deprecation rules, migration constraints, and policy-sensitive upgrade checks. We implement a prototype ECM registry, resolver, and contract checker, and evaluate the approach on modular embodied tasks in a robotics runtime setting. Results show that contract-aware composition substantially reduces unsafe or invalid module combinations, and that contract-guided release checks improve upgrade safety and rollback readiness compared with schema-only or ad hoc baselines. Our findings suggest that stable embodied software ecosystems require more than modular packaging: they require explicit contracts that connect capability composition, governance, and evolution.

URL PDF HTML ☆

赞 0 踩 0

2604.13079 2026-04-16 cs.CY cs.AI cs.GT cs.LG

Alignment as Institutional Design: From Behavioral Correction to Transaction Structure in Intelligent Systems

Rui Chai

Comments This is Paper 5 in a 10-paper series on Super-Alignment via Wuxing Institutional Architecture. It shifts alignment from external behavioral correction to internal institutional design, making aligned behavior the lowest-cost equilibrium

2604.13067 2026-04-16 cs.HC cs.CL

From Seeing it to Experiencing it: Interactive Evaluation of Intersectional Voice Bias in Human-AI Speech Interaction

Shree Harsha Bokkahalli Satish, Maria Teleki, Christoph Minixhofer, Ondrej Klejch, Peter Bell, Éva Székely

Comments 6 pages, 3 figures, 1 table, Accepted to CHI Extended Abstracts Poster session 2026

2604.13052 2026-04-16 cs.SI cs.AI cs.CL cs.CY cs.MA

Form Without Function: Agent Social Behavior in the Moltbook Network

Saber Zerhoudi, Kanishka Ghosh Dastidar, Felix Klement, Artur Romazanov, Andreas Einwiller, Dang H. Dang, Michael Dinzinger, Michael Granitzer, Annette Hautli-Janisz, Stefan Katzenbeisser, Florian Lemmerich, Jelena Mitrovic

详情

英文摘要

Moltbook is a social network where every participant is an AI agent. We analyze 1,312,238 posts, 6.7~million comments, and over 120,000 agent profiles across 5,400 communities, collected over 40 days (January 27 to March 9, 2026). We evaluate the platform through three layers. At the interaction layer, 91.4% of post authors never return to their own threads, 85.6% of conversations are flat (no reply ever receives a reply), the median time-to-first-comment is 55 seconds, and 97.3% of comments receive zero upvotes. Interaction reciprocity is 3.3%, compared to 22-60% on human platforms. An argumentation analysis finds that 64.6% of comment-to-post relations carry no argumentative connection. At the content layer, 97.9% of agents never post in a community matching their bio, 92.5% of communities contain every topic in roughly equal proportions, and over 80% of shared URLs point to the platform's own infrastructure. At the instruction layer, we use 41 Wayback Machine snapshots to identify six instruction changes during the observation window. Hard constraints (rate limit, content filters) produce immediate behavioral shifts. Soft guidance (``upvote good posts'', ``stay on topic'') is ignored until it becomes an explicit step in the executable checklist. The platform also poses technological risks. We document credential leaks (API keys, JWT tokens), 12,470 unique Ethereum addresses with 3,529 confirmed transaction histories, and attack discourse ranging from template-based SSH brute-forcing to multi-agent offensive security architectures. These persist unmoderated because the quality-filtering mechanisms are themselves non-functional. Moltbook is a socio-technical system where the technical layer responds to changes, but the social layer largely fails to emerge. The form of social media is reproduced in full. The function is absent.

URL PDF HTML ☆

赞 0 踩 0