arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.17956 2026-02-27 cs.AI cs.HC

"I think this is fair": Uncovering the Complexities of Stakeholder Decision-Making in AI Fairness Assessment

Lin Luo, Yuri Nakao, Mathieu Chollet, Hiroya Inakoshi, Simone Stumpf

详情

DOI: 10.1145/3772318.3790770

英文摘要

Assessing fairness in artificial intelligence (AI) typically involves AI experts who select protected features, fairness metrics, and set fairness thresholds to assess outcome fairness. However, little is known about how stakeholders, particularly those affected by AI outcomes but lacking AI expertise, assess fairness. To address this gap, we conducted a qualitative study with 26 stakeholders without AI expertise, representing potential decision subjects in a credit rating scenario, to examine how they assess fairness when placed in the role of deciding on features with priority, metrics, and thresholds. We reveal that stakeholders' fairness decisions are more complex than typical AI expert practices: they considered features far beyond legally protected features, tailored metrics for specific contexts, set diverse yet stricter fairness thresholds, and even preferred designing customized fairness. Our results extend the understanding of how stakeholders can meaningfully contribute to AI fairness governance and mitigation, underscoring the importance of incorporating stakeholders' nuanced fairness judgments.

URL PDF HTML ☆

赞 0 踩 0

2509.17562 2026-02-27 cs.CV

Visual Instruction Pretraining for Domain-Specific Foundation Models

Yuxuan Li, Yicheng Zhang, Wenhao Tang, Yimian Dai, Ming-Ming Cheng, Xiang Li, Jian Yang

2509.16552 2026-02-27 cs.CV cs.RO

ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting

Xiaoyang Yan, Muleilan Pei, Shaojie Shen

Comments Accepted by ICRA 2026

2509.15429 2026-02-27 cs.LG physics.bio-ph q-bio.QM

Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data

Victor Chardès

Comments 16 figures

2509.09458 2026-02-27 cs.LG

AquaCast: Urban Water Dynamics Forecasting with Precipitation-Informed Multi-Input Transformer

Golnoosh Abdollahinejad, Saleh Baghersalimi, Denisa-Andreea Constantinescu, Sergey Shevchik, David Atienza

Comments This work has been submitted to Journal of Hydrology, Elsevier, and a preprint version is also available at SSRN 10.2139/ssrn.5399833

2509.07706 2026-02-27 cs.AI

FHIR-RAG-MEDS: Integrating HL7 FHIR with Retrieval-Augmented Large Language Models for Enhanced Medical Decision Support

Yildiray Kabak, Gokce B. Laleci Erturkmen, Mert Gencturk, Tuncay Namli, A. Anil Sinaci, Ruben Alcantud Corcoles, Cristina Gomez Ballesteros, Pedro Abizanda, Asuman Dogac

Comments 23 pages, submitted to Journal AI specifically to the special issue "LLMs and AI Agents in Biomedical and Health Sciences", under review

2509.04403 2026-02-27 cs.CV cs.CL cs.CR

Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

Jingen Qu, Lijun Li, Bo Zhang, Yichen Yan, Jing Shao

Comments Accepted at EMNLP 2025 Findings

2508.20570 2026-02-27 cs.CV cs.AI

Dyslexify: A Mechanistic Defense Against Typographic Attacks in CLIP

Lorenz Hufe, Constantin Venhoff, Erblina Purelku, Maximilian Dreyer, Sebastian Lapuschkin, Wojciech Samek

2508.12764 2026-02-27 cs.LG physics.data-an

Short-Term Forecasting of Energy Production and Consumption Using Extreme Learning Machine: A Comprehensive MIMO based ELM Approach

Cyril Voyant, Milan Despotovic, Luis Garcia-Gutierrez, Mohammed Asloune, Yves-Marie Saint-Drenan, Jean-Laurent Duchaud, hjuvan Antone Faggianelli, Elena Magliaro

详情

DOI: 10.1016/j.apenergy.2026.127599
Journal ref: Applied Energy, Volume 410, 1 May 2026, 127599

英文摘要

A novel methodology for short-term energy forecasting using an Extreme Learning Machine ($\mathtt{ELM}$) is proposed. Using six years of hourly data collected in Corsica (France) from multiple energy sources (solar, wind, hydro, thermal, bioenergy, and imported electricity), our approach predicts both individual energy outputs and total production (including imports, which closely follow energy demand, modulo losses) through a Multi-Input Multi-Output ($\mathtt{MIMO}$) architecture. To address non-stationarity and seasonal variability, sliding window techniques and cyclic time encoding are incorporated, enabling dynamic adaptation to fluctuations. The $\mathtt{ELM}$ model significantly outperforms persistence-based forecasting, particularly for solar and thermal energy, achieving an $\mathtt{nRMSE}$ of $17.9\%$ and $5.1\%$, respectively, with $\mathtt{R^2} > 0.98$ (1-hour horizon). The model maintains high accuracy up to five hours ahead, beyond which renewable energy sources become increasingly volatile. While $\mathtt{MIMO}$ provides marginal gains over Single-Input Single-Output ($\mathtt{SISO}$) architectures and offers key advantages over deep learning methods such as $\mathtt{LSTM}$, it provides a closed-form solution with lower computational demands, making it well-suited for real-time applications, including online learning. Beyond predictive accuracy, the proposed methodology is adaptable to various contexts and datasets, as it can be tuned to local constraints such as resource availability, grid characteristics, and market structures.

URL PDF HTML ☆

赞 0 踩 0

2508.04228 2026-02-27 cs.CV cs.AI cs.LG cs.MM

LayerT2V: A Unified Multi-Layer Video Generation Framework

Guangzhao Li, Kangrui Cen, Baixuan Zhao, Yi Xin, Siqi Luo, Guangtao Zhai, Lei Zhang, Xiaohong Liu

Comments Project Page is https://layert2v.github.io/

2508.03587 2026-02-27 cs.LG

Zero-Variance Gradients for Variational Autoencoders

Zilei Shao, Anji Liu, Guy Van den Broeck

2507.21259 2026-02-27 cs.RO

NMPCM: Nonlinear Model Predictive Control on Resource-Constrained Microcontrollers

Van Chung Nguyen, Pratik Walunj, Chuong Le, An Duy Nguyen, Hung Manh La

2507.08491 2026-02-27 cs.CL

A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench

David Schlangen, Sherzod Hakimov, Chalamalasetti Kranti, Jonathan Jordan, Philipp Sadler

Comments All code required to run the benchmark, as well as extensive documentation, is available at https://github.com/clembench/clembench

2507.03772 2026-02-27 cs.LG stat.ML

Skewed Score: A statistical framework to assess autograders

Magda Dubois, Harry Coppock, Mario Giulianelli, Timo Flesch, Lennart Luettgau, Cozmin Ududec

2506.16782 2026-02-27 cs.LG cs.AI cs.CY

What Is the Point of Equality in Machine Learning Fairness? Beyond Equality of Opportunity

Youjin Kong

Comments Presented at ACM FAccT 2025; Forthcoming in ACM Journal on Responsible Computing

详情

DOI: 10.1145/3766539
Journal ref: ACM J. Responsib. Comput. 3, 1, Article 4 (March 2026)

英文摘要

Fairness in machine learning (ML) has become a rapidly growing area of research. But why, in the first place, is unfairness in ML wrong? And why should we care about improving fairness? Most fair-ML research implicitly appeals to distributive equality: the idea that desirable benefits and goods, such as opportunities (e.g., Barocas et al., 2023), should be equally distributed across society. Unfair ML models, then, are seen as wrong because they unequally distribute such benefits. This paper argues that this exclusive focus on distributive equality offers an incomplete and potentially misleading ethical foundation. Grounding ML fairness in egalitarianism--the view that equality is a fundamental moral and social ideal--requires challenging structural inequality: systematic, institutional, and durable arrangements that privilege some groups while disadvantaging others. Structural inequality manifests through ML systems in two primary forms: allocative harms (e.g., economic loss) and representational harms (e.g., stereotypes, erasure). While distributive equality helps address allocative harms, it fails to explain why representational harms are wrong--why it is wrong for ML systems to reinforce social hierarchies that stratify people into superior and inferior groups--and why ML systems should aim to foster a society where people relate as equals (i.e., relational equality). To address these limitations, the paper proposes a multifaceted egalitarian framework for ML fairness that integrates both distributive and relational equality. Drawing on critical social and political philosophy, this framework offers a more comprehensive ethical foundation for tackling the full spectrum of harms perpetuated by ML systems. The paper also outlines practical pathways for implementing the framework across the entire ML pipeline.

URL PDF HTML ☆

赞 0 踩 0

2506.15339 2026-02-27 cs.CL

DeVisE: Behavioral Testing of Medical Large Language Models

Camila Zurdo Tagliabue, Heloisa Oss Boll, Aykut Erdem, Erkut Erdem, Iacer Calixto

Comments Camera-ready version published at Findings of the EACL 2026

2506.15190 2026-02-27 cs.LG q-bio.NC

Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior

Jiyi Wang, Jingyang Ke, Bo Dai, Anqi Wu

Comments 8 pages and 4 figures for the main text

2506.01392 2026-02-27 cs.RO cs.AI cs.CV

Sparse Imagination for Efficient Visual World Model Planning

Junha Chun, Youngjoon Jeong, Taesup Kim

Comments Accepted to ICLR 2026; Project Page: https://nikriz1.github.io/sparse_imagination/

2505.19792 2026-02-27 cs.AI

Types of Relations: Defining Analogies with Category Theory

Claire Ott, Frank Jäkel

Comments 27 pages, 15 figures

2505.17517 2026-02-27 cs.LG

The Spacetime of Diffusion Models: An Information Geometry Perspective

Rafał Karczewski, Markus Heinonen, Alison Pouplin, Søren Hauberg, Vikas Garg

Comments ICLR 2026 (Oral)

2505.17442 2026-02-27 cs.CV

Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds

Hao Jing, Anhong Wang, Yifan Zhang, Donghan Bu, Junhui Hou

详情

DOI: 10.1109/TIP.2025.3648203
Journal ref: IEEE Transactions on Image Processing, Vol. 35, pp. 85-97, 2026

英文摘要

Regarding intelligent transportation systems, low-bitrate transmission via lossy point cloud compression is vital for facilitating real-time collaborative perception among connected agents, such as vehicles and infrastructures, under restricted bandwidth. In existing compression transmission systems, the sender lossily compresses point coordinates and reflectance to generate a transmission code stream, which faces transmission burdens from reflectance encoding and limited detection robustness due to information loss. To address these issues, this paper proposes a 3D object detection framework with reflectance prediction-based knowledge distillation (RPKD). We compress point coordinates while discarding reflectance during low-bitrate transmission, and feed the decoded non-reflectance compressed point clouds into a student detector. The discarded reflectance is then reconstructed by a geometry-based reflectance prediction (RP) module within the student detector for precise detection. A teacher detector with the same structure as the student detector is designed for performing reflectance knowledge distillation (RKD) and detection knowledge distillation (DKD) from raw to compressed point clouds. Our cross-source distillation training strategy (CDTS) equips the student detector with robustness to low-quality compressed data while preserving the accuracy benefits of raw data through transferred distillation knowledge. Experimental results on the KITTI and DAIR-V2X-V datasets demonstrate that our method can boost detection accuracy for compressed point clouds across multiple code rates. We will release the code publicly at https://github.com/HaoJing-SX/RPKD.

URL PDF HTML ☆

赞 0 踩 0

2505.16164 2026-02-27 cs.CL

Can LLMs Simulate Human Behavioral Variability? A Case Study in the Phonemic Fluency Task

Mengyang Qiu, Zoe Brisebois, Siena Sun

2505.08371 2026-02-27 cs.LG stat.ML

Density Ratio-based Causal Discovery from Bivariate Continuous-Discrete Data

Takashi Nicholas Maeda, Shohei Shimizu, Hidetoshi Matsui

2505.07734 2026-02-27 cs.CV

LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention

Jiangling Zhang, Weijie Zhu, Jirui Huang, Yaxiong Chen

Comments Accepted to ECAI 2025

2505.04733 2026-02-27 cs.LG

Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting

Shai Feldman, Stephen Bates, Yaniv Romano

2505.04317 2026-02-27 cs.AI

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

Ruize Zhang, Sirui Xiang, Zelai Xu, Feng Gao, Shilong Ji, Wenhao Tang, Wenbo Ding, Chao Yu, Yu Wang

Comments Accepted by CoRL 2025

2505.03801 2026-02-27 cs.LG cs.AI

Large Language Model Compression with Global Rank and Sparsity Optimization

Changhai Zhou, Qian Qiao, Yuhua Zhou, Yuxin Wu, Shichao Weng, Weizhong Zhang, Cheng Jin

Comments 33 pages, 5 figures

2504.12522 2026-02-27 cs.CL cs.AI

Evaluating the Diversity and Quality of LLM Generated Content

Alexander Shypula, Shuo Li, Botong Zhang, Vishakh Padmakumar, Kayo Yin, Osbert Bastani

Comments Published at COLM 2025

2504.01445 2026-02-27 cs.AI

Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning

Philipp Mondorf, Shijia Zhou, Monica Riedler, Barbara Plank

Comments ICLR 2026, 37 pages, 15 figures

详情

英文摘要

Systematic generalization refers to the capacity to understand and generate novel combinations from known components. Despite recent progress by large language models (LLMs) across various domains, these models often fail to extend their knowledge to novel compositional scenarios, revealing notable limitations in systematic generalization. There has been an ongoing debate about whether neural networks possess the capacity for systematic generalization, with recent studies suggesting that meta-learning approaches designed for compositionality can significantly enhance this ability. However, these insights have largely been confined to linguistic problems, leaving their applicability to other tasks an open question. In this study, we extend meta-learning for compositionality to the domain of abstract spatial reasoning. To this end, we introduce $\textit{Compositional-ARC}\unicode{x2014}$a dataset designed to evaluate the capacity of models to systematically generalize from known geometric transformations (e.g., translation, rotation) of abstract two-dimensional objects to novel combinations of these transformations (e.g., translation+rotation). Our results show that a small transformer-based encoder-decoder model, trained via meta-learning for compositionality, can systematically generalize to previously unseen transformation compositions. Notably, despite having only 5.7M parameters, this model significantly outperforms state-of-the-art LLMs$\unicode{x2014}$including o3-mini, GPT-4o, and Gemini 2.0 Flash, which fail to exhibit similar systematic behavior$\unicode{x2014}$and performs on par with the winning model of the ARC prize 2024, an 8B-parameter LLM trained via test-time training. Our findings highlight the effectiveness of meta-learning in promoting systematicity beyond linguistic tasks, suggesting a promising direction toward more robust and generalizable models.

URL PDF HTML ☆

赞 0 踩 0

2503.22841 2026-02-27 cs.CV

GmNet: Revisiting Gating Mechanisms From A Frequency View

Yifan Wang, Xu Ma, Yitian Zhang, Zhongruo Wang, Sung-Cheol Kim, Vahid Mirjalili, Vidya Renganathan, Yun Fu