arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.07368 2026-03-10 cs.CL cs.AI

Position: LLMs Must Use Functor-Based and RAG-Driven Bias Mitigation for Fairness

Ravi Ranjan, Utkarsh Grover, Agorista Polyzou

Comments 24 pages, 3 figures

详情

Journal ref: Review available from NeurIPS 2025 reviwers

英文摘要

Biases in large language models (LLMs) often manifest as systematic distortions in associations between demographic attributes and professional or social roles, reinforcing harmful stereotypes across gender, ethnicity, and geography. This position paper advocates for addressing demographic and gender biases in LLMs through a dual-pronged methodology, integrating category-theoretic transformations and retrieval-augmented generation (RAG). Category theory provides a rigorous, structure-preserving mathematical framework that maps biased semantic domains to unbiased canonical forms via functors, ensuring bias elimination while preserving semantic integrity. Complementing this, RAG dynamically injects diverse, up-to-date external knowledge during inference, directly countering ingrained biases within model parameters. By combining structural debiasing through functor-based mappings and contextual grounding via RAG, we outline a comprehensive framework capable of delivering equitable and fair model outputs. Our synthesis of the current literature validates the efficacy of each approach individually, while addressing potential critiques demonstrates the robustness of this integrated strategy. Ensuring fairness in LLMs, therefore, demands both the mathematical rigor of category-theoretic transformations and the adaptability of retrieval augmentation.

URL PDF HTML ☆

赞 0 踩 0

2603.07366 2026-03-10 cs.CL

RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts

Darya Kharlamova, Irina Proskurina

Comments 12 pages, 7 tables, 2 figures. Accepted to LREC 2026

2603.07365 2026-03-10 cs.LG cs.AI

Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes

Mohammed Alnemari, Rizwan Qureshi, Nader Begrazadah

Comments 17 pages, 6 figures, 2 tables. Submitted to MDPI Machine Learning and Knowledge Extraction (MAKE)

2603.07361 2026-03-10 cs.LG cs.CV

N-Tree Diffusion for Long-Horizon Wildfire Risk Forecasting

Yucheng Xing, Xin Wang

Comments 15 pages, 6 figures

2603.07360 2026-03-10 cs.AI

The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations

Ivan Pasichnyk

Comments 13 pages, 2 figures, 7 tables

2603.07351 2026-03-10 cs.RO cs.LG stat.ML

A Distributed Gaussian Process Model for Multi-Robot Mapping

Seth Nabarro, Mark van der Wilk, Andrew J. Davison

Comments ICRA 2026, 8 pages

2603.07348 2026-03-10 cs.LG

Learning Clinical Representations Under Systematic Distribution Shift

Yuanyun Zhang, Shi Li

2603.07346 2026-03-10 cs.CL

How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection

Nouran Khallaf, Serge Sharoff

详情

Journal ref: Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2026

英文摘要

Noisy training data can significantly degrade the performance of language-model-based classifiers, particularly in non-topical classification tasks. In this study we designed a methodological framework to assess the impact of denoising. More specifically, we explored a range of denoising strategies for sentence-level difficulty detection, using training data derived from document-level difficulty annotations obtained through noisy crowdsourcing. Beyond monolingual settings, we also address cross-lingual transfer, where a multilingual language model is trained in one language and tested in another. We evaluate several noise reduction techniques, including Gaussian Mixture Models (GMM), Co-Teaching, Noise Transition Matrices, and Label Smoothing. Our results indicate that while BERT-based models exhibit inherent robustness to noise, incorporating explicit noise detection can further enhance performance. For our smaller dataset, GMM-based noise filtering proves particularly effective in improving prediction quality by raising the Area-Under-the-Curve score from 0.52 to 0.92, or to 0.93 when de-noising methods are combined. However, for our larger dataset, the intrinsic regularisation of pre-trained language models provides a strong baseline, with denoising methods yielding only marginal gains (from 0.92 to 0.94, while a combination of two denoising methods made no contribution). Nonetheless, removing noisy sentences (about 20\% of the dataset) helps in producing a cleaner corpus with fewer infelicities. As a result we have released the largest multilingual corpus for sentence difficulty prediction: see https://github.com/Nouran-Khallaf/denoising-difficulty

URL PDF HTML ☆

赞 0 踩 0

2603.06281 2026-03-10 cs.CV

Attribute Distribution Modeling and Semantic-Visual Alignment for Generative Zero-shot Learning

Haojie Pu, Zhuoming Li, Yongbiao Gao, Yuheng Jia

Comments 17 pages, 13 figures(Under review)

2603.06034 2026-03-10 cs.CV

Occlusion-Aware SORT: Observing Occlusion for Robust Multi-Object Tracking

Chunjiang Li, Jianbo Ma, Li Shen, Yanru Chen, Liangyin Chen

Comments Accepted to CVPR 2026. [The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR2026)]

2603.05971 2026-03-10 cs.CV

Towards High-resolution and Disentangled Reference-based Sketch Colorization

Dingkun Yan, Xinrui Wang, Ru Wang, Zhuoru Li, Jinze Yu, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo

2603.05964 2026-03-10 cs.CV

CR-QAT: Curriculum Relational Quantization-Aware Training for Open-Vocabulary Object Detection

Jinyeong Park, Donghwa Kang, Brent ByungHoon Kang, Hyeongboo Baek, Jibum Kim

2603.05069 2026-03-10 cs.AI cs.HC cs.MA

Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

Ravi Kiran Kadaboina

Comments 12 pages, 4 figures

2603.04989 2026-03-10 cs.CV

TAPFormer: Robust Arbitrary Point Tracking via Transient Asynchronous Fusion of Frames and Events

Jiaxiong Liu, Zhen Tan, Jinpu Zhang, Yi Zhou, Hui Shen, Xieyuanli Chen, Dewen Hu

2603.04663 2026-03-10 cs.LG cs.AI cs.CE

Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector

Pedram Agand

Comments 21 pages, 8 figures, 7 tables

2603.03596 2026-03-10 cs.RO cs.LG

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

Marcel Torne, Karl Pertsch, Homer Walke, Kyle Vedder, Suraj Nair, Brian Ichter, Allen Z. Ren, Haohuan Wang, Jiaming Tang, Kyle Stachowicz, Karan Dhabalia, Michael Equi, Quan Vuong, Jost Tobias Springenberg, Sergey Levine, Chelsea Finn, Danny Driess

Comments Website: https://pi.website/research/memory

2603.03524 2026-03-10 cs.LG cs.AI

Test-Time Meta-Adaptation with Self-Synthesis

Zeyneb N. Kaya, Nick Rui

Comments 5 pages, 2 figures, 1 table. Accepted to AI with Recursive Self-Improvement (RSI) Workshop @ ICLR 2026

2603.03155 2026-03-10 cs.LG cs.AI physics.chem-ph

Information Routing in Atomistic Foundation Models: How Task Alignment and Equivariance Shape Linear Disentanglement

Joshua Steier

2603.02899 2026-03-10 cs.LG

Embedding interpretable $\ell_1$-regression into neural networks for uncovering temporal structure in cell imaging

Fabian Kabus, Maren Hackenberg, Julia Hindel, Thibault Cholvin, Antje Kilias, Thomas Brox, Abhinav Valada, Marlene Bartos, Harald Binder

2603.02767 2026-03-10 cs.CV cs.AI

ITO: Images and Texts as One via Synergizing Multiple Alignment and Training-Time Fusion

Hanpeng Liu, Yaqian Li, Zidan Wang, Shuoxi Zhang, Zonglin Zhao, Zihao Bo, Rinyoichi Takezoe, Kaiwen Long, Kun He

2603.02748 2026-03-10 cs.CV cs.AI

iGVLM: Dynamic Instruction-Guided Vision Encoding for Question-Aware Multimodal Understanding

Hanpeng Liu, Yaqian Li, Zidan Wang, Shuoxi Zhang, Zihao Bo, Rinyoichi Takezoe, Kaiwen Long, Kun He

2603.01396 2026-03-10 cs.AI cs.CE q-bio.QM

HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

Wenxuan Huang, Mingyu Tsoi, Yanhao Huang, Xinjie Mao, Xue Xia, Hao Wu, Jiaqi Wei, Yuejin Yang, Lang Yu, Cheng Tan, Xiang Zhang, Zhangyang Gao, Siqi Sun

Comments 18 pages total (8 pages main text + appendix), 6 figures

2603.00924 2026-03-10 cs.CL cs.AI

Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains

Manil Shrestha, Edward Kim

2603.00907 2026-03-10 cs.CL

KVSlimmer: Theoretical Insights and Practical Optimizations for Asymmetric KV Merging

Lianjun Liu, Hongli An, Weiqi Yan, Xin Du, Shengchuan Zhang, Huazhong Liu, Yunshan Zhong

2603.00586 2026-03-10 cs.CV

WildActor: Unconstrained Identity-Preserving Video Generation

Qin Guo, Tianyu Yang, Xuanhua He, Fei Shen, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Dan Xu

Comments Project Page: https://wildactor.github.io/

2603.00312 2026-03-10 cs.AI cs.LG

How Well Do Multimodal Models Reason on ECG Signals?

Maxwell A. Xu, Harish Haresamudram, Catherine W. Liu, Patrick Langer, Jathurshan Pradeepkumar, Wanting Mao, Sunita J. Ferns, Aradhana Verma, Jimeng Sun, Paul Schmiedmayer, Xin Liu, Daniel McDuff, Emily B. Fox, James M. Rehg

2602.23615 2026-03-10 cs.CV

Annotation-Free Visual Reasoning for High-Resolution Large Multimodal Models via Reinforcement Learning

Jiacheng Yang, Anqi Chen, Yunkai Dang, Qi Fan, Cong Wang, Wenbin Li, Feng Miao, Yang Gao

2602.22758 2026-03-10 cs.AI stat.AP

Decomposing Physician Disagreement in HealthBench

Satya Borgohain, Roy Mariathas

2602.22519 2026-03-10 cs.AI cs.IT math.IT

A Mathematical Theory of Agency and Intelligence

Wael Hafez, Chenan Wei, Rodrigo Pena, Amir Nazeri, Cameron Reid

Comments 20 pages, 4 figuers

2602.22401 2026-03-10 cs.AI cs.HC

Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?

Yongjun Zhang

Comments Commentary