arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.04854 2026-03-06 cs.CL

SinhaLegal: A Benchmark Corpus for Information Extraction and Analysis in Sinhala Legislative Texts

Minduli Lasandi, Nevidu Jayatilleke

Comments 18 pages, 8 figures, 18 tables, Accepted paper at the 2nd workshop on Language Models for Low-Resource Languages (LoResLM 2026) @ EACL 2026

详情

英文摘要

SinhaLegal introduces a Sinhala legislative text corpus containing approximately 2 million words across 1,206 legal documents. The dataset includes two types of legal documents: 1,065 Acts dated from 1981 to 2014 and 141 Bills from 2010 to 2014, which were systematically collected from official sources. The texts were extracted using OCR with Google Document AI, followed by extensive post-processing and manual cleaning to ensure high-quality, machine-readable content, along with dedicated metadata files for each document. A comprehensive evaluation was conducted, including corpus statistics, lexical diversity, word frequency analysis, named entity recognition, and topic modelling, demonstrating the structured and domain-specific nature of the corpus. Additionally, perplexity analysis using both large and small language models was performed to assess how effectively language models respond to domain-specific texts. The SinhaLegal corpus represents a vital resource designed to support NLP tasks such as summarisation, information extraction, and analysis, thereby bridging a critical gap in Sinhala legal research.

URL PDF HTML ☆

赞 0 踩 0

2603.04851 2026-03-06 cs.LG cs.CL

Why Is RLHF Alignment Shallow? A Gradient Analysis

Robin Young

2603.04847 2026-03-06 cs.CV cs.GR

GloSplat: Joint Pose-Appearance Optimization for Faster and More Accurate 3D Reconstruction

Tianyu Xiong, Rui Li, Linjie Li, Jiaqi Yang

2603.04845 2026-03-06 cs.RO

Task-Relevant and Irrelevant Region-Aware Augmentation for Generalizable Vision-Based Imitation Learning in Agricultural Manipulation

Shun Hattori, Hikaru Sasaki, Takumi Hachimine, Yusuke Mizutani, Takamitsu Matsubara

2603.04837 2026-03-06 cs.AI

Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models

G. Madan Mohan, Veena Kiran Nambiar, Kiranmayee Janardhan

Comments 14 pages, 3 figures

详情

英文摘要

We introduce the Dynamic Behavioral Constraint (DBC) benchmark, the first empirical framework for evaluating the efficacy of a structured, 150-control behavioral governance layer, the MDBC (Madan DBC) system, applied at inference time to large language models (LLMs). Unlike training time alignment methods (RLHF, DPO) or post-hoc content moderation APIs, DBCs constitute a system prompt level governance layer that is model-agnostic, jurisdiction-mappable, and auditable. We evaluate the DBC Framework across a 30 domain risk taxonomy organized into six clusters (Hallucination and Calibration, Bias and Fairness, Malicious Use, Privacy and Data Protection, Robustness and Reliability, and Misalignment Agency) using an agentic red-team protocol with five adversarial attack strategies (Direct, Roleplay, Few-Shot, Hypothetical, Authority Spoof) across 3 model families. Our three-arm controlled design (Base, Base plus Moderation, Base plus DBC) enables causal attribution of risk reduction. Key findings: the DBC layer reduces the aggregate Risk Exposure Rate (RER) from 7.19 percent (Base) to 4.55 percent (Base plus DBC), representing a 36.8 percent relative risk reduction, compared with 0.6 percent for a standard safety moderation prompt. MDBC Adherence Scores improve from 8.6 by 10 (Base) to 8.7 by 10 (Base plus DBC). EU AI Act compliance (automated scoring) reaches 8.5by 10 under the DBC layer. A three judge evaluation ensemble yields Fleiss kappa greater than 0.70 (substantial agreement), validating our automated pipeline. Cluster ablation identifies the Integrity Protection cluster (MDBC 081 099) as delivering the highest per domain risk reduction, while graybox adversarial attacks achieve a DBC Bypass Rate of 4.83 percent . We release the benchmark code, prompt database, and all evaluation artefacts to enable reproducibility and longitudinal tracking as models evolve.

URL PDF HTML ☆

赞 0 踩 0

2603.04827 2026-03-06 cs.LG cs.AI cs.NA math.NA

Multilevel Training for Kolmogorov Arnold Networks

Ben S. Southworth, Jonas A. Actor, Graham Harper, Eric C. Cyr

2603.04825 2026-03-06 cs.CV cs.LG

Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning

Rui Zhao, Bin Shi, Kai Sun, Bo Dong

Comments Accepted to CVPR2026

2603.04822 2026-03-06 cs.AI

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

Jiawei Chen, Tianzhuo Yang, Guoxi Zhang, Jiaming Ji, Yaodong Yang, Juntao Dai

2603.04819 2026-03-06 cs.RO cs.AI cs.LG

On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

Pradyumna Tambwekar, Andrew Silva, Deepak Gopinath, Jonathan DeCastro, Xiongyi Cui, Guy Rosman

2603.04817 2026-03-06 cs.CV

Revisiting Shape from Polarization in the Era of Vision Foundation Models

Chenhao Li, Taishi Ono, Takeshi Uemori, Yusuke Moriuchi

2603.04815 2026-03-06 cs.AI

EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue

Ratna Kandala, Niva Manchanda, Akshata Kishore Moharir, Ananth Kandala

2603.04814 2026-03-06 cs.CL

Beyond the Context Window: A Cost-Performance Analysis of Fact-Based Memory vs. Long-Context LLMs for Persistent Agents

Natchanon Pollertlam, Witchayut Kornsuwannawit

Comments 15 pages, 1 figure

2603.04811 2026-03-06 cs.CV cs.AI

Meta-D: Metadata-Aware Architectures for Brain Tumor Analysis and Missing-Modality Segmentation

SangHyuk Kim, Daniel Haehn, Sumientra Rampersad

Comments 9 pages, 2 figures, 3 tables

2603.04809 2026-03-06 cs.SD cs.LG

WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech

Aurchi Chowdhury, Rubaiyat -E-Zaman, Sk. Ashrafuzzaman Nafees

2603.04805 2026-03-06 cs.CL cs.AI

Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

Edward Zhang

2603.04800 2026-03-06 cs.CV

MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models

Lulu Hu, Wenhu Xiao, Xin Chen, Xinhua Xu, Bowen Xu, Kun Li, Yongliang Tao

Comments Accepted to CVPR 2026

2603.04796 2026-03-06 cs.CV cs.AI

Comparative Evaluation of Traditional Methods and Deep Learning for Brain Glioma Imaging. Review Paper

Kiranmayee Janardhan, Vinay Martin DSa Prabhu, T. Christy Bobby

Comments 22 pages, 4 Figures

2603.04795 2026-03-06 cs.CV cs.AI

LAW & ORDER: Adaptive Spatial Weighting for Medical Diffusion and Segmentation

Anugunj Naman, Ayushman Singh, Gaibo Zhang, Yaguang Zhang

2603.04793 2026-03-06 cs.CV

RMK RetinaNet: Rotated Multi-Kernel RetinaNet for Robust Oriented Object Detection in Remote Sensing Imagery

Huiran Sun

2603.04790 2026-03-06 cs.LG cs.RO

Diffusion Policy through Conditional Proximal Policy Optimization

Ben Liu, Shunpeng Yang, Hua Chen

2603.04787 2026-03-06 cs.RO cs.SY eess.SY

Data-Driven Control of a Magnetically Actuated Fish-Like Robot

Akiyuki Koyama, Hiroaki Kawashima

Comments Author's version of the paper presented at AROB-ISBC 2026

2603.04780 2026-03-06 cs.LG stat.ML

Distributional Equivalence in Linear Non-Gaussian Latent-Variable Cyclic Causal Models: Characterization and Learning

Haoyue Dai, Immanuel Albrecht, Peter Spirtes, Kun Zhang

Comments Appears at ICLR 2026 (oral)

2603.04775 2026-03-06 cs.CV cs.CL

Privacy-Aware Camera 2.0 Technical Report

Huan Song, Shuyu Tian, Ting Long, Jiang Liu, Cheng Yuan, Zhenyu Jia, Jiawei Shao, Xuelong Li

2603.04772 2026-03-06 cs.CL cs.AI

TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Yebo Wu, Feng Liu, Ziwei Xie, Zhiyuan Liu, Changwang Zhang, Jun Wang, Li Li

2603.04771 2026-03-06 cs.CV cs.AI

MADCrowner: Margin Aware Dental Crown Design with Template Deformation and Refinement

Linda Wei, Chang Liu, Wenran Zhang, Yuxuan Hu, Ruiyang Li, Feng Qi, Changyao Tian, Ke Wang, Yuanyuan Wang, Shaoting Zhang, Dimitris Metaxas, Hongsheng Li

2603.04770 2026-03-06 cs.CV cs.AI

DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction

Shiyu Zhang, Zhicong Wu, Huangxuan Zhao, Zhentao Liu, Lei Chen, Yong Luo, Lefei Zhang, Zhiming Cui, Ziwen Ke, Bo Du

Comments 11 pages, 3 figures, 3 tables

2603.04768 2026-03-06 cs.LG

Distributional Reinforcement Learning with Information Bottleneck for Uncertainty-Aware DRAM Equalization

Muhammad Usama, Dong Eui Chang

2603.04767 2026-03-06 cs.LG

ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation

Shaocheng Lan, Shuqi Gu, Zhangzhi Xiong, Kan Ren

Comments We have open-sourced ConTSG-Bench at https://github.com/seqml/ConTSG-Bench

2603.04766 2026-03-06 cs.CV cs.CY

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

Feng Liu, Bingyu Nan, Xuezhong Qian, Xiaolan Fu

Comments 15 pages, 8 figures, 7 tables

2603.04763 2026-03-06 cs.CV cs.AI cs.LG

Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

Alexandru Florea, Shansong Wang, Mingzhe Hu, Qiang Li, Zach Eidex, Luke del Balzo, Mojtaba Safari, Xiaofeng Yang