arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2602.21608 2026-02-26 cs.CL

MixSarc: A Bangla-English Code-Mixed Corpus for Implicit Meaning Identification

Kazi Samin Yasar Alam, Md Tanbir Chowdhury, Tamim Ahmed, Ajwad Abrar, Md Rafid Haque

Comments Under Review

详情

英文摘要

Bangla-English code-mixing is widespread across South Asian social media, yet resources for implicit meaning identification in this setting remain scarce. Existing sentiment and sarcasm models largely focus on monolingual English or high-resource languages and struggle with transliteration variation, cultural references, and intra-sentential language switching. To address this gap, we introduce MixSarc, the first publicly available Bangla-English code-mixed corpus for implicit meaning identification. The dataset contains 9,087 manually annotated sentences labeled for humor, sarcasm, offensiveness, and vulgarity. We construct the corpus through targeted social media collection, systematic filtering, and multi-annotator validation. We benchmark transformer-based models and evaluate zero-shot large language models under structured prompting. Results show strong performance on humor detection but substantial degradation on sarcasm, offense, and vulgarity due to class imbalance and pragmatic complexity. Zero-shot models achieve competitive micro-F1 scores but low exact match accuracy. Further analysis reveals that over 42\% of negative sentiment instances in an external dataset exhibit sarcastic characteristics. MixSarc provides a foundational resource for culturally aware NLP and supports more reliable multi-label modeling in code-mixed environments.

URL PDF HTML ☆

赞 0 踩 0

2602.21601 2026-02-26 cs.LG cs.CE

Deep Clustering based Boundary-Decoder Net for Inter and Intra Layer Stress Prediction of Heterogeneous Integrated IC Chip

Kart Leong Lim, Ji Lin

2602.21597 2026-02-26 cs.LG

NGDB-Zoo: Towards Efficient and Scalable Neural Graph Databases Training

Zhongwei Xie, Jiaxin Bai, Shujie Liu, Haoyu Huang, Yufei Li, Yisen Gao, Hong Ting Tsang, Yangqiu Song

2602.21596 2026-02-26 cs.CV

A Hidden Semantic Bottleneck in Conditional Embeddings of Diffusion Transformers

Trung X. Pham, Kang Zhang, Ji Woo Hong, Chang D. Yoo

Comments Accepted to ICLR 2026

2602.21595 2026-02-26 cs.RO

SPOC: Safety-Aware Planning Under Partial Observability And Physical Constraints

Hyungmin Kim, Hobeom Jeon, Dohyung Kim, Minsu Jang, Jeahong Kim

Comments Accepted to IEEE ICASSP 2026

2602.21593 2026-02-26 cs.LG cs.CR cs.CV

Breaking Semantic-Aware Watermarks via LLM-Guided Coherence-Preserving Semantic Injection

Zheng Gao, Xiaoyu Li, Zhicheng Bao, Xiaoyan Feng, Jiaojiao Jiang

Comments Accepted by The Web Conference 2026 (Short Paper Track)

2602.21589 2026-02-26 cs.CV

SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction

Haoxiang Fu, Lingfeng Zhang, Hao Li, Ruibing Hu, Zhengrong Li, Guanjing Liu, Zimu Tan, Long Chen, Hangjun Ye, Xiaoshuai Hao

2602.21588 2026-02-26 cs.LG cs.CE

ABM-UDE: Developing Surrogates for Epidemic Agent-Based Models via Scientific Machine Learning

Sharv Murgai, Utkarsh Utkarsh, Kyle C. Nguyen, Alan Edelman, Erin C. S. Acquesta, Christopher Vincent Rackauckas

Comments 25 pages, 4 figures

详情

英文摘要

Agent-based epidemic models (ABMs) encode behavioral and policy heterogeneity but are too slow for nightly hospital planning. We develop county-ready surrogates that learn directly from exascale ABM trajectories using Universal Differential Equations (UDEs): mechanistic SEIR-family ODEs with a neural-parameterized contact rate $κ_ϕ(u,t)$ (no additive residual). Our contributions are threefold: we adapt multiple shooting and an observer-based prediction-error method (PEM) to stabilize identification of neural-augmented epidemiological dynamics across intervention-driven regime shifts; we enforce positivity and mass conservation and show the learned contact-rate parameterization yields a well-posed vector field; and we quantify accuracy, calibration, and compute against ABM ensembles and UDE baselines. On a representative ExaEpi scenario, PEM-UDE reduces mean MSE by 77% relative to single-shooting UDE (3.00 vs. 13.14) and by 20% relative to MS-UDE (3.75). Reliability improves in parallel: empirical coverage of ABM $10$-$90$% and $25$-$75$% bands rises from 0.68/0.43 (UDE) and 0.79/0.55 (MS-UDE) to 0.86/0.61 with PEM-UDE and 0.94/0.69 with MS+PEM-UDE, indicating calibrated uncertainty rather than overconfident fits. Inference runs in seconds on commodity CPUs (20-35 s per $\sim$90-day forecast), enabling nightly ''what-if'' sweeps on a laptop. Relative to a $\sim$100 CPU-hour ABM reference run, this yields $\sim10^{4}\times$ lower wall-clock per scenario. This closes the realism-cadence gap, supports threshold-aware decision-making (e.g., maintaining ICU occupancy $<75$%), preserves mechanistic interpretability, and enables calibrated, risk-aware scenario planning on standard institutional hardware. Beyond epidemics, the ABM$\to$UDE recipe provides a portable path to distill agent-based simulators into fast, trustworthy surrogates for other scientific domains.

URL PDF HTML ☆

赞 0 踩 0

2602.21583 2026-02-26 cs.RO

Learning Agile and Robust Omnidirectional Aerial Motion on Overactuated Tiltable-Quadrotors

Wentao Zhang, Zhaoqi Ma, Jinjie Li, Huayi Wang, Haokun Liu, Junichiro Sugihara, Chen Chen, Yicheng Chen, Moju Zhao

2602.21556 2026-02-26 cs.AI cs.GT

Power and Limitations of Aggregation in Compound AI Systems

Nivasini Ananthakrishnan, Meena Jagadeesan

2602.21552 2026-02-26 cs.CV

Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction

Changqing Zhou, Yueru Luo, Changhao Chen

Comments Accepted by CVPR2026

2602.21551 2026-02-26 cs.LG cs.AI

From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators

Zhihao Li, Yu Feng, Zhilu Lai, Wei Wang

2602.21546 2026-02-26 cs.LG

Mamba Meets Scheduling: Learning to Solve Flexible Job Shop Scheduling with Efficient Sequence Modeling

Zhi Cao, Cong Zhang, Yaoxin Wu, Yaqing Hou, Hongwei Ge

2602.21543 2026-02-26 cs.CL cs.AI cs.IR

Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment

Barah Fazili, Koustava Goswami

2602.21539 2026-02-26 cs.CV

VasGuideNet: Vascular Topology-Guided Couinaud Liver Segmentation with Structural Contrastive Loss

Chaojie Shen, Jingjun Gu, Zihao Zhao, Ruocheng Li, Cunyuan Yang, Jiajun Bu, Lei Wu

2602.21535 2026-02-26 cs.CV

Pseudo-View Enhancement via Confidence Fusion for Unposed Sparse-View Reconstruction

Beizhen Zhao, Sicheng Yu, Guanzhi Ding, Yu Hu, Hao Wang

Comments 14 pages

2602.21531 2026-02-26 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY

LiLo-VLA: Compositional Long-Horizon Manipulation via Linked Object-Centric Policies

Yue Yang, Shuo Cheng, Yu Fang, Homanga Bharadhwaj, Mingyu Ding, Gedas Bertasius, Daniel Szafir

2602.21517 2026-02-26 cs.CV

Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning

Zheang Huai, Honglong Yang, Xiaomeng Li

Comments 11 pages

2602.21508 2026-02-26 cs.LG cs.CR cs.CV

WaterVIB: Learning Minimal Sufficient Watermark Representations via Variational Information Bottleneck

Haoyuan He, Yu Zheng, Jie Zhou, Jiwen Lu

Comments 22 pages, 7 figures. Preprint

2602.21503 2026-02-26 cs.CV

AHAN: Asymmetric Hierarchical Attention Network for Identical Twin Face Verification

Hoang-Nhat Nguyen

Comments Accepted to AAAI 2026

2602.21498 2026-02-26 cs.LG

Learning Recursive Multi-Scale Representations for Irregular Multivariate Time Series Forecasting

Boyuan Li, Zhen Liu, Yicheng Luo, Qianli Ma

Comments Accepted in ICLR 2026

2602.21496 2026-02-26 cs.AI

Beyond Refusal: Probing the Limits of Agentic Self-Correction for Semantic Sensitive Information

Umid Suleymanov, Zaur Rajabov, Emil Mirzazada, Murat Kantarcioglu

Comments Under Review

2602.21492 2026-02-26 cs.LG cs.AI cs.CL

GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning

Ningyuan Yang, Weihua Du, Weiwei Sun, Sean Welleck, Yiming Yang

Comments 14 pages. Preliminary work

2602.21485 2026-02-26 cs.CL cs.HC

Evaluating the Usage of African-American Vernacular English in Large Language Models

Deja Dunlap, R. Thomas McCoy

2602.21472 2026-02-26 cs.LG

The Design Space of Tri-Modal Masked Diffusion Models

Louis Bethune, Victor Turrisi, Bruno Kacper Mlodozeniec, Pau Rodriguez Lopez, Lokesh Boominathan, Nikhil Bhendawade, Amitis Shidani, Joris Pelemans, Theo X. Olausson, Devon Hjelm, Paul Dixon, Joao Monteiro, Pierre Ablin, Vishnu Banna, Arno Blaas, Nick Henderson, Kari Noriy, Dan Busbridge, Josh Susskind, Marco Cuturi, Irina Belousova, Luca Zappella, Russ Webb, Jason Ramapuram

Comments 41 pages, 29 figures, 10 tables

2602.21469 2026-02-26 cs.LG

D-Flow SGLD: Source-Space Posterior Sampling for Scientific Inverse Problems with Flow Matching

Meet Hemant Parikh, Yaqin Chen, Jian-Xun Wang

2602.21467 2026-02-26 cs.LG

Geometric Priors for Generalizable World Models via Vector Symbolic Architecture

William Youngwoo Chung, Calvin Yeung, Hansen Jin Lillemark, Zhuowen Zou, Xiangjian Liu, Mohsen Imani

Comments 9 pages, accepted to Neurips 2025 Workshop Symmetry and Geometry in Neural Representations

2602.21466 2026-02-26 cs.LG physics.comp-ph

Asymptotically Fast Clebsch-Gordan Tensor Products with Vector Spherical Harmonics

YuQing Xie, Ameya Daigavane, Mit Kotak, Tess Smidt

Comments 28 pages, 2 figures. arXiv admin note: text overlap with arXiv:2506.13523

2602.21462 2026-02-26 cs.LG q-bio.GN stat.ML

Effects of Training Data Quality on Classifier Performance

Alan F. Karr, Regina Ruane

2602.21461 2026-02-26 cs.CL

VecGlypher: Unified Vector Glyph Generation with Language Models

Xiaoke Huang, Bhavul Gauri, Kam Woh Ng, Tony Ng, Mengmeng Xu, Zhiheng Liu, Weiming Ren, Zhaochong An, Zijian Zhou, Haonan Qiu, Yuyin Zhou, Sen He, Ziheng Wang, Tao Xiang, Xiao Han

Comments Accepted to CVPR'26. Project page: https://xk-huang.github.io/VecGlypher/