arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.22888 2026-05-08 cs.CL cs.AI

DialectLLM: A Dialect-Aware Dialog[ue] Generation Framework Beyond Standard American English

Jio Oh, Paul Vicinanza, Thomas Butler, Steven Euijong Whang, Dezhi Hong, Amani Namboori

详情

英文摘要

More than 80% of the 1.6B English speakers do not use Standard American English (SAE), yet LLMs often fail to correctly identify non-SAE dialects and generate stereotyped responses for their speakers. We introduce DialectLLM, the first large-scale framework for generating high-quality multi-dialectal conversational data encompassing the three pillars of written dialect -- lexical (vocabulary), orthographic (spelling), and morphosyntactic (grammar) features. DialectLLM produces a dialect-parallel dialog dataset spanning nine English dialects. Partnering with native linguists, we design and validate SAE-to-dialect transformation rules, ensuring authenticity. Our approach challenges the prevailing practice of applying a single morphosyntactic feature set to both user utterances and model responses, showing that models should not reproduce up to 90% of the grammatical features of a dialect. Human evaluation confirms data quality, with annotators preferring DialectLLM over prior methods in 98.8% of pairwise comparisons for dialect naturalness. We then construct DialectLLM-Bench, a dialect-parallel benchmark with 50k+ dialogs, resulting in 97k+ QA pairs, and evaluate 17 LLMs on dialect identification and response generation tasks. Even frontier models achieve under 70% accuracy, fail to reach 50% for prominent dialects like Canadian English, and systematically misclassify non-SAE dialects as American or British. Beyond benchmarking, we show that DialectLLM data also serve as a scalable LLM post-training resource, suggesting a practical path toward dialect-aware conversational AI.

URL PDF HTML ☆

赞 0 踩 0

2601.22509 2026-05-08 cs.LG cs.AI

Keep Rehearsing and Refining: Lifelong Learning Vehicle Routing under Continually Drifting Tasks

Jiyuan Pei, Yi Mei, Jialin Liu, Mengjie Zhang, Xin Yao

2601.22382 2026-05-08 cs.LG

Purely Agent-Driven Black-Box Optimization for Biological Design

Natalie Maus, Yimeng Zeng, Haydn Thomas Jones, Yining Huang, Gaurav Ng Goel, Alden Rose, Kyurae Kim, Hyun-Su Lee, Marcelo Der Torossian Torres, Fangping Wan, Cesar de la Fuente-Nunez, Mark Yatskar, Osbert Bastani, Jacob R. Gardner

2601.22040 2026-05-08 cs.CL cs.AI cs.LG

Leviathan: Decoupling Input and Output Representations in Language Models

Reza T. Batley, Sourav Saha

2601.21800 2026-05-08 cs.AI

BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics

Dionizije Fa, Marko Culjak, Bruno Pandza, Mateo Cupic

Comments Accepted at ICML 2026

2601.21682 2026-05-08 cs.CL cs.AI cs.CR cs.LG

FIT to Forget: Robust Continual Unlearning for Large Language Models

Xiaoyu Xu, Minxin Du, Kun Fang, Yaxin Xiao, Zhicong Huang, Cheng Hong, Qingqing Ye, Haibo Hu

Comments 26 Pages

2601.21623 2026-05-08 cs.LG cs.NA math.NA

LAMP: Look-Ahead Mixed-Precision Inference of Large Language Models

Stanislav Budzinskiy, Marian Gloser, Tolunay Yilmaz, Ying Hong Tham, Yuanyi Lin, Wenyi Fang, Fan Wu, Philipp Petersen

Comments Major revision

2601.21464 2026-05-08 cs.CL cs.AI

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Yuan Sui, Bryan Hooi

Comments Accepted by ICML'26

2601.21092 2026-05-08 cs.LG

MapPFN: Learning Causal Perturbation Maps in Context

Marvin Sextro, Weronika Kłos, Gabriel Dernbach

2601.20571 2026-05-08 cs.LG cs.AI stat.ML

Fast and Efficient Gossip Algorithms for Robust and Non-smooth Decentralized Learning

Anna van Elst, Igor Colin, Stephan Clémençon

2601.20362 2026-05-08 cs.SD cs.AI

Switchcodec: Adaptive residual-expert sparse quantization for high-fidelity neural audio coding

Xiangbo Wang, Wenbin Jiang, Jin Wang, Yubo You, Sheng Fang, Fei Wen

Comments This manuscript contains critical errors in the experimental parameter settings and partial algorithm derivation in Section 3 and Section 4, which will lead to inaccurate conclusion interpretation. We need to withdraw the paper for comprehensive revision, re-calculation and experimental verification, and will resubmit after full correction

2601.16715 2026-05-08 cs.LG cs.AI

Dynamic Expert-Guided Model Averaging for Causal Discovery

Adrick Tench, Thomas Demeester

2601.14724 2026-05-08 cs.CV cs.AI cs.CL

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Haowei Zhang, Shudong Yang, Jinlan Fu, See-Kiong Ng, Xipeng Qiu

Comments Accepted to ACL 2026 Main

2601.11789 2026-05-08 cs.LG

Suspicious Alignment of SGD: A Fine-Grained Step Size Condition Analysis

Shenyang Deng, Boyao Liao, Zhuoli Ouyang, Tianyu Pang, Minhak Song, Yaoqing Yang

Comments The 37th International Conference on Algorithmic Learning Theory

2601.04378 2026-05-08 cs.LG cs.CV stat.ML

Aligned explanations in neural networks

Corentin Lobet, Francesca Chiaromonte

2601.01746 2026-05-08 cs.CV

Point-SRA: Self-Representation Alignment for 3D Representation Learning

Lintong Wei, Jian Lu, Haozhe Cheng, Jihua Zhu, Kaibing Zhang

Comments This is an AAAI 2026 accepted paper titled "Point-SRA: Self-Representation Alignment for 3D Representation Learning", spanning 13 pages in total. The submission includes 7 figures (fig1 to fig7) that visually support the technical analysis

2512.20822 2026-05-08 cs.CL cs.AI

MediEval: A Unified Medical Benchmark for Patient-Contextual and Knowledge-Grounded Reasoning in LLMs

Zhan Qu, Michael Färber

2512.18857 2026-05-08 cs.AI cs.LG

CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning

Zijun Gao, Zhikun Xu, Xiao Ye, Ben Zhou

2512.14397 2026-05-08 cs.LG physics.flu-dyn

SuperWing: a comprehensive transonic wing dataset for data-driven aerodynamic design

Yunjia Yang, Weishao Tang, Mengxin Liu, Nils Thuerey, Yufei Zhang, Haixin Chen

2512.11016 2026-05-08 cs.CV cs.AI

SoccerMaster: A Vision Foundation Model for Soccer Understanding

Haolin Yang, Jiayuan Rao, Haoning Wu, Weidi Xie

Comments Accepted by CVPR 2026 (Oral); Project Page: https://haolinyang-hlyang.github.io/SoccerMaster

2512.06721 2026-05-08 cs.AI cs.CL cs.HC

ProAgent: Harnessing On-Demand Sensory Contexts for Proactive LLM Agent Systems in the Wild

Bufang Yang, Lilin Xu, Liekang Zeng, Yunqi Guo, Siyang Jiang, Wenrui Lu, Kaiwei Liu, Yixuan Li, Xiaofan Jiang, Guoliang Xing, Zhenyu Yan

2511.14148 2026-05-08 cs.RO cs.AI cs.LG

AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models

Yuhua Jiang, Shuang Cheng, Yan Ding, Feifei Gao, Biqing Qi

2511.06856 2026-05-08 cs.LG math.DG

Contact Wasserstein Geodesics for Non-Conservative Schrödinger Bridges

Andrea Testa, Søren Hauberg, Tamim Asfour, Leonel Rozo

Comments 44 pages, 21 figures, ICLR 2026

2511.04334 2026-05-08 cs.CV cs.LG

Submanifold Sparse Convolutional Networks for Automated 3D Segmentation of Kidneys and Kidney Tumours in Computed Tomography

Saúl Alonso-Monsalve, Leigh H. Whitehead, Adam Aurisano, Lorena Escudero Sanchez

Comments 15 pages, 6 figures

详情

DOI: 10.1038/s41598-026-51801-7

英文摘要

Accurate delineation of kidney tumours in Computed Tomography (CT) is essential for downstream quantitative analysis and precision oncology, but manual segmentation is a specialised task, time-consuming and difficult to scale. Automated 3D segmentation remains challenging because CT scans are large volumetric images, making high-resolution dense convolutional networks computationally expensive and often dependent on downsampling or patch-based inference. We propose a two-stage 3D segmentation methodology based on voxel sparsification and submanifold sparse convolutional networks (SSCNs). Stage 1 uses a low-resolution sparse network to identify a region of interest (ROI); Stage 2 applies a high-resolution sparse network for refined segmentation within the cropped ROI. This enables native high-resolution 3D processing while reducing memory use and inference time. We evaluate the method on the KiTS23 renal cancer CT dataset using 5-fold cross-validation. Our method achieved Dice similarity coefficients of 95.8% for kidneys + masses, 85.7% for tumours + cysts, and 80.3% for tumours alone, competitive with top KiTS23 approaches. In direct comparisons on the same cross-validation folds, the proposed sparse method achieves tumour + cyst and tumour-only Dice scores comparable to, and slightly higher than, a patch-based nnU-Net baseline, while consistently requiring less VRAM and shorter inference time across the tested hardware. Across the tested GPUs, our sparse model is markedly faster than both nnU-Net and the zero-shot zoom-out/zoom-in foundation model SegVol, which localises kidneys well but underperforms on small heterogeneous lesions. Compared to an equivalent dense implementation of the same architecture, the proposed sparse approach achieves up to a 60% reduction in inference time and up to a 75% reduction in VRAM usage across both CPU and the GPU configurations tested.

URL PDF HTML ☆

赞 0 踩 0

2511.02481 2026-05-08 cs.LG

NOWS: Neural Operator Warm Starts for Accelerating Iterative Solvers

Mohammad Sadegh Eshaghi, Cosmin Anitescu, Navid Valizadeh, Yizheng Wang, Xiaoying Zhuang, Timon Rabczuk

2510.19316 2026-05-08 cs.CL

KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Controls

Kailin Jiang, Hongbo Jiang, Ning Jiang, Zhi Gao, Jinhe Bi, Yuchen Ren, Bin Li, Yuntao Du, Lei Liu, Qing Li

Comments ICML 2026, Project Page: https://kore-lmm.github.io/

2510.16811 2026-05-08 cs.LG

Graph Learning Is Suboptimal in Causal Bandits

Mohammad Shahverdikondori, Jalal Etesami, Negar Kiyavash

Comments 32 pages, accepted at AISTATS 2026

2510.13879 2026-05-08 cs.CL cs.AI

Catch Your Breath: Adaptive Computation for Self-Paced Sequence Production

Alexandre Galashov, Matt Jones, Rosemary Ke, Yuan Cao, Vaishnavh Nagarajan, Michael C. Mozer

2510.12635 2026-05-08 cs.AI

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Yuxiang Zhang, Jiangming Shu, Ye Ma, Xueyuan Lin, Shangxi Wu, Jitao Sang

2510.08539 2026-05-08 cs.LG cs.AI cs.IT math.IT math.OC stat.ML

On the optimization dynamics of RLVR: Gradient gap and step size thresholds

Joe Suk, Yaqi Duan