arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2511.03153 2026-03-06 cs.SE cs.AI

RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Khouloud Oueslati, Maxime Lamothe, Foutse Khomh

详情

英文摘要

Large Language Models (LLMs) have substantially influenced various software engineering tasks. Indeed, in the case of software refactoring, traditional LLMs have shown the ability to reduce development time and enhance code quality. However, these LLMs often rely on static, detailed instructions for specific tasks. In contrast, LLM-based agents can dynamically adapt to evolving contexts and autonomously make decisions by interacting with software tools and executing workflows. In this paper, we explore the potential of LLM-based agents in supporting refactoring activities. Specifically, we introduce RefAgent, a multi-agent LLM-based framework for end-to-end software refactoring. RefAgent consists of specialized agents responsible for planning, executing, testing, and iteratively refining refactorings using self-reflection and tool-calling capabilities. We evaluate RefAgent on eight open-source Java projects, comparing its effectiveness against a single-agent approach, a search-based refactoring tool, and historical developer refactorings. Our assessment focuses on: (1) the impact of generated refactorings on software quality, (2) the ability to identify refactoring opportunities, and (3) the contribution of each LLM agent through an ablation study. Our results show that RefAgent achieves a median unit test pass rate of 90%, reduces code smells by a median of 52.5%, and improves key quality attributes (e.g., reusability) by a median of 8.6%. Additionally, it closely aligns with developer refactorings and the search-based tool in identifying refactoring opportunities, attaining a median F1-score of 79.15% and 72.7%, respectively. Compared to single-agent approaches, RefAgent improves the median unit test pass rate by 64.7% and the median compilation success rate by 40.1%. These findings highlight the promise of multi-agent architectures in advancing automated software refactoring.

URL PDF HTML ☆

赞 0 踩 0

2511.01870 2026-03-06 q-bio.NC cs.AI cs.LG

CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Christian Schiffer, Zeynep Boztoprak, Jan-Oliver Kropp, Julia Thönnißen, Katia Berr, Hannah Spitzer, Katrin Amunts, Timo Dickscheid

Comments 42 pages, 10 figures, 7 tables. Extended version with functional decoding

2510.20333 2026-03-06 cs.CR cs.AI

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

Chiyu Chen, Xinhao Song, Yunkai Chai, Yang Yao, Haodong Zhao, Lijun Li, Jie Li, Yan Teng, Gongshen Liu, Yingchun Wang

2508.11847 2026-03-06 stat.ML cs.LG

Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

Jenny Y. Huang, Yunyi Shen, Dennis Wei, Tamara Broderick

2508.02338 2026-03-06 cs.SE cs.RO

Vision Language Model-based Testing of Industrial Autonomous Mobile Robots

Jiahui Wu, Chengjie Lu, Aitor Arrieta, Shaukat Ali, Thomas Peyrucain

2507.21569 2026-03-06 quant-ph cs.LG

Structured quantum learning via em algorithm for Boltzmann machines

Takeshi Kimura, Kohtaro Kato, Masahito Hayashi

Comments 14 pages, 3 figures

2506.08762 2026-03-06 q-fin.ST cs.CE cs.CL cs.LG

EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements

Issa Sugiura, Takashi Ishida, Taro Makino, Chieko Tazuke, Takanori Nakagawa, Kosuke Nakago, David Ha

Comments Accepted to ICLR 2026

2505.04007 2026-03-06 stat.ML cs.LG

Variational Formulation of Particle Flow

Yinzhuang Yi, Jorge Cortés, Nikolay Atanasov

2504.04372 2026-03-06 cs.SE cs.AI cs.LG

Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

Sabaat Haroon, Ahmad Faraz Khan, Ahmad Humayun, Waris Gill, Abdul Haddi Amjad, Ali R. Butt, Mohammad Taha Khan, Muhammad Ali Gulzar

Comments This paper is currently Under Review. It consists of 12 pages, 11 Figures, and 5 Tables

详情

英文摘要

Generative Large Language Models (LLMs) are increasingly used in non-generative software maintenance tasks, such as fault localization (FL). Success in FL depends on a models ability to reason about program semantics beyond surface-level syntactic and lexical features. However, widely used LLM benchmarks primarily evaluate code generation, which differs fundamentally from semantic program reasoning. Meanwhile, traditional FL benchmarks such as Defect4J and BugsInPy are either not scalable or obsolete, as their datasets have become part of LLM training data, leading to biased results. This paper presents the first large-scale empirical investigation into the robustness of LLMs fault localizability. Inspired by mutation testing, we develop an end-to-end evaluation framework that addresses key limitations in existing LLM evaluation, including data contamination, scalability, automation, and extensibility. Using real-world programs with specifications, we inject unseen faults and ask LLMs to localize them, filtering out underspecified programs where localization is ambiguous. For each successfully localized program, we apply semantic-preserving mutations (SPMs) and rerun localization to assess robustness and determine whether LLM reasoning relies on syntactic cues rather than semantics. We evaluate 10 state-of-the-art LLMs on 750,013 fault localization tasks from over 1,300 Java and Python programs. We find that SPMs cause LLMs to fail on previously localized faults in 78% of cases, and that reasoning is stronger when relevant code appears earlier in context. These results indicate that LLM code reasoning is often tied to features irrelevant to semantics. We also identify code patterns that are challenging for LLMs to reason about. Overall, our findings motivate fundamental advances in how LLMs represent, interpret, and prioritize code semantics to reason more deeply about program logic

URL PDF HTML ☆

赞 0 踩 0

2503.16558 2026-03-06 cs.CY cs.AI

Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI

Micky C. Nnamdi, J. Ben Tamo, Benoit Marteau, Wenqi Shi, May D. Wang

详情

DOI: 10.1109/TE.2026.3658007

英文摘要

Problem-Based Learning (PBL) has significantly impacted biomedical engineering (BME) education since its introduction in the early 2000s, effectively enhancing critical thinking and real-world knowledge application among students. With biomedical engineering rapidly converging with artificial intelligence (AI), integrating effective AI education into established curricula has become challenging yet increasingly necessary. Recent advancements, including AI's recognition by the 2024 Nobel Prize, have highlighted the importance of training students comprehensively in biomedical AI. However, effective biomedical AI education faces substantial obstacles, such as diverse student backgrounds, limited personalized mentoring, constrained computational resources, and difficulties in safely scaling hands-on practical experiments due to privacy and ethical concerns associated with biomedical data. To overcome these issues, we conducted a three-year (2021-2023) case study implementing an advanced PBL framework tailored specifically for biomedical AI education, involving 92 undergraduate and 156 graduate students from the joint Biomedical Engineering program of Georgia Institute of Technology and Emory University. Our approach emphasizes collaborative, interdisciplinary problem-solving through authentic biomedical AI challenges. The implementation led to measurable improvements in learning outcomes, evidenced by high research productivity (16 student-authored publications), consistently positive peer evaluations, and successful development of innovative computational methods addressing real biomedical challenges. Additionally, we examined the role of generative AI both as a teaching subject and an educational support tool within the PBL framework. Our study presents a practical and scalable roadmap for biomedical engineering departments aiming to integrate robust AI education into their curricula.

URL PDF HTML ☆

赞 0 踩 0

2503.16481 2026-03-06 cs.HC cs.RO

PeRoI: A Pedestrian-Robot Interaction Dataset for Learning Avoidance, Neutrality, and Attraction Behaviors in Social Navigation

Subham Agrawal, Nico Ostermann-Myrau, Nils Dengler, Maren Bennewitz

2503.02703 2026-03-06 cs.HC cs.AI

Heuristics for AI-driven Graphical Asset Generation Tools in Game Design and Development Pipelines: A User-Centred Approach

Kaisei Fukaya, Damon Daylamani-Zad, Harry Agius

2502.14401 2026-03-06 eess.IV cs.CV

MedFuncta: A Unified Framework for Learning Efficient Medical Neural Fields

Paul Friedrich, Florentin Bieder, Julian McGinnis, Julia Wolleb, Daniel Rueckert, Philippe C. Cattin

Comments Accepted at MIDL 2026 (Oral) Project page: https://pfriedri.github.io/medfuncta-io/ Code: https://github.com/pfriedri/medfuncta/ Dataset: https://doi.org/10.5281/zenodo.14898708

2502.07975 2026-03-06 cs.GT cs.LG

Sink equilibria and the attractors of learning in games

Oliver Biggar, Christos Papadimitriou

2502.07584 2026-03-06 stat.ML cs.LG

Generalization Bounds for Markov Algorithms through Entropy Flow Computations

Benjamin Dupuis, Maxime Haddouche, George Deligiannidis, Umut Simsekli

2501.05310 2026-03-06 eess.AS cs.SD

A Large-Scale Probing Analysis of Speaker-Specific Attributes in Self-Supervised Speech Representations

Aemon Yat Fei Chiu, Kei Ching Fung, Roger Tsz Yeung Li, Jingyu Li, Tan Lee

Comments Under review

2409.09769 2026-03-06 eess.SY cs.FL cs.RO cs.SY

Risk-Aware Autonomous Driving with Linear Temporal Logic Specifications

Shuhao Qi, Zengjie Zhang, Zhiyong Sun, Sofie Haesaert

2404.16911 2026-03-06 physics.chem-ph cs.LG q-bio.BM

HEroBM: a deep equivariant graph neural network for universal backmapping from coarse-grained to all-atom representations

Daniele Angioletti, Stefano Raniolo, Vittorio Limongelli

详情

DOI: 10.1063/5.0280330

英文摘要

Molecular simulations have assumed a paramount role in the fields of chemistry, biology, and material sciences, being able to capture the intricate dynamic properties of systems. Within this realm, coarse-grained (CG) techniques have emerged as invaluable tools to sample large-scale systems and reach extended timescales by simplifying system representation. However, CG approaches come with a trade-off: they sacrifice atomistic details that might hold significant relevance in deciphering the investigated process. Therefore, a recommended approach is to identify key CG conformations and process them using backmapping methods, which retrieve atomistic coordinates. Currently, rule-based methods yield subpar geometries and rely on energy relaxation, resulting in less-than-optimal outcomes. Conversely, machine learning techniques offer higher accuracy but are either limited in transferability between systems or tied to specific CG mappings. In this work, we introduce HEroBM, a dynamic and scalable method that employs deep equivariant graph neural networks and a hierarchical approach to achieve high-resolution backmapping. HEroBM handles any type of CG mapping, offering a versatile and efficient protocol for reconstructing atomistic structures with high accuracy. Focused on local principles, HEroBM spans the entire chemical space and is transferable to systems of varying sizes. We illustrate the versatility of our framework through diverse biological systems, including a complex real-case scenario. Here, our end-to-end backmapping approach accurately generates the atomistic coordinates of a G protein-coupled receptor bound to an organic small molecule within a cholesterol/phospholipid bilayer.

URL PDF HTML ☆

赞 0 踩 0

2402.03352 2026-03-06 math.OC cs.LG stat.ML

Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

Huiling Zhang, Zi Xu, Yuhong Dai

Comments arXiv admin note: text overlap with arXiv:2212.04672

2401.05683 2026-03-06 cs.GT cs.AI

Deep Learning Meets Mechanism Design: Key Results and Some Novel Applications

V. Udaya Sankar, Vishisht Srihari Rao, Mayank Ratan Bhardwaj, Y. Narahari

2209.11691 2026-03-06 econ.EM cs.LG stat.ME

Linear Multidimensional Regression with Interactive Fixed-Effects

Hugo Freeman

2112.13243 2026-03-06 cs.NE cs.AI cs.CV

Motion Illusions Generated Using Predictive Neural Networks Also Fool Humans

Lana Sinapayen, Eiji Watanabe

2603.04425 2026-03-06 cs.NI cs.LG

Data-Driven Optimization of Multi-Generational Cellular Networks: A Performance Classification Framework for Strategic Infrastructure Management

Maryam Sabahat, M. Umar Khan

2603.04424 2026-03-06 cs.NI cs.LG

When Scaling Fails: Network and Fabric Effects on Distributed GPU Training Performance

Dinesh Gopalan, Ratul Ali

Comments 10 pages, 5 figures, 1 table

2603.04404 2026-03-06 cs.IR cs.CL cs.CY

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

Ahmed Dawoud, Osama El-Shamy, Ahmed Habashy

2603.04403 2026-03-06 cs.IR cs.AI cs.CL

FinRetrieval: A Benchmark for Financial Data Retrieval by AI Agents

Eric Y. Kim, Jie Huang

Comments 26 pages, 2 figures, 16 tables

2603.04402 2026-03-06 cs.IR cs.CL

SearchGym: A Modular Infrastructure for Cross-Platform Benchmarking and Hybrid Search Orchestration

Jerome Tze-Hou Hsu

Comments 5 pages, 5 figures

2603.05505 2026-03-06 cond-mat.quant-gas cond-mat.other physics.atom-ph physics.flu-dyn quant-ph

Core-bound waves on a Gross-Pitaevskii vortex

Evan Papoutsis, Nathan Apfel, Nir Navon

Comments Main text: 5 pages, 5 figures. Supplemental Material: 5 pages, 5 figures

2603.05502 2026-03-06 quant-ph cond-mat.str-el

Universal quantum computation with group surface codes

Naren Manjunath, Vieri Mattei, Apoorv Tiwari, Tyler D. Ellison

2603.05501 2026-03-06 math.LO

Capturing dual team properties with inclusion atoms

Matilda Häggblom