arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2412.09333 2026-02-06 cs.CV cond-mat.mtrl-sci eess.IV

MaskTerial: A Foundation Model for Automated 2D Material Flake Detection

Jan-Lucas Uslu, Alexey Nekrasov, Alexander Hermans, Bernd Beschoten, Bastian Leibe, Lutz Waldecker, Christoph Stampfer

Comments 9 pages, 5 figures

Journal ref Digital Discovery 4, 3744 (2025)

2412.08419 2026-02-06 cs.LG

Energy Guided smoothness to improve Robustness in Graph Classification

Farooq Ahmad Wani, Maria Sofia Bucarelli, Andrea Giuseppe Di Francesco, Oleksandr Pryymak, Fabrizio Silvestri

2411.12788 2026-02-06 cs.CV

Efficient Scene Modeling via Structure-Aware and Region-Prioritized 3D Gaussians

Guangchi Fang, Bing Wang

2410.23059 2026-02-06 cs.RO

FilMBot: A High-Speed Soft Parallel Robotic Micromanipulator

Jiangkun Yu, Houari Bettahar, Hakan Kandemir, Quan Zhou

Comments 13 pages, 16 figures

Journal ref IEEE Transactions on Robotics, 2026

2410.13431 2026-02-06 cs.LG cs.AI

Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport

Zhanpeng Wang, Shenghao Li, Jiameng Che, Chen Wang, Shangling Jui, Na Lei, Zhongxuan Luo

2410.04560 2026-02-06 cs.LG stat.ML

GAMformer: Bridging Tabular Foundation Models and Interpretable Machine Learning

Andreas Mueller, Julien Siems, Harsha Nori, David Salinas, Arber Zela, Rich Caruana, Frank Hutter

Comments 22 pages, 15 figures

2410.02103 2026-02-06 cs.CV

MVGS: Multi-view Regulated Gaussian Splatting for Novel View Synthesis

Xiaobiao Du, Yida Wang, Xin Yu

Comments Project Page:https://xiaobiaodu.github.io/mvgs-project/

2410.01031 2026-02-06 cs.CV

Pediatric Wrist Fracture Detection Using Feature Context Excitation Modules in X-ray Images

Rui-Yang Ju, Chun-Tse Chien, Enkaer Xieerke, Jen-Shiun Chiang

Comments arXiv admin note: text overlap with arXiv:2407.03163

Journal ref IET Image Process. 20 (2026) e70269

2409.12636 2026-02-06 cs.CV cs.LG eess.IV

Image inpainting for corrupted images by using the semi-super resolution GAN

Mehrshad Momen-Tayefeh, Mehrdad Momen-Tayefeh, Amir Ali Ghafourian Ghahramani

2409.09098 2026-02-06 cs.SD cs.CL eess.AS

AccentBox: Towards High-Fidelity Zero-Shot Accent Generation

Jinzuomu Zhong, Korin Richmond, Zhiba Su, Siqi Sun

Comments Accepted by ICASSP 2025

2409.07253 2026-02-06 cs.LG cs.CV

Alignment of Diffusion Models: Fundamentals, Challenges, and Future

Buhua Liu, Shitong Shao, Bao Li, Lichen Bai, Zhiqiang Xu, Haoyi Xiong, James Kwok, Sumi Helal, Zeke Xie

Comments Accepted at ACM Computing Surveys. 35 pages, 5 figures, 4 tables. Paper List: github.com/xie-lab-ml/awesome-alignment-of-diffusion-models

2407.18442 2026-02-06 cs.CL

Guidance-Based Prompt Data Augmentation in Specialized Domains for Named Entity Recognition

Hyeonseok Kang, Hyein Seo, Jeesu Jung, Sangkeun Jung, Du-Seong Chang, Riwoo Chung

Journal ref Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), 2024

2406.17835 2026-02-06 cs.LG cs.AI

The Use of AI-Robotic Systems for Scientific Discovery

Alexander H. Gower, Konstantin Korovin, Daniel Brunnsåker, Filip Kronström, Gabriel K. Reder, Ievgeniia A. Tiukova, Ronald S. Reiserer, John P. Wikswo, Ross D. King

Comments 23 pages, book chapter

2406.09643 2026-02-06 cs.LG

A Policy Gradient-Based Sequence-to-Sequence Method for Time Series Prediction

Qi Sima, Xinze Zhang, Yukun Bao, Siyue Yang, Liang Shen

2406.04897 2026-02-06 cs.LG

From Link Prediction to Forecasting: Addressing Challenges in Batch-based Temporal Graph Learning

Moritz Lampert, Christopher Blöcker, Ingo Scholtes

Comments 46 pages (12 pages main text), 19 figures. Published in Transactions on Machine Learning Research (2026)

Journal ref Lampert, M., Blöcker, C. & Scholtes, I., (2026). From Link Prediction to Forecasting: Addressing Challenges in Batch-based Temporal Graph Learning. Transactions on Machine Learning Research, February 2026

2405.14982 2026-02-06 cs.LG cs.AI cs.CL stat.ML

In-context Time Series Predictor

Jiecheng Lu, Yan Sun, Shihao Yang

Comments Camera-ready version. Accepted at ICLR 2025

Journal ref Proceedings of the Thirteenth International Conference on Learning Representations (ICLR 2025)

2404.03208 2026-02-06 cs.LG

HiMAL: A Multimodal Hierarchical Multi-task Auxiliary Learning framework for predicting and explaining Alzheimer disease progression

Sayantan Kumar, Sean Yu, Andrew Michelson, Thomas Kannampallil, Philip Payne

Comments Currently under review in Journal of Medical Informatics Association (JAMIA). 6 figures, 3 tables

Journal ref JAMIA Open, Volume 7, Issue 3, October 2024, ooae087

详情

DOI: 10.1093/jamiaopen/ooae087

英文摘要

Objective: We aimed to develop and validate a novel multimodal framework HiMAL (Hierarchical, Multi-task Auxiliary Learning) framework, for predicting cognitive composite functions as auxiliary tasks that estimate the longitudinal risk of transition from Mild Cognitive Impairment (MCI) to Alzheimer Disease (AD). Methods: HiMAL utilized multimodal longitudinal visit data including imaging features, cognitive assessment scores, and clinical variables from MCI patients in the Alzheimer Disease Neuroimaging Initiative (ADNI) dataset, to predict at each visit if an MCI patient will progress to AD within the next 6 months. Performance of HiMAL was compared with state-of-the-art single-task and multi-task baselines using area under the receiver operator curve (AUROC) and precision recall curve (AUPRC) metrics. An ablation study was performed to assess the impact of each input modality on model performance. Additionally, longitudinal explanations regarding risk of disease progression were provided to interpret the predicted cognitive decline. Results: Out of 634 MCI patients (mean [IQR] age : 72.8 [67-78], 60% men), 209 (32%) progressed to AD. HiMAL showed better prediction performance compared to all single-modality singe-task baselines (AUROC = 0.923 [0.915-0.937]; AUPRC= 0.623 [0.605-0.644]; all p<0.05). Ablation analysis highlighted that imaging and cognition scores with maximum contribution towards prediction of disease progression. Discussion: Clinically informative model explanations anticipate cognitive decline 6 months in advance, aiding clinicians in future disease progression assessment. HiMAL relies on routinely collected EHR variables for proximal (6 months) prediction of AD onset, indicating its translational potential for point-of-care monitoring and managing of high-risk patients.

URL PDF HTML ☆

赞 0 踩 0

2403.19254 2026-02-06 cs.CV

Imperceptible Protection against Style Imitation from Diffusion Models

Namhyuk Ahn, Wonhyuk Ahn, KiYoon Yoo, Daesik Kim, Seung-Hun Nam

Comments IEEE Transactions on Multimedia

2402.09329 2026-02-06 cs.CV

YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection

Chun-Tse Chien, Rui-Yang Ju, Kuang-Yi Chou, Enkaer Xieerke, Jen-Shiun Chiang

Journal ref IEEE Access 13 (2025) 52461-52477

详情

DOI: 10.1109/ACCESS.2025.3549839

英文摘要

Wrist trauma and even fractures occur frequently in daily life, particularly among children who account for a significant proportion of fracture cases. Before performing surgery, surgeons often request patients to undergo X-ray imaging first and prepare for it based on the analysis of the radiologist. With the development of neural networks, You Only Look Once (YOLO) series models have been widely used in fracture detection as computer-assisted diagnosis (CAD). In 2023, Ultralytics presented the latest version of the YOLO models, which has been employed for detecting fractures across various parts of the body. Attention mechanism is one of the hottest methods to improve the model performance. This research work proposes YOLOv8-AM, which incorporates the attention mechanism into the original YOLOv8 architecture. Specifically, we respectively employ four attention modules, Convolutional Block Attention Module (CBAM), Global Attention Mechanism (GAM), Efficient Channel Attention (ECA), and Shuffle Attention (SA), to design the improved models and train them on GRAZPEDWRI-DX dataset. Experimental results demonstrate that the mean Average Precision at IoU 50 (mAP 50) of the YOLOv8-AM model based on ResBlock + CBAM (ResCBAM) increased from 63.6% to 65.8%, which achieves the state-of-the-art (SOTA) performance. Conversely, YOLOv8-AM model incorporating GAM obtains the mAP 50 value of 64.2%, which is not a satisfactory enhancement. Therefore, we combine ResBlock and GAM, introducing ResGAM to design another new YOLOv8-AM model, whose mAP 50 value is increased to 65.0%. The implementation code for this study is available on GitHub at https://github.com/RuiyangJu/Fracture_Detection_Improved_YOLOv8.

URL PDF HTML ☆

赞 0 踩 0

2312.00992 2026-02-06 cs.LG

Improving Normative Modeling for Multi-modal Neuroimaging Data using mixture-of-product-of-experts variational autoencoders

Sayantan Kumar, Philip Payne, Aristeidis Sotiras

Comments IEEE Internattional Symposium in Biomedical Imaging 2024

Journal ref 2024 IEEE International Symposium on Biomedical Imaging (ISBI)

2307.01930 2026-02-06 cs.LG cs.AI cs.CV stat.AP stat.ML

Learning ECG Signal Features Without Backpropagation Using Linear Laws

Péter Pósfay, Marcell T. Kurbucz, Péter Kovács, Antal Jakovác

Comments 35 pages, 3 figures, 3 tables

Journal ref Machine Learning: Science and Technology 6, 035001 (2025)

2305.15793 2026-02-06 cs.LG cs.AI cs.CE stat.CO

Feature space reduction method for ultrahigh-dimensional, multiclass data: Random forest-based multiround screening (RFMS)

Gergely Hanczár, Marcell Stippinger, Dávid Hanák, Marcell T. Kurbucz, Olivér M. Törteli, Ágnes Chripkó, Zoltán Somogyvári

Comments 9 pages, 2 figures, 2 tables

Journal ref Machine Learning: Science and Technology 4, 045012 (2023)

2304.14211 2026-02-06 cs.LG cs.AI cs.CV cs.MS stat.ML

LLT: An R package for Linear Law-based Feature Space Transformation

Marcell T. Kurbucz, Péter Pósfay, Antal Jakovác

Comments 15 pages, 5 figures, 1 table

Journal ref SoftwareX 25, 101623 (2024)

2304.05071 2026-02-06 cs.CV

Fracture Detection in Pediatric Wrist Trauma X-ray Images Using YOLOv8 Algorithm

Rui-Yang Ju, Weiming Cai

Comments Scientific Reports

Journal ref Sci Rep 13 (2023) 20077

2303.09190 2026-02-06 cs.CV

Resolution Enhancement Processing on Low Quality Images Using Swin Transformer Based on Interval Dense Connection Strategy

Rui-Yang Ju, Chih-Chia Chen, Jen-Shiun Chiang, Yu-Shian Lin, Wei-Han Chen, Chun-Tse Chien

Journal ref Multimed Tools Appl 83 (2024) 14839-14855

2211.16098 2026-02-06 cs.CV

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

Rui-Yang Ju, Yu-Shian Lin, Yanlin Jin, Chih-Chia Chen, Chun-Tse Chien, Jen-Shiun Chiang

Comments Accepted by Knowledge-Based Systems

Journal ref Knowl.-Based Syst. 304 (2024) 112542

2204.00943 2026-02-06 cs.CV

Efficient Convolutional Neural Networks on Raspberry Pi for Image Classification

Rui-Yang Ju, Ting-Yu Lin, Jia-Hao Jian, Jen-Shiun Chiang

Journal ref J Real-Time Image Proc 20 (2023) 21

2202.00009 2026-02-06 cs.LG

Identifying Dementia Subtypes with Electronic Health Records

Sayantan Kumar, Zachary Abrams, Suzanne Schindler, Nupur Ghoshal, Philip Payne

Comments ACM Conference on Bioinformatics, Computational Biology, and Health Informatics 13 pages, 7 figures, 3 tables

Journal ref PLoS ONE 19(11): e0313425, 2024

2201.03013 2026-02-06 cs.CV

ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections

Rui-Yang Ju, Ting-Yu Lin, Jia-Hao Jian, Jen-Shiun Chiang, Wei-Bin Yang

Comments IEEE Access

Journal ref IEEE Access 10 (2022) 82834-82843

2110.04598 2026-02-06 cs.LG

Self-explaining Neural Network with Concept-based Explanations for ICU Mortality Prediction

Sayantan Kumar, Sean C. Yu, Thomas Kannampallil, Zachary Abrams, Andrew Michelson, Philip R. O. Payne

Comments Workshop on Interpretable ML in Healthcare at International Conference on Machine Learning (ICML 2022)

Journal ref BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics Article No.: 8, Pages 1 - 9, 2022

AI 大模型

视觉与机器人

科学与医疗