arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.13361 2026-02-17 cs.CV

The Diffusion Duet: Harmonizing Dual Channels with Wavelet Suppression for Image Separation

Jingwei Li, Wei Pu

2602.13359 2026-02-17 cs.LG

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

Hannes Kath, Thiago S. Gouvêa, Daniel Sonntag

Journal ref H. Kath, T.S. Gouvêa, D. Sonntag (2026). The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric. Transactions on Machine Learning Research

2602.13352 2026-02-17 cs.CV cs.AI cs.CL

Using Deep Learning to Generate Semantically Correct Hindi Captions

Wasim Akram Khan, Anil Kumar Vuppala

Comments 34 pages, 12 figures, 3 tables. Master's thesis, Liverpool John Moores University, November 2022

2602.13350 2026-02-17 cs.CV cs.AI

Detecting Brick Kiln Infrastructure at Scale: Graph, Foundation, and Remote Sensing Models for Satellite Imagery Data

Usman Nazir, Xidong Chen, Hafiz Muhammad Abubakar, Hadia Abu Bakar, Raahim Arbaz, Fezan Rasool, Bin Chen, Sara Khalid

2602.13349 2026-02-17 cs.CV cs.AI

From Prompt to Production:Automating Brand-Safe Marketing Imagery with Text-to-Image Models

Parmida Atighehchian, Henry Wang, Andrei Kapustin, Boris Lerner, Tiancheng Jiang, Taylor Jensen, Negin Sokhandan

Comments 17 pages, 12 figures, Accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

2602.13348 2026-02-17 cs.LG cs.AI cs.CL

Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset

Michael Beebe, GodsGift Uzor, Manasa Chepuri, Divya Sree Vemula, Angel Ayala

2602.13347 2026-02-17 cs.CV cs.AI cs.RO

Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots

Lijun Zhang, Nikhil Chacko, Petter Nilsson, Ruinian Xu, Shantanu Thakar, Bai Lou, Harpreet Sawhney, Zhebin Zhang, Mudit Agrawal, Bhavana Chandrashekhar, Aaron Parness

Comments 20 pages, 16 figures

2602.13345 2026-02-17 cs.LG cs.IR cs.MA

BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

Ethan Seefried, Ran Eldegaway, Sanjay Das, Nathaniel Blanchard, Tirthankar Ghosal

Comments 20 pages 8 main + 12 appendix + references

2602.13339 2026-02-17 cs.CV cs.AI

An Integrated Causal Inference Framework for Traffic Safety Modeling with Semantic Street-View Visual Features

Lishan Sun, Yujia Cheng, Pengfei Cui, Lei Han, Mohamed Abdel-Aty, Yunhan Zheng, Xingchen Zhang

Comments 34 pages, 13 figures

2602.13335 2026-02-17 cs.CV

Meningioma Analysis and Diagnosis using Limited Labeled Samples

Jiamiao Lu, Wei Wu, Ke Gao, Ping Mao, Weichuan Zhang, Tuo Wang, Lingkun Ma, Jiapan Guo, Zanyi Wu, Yuqing Hu, Changming Sun

Comments 19 pages,7 figures

2602.13334 2026-02-17 cs.CV cs.DC cs.LG

Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators

Hao Liu, Suhaib A. Fahmy

2602.13332 2026-02-17 cs.CV cs.AI

MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling

Wenjie Li, Yujie Zhang, Haoran Sun, Xingqi He, Hongcheng Gao, Chenglong Ma, Ming Hu, Guankun Wang, Shiyi Yao, Renhao Yang, Hongliang Ren, Lei Wang, Junjun He, Yankai Jiang

2602.13330 2026-02-17 cs.CV

Zwitscherkasten -- DIY Audiovisual bird monitoring

Dominik Blum, Elias Häring, Fabian Jirges, Martin Schäffer, David Schick, Florian Schulenberg, Torsten Schön

Comments Project Report of the Applied Artificial Intelligence Degree Program at Technische Hochschule Ingolstadt

2602.13329 2026-02-17 cs.CV cs.AI cs.RO

HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving

Yiru Wang, Zichong Gu, Yu Gao, Anqing Jiang, Zhigang Sun, Shuo Wang, Yuwen Heng, Hao Sun

2602.13326 2026-02-17 cs.CV

MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation

Xirui Hu, Yanbo Ding, Jiahao Wang, Tingting Shi, Yali Wang, Guo Zhi Zhi, Weizhan Zhang

2602.13324 2026-02-17 cs.CV cs.AI cs.RO

Synthesizing the Kill Chain: A Zero-Shot Framework for Target Verification and Tactical Reasoning on the Edge

Jesse Barkley, Abraham George, Amir Barati Farimani

Comments 8 Pages, 3 Figures

2602.13323 2026-02-17 cs.AI

Contrastive explanations of BDI agents

Michael Winikoff

Comments AAMAS 2026 paper with added supplementary material

2602.13322 2026-02-17 cs.CV cs.LG

Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture

Datorien L. Anderson

Comments 8 pages, 3 figures and extra material to help can be found: https://zenodo.org/records/18529180

2602.13321 2026-02-17 cs.AI cs.LG

Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

Tri Nguyen, Huy Hoang Bao Le, Lohith Srikanth Pentapalli, Laurah Turner, Kelly Cohen

2602.13320 2026-02-17 cs.AI

Information Fidelity in Tool-Using LLM Agents: A Martingale Analysis of the Model Context Protocol

Flint Xiaofeng Fan, Cheston Tan, Roger Wattenhofer, Yew-Soon Ong

Comments Full working version of an extended abstract accepted at the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)

2602.13319 2026-02-17 cs.AI cs.HC

Situation Graph Prediction: Structured Perspective Inference for User Modeling

Jisung Shin, Daniel Platnick, Marjan Alirezaie, Hossein Rahnama

Comments Preprint under review, 4 pages

2602.13315 2026-02-17 cs.CV cs.AI

IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs

Yifan Tan, Yifu Sun, Shirui Huang, Hong Liu, Guanghua Yu, Jianchen Zhu, Yangdong Deng

2602.13313 2026-02-17 cs.CV cs.AI

Agentic Spatio-Temporal Grounding via Collaborative Reasoning

Heng Zhao, Yew-Soon Ong, Joey Tianyi Zhou

2602.13306 2026-02-17 cs.CV cs.AI cs.LG

Fine-Tuning a Large Vision-Language Model for Artwork's Scoring and Critique

Zhehan Zhang, Meihua Qian, Li Luo, Siyu Huang, Chaoyi Zhou, Ripon Saha, Xinxin Song

2602.13303 2026-02-17 cs.CV cs.AI cs.LG eess.IV

Spectral Collapse in Diffusion Inversion

Nicolas Bourriez, Alexandre Verine, Auguste Genovesio

2602.13299 2026-02-17 cs.CV cs.AI

KidMesh: Computational Mesh Reconstruction for Pediatric Congenital Hydronephrosis Using Deep Neural Networks

Haoran Sun, Zhanpeng Zhu, Anguo Zhang, Bo Liu, Zhaohua Lin, Liqin Huang, Mingjing Yang, Lei Liu, Shan Lin, Wangbin Ding

详情

英文摘要

Pediatric congenital hydronephrosis (CH) is a common urinary tract disorder, primarily caused by obstruction at the renal pelvis-ureter junction. Magnetic resonance urography (MRU) can visualize hydronephrosis, including renal pelvis and calyces, by utilizing the natural contrast provided by water. Existing voxel-based segmentation approaches can extract CH regions from MRU, facilitating disease diagnosis and prognosis. However, these segmentation methods predominantly focus on morphological features, such as size, shape, and structure. To enable functional assessments, such as urodynamic simulations, external complex post-processing steps are required to convert these results into mesh-level representations. To address this limitation, we propose an end-to-end method based on deep neural networks, namely KidMesh, which could automatically reconstruct CH meshes directly from MRU. Generally, KidMesh extracts feature maps from MRU images and converts them into feature vertices through grid sampling. It then deforms a template mesh according to these feature vertices to generate the specific CH meshes of MRU images. Meanwhile, we develop a novel schema to train KidMesh without relying on accurate mesh-level annotations, which are difficult to obtain due to the sparsely sampled MRU slices. Experimental results show that KidMesh could reconstruct CH meshes in an average of 0.4 seconds, and achieve comparable performance to conventional methods without requiring post-processing. The reconstructed meshes exhibited no self-intersections, with only 3.7% and 0.2% of the vertices having error distances exceeding 3.2mm and 6.4mm, respectively. After rasterization, these meshes achieved a Dice score of 0.86 against manually delineated CH masks. Furthermore, these meshes could be used in renal urine flow simulations, providing valuable urodynamic information for clinical practice.

URL PDF HTML ☆

赞 0 踩 0

2602.13297 2026-02-17 cs.CV cs.LG

Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

Edwyn Brient, Santiago Velasco-Forero, Rami Kassab

2602.13296 2026-02-17 cs.CV cs.LG

MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

Edwyn Brient, Santiago Velasco-Forero, Rami Kassab

Journal ref 2025 IEEE Radar Conference (RadarConf25), Oct 2025, Krakow, Poland. pp.1-6

2602.13292 2026-02-17 cs.AI

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

Yifan Ding, Yuhui Shi, Zhiyan Li, Zilong Wang, Yifeng Gao, Yajun Yang, Mengjie Yang, Yixiu Liang, Xipeng Qiu, Xuanjing Huang, Xingjun Ma, Yu-Gang Jiang, Guoyu Wang

Comments 4 figures, 3 tables

2602.13289 2026-02-17 cs.CV cs.AI

Evaluating the Impact of Post-Training Quantization on Reliable VQA with Multimodal LLMs

Paul Jonas Kurz, Tobias Jan Wieczorek, Mohamed A. Abdelsalam, Rahaf Aljundi, Marcus Rohrbach

Comments Accepted poster at the 1st Workshop on Epistemic Intelligence in Machine Learning (EIML) @ EURIPS 2025

AI 大模型

视觉与机器人

科学与医疗

The Diffusion Duet: Harmonizing Dual Channels with Wavelet Suppression for Image Separation

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

Using Deep Learning to Generate Semantically Correct Hindi Captions

Detecting Brick Kiln Infrastructure at Scale: Graph, Foundation, and Remote Sensing Models for Satellite Imagery Data

From Prompt to Production:Automating Brand-Safe Marketing Imagery with Text-to-Image Models

Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset

Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots

BLUEPRINT Rebuilding a Legacy: Multimodal Retrieval for Complex Engineering Drawings and Documents

An Integrated Causal Inference Framework for Traffic Safety Modeling with Semantic Street-View Visual Features

Meningioma Analysis and Diagnosis using Limited Labeled Samples

Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators

MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling

Zwitscherkasten -- DIY Audiovisual bird monitoring

HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving

MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation

Synthesizing the Kill Chain: A Zero-Shot Framework for Target Verification and Tactical Reasoning on the Edge

Contrastive explanations of BDI agents

Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture

Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction

Information Fidelity in Tool-Using LLM Agents: A Martingale Analysis of the Model Context Protocol

Situation Graph Prediction: Structured Perspective Inference for User Modeling

IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs

Agentic Spatio-Temporal Grounding via Collaborative Reasoning

Fine-Tuning a Large Vision-Language Model for Artwork's Scoring and Critique

Spectral Collapse in Diffusion Inversion

KidMesh: Computational Mesh Reconstruction for Pediatric Congenital Hydronephrosis Using Deep Neural Networks

Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

Evaluating the Impact of Post-Training Quantization on Reliable VQA with Multimodal LLMs