arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2506.08916 2026-03-25 cs.LG math.DS q-bio.QM

Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)

Maria-Veronica Ciocanel, John T. Nardini, Kevin B. Flores, Erica M. Rutter, Suzanne S. Sindi, Alexandria Volkening

Comments 31 pages, 10 figures

2506.05520 2026-03-25 cs.AI cs.MA

Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted

Cecil Pang

Comments Published by IEEE Access

Journal ref IEEE Access, vol. 13, pp. 113752-113762, 2025

详情

DOI: 10.1109/ACCESS.2025.3583260

英文摘要

Contemporary businesses operate in dynamic environments requiring rapid adaptation to achieve goals and maintain competitiveness. Existing data platforms often fall short by emphasizing tools over alignment with business needs, resulting in inefficiencies and delays. To address this gap, I propose the Business Semantics Centric, AI Agents Assisted Data System (BSDS), a holistic system that integrates architecture, workflows, and team organization to ensure data systems are tailored to business priorities rather than dictated by technical constraints. BSDS redefines data systems as dynamic enablers of business success, transforming them from passive tools into active drivers of organizational growth. BSDS has a modular architecture that comprises curated data linked to business entities, a knowledge base for context-aware AI agents, and efficient data pipelines. AI agents play a pivotal role in assisting with data access and system management, reducing human effort, and improving scalability. Complementing this architecture, BSDS incorporates workflows optimized for both exploratory data analysis and production requirements, balancing speed of delivery with quality assurance. A key innovation of BSDS is its incorporation of the human factor. By aligning data team expertise with business semantics, BSDS bridges the gap between technical capabilities and business needs. Validated through real-world implementation, BSDS accelerates time-to-market for data-driven initiatives, enhances cross-functional collaboration, and provides a scalable blueprint for businesses of all sizes. Future research can build on BSDS to explore optimization strategies using complex systems and adaptive network theories, as well as developing autonomous data systems leveraging AI agents.

URL PDF HTML ☆

赞 0 踩 0

2505.22564 2026-03-25 cs.CV cs.AI cs.LG

PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion

Jaehyun Choi, Jiwan Hur, Gyojin Han, Jaemyung Yu, Junmo Kim

Comments CVPR 2026

2505.22318 2026-03-25 cs.CL cs.LG

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Anish R Joishy, Ishwar B Balappanawar, Vamshi Krishna Bonagiri, Manas Gaur, Krishnaprasad Thirunarayan, Ponnurangam Kumaraguru

2505.20881 2026-03-25 cs.LG cs.AI

Generalizable Heuristic Generation Through LLMs with Meta-Optimization

Yiding Shi, Jianan Zhou, Wen Song, Jieyi Bi, Yaoxin Wu, Zhiguang Cao, Jie Zhang

Comments Accepted at ICLR 2026

2505.18179 2026-03-25 cs.LG cs.AI

GAIA: A Foundation Model for Operational Atmospheric Dynamics

Ata Akbari Asanjan, Olivia Alexander, Tom Berg, Stephen Peng, Jad Makki, Clara Zhang, Matt Yang, Disha Shidham, Srija Chakraborty, William Bender, Cara Crawford, Arun Ravindran, Olivier Raiman, David Potere, David Bell

Comments 22 pages, 11 figures

2505.11897 2026-03-25 cs.CV

FiGKD: Fine-Grained Knowledge Distillation via High-Frequency Detail Transfer

Seonghak Kim

Comments 18 pages, 6 figures

Journal ref Expert Syst. Appl. 319 (2026) 132071

详情

DOI: 10.1016/j.eswa.2026.132071

英文摘要

Knowledge distillation (KD) is a widely adopted technique for transferring knowledge from a high-capacity teacher model to a smaller student model by aligning their output distributions. However, existing methods often underperform in fine-grained visual recognition tasks, where distinguishing subtle differences between visually similar classes is essential. This performance gap stems from the fact that conventional approaches treat the teacher's output logits as a single, undifferentiated signal-assuming all contained information is equally beneficial to the student. Consequently, student models may become overloaded with redundant signals and fail to capture the teacher's nuanced decision boundaries. To address this issue, we propose Fine-Grained Knowledge Distillation (FiGKD), a novel frequency-aware framework that decomposes a model's logits into low-frequency (content) and high-frequency (detail) components using the discrete wavelet transform (DWT). FiGKD selectively transfers only the high-frequency components, which encode the teacher's semantic decision patterns, while discarding redundant low-frequency content already conveyed through ground-truth supervision. Our approach is simple, architecture-agnostic, and requires no access to intermediate feature maps. Extensive experiments on CIFAR-100, TinyImageNet, and multiple fine-grained recognition benchmarks show that FiGKD consistently outperforms state-of-the-art logit-based and feature-based distillation methods across a variety of teacher-student configurations. These findings confirm that frequency-aware logit decomposition enables more efficient and effective knowledge transfer, particularly in resource-constrained settings.

URL PDF HTML ☆

赞 0 踩 0

2505.11191 2026-03-25 cs.AI cs.RO

Multi-Modal Multi-Task (M3T) Federated Foundation Models for Embodied AI: Potentials and Challenges for Edge Integration

Kasra Borazjani, Payam Abdisarabshali, Fardis Nadimi, Naji Khosravan, Minghui Liwang, Xianbin Wang, Yiguang Hong, Seyyedali Hosseinalipour

Comments Accepted for Publication in IEEE Internet of Things Magazine, 2025

Journal ref IEEE Internet of Things Magazine, 2025

详情

DOI: 10.1109/MIOT.2025.3604330

英文摘要

As embodied AI systems become increasingly multi-modal, personalized, and interactive, they must learn effectively from diverse sensory inputs, adapt continually to user preferences, and operate safely under resource and privacy constraints. These challenges expose a pressing need for machine learning models capable of swift, context-aware adaptation while balancing model generalization and personalization. Here, two methods emerge as suitable candidates, each offering parts of these capabilities: multi-modal multi-task foundation models (M3T-FMs) provide a pathway toward generalization across tasks and modalities, whereas federated learning (FL) offers the infrastructure for distributed, privacy-preserving model updates and user-level model personalization. However, when used in isolation, each of these approaches falls short of meeting the complex and diverse capability requirements of real-world embodied AI environments. In this vision paper, we introduce multi-modal multi-task federated foundation models (M3T-FFMs) for embodied AI, a new paradigm that unifies the strengths of M3T-FMs with the privacy-preserving distributed training nature of FL, enabling intelligent systems at the wireless edge. We collect critical deployment dimensions of M3T-FFMs in embodied AI ecosystems under a unified framework, which we name "EMBODY": Embodiment heterogeneity, Modality richness and imbalance, Bandwidth and compute constraints, On-device continual learning, Distributed control and autonomy, and Yielding safety, privacy, and personalization. For each, we identify concrete challenges and envision actionable research directions. We also present an evaluation framework for deploying M3T-FFMs in embodied AI systems, along with the associated trade-offs. Finally, we present a prototype implementation of M3T-FFMs and evaluate their energy and latency performance.

URL PDF HTML ☆

赞 0 踩 0

2505.09424 2026-03-25 cs.RO

Exploring Pose-Guided Imitation Learning for Robotic Precise Insertion

Han Sun, Sheng Liu, Yizhao Wang, Zhenning Zhou, Shuai Wang, Haibo Yang, Jingyuan Sun, Qixin Cao

2505.02395 2026-03-25 cs.RO cs.SY eess.SY

A Real-Time Control Barrier Function-Based Safety Filter for Motion Planning with Arbitrary Road Boundary Constraints

Jianye Xu, Chang Che, Bassam Alrifaee

Comments Published version, see https://doi.org/10.1109/ITSC60802.2025.11423203

2505.00333 2026-03-25 cs.LG eess.SP

Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates

Bumjun Kim, Wan Choi

2504.16956 2026-03-25 cs.CL cs.LG q-bio.GN

GeneMamba: An Efficient and Effective Foundation Model on Single Cell Data

Cong Qi, Hanzhang Fang, Siqi Jiang, Xun Song, Tianxing Hu, Wei Zhi

2504.14094 2026-03-25 cs.LG cs.AI stat.ML

Leakage and Interpretability in Concept-Based Models

Enrico Parisini, Tapabrata Chakraborti, Chris Harbron, Ben D. MacArthur, Christopher R. S. Banerji

Comments 39 pages, 25 figures

2503.17937 2026-03-25 cs.CV

Cross-Domain Underwater Image Enhancement Guided by No-Reference Image Quality Assessment: A Transfer Learning Approach

Zhi Zhang, Minfu Li, Lu Li, Daoyi Chen

2503.14553 2026-03-25 cs.CV cs.LG

Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions

Kasra Borazjani, Payam Abdisarabshali, Naji Khosravan, Seyyedali Hosseinalipour

Comments Accepted for publication in IEEE Transactions on Artificial Intelligence, 2026

2503.10144 2026-03-25 cs.LG cs.AI

Multiplicative learning from observation-prediction ratios

Han Kim, Hyungjoon Soh, Vipul Periwal, Junghyo Jo

2503.05656 2026-03-25 cs.RO cs.MA

Small-Scale Testbeds for Connected and Automated Vehicles and Robot Swarms: Challenges and a Roadmap

Jianye Xu, Johannes Betz, Armin Mokhtarian, Archak Mittal, Mengchi Cai, Rahul Mangharam, Omar M. Shehata, Catherine M. Elias, Jan-Nico Zaech, Patrick Scheffe, Felix Jahncke, Sangeet Sankaramangalam Ulhas, Kaj Munhoz Arfvidsson, Bassam Alrifaee

Comments Published version

2503.02693 2026-03-25 cs.LG cs.MA

Federated Learning for Data-Driven Feedforward Control: A Case Study on Vehicle Lateral Dynamics

Jakob Weber, Markus Gurtner, Benedikt Alt, Adrian Trachte, Andreas Kugi

Comments Accepted at ECC 2026

2502.07861 2026-03-25 cs.LG cs.AI cs.DS

Streaming Attention Approximation via Discrepancy Theory

Ekaterina Kochetkova, Kshiteej Sheth, Insu Han, Amir Zandieh, Michael Kapralov

2502.01969 2026-03-25 cs.CV cs.AI

Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration

Younan Zhu, Linwei Tao, Minjing Dong, Chang Xu

2502.01356 2026-03-25 cs.CV

Quasi-Conformal Convolution : A Learnable Convolution for Deep Learning on Simply Connected Open Surfaces

Han Zhang, Tsz Lok Ip, Lok Ming Lui

2501.02949 2026-03-25 cs.LG eess.SP

MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Stephan Goerttler, Yucheng Wang, Emadeldeen Eldele, Min Wu, Fei He

Comments 12 pages, 8 figures, journal paper

Journal ref Biomedical Signal Processing and Control, 120(B), 2026, 110141

2501.01921 2026-03-25 cs.SD eess.AS

Structural and Statistical Audio Texture Knowledge Distillation for Acoustic Classification

Jarin Ritu, Amirmohammad Mohammadi, Davelle Carreiro, Alexandra Van Dine, Joshua Peeples

Comments 13 pages, 6 figures

2412.17963 2026-03-25 cs.CL

Extracting and Following Paths for Robust Relational Reasoning with Large Language Models

Ge Zhang, Mohammad Ali Alomrani, Hongjian Gu, Jiaming Zhou, Yaochen Hu, Bin Wang, Qun Liu, Mark Coates, Yingxue Zhang, Jianye Hao

Journal ref Transactions on Machine Learning Research, 2026

2412.13152 2026-03-25 cs.CV cs.AI

Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings

Paolo Gabriel, Peter Rehani, Tyler Troy, Tiffany Wyatt, Michael Choma, Narinder Singh

Comments 21 pages, 9 figures, 3 tables, submitted to Frontiers in Imaging > Imaging Applications > (Research Topic) Deep Learning for Medical Imaging Applications for publication

2412.08686 2026-03-25 cs.CL cs.CY cs.LG

LatentQA: Teaching LLMs to Decode Activations Into Natural Language

Alexander Pan, Lijie Chen, Jacob Steinhardt

Comments ICLR 2026; project page at https://latentqa.github.io

2412.07481 2026-03-25 cs.CV

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

Wenbo Huang, Jinghui Zhang, Guang Li, Lei Zhang, Shuoyuan Wang, Fang Dong, Jiahui Jin, Takahiro Ogawa, Miki Haseyama

Comments Accepted by AAAI 2025

2412.05430 2026-03-25 cs.LG q-bio.GN

DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA

Aman Patel, Arpita Singhal, Austin Wang, Anusri Pampari, Maya Kasowski, Anshul Kundaje

Comments NeurIPS Datasets and Benchmarks 2024

2412.04227 2026-03-25 cs.LG cs.CV cs.PF

Foundations of the Theory of Performance-Based Ranking

Sébastien Piérard, Anaïs Halin, Anthony Cioppa, Adrien Deliège, Marc Van Droogenbroeck

2411.14827 2026-03-25 cs.CV cs.AI cs.LG eess.IV

Physically Interpretable Probabilistic Domain Characterization

Anaïs Halin, Sébastien Piérard, Renaud Vandeghen, Benoît Gérin, Maxime Zanella, Martin Colot, Jan Held, Anthony Cioppa, Emmanuel Jean, Gianluca Bontempi, Saïd Mahmoudi, Benoît Macq, Marc Van Droogenbroeck

AI 大模型

视觉与机器人

科学与医疗

Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)

Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted

PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Generalizable Heuristic Generation Through LLMs with Meta-Optimization

GAIA: A Foundation Model for Operational Atmospheric Dynamics

FiGKD: Fine-Grained Knowledge Distillation via High-Frequency Detail Transfer

Multi-Modal Multi-Task (M3T) Federated Foundation Models for Embodied AI: Potentials and Challenges for Edge Integration

Exploring Pose-Guided Imitation Learning for Robotic Precise Insertion

A Real-Time Control Barrier Function-Based Safety Filter for Motion Planning with Arbitrary Road Boundary Constraints

Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates

GeneMamba: An Efficient and Effective Foundation Model on Single Cell Data

Leakage and Interpretability in Concept-Based Models

Cross-Domain Underwater Image Enhancement Guided by No-Reference Image Quality Assessment: A Transfer Learning Approach

Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions

Multiplicative learning from observation-prediction ratios

Small-Scale Testbeds for Connected and Automated Vehicles and Robot Swarms: Challenges and a Roadmap

Federated Learning for Data-Driven Feedforward Control: A Case Study on Vehicle Lateral Dynamics

Streaming Attention Approximation via Discrepancy Theory

Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration

Quasi-Conformal Convolution : A Learnable Convolution for Deep Learning on Simply Connected Open Surfaces

MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Structural and Statistical Audio Texture Knowledge Distillation for Acoustic Classification

Extracting and Following Paths for Robust Relational Reasoning with Large Language Models

Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings

LatentQA: Teaching LLMs to Decode Activations Into Natural Language

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA

Foundations of the Theory of Performance-Based Ranking

Physically Interpretable Probabilistic Domain Characterization