arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.08176 2026-02-06 cs.SD cs.AI cs.LG eess.AS

Leveraging Whisper Embeddings for Audio-based Lyrics Matching

Eleonora Mancini, Joan Serrà, Paolo Torroni, Yuki Mitsufuji

Comments Accepted at ICASSP 2026 (IEEE International Conference on Acoustics, Speech and Signal Processing)

2510.04660 2026-02-06 cs.LG

An Attention-based Feature Memory Design for Energy-Efficient Continual Learning

Yuandou Wang, Filip Gunnarsson, Rihan Hai

2510.03999 2026-02-06 cs.CL

LH-Deception: Simulating and Understanding LLM Deceptive Behaviors in Long-Horizon Interactions

Yang Xu, Xuanming Zhang, Samuel Yeh, Jwala Dhamala, Ousmane Dia, Rahul Gupta, Sharon Li

Comments ICLR 2026

2510.02345 2026-02-06 cs.CL cs.AI cs.DC cs.LG cs.NE

Breaking the MoE LLM Trilemma: Dynamic Expert Clustering with Structured Compression

Peijun Zhu, Ning Yang, Baoliang Tian, Jiayu Wei, Weihao Zhang, Haijun Zhang, Pin Lv

Comments 10 pages, 2 figures, 8 tables. Under review as a conference paper at ICML 2026

2509.22650 2026-02-06 cs.CV

RefAM: Attention Magnets for Zero-Shot Referral Segmentation

Anna Kukleva, Enis Simsar, Alessio Tonioni, Muhammad Ferjad Naeem, Federico Tombari, Jan Eric Lenssen, Bernt Schiele

Comments Project Page: https://refam-diffusion.github.io/

2509.22352 2026-02-06 cs.LG cs.AI

SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis

Marie Brockschmidt, Maresa Schröder, Stefan Feuerriegel

2509.21818 2026-02-06 cs.LG math.OC

Sharpness-Aware Minimization Can Hallucinate Minimizers

Chanwoong Park, Uijeong Jang, Ernest K. Ryu, Insoon Yang

2509.21004 2026-02-06 cs.LG

Multi-Agent Inverted Transformer for Flight Trajectory Prediction

Seokbin Yoon, Keumjin Lee

Comments 11 pages, 8 figures, submitted for IEEE Transactions on Intelligent Transportation System

2509.20900 2026-02-06 cs.CL

Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization

Weixuan Wang, Minghao Wu, Barry Haddow, Alexandra Birch

2509.19391 2026-02-06 cs.LG cs.AI

TensLoRA: Tensor Alternatives for Low-Rank Adaptation

Axel Marmoret, Reda Bensaid, Jonathan Lys, Vincent Gripon, François Leduc-Primeau

Comments Published at ICASSP 2026. 5 pages, 1 figure, 2 tables. Code can be found at https://github.com/ax-le/TensLoRA

2509.11298 2026-02-06 cs.LG cs.AI cs.CL

When Are Two RLHF Objectives the Same?

Madhava Gaikwad

Comments 21 pages

2509.06690 2026-02-06 cs.CV cs.AI cs.AR

BioLite U-Net: Edge-Deployable Semantic Segmentation for In Situ Bioprinting Monitoring

Usman Haider, Lukasz Szemet, Daniel Kelly, Vasileios Sergis, Andrew C. Daly, Karl Mason

Comments 8 pages, 5 figures, conference-style submission (ICRA 2026). Includes dataset description, BioLite U-Net architecture, benchmark results on edge device (Raspberry Pi 4B)

2509.06505 2026-02-06 cs.LG cs.IT math.IT stat.ML

On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data

Yu-Jui Huang, Hsin-Hua Shen, Yu-Chih Huang, Wan-Yi Lin, Shih-Chun Lin

2509.04661 2026-02-06 cs.LG cs.NE

Flexible inference for animal learning rules using neural networks

Yuhan Helena Liu, Victor Geadah, Jonathan Pillow

详情

英文摘要

Understanding how animals learn is a central challenge in neuroscience, with growing relevance to the development of animal- or human-aligned artificial intelligence. However, existing approaches tend to assume fixed parametric forms for the learning rule (e.g., Q-learning, policy gradient), which may not accurately describe the complex forms of learning employed by animals in realistic settings. Here we address this gap by developing a framework to infer learning rules directly from behavioral data collected during de novo task learning. We assume that animals follow a decision policy parameterized by a generalized linear model (GLM), and we model their learning rule -- the mapping from task covariates to per-trial weight updates -- using a deep neural network (DNN). This formulation allows flexible, data-driven inference of learning rules while maintaining an interpretable form of the decision policy itself. To capture more complex learning dynamics, we introduce a recurrent neural network (RNN) variant that relaxes the Markovian assumption that learning depends solely on covariates of the current trial, allowing for learning rules that integrate information over multiple trials. Simulations demonstrate that the framework can recover ground-truth learning rules. We applied our DNN and RNN-based methods to a large behavioral dataset from mice learning to perform a sensory decision-making task and found that they outperformed traditional RL learning rules at predicting the learning trajectories of held-out mice. The inferred learning rules exhibited reward-history-dependent learning dynamics, with larger updates following sequences of rewarded trials. Overall, these methods provide a flexible framework for inferring learning rules from behavioral data in de novo learning tasks, setting the stage for improved animal training protocols and the development of behavioral digital twins.

URL PDF HTML ☆

赞 0 踩 0

2509.03493 2026-02-06 cs.LG cs.AI

On Entropy Control in LLM-RL Algorithms

Han Shen

Comments Updated with ICLR 2026 version

2509.02276 2026-02-06 cs.AI

Rewarding Explainability in Drug Repurposing with Knowledge Graphs

Susana Nunes, Samy Badreddine, Catia Pesquita

Comments 9 pages, 4 figures, accepted at conference IJCAI 2025

Journal ref Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI-25), pp. 4624-4632, 2025

2508.21051 2026-02-06 cs.CL cs.AI cs.CY

Language Models and Logic Programs for Trustworthy Tax Reasoning

William Jurayj, Nils Holzenberger, Benjamin Van Durme

Comments Accepted to AAAI 2026

2508.19842 2026-02-06 cs.LG

Symplectic convolutional neural networks

Süleyman Yıldız, Konrad Janik, Peter Benner

2508.04349 2026-02-06 cs.CL cs.AI

GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy

Hongze Tan, Zihan Wang, Jianfei Pan, Jinghao Lin, Hao Wang, Yifan Wu, Tao Chen, Zhihang Zheng, Zhihao Tang, Haihua Yang

2508.01151 2026-02-06 cs.CV cs.AI

Personalized Safety Alignment for Text-to-Image Diffusion Models

Yu Lei, Jinbin Bai, Qingyu Shi, Aosong Feng, Hongcheng Gao, Xiao Zhang, Rex Ying

2507.17001 2026-02-06 cs.LG

Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

Yan Li, Yunlong Deng, Zijian Li, Anpeng Wu, Zeyu Tang, Kun Zhang, Guangyi Chen

2507.13772 2026-02-06 cs.CV cs.LG

Feature Engineering is Not Dead: Reviving Classical Machine Learning with Entropy, HOG, and LBP Feature Fusion for Image Classification

Abhijit Sen, Giridas Maiti, Bikram K. Parida, Bhanu P. Mishra, Mahima Arya, Denys I. Bondar

2507.13624 2026-02-06 cs.LG cs.DC cs.NI

FedSkipTwin: Digital-Twin-Guided Client Skipping for Communication-Efficient Federated Learning

Daniel Commey, Kamel Abbad, Garth V. Crosby, Lyes Khoukhi

Journal ref 2026 IEEE 23rd Consumer Communications & Networking Conference (CCNC)

2507.13579 2026-02-06 cs.LG cs.AI

Learning to summarize user information for personalized reinforcement learning from human feedback

Hyunji Nam, Yanming Wan, Mickel Liu, Peter Ahnn, Jianxun Lian, Natasha Jaques

Comments 10 pages for main text, 10 pages for appendix

2507.10239 2026-02-06 cs.CV

Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks

Ben Hamscher, Edgar Heinert, Annika Mütze, Kira Maag, Matthias Rottmann

Comments accepted at ECAI 2025

2507.04756 2026-02-06 cs.CL cs.AI

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Hang Lv, Sheng Liang, Hao Wang, Hongchao Gu, Yaxiong Wu, Wei Guo, Defu Lian, Yong Liu, Enhong Chen

2507.01028 2026-02-06 cs.LG cs.AI

Dual Perspectives on Non-Contrastive Self-Supervised Learning

Jean Ponce, Basile Terver, Martial Hebert, Michael Arbel

2506.22186 2026-02-06 cs.LG

Thompson Sampling-Based Learning and Control for Unknown Dynamic Systems

Kaikai Zheng, Dawei Shi, Yang Shi, Long Wang

2506.21996 2026-02-06 cs.AI

AlphaBeta is not as good as you think: a simple class of synthetic games for a better analysis of deterministic game-solving algorithms

Raphaël Boige, Amine Boumaza, Bruno Scherrer

Journal ref The Thirty-ninth Annual Conference on Neural Information Processing Systems, Dec 2025, San Diego, United States

详情

英文摘要

Deterministic game-solving algorithms are conventionally analyzed in the light of their average-case complexity against a distribution of random game-trees, where leaf values are independently sampled from a fixed distribution. This simplified model enables uncluttered mathematical analysis, revealing two key properties: root value distributions asymptotically collapse to a single fixed value for finite-valued trees, and all reasonable algorithms achieve global optimality. However, these findings are artifacts of the model's design: its long criticized independence assumption strips games of structural complexity, producing trivial instances where no algorithm faces meaningful challenges. To address this limitation, we introduce a class of synthetic games generated by a probabilistic model that incrementally constructs game-trees using a fixed level-wise conditional distribution. By enforcing ancestor dependencies, a critical structural feature of real-world games, our framework generates problems with adjustable difficulty while retaining some form of analytical tractability. For several algorithms, including AlphaBeta and Scout, we derive recursive formulas characterizing their average-case complexities under this model. These allow us to rigorously compare algorithms on deep game-trees, where Monte-Carlo simulations are no longer feasible. While asymptotically, all algorithms seem to converge to identical branching factor (a result analogous to that of independence-based models), deep finite trees reveal stark differences: AlphaBeta incurs a significantly larger constant multiplicative factor compared to algorithms like Scout, leading to a substantial practical slowdown. Our framework sheds new light on classical game-solving algorithms, offering rigorous evidence and analytical tools to advance the understanding of these methods under a richer, more challenging, and yet tractable model.

URL PDF HTML ☆

赞 0 踩 0

2506.08629 2026-02-06 cs.CV cs.AI

ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network

Feixiang Du, Shengkun Wu

Comments 16 pages, 2 figures, 4 tables

AI 大模型

视觉与机器人

科学与医疗

Leveraging Whisper Embeddings for Audio-based Lyrics Matching

An Attention-based Feature Memory Design for Energy-Efficient Continual Learning

LH-Deception: Simulating and Understanding LLM Deceptive Behaviors in Long-Horizon Interactions

Breaking the MoE LLM Trilemma: Dynamic Expert Clustering with Structured Compression

RefAM: Attention Magnets for Zero-Shot Referral Segmentation

SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis

Sharpness-Aware Minimization Can Hallucinate Minimizers

Multi-Agent Inverted Transformer for Flight Trajectory Prediction

Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization

TensLoRA: Tensor Alternatives for Low-Rank Adaptation

When Are Two RLHF Objectives the Same?

BioLite U-Net: Edge-Deployable Semantic Segmentation for In Situ Bioprinting Monitoring

On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data

Flexible inference for animal learning rules using neural networks

On Entropy Control in LLM-RL Algorithms

Rewarding Explainability in Drug Repurposing with Knowledge Graphs

Language Models and Logic Programs for Trustworthy Tax Reasoning

Symplectic convolutional neural networks

GTPO and GRPO-S: Token and Sequence-Level Reward Shaping with Policy Entropy

Personalized Safety Alignment for Text-to-Image Diffusion Models

Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

Feature Engineering is Not Dead: Reviving Classical Machine Learning with Entropy, HOG, and LBP Feature Fusion for Image Classification

FedSkipTwin: Digital-Twin-Guided Client Skipping for Communication-Efficient Federated Learning

Learning to summarize user information for personalized reinforcement learning from human feedback

Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks

CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering

Dual Perspectives on Non-Contrastive Self-Supervised Learning

Thompson Sampling-Based Learning and Control for Unknown Dynamic Systems

AlphaBeta is not as good as you think: a simple class of synthetic games for a better analysis of deterministic game-solving algorithms

ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network