arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2602.18591 2026-02-24 cs.LG

Ensemble Prediction of Task Affinity for Efficient Multi-Task Learning

Afiya Ayman, Ayan Mukhopadhyay, Aron Laszka

2602.18585 2026-02-24 cs.CV cs.AI

BloomNet: Exploring Single vs. Multiple Object Annotation for Flower Recognition Using YOLO Variants

Safwat Nusrat, Prithwiraj Bhattacharjee

Comments Accepted for publication in 7th International Conference on Trends in Computational and Cognitive Engineering (TCCE-2025)

2602.18583 2026-02-24 cs.CL cs.AI cs.LG

Luna-2: Scalable Single-Token Evaluation with Small Language Models

Vatsal Goel, Rishon Dsouza, Nikhil Ega, Amey Ramesh Rambatla, Rob Friel, Shuai Shao, Yash Sheth

2602.18582 2026-02-24 cs.AI cs.CL cs.HC cs.LG

Hierarchical Reward Design from Language: Enhancing Alignment of Agent Behavior with Human Specifications

Zhiqin Qian, Ryan Diaz, Sangwon Seo, Vaibhav Unhelkar

Comments Extended version of an identically-titled paper accepted at AAMAS 2026

2602.18581 2026-02-24 cs.LG cond-mat.stat-mech physics.soc-ph

Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

Sheng Ran

2602.18572 2026-02-24 cs.LG q-fin.ST

Sub-City Real Estate Price Index Forecasting at Weekly Horizons Using Satellite Radar and News Sentiment

Baris Arat, Hasan Fehmi Ates, Emre Sefer

2602.18569 2026-02-24 cs.RO cs.SY eess.SY

Design and Biomechanical Evaluation of a Lightweight Low-Complexity Soft Bilateral Ankle Exoskeleton

Josée Mallah, Zakii Javed, Zafer Azak, Thomas Stone, Luigi G. Occhipinti

2602.18540 2026-02-24 cs.CV cs.AI

Rodent-Bench

Thomas Heap, Laurence Aitchison, Emma Cahill, Adriana Casado Rodriguez

2602.18535 2026-02-24 cs.SD cs.AI

Fairness-Aware Partial-label Domain Adaptation for Voice Classification of Parkinson's and ALS

Arianna Francesconi, Zhixiang Dai, Arthur Stefano Moscheni, Himesh Morgan Perera Kanattage, Donato Cappetta, Fabio Rebecchi, Paolo Soda, Valerio Guarrasi, Rosa Sicilia, Mary-Anne Hartley

Comments 7 pages, 1 figure. Submitted to Pattern Recognition Letters

详情

英文摘要

Voice-based digital biomarkers can enable scalable, non-invasive screening and monitoring of Parkinson's disease (PD) and Amyotrophic Lateral Sclerosis (ALS). However, models trained on one cohort or device often fail on new acquisition settings due to cross-device and cross-cohort domain shift. This challenge is amplified in real-world scenarios with partial-label mismatch, where datasets may contain different disease labels and only partially overlap in class space. In addition, voice-based models may exploit demographic cues, raising concerns about gender-related unfairness, particularly when deployed across heterogeneous cohorts. To tackle these challenges, we propose a hybrid framework for unified three-class (healthy/PD/ALS) cross-domain voice classification from partially overlapping cohorts. The method combines style-based domain generalization with conditional adversarial alignment tailored to partial-label settings, reducing negative transfer. An additional adversarial gender branch promotes gender-invariant representations. We conduct a comprehensive evaluation across four heterogeneous sustained-vowel datasets, spanning distinct acquisition settings and devices, under both domain generalization and unsupervised domain adaptation protocols. The proposed approach is compared against twelve state-of-the-art machine learning and deep learning methods, and further evaluated through three targeted ablations, providing the first cross-cohort benchmark and end-to-end domain-adaptive framework for unified healthy/PD/ALS voice classification under partial-label mismatch and fairness constraints. Across all experimental settings, our method consistently achieves the best external generalization over the considered evaluation metrics, while maintaining reduced gender disparities. Notably, no competing method shows statistically significant gains in external performance.

URL PDF HTML ☆

赞 0 踩 0

2602.18533 2026-02-24 cs.CV

Morphological Addressing of Identity Basins in Text-to-Image Diffusion Models

Andrew Fraser

详情

英文摘要

We demonstrate that morphological pressure creates navigable gradients at multiple levels of the text-to-image generative pipeline. In Study~1, identity basins in Stable Diffusion 1.5 can be navigated using morphological descriptors -- constituent features like platinum blonde,'' beauty mark,'' and 1950s glamour'' -- without the target's name or photographs. A self-distillation loop (generating synthetic images from descriptor prompts, then training a LoRA on those outputs) achieves consistent convergence toward a specific identity as measured by ArcFace similarity. The trained LoRA creates a local coordinate system shaping not only the target identity but also its inverse: maximal away-conditioning produces eldritch'' structural breakdown in base SD1.5, while the LoRA-equipped model produces ``uncanny valley'' outputs -- coherent but precisely wrong. In Study~2, we extend this to prompt-level morphology. Drawing on phonestheme theory, we generate 200 novel nonsense words from English sound-symbolic clusters (e.g., \emph{cr-}, \emph{sn-}, \emph{-oid}, \emph{-ax}) and find that phonestheme-bearing candidates produce significantly more visually coherent outputs than random controls (mean Purity@1 = 0.371 vs.\ 0.209, p<0.00001p < 0.00001 p<0.00001, Cohen's d=0.55d = 0.55 d=0.55). Three candidates -- \emph{snudgeoid}, \emph{crashax}, and \emph{broomix} -- achieve perfect visual consistency (Purity@1 = 1.0) with zero training data contamination, each generating a distinct, coherent visual identity from phonesthetic structure alone. Together, these studies establish that morphological structure -- whether in feature descriptors or prompt-level phonological form -- creates systematic navigational gradients through diffusion model latent spaces. We document phase transitions in identity basins, CFG-invariant identity stability, and novel visual concepts emerging from sub-lexical sound patterns.

URL PDF HTML ☆

赞 0 踩 0

2602.18531 2026-02-24 cs.LG cs.AI cs.DC

Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems

Abeer Alsheikhi, Amirfarhad Farhadi, Azadeh Zamanifar

Comments arXiv admin note: text overlap with arXiv:2510.17380 by other authors

2602.18530 2026-02-24 cs.CV

Image-Based Classification of Olive Varieties Native to Turkiye Using Multiple Deep Learning Architectures: Analysis of Performance, Complexity, and Generalization

Hatice Karatas, Irfan Atabas

2602.18528 2026-02-24 cs.LG cs.SD

Audio-Visual Continual Test-Time Adaptation without Forgetting

Sarthak Kumar Maharana, Akshay Mehra, Bhavya Ramakrishna, Yunhui Guo, Guan-Ming Su

2602.18525 2026-02-24 cs.CV cs.LG

Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity

Vasile Marian, Yong-Bin Kang, Alexander Buddery

Comments 23 pages, 13 figures, includes appendix

2602.18521 2026-02-24 cs.LG cs.AI

AdaptStress: Online Adaptive Learning for Interpretable and Personalized Stress Prediction Using Multivariate and Sparse Physiological Signals

Xueyi Wang, Claudine J. C. Lamoth, Elisabeth Wilhelm

2602.18520 2026-02-24 cs.CV cs.AI

Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

Aayam Bansal

2602.18519 2026-02-24 cs.LG cs.CV

Wide Open Gazes: Quantifying Visual Exploratory Behavior in Soccer with Pose Enhanced Positional Data

Joris Bekkers

2602.18518 2026-02-24 cs.LG stat.ME stat.ML

Measuring the Prevalence of Policy Violating Content with ML Assisted Sampling and LLM Labeling

Attila Dobi, Aravindh Manickavasagam, Benjamin Thompson, Xiaohan Yang, Faisal Farooq

Comments 8 pages

2602.18515 2026-02-24 cs.LG cs.AI

Weak-Form Evolutionary Kolmogorov-Arnold Networks for Solving Partial Differential Equations

Bongseok Kim, Jiahao Zhang, Guang Lin

2602.18505 2026-02-24 cs.CV

Suppression or Deletion: A Restoration-Based Representation-Level Analysis of Machine Unlearning

Yurim Jang, Jaeung Lee, Dohyun Kim, Jaemin Jo, Simon S. Woo

2602.18504 2026-02-24 cs.CV cs.AI

A Computer Vision Framework for Multi-Class Detection and Tracking in Soccer Broadcast Footage

Daniel Tshiani

Comments Presented at the Robyn Rafferty Mathias Reseaerch Conference. Additional Information available at: https://DGT-International.com

2602.18500 2026-02-24 cs.CV cs.ET cs.HC

Scaling Ultrasound Volumetric Reconstruction via Mobile Augmented Reality

Kian Wei Ng, Yujia Gao, Deborah Khoo, Ying Zhen Tan, Chengzheng Mao, Haojie Cheng, Andrew Makmur, Kee Yuan Ngiam, Serene Goh, Eng Tat Khoo

Comments Submitted to MICCAI 2026

2602.18496 2026-02-24 cs.CV

A Patient-Specific Digital Twin for Adaptive Radiotherapy of Non-Small Cell Lung Cancer

Anvi Sud, Jialu Huang, Gregory R. Hart, Keshav Saxena, John Kim, Lauren Tressel, Jun Deng

详情

英文摘要

Radiotherapy continues to become more precise and data dense, with current treatment regimens generating high frequency imaging and dosimetry streams ideally suited for AI driven temporal modeling to characterize how normal tissues evolve with time. Each fraction in biologically guided radiotherapy(BGRT) treated non small cell lung cancer (NSCLC) patients records new metabolic, anatomical, and dose information. However, clinical decision making is largely informed by static, population based NTCP models which overlook the dynamic, unique biological trajectories encoded in sequential data. We developed COMPASS (Comprehensive Personalized Assessment System) for safe radiotherapy, functioning as a temporal digital twin architecture utilizing per fraction PET, CT, dosiomics, radiomics, and cumulative biologically equivalent dose (BED) kinetics to model normal tissue biology as a dynamic time series process. A GRU autoencoder was employed to learn organ specific latent trajectories, which were classified via logistic regression to predict eventual CTCAE grade 1 or higher toxicity. Eight NSCLC patients undergoing BGRT contributed to the 99 organ fraction observations covering 24 organ trajectories (spinal cord, heart, and esophagus). Despite the small cohort, intensive temporal phenotyping allowed for comprehensive analysis of individual dose response dynamics. Our findings revealed a viable AI driven early warning window, as increasing risk ratings occurred from several fractions before clinical toxicity. The dense BED driven representation revealed biologically relevant spatial dose texture characteristics that occur before toxicity and are averaged out with traditional volume based dosimetry. COMPASS establishes a proof of concept for AI enabled adaptive radiotherapy, where treatment is guided by a continually updated digital twin that tracks each patients evolving biological response.

URL PDF HTML ☆

赞 0 踩 0

2602.18494 2026-02-24 cs.AI cs.LG

On the Dynamics of Observation and Semantics

Xiu Li

2602.18493 2026-02-24 cs.LG cs.AI

Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning

Kehao Zhang, Shangtong Gui, Sheng Yang, Wei Chen, Yang Feng

2602.18487 2026-02-24 cs.CL cs.LG

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Ihor Stepanov, Mykhailo Shtopko, Dmytro Vodianytskyi, Oleksandr Lukashov

Comments 13 pages, 1 figure, 4 tables

2602.18486 2026-02-24 cs.LG eess.SP stat.ML

Support Vector Data Description for Radar Target Detection

Jean Pinsolle, Yadang Alexis Rouzoumka, Chengfang Ren, Chistèle Morisseau, Jean-Philippe Ovarlez

Comments 5 pages, 2 figures, to appear in Acoustics, Speech and Signal Processing (ICASSP), 2026 IEEE International Conference on, Barcelona, Spain, May 2026

2602.18472 2026-02-24 cs.LG cs.AI q-bio.QM

Physiologically Informed Deep Learning: A Multi-Scale Framework for Next-Generation PBPK Modeling

Shunqi Liu, Han Qiu, Tong Wang

2602.18465 2026-02-24 cs.LG stat.ML

Revisiting the Seasonal Trend Decomposition for Enhanced Time Series Forecasting

Sanjeev Panta, Xu Yuan, Li Chen, Nian-Feng Tzeng

Comments 5 pages, accepted at 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2026)

2602.18450 2026-02-24 cs.CL cs.IT cs.LG math.IT

Asymptotic Semantic Collapse in Hierarchical Optimization

Faruk Alpay, Bugra Kilictas

Comments 23 pages, 2 figures. Includes a dataset-free benchmark with full metric reporting

AI 大模型

视觉与机器人

科学与医疗

Ensemble Prediction of Task Affinity for Efficient Multi-Task Learning

BloomNet: Exploring Single vs. Multiple Object Annotation for Flower Recognition Using YOLO Variants

Luna-2: Scalable Single-Token Evaluation with Small Language Models

Hierarchical Reward Design from Language: Enhancing Alignment of Agent Behavior with Human Specifications

Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

Sub-City Real Estate Price Index Forecasting at Weekly Horizons Using Satellite Radar and News Sentiment

Design and Biomechanical Evaluation of a Lightweight Low-Complexity Soft Bilateral Ankle Exoskeleton

Rodent-Bench

Fairness-Aware Partial-label Domain Adaptation for Voice Classification of Parkinson's and ALS

Morphological Addressing of Identity Basins in Text-to-Image Diffusion Models

Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems

Image-Based Classification of Olive Varieties Native to Turkiye Using Multiple Deep Learning Architectures: Analysis of Performance, Complexity, and Generalization

Audio-Visual Continual Test-Time Adaptation without Forgetting

Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity

AdaptStress: Online Adaptive Learning for Interpretable and Personalized Stress Prediction Using Multivariate and Sparse Physiological Signals

Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

Wide Open Gazes: Quantifying Visual Exploratory Behavior in Soccer with Pose Enhanced Positional Data

Measuring the Prevalence of Policy Violating Content with ML Assisted Sampling and LLM Labeling

Weak-Form Evolutionary Kolmogorov-Arnold Networks for Solving Partial Differential Equations

Suppression or Deletion: A Restoration-Based Representation-Level Analysis of Machine Unlearning

A Computer Vision Framework for Multi-Class Detection and Tracking in Soccer Broadcast Footage

Scaling Ultrasound Volumetric Reconstruction via Mobile Augmented Reality

A Patient-Specific Digital Twin for Adaptive Radiotherapy of Non-Small Cell Lung Cancer

On the Dynamics of Observation and Semantics

Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Support Vector Data Description for Radar Target Detection

Physiologically Informed Deep Learning: A Multi-Scale Framework for Next-Generation PBPK Modeling

Revisiting the Seasonal Trend Decomposition for Enhanced Time Series Forecasting

Asymptotic Semantic Collapse in Hierarchical Optimization