arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.21041 2026-04-24 cs.CV

Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attacks

Aljalila Aladawi, Mohammed Talha Alam, Fakhri Karray

详情

英文摘要

Machine unlearning for text-to-image diffusion models aims to selectively remove undesirable concepts from pre-trained models without costly retraining. Current unlearning methods share a common weakness: erased concepts return when the model is fine-tuned on downstream data, even when that data is entirely unrelated. We adapt Projected Gradient Unlearning (PGU) from classification to the diffusion domain as a post-hoc hardening step. By constructing a Core Gradient Space (CGS) from the retain concept activations and projecting gradient updates into its orthogonal complement, PGU ensures that subsequent fine-tuning cannot undo the achieved erasure. Applied on top of existing methods (ESD, UCE, Receler), the approach eliminates revival for style concepts and substantially delays it for object concepts, running in roughly 6 minutes versus the ~2 hours required by Meta-Unlearning. PGU and Meta-Unlearning turn out to be complementary: which performs better depends on how the concept is encoded, and retain concept selection should follow visual feature similarity rather than semantic grouping.

URL PDF HTML ☆

赞 0 踩 0

2604.21036 2026-04-24 cs.AI

Who Defines Fairness? Target-Based Prompting for Demographic Representation in Generative Models

Marzia Binta Nizam, James Davis

2604.21032 2026-04-24 cs.CV

Unlocking Multi-Spectral Data for Multi-Modal Models with Guided Inputs and Chain-of-Thought Reasoning

Dahun Kim, Ganesh Satish Mallya, Anelia Angelova

Comments Accepted to IGARSS 2026

2604.21031 2026-04-24 cs.LG cs.AI

Synthetic Data in Education: Empirical Insights from Traditional Resampling and Deep Generative Models

Tapiwa Amion Chinodakufa, Ashfaq Ali Shafin, Khandaker Mamun Ahmed

2604.21028 2026-04-24 cs.LG cs.AI cs.CV

A Deep U-Net Framework for Flood Hazard Mapping Using Hydraulic Simulations of the Wupper Catchment

Christian Lammers, Fernando Arévalo, Leonie Märker-Neuhaus, Daniel Heinenberg, Christian Förster, Karl-Heinz Spies

Comments 18 Pages, 9 Figures

2604.21027 2026-04-24 cs.AI

HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering

Yuyu Liu, Sarang Rajendra Patil, Mengjia Xu, Tengfei Ma

Comments Accepted by Findings of ACL 2026

2604.21018 2026-04-24 cs.AI

Adaptive Test-Time Compute Allocation with Evolving In-Context Demonstrations

Bowen Zuo, Dongruo Zhou, Yinglun Zhu

2604.21016 2026-04-24 cs.LG cs.AI math.OC

SGD at the Edge of Stability: The Stochastic Sharpness Gap

Fangshuo Liao, Afroditi Kolomvaki, Anastasios Kyrillidis

2604.21011 2026-04-24 cs.CV q-bio.NC

Micro-DualNet: Dual-Path Spatio-Temporal Network for Micro-Action Recognition

Naga VS Raviteja Chappa, Evangelos Sariyanidi, Lisa Yankowitz, Gokul Nair, Casey J. Zampella, Robert T. Schultz, Birkan Tunç

Comments Accepted to International Conference on Automatic Face and Gesture Recognition (FG)

2604.21008 2026-04-24 cs.CV

Linear Image Generation by Synthesizing Exposure Brackets

Yuekun Dai, Zhoutong Zhang, Shangchen Zhou, Nanxuan Zhao

Comments accepted by CVPR2026

2604.21006 2026-04-24 cs.AI cs.LG

Deep FinResearch Bench: Evaluating AI's Ability to Conduct Professional Financial Investment Research

Mirazul Haque, Antony Papadimitriou, Samuel Mensah, Zhiqiang Ma, Zhijin Guo, Joy Prakash Sain, Simerjot Kaur, Charese Smiley, Xiaomo Liu

2604.20993 2026-04-24 cs.LG

Droplet-LNO: Physics-Informed Laplace Neural Operators for Accurate Prediction of Droplet Spreading Dynamics on Complex Surfaces

Ganesh Sahadeo Meshram, Partha Pratim Chakrabarti, Suman Chakraborty

Comments 36 pages, 8 figures

2604.20990 2026-04-24 cs.RO cs.SY eess.SY

A Survey of Legged Robotics in Non-Inertial Environments: Past, Present, and Future

I-Chia Chang, Xinyan Huang, Tzu-Yuan Lin, Sangli Teng, Wenjing Li, Maani Ghaffari, Jingang Yi, Yan Gu

2604.20987 2026-04-24 cs.AI

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Xiyang Wu, Zongxia Li, Guangyao Shi, Alexander Duffy, Tyler Marques, Matthew Lyle Olson, Tianyi Zhou, Dinesh Manocha

Comments 26 pages

2604.20983 2026-04-24 cs.CV cs.AI cs.CL

Thinking Like a Botanist: Challenging Multimodal Language Models with Intent-Driven Chain-of-Inquiry

Syed Nazmus Sakib, Nafiul Haque, Shahrear Bin Amin, Hasan Muhammad Abdullah, Md. Mehedi Hasan, Mohammad Zabed Hossain, Shifat E. Arman

Comments Accepted at ACL 2026 Findings

2604.20949 2026-04-24 cs.LG q-fin.TR stat.ME stat.ML

Early Detection of Latent Microstructure Regimes in Limit Order Books

Prakul Sunil Hiremath, Vruksha Arun Hiremath

Comments 48 pages, 7 figures. Combines theoretical guarantees (identifiability and early-detection bounds), 200-run simulation study, and preliminary real-data evaluation on BTC/USDT limit order books. Code and data available

2604.20944 2026-04-24 cs.LG cs.AI

LAF-Based Evaluation and UTTL-Based Learning Strategies with MIATTs

Yongquan Yang

2604.20943 2026-04-24 cs.LG

SCM: Sleep-Consolidated Memory with Algorithmic Forgetting for Large Language Models

Saish Sachin Shinde

Comments 5 figures. Submitted April 2026

2604.20938 2026-04-24 cs.LG cs.AI

HARBOR: Automated Harness Optimization

Biswa Sengupta, Jinhua Wang

2604.20937 2026-04-24 cs.LG

Sink-Token-Aware Pruning for Fine-Grained Video Understanding in Efficient Video LLMs

Kibum Kim, Jiwan Kim, Kyle Min, Yueqi Wang, Jinyoung Moon, Julian McAuley, Chanyoung Park

Comments Under Review

2604.20935 2026-04-24 cs.LG cs.AI

Data-Driven Open-Loop Simulation for Digital-Twin Operator Decision Support in Wastewater Treatment

Gary Simethy, Daniel Ortiz Arroyo, Petar Durdevic

Comments 18 pages, 10 figures, 9 tables

2604.20933 2026-04-24 cs.LG cs.AI

IRIS: Interpolative Rényi Iterative Self-play for Large Language Model Fine-Tuning

Wenjie Liao, Like Wu, Liangjie Zhao, Shihui Xu, Shigeru Fujimura

2604.20928 2026-04-24 cs.LG cs.AI

Domain-Aware Hierarchical Contrastive Learning for Semi-Supervised Generalization Fault Diagnosis

Junyu Ren, Wensheng Gan, Philip S Yu

Comments Preprint

2604.20925 2026-04-24 cs.LG

Unsupervised Learning of Inter-Object Relationships via Group Homomorphism

Kyotaro Ushida, Takayuki Komatsu, Yoshiyuki Ohmura, Yasuo Kuniyoshi

Comments Preprint. Under review at ICDL 2026

2604.20924 2026-04-24 cs.LG

Clinically Interpretable Sepsis Early Warning via LLM-Guided Simulation of Temporal Physiological Dynamics

Weizhi Nie, Zhen Qu, Weijie Wang, Chunpei Li, Ke Lu, Bingyang Zhou, Hongzhi Yu

2604.20923 2026-04-24 cs.LG

ILDR: Geometric Early Detection of Grokking

Shreel Golwala

详情

英文摘要

Grokking describes a delayed generalization phenomenon in which a neural network achieves perfect training accuracy long before validation accuracy improves, followed by an abrupt transition to strong generalization. Existing detection signals are indirect: weight norm reflects parameter-space regularization and consistently lags the transition, while GrokFast's slow gradient EMA, used without gradient amplification, is unstable across seeds with standard deviation exceeding mean lead time. We propose the Inter/Intra-class Distance Ratio (ILDR), a geometric metric computed on second-to-last layer representations as the ratio of inter-class centroid separation to intra-class scatter. ILDR provides an early detection signal: it rises and crosses a threshold at 2.5 times its baseline before the grokking transition appears in validation accuracy, indicating early geometric reorganization in representation space. Grounded in Fisher's linear discriminant criterion, ILDR requires no eigendecomposition and runs in O(|C|^2 + N). It is evaluated exclusively on held-out data, making it robust to memorization effects. Across modular arithmetic and permutation group composition (S5), ILDR leads the grokking transition by 9 to 73 percent of the training budget, with lead time increasing with task algebraic complexity. Over eight random seeds, ILDR leads by 950 +/- 250 steps with a coefficient of variation of 26 percent, and post-grokking variance drops by 1696 times, consistent with a sharp phase transition in representation space. Using ILDR as an early stopping trigger reduces training by 18.6 percent on average. Optimizer interventions triggered at the ILDR threshold demonstrate bidirectional control over the transition, suggesting ILDR tracks representational conditions underlying generalization rather than a downstream correlate.

URL PDF HTML ☆

赞 0 踩 0

2604.20921 2026-04-24 cs.LG

Validating a Deep Learning Algorithm to Identify Patients with Glaucoma using Systemic Electronic Health Records

John Xiang, Rohith Ravindranath, Sophia Y. Wang

Comments submitted to AMIA Annual Symposium 2026

2604.20920 2026-04-24 cs.LG

Forget, Then Recall: Learnable Compression and Selective Unfolding via Gist Sparse Attention

Yuzhen Mao, Michael Y. Li, Emily B. Fox

2604.20917 2026-04-24 cs.LG cs.AI cs.CL cs.PL cs.SE

The Path Not Taken: Duality in Reasoning about Program Execution

Eshgin Hasanov, Md Mahadi Hassan Sibat, Santu Karmaker, Aashish Yadavally

Comments Accepted to ACL 2026 Main Conference

2604.20915 2026-04-24 cs.LG cs.AI cs.CL cs.SE math.OC

Absorber LLM: Harnessing Causal Synchronization for Test-Time Training

Zhixin Zhang, Shabo Zhang, Chengcan Wu, Zeming Wei, Meng Sun