arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13247 2026-06-13 cs.AI 新提交

EPIG: Emotion-Based Prompting for Personalised Image Generation

EPIG:基于情感提示的个性化图像生成

Emna Othmen, Mohamed Yassine Landolsi, Lotfi Ben Romdhane

发表机构 * MARS Research Lab LR17ES05, ISITCom, University of Sousse(苏塞大学ISITCom学院MARS研究实验室LR17ES05)

AI总结 提出EPIG方法,利用心理学效价-唤醒模型在提示层面增强情感表达,无需训练即可控制生成图像的唤醒度,在10个多样化提示上平均唤醒误差降低14%-17%。

详情
Comments
Submitted to arXiv. 20 pages, 4 figures. Work on emotion-based prompt engineering for text-to-image diffusion models with applications in personalized image generation
AI中文摘要

文本到图像扩散模型在从自然语言提示合成高质量图像方面取得了令人印象深刻的结果。然而,常用的提示策略仍然相对通用,限制了模型准确表达情感意图和细微情感属性的能力。本文提出EPIG,一种在图像生成之前在提示层面增强情感表达性的方法。基于心理学知情的情感表示(效价-唤醒)并利用结构化的、角色感知的提示丰富化,EPIG在不修改或重新训练图像生成主干的情况下丰富提示的情感相关组件。由此产生的情感感知提示引导生成过程朝向更情感连贯的视觉输出,在控制唤醒方面特别有效。EPIG轻量级、无需训练,非常适合资源受限和个性化图像生成场景。在10个多样化提示的基准测试上的实验结果表明,与强基线(包括朴素插入和基于LLM的提示扩展)相比,EPIG将平均唤醒误差分别降低了14%和12%。这些改进具有统计显著性。EPIG还保持了效价对齐和语义一致性,如CLIPScore所测量并由消融研究所支持。在包含人类、儿童或动物等显式主体的提示上效果更为显著,误差降低达到17%,突出了所提出方法的主题敏感行为。

英文摘要

Text-to-image diffusion models have achieved impressive results in synthesizing high-quality images from natural language prompts. However, commonly used prompting strategies remain relatively generic, limiting the model's ability to accurately express emotional intent and nuanced affective attributes. This work proposes EPIG, a method that enhances emotional expressiveness at the prompt level prior to image generation. Grounded in psychologically informed emotion representations (valence-arousal) and leveraging structured, role-aware prompt enrichment, EPIG enriches emotion-related components of prompts without modifying or retraining the image generation backbone. The resulting emotion-aware prompts guide the generative process toward more emotionally coherent visual outputs, with particular effectiveness in controlling arousal. EPIG is lightweight, training-free, and well suited for resource-constrained and personalized image generation scenarios. Experimental results on a benchmark of 10 diverse prompts show that EPIG reduces mean arousal error compared to strong baselines, including naive insertion and LLM-based prompt expansion, with reductions of 14% and 12%, respectively. These improvements are statistically significant. EPIG also preserves valence alignment and semantic consistency, as measured by CLIPScore and supported by ablation studies. The effect is more pronounced on prompts containing explicit subjects such as humans, children, or animals, where the reduction reaches 17%, highlighting the subject-sensitive behavior of the proposed method.

2606.12651 2026-06-13 cs.LG q-bio.QM 新提交

Physics-Aware Auxiliary Losses Improve Out-of-Distribution Generalization of a GNN Synthesizability Filter

物理感知辅助损失提升图神经网络可合成性滤波器的分布外泛化能力

Riya Bisht, Dhruv Agarwal

AI总结 通过在GNN上添加基于Bertz指数的拓扑复杂度回归和MMFF94力场应变能软惩罚作为辅助损失,在分布外数据上小幅但显著提升了可合成性滤波器的AUC(最高+0.0066)。

详情
AI中文摘要

机器学习药物发现流程越来越依赖生成模型,这些模型提出的分子远离用于训练下游可合成性滤波器的数据。现有滤波器(SAScore、SCScore、RAscore、DeepSA)纯粹基于统计,在分布外(OOD)场景下性能下降。我们探究廉价的闭式物理先验,作为图神经网络(GNN)的辅助监督,是否能改善OOD泛化。我们在GINE骨干网络上添加两个辅助损失:基于Bertz指数的拓扑复杂度回归,以及基于MMFF94力场能量的应变能软惩罚。在由SAScore阈值标注的65,177个分子语料库(HIV、Tox21、COCONUT)上,我们复现了强分布内基线,然后在单源OOD划分(在类药HIV+Tox21上训练,在COCONUT天然产物上测试)上评估4路消融实验(基线/+复杂度/+应变/+两者),重复5个种子并采用配对bootstrap置信区间。所有三个物理感知变体相比基线(平均OOD AUC 0.9774)均带来微小但统计显著的OOD提升:+复杂度Delta = +0.0060(95% CI [+0.0023, +0.0102]),+应变Delta = +0.0032([+0.0008, +0.0052]),+两者Delta = +0.0066([+0.0038, +0.0093]);每个区间均不包含零,且组合效果最佳。各变体在分布内表现无差异,因此效果仅在OOD评估下可见。我们明确指出效果是适度的,并报告一个警示性方法学发现:该实验的单种子版本产生了定性不同(非单调)的故事,未能在多种子评估中复现。

英文摘要

Machine-learning drug-discovery pipelines increasingly rely on generative models that propose molecules far from the data used to train downstream synthesizability filters. Existing filters (SAScore, SCScore, RAscore, DeepSA) are purely statistical and degrade in exactly this out-of-distribution (OOD) regime. We ask whether cheap, closed-form physical priors, used as auxiliary supervision on a graph neural network (GNN), improve OOD generalization. We add two auxiliary losses to a GINE backbone: a topological complexity regression supervised by the Bertz index, and a strain-energy soft penalty supervised by MMFF94 force-field energy. On a 65,177-molecule corpus (HIV, Tox21, COCONUT) labeled by SAScore thresholds we reproduce a strong in-distribution baseline, then evaluate a 4-way ablation (baseline / +complexity / +strain / +both) on a single-source OOD split (train on drug-like HIV+Tox21, test on COCONUT natural products), repeated over 5 seeds with paired bootstrap confidence intervals. All three physics-aware variants give a small but statistically significant OOD improvement over the baseline (mean OOD AUC 0.9774): +complexity Delta = +0.0060 (95% CI [+0.0023, +0.0102]), +strain Delta = +0.0032 ([+0.0008, +0.0052]), +both Delta = +0.0066 ([+0.0038, +0.0093]); every interval excludes zero, and the combination is best. The variants are indistinguishable in-distribution, so the effect is visible only under OOD evaluation. We are explicit that the effects are modest, and we report a cautionary methodological finding: a single-seed version of this experiment produced a qualitatively different (non-monotone) story that did not survive multi-seed evaluation.

2412.13012 2026-06-13 cs.LG cond-mat.mtrl-sci cond-mat.str-el

Deep Learning Based Superconductivity: Prediction and Experimental Tests

Daniel Kaplan, Adam Zhang, Joanna Blawat, Rongying Jin, Robert J. Cava, Viktor Oudovenko, Gabriel Kotliar, Anirvan M. Sengupta, Weiwei Xie

详情
Journal ref
Eur. Phys. J. Plus (2025) 140:58
Comments
14 pages + 2 appendices + references. EPJ submission
英文摘要

The discovery of novel superconducting materials is a longstanding challenge in materials science, with a wealth of potential for applications in energy, transportation, and computing. Recent advances in artificial intelligence (AI) have enabled expediting the search for new materials by efficiently utilizing vast materials databases. In this study, we developed an approach based on deep learning (DL) to predict new superconducting materials. We have synthesized a compound derived from our DL network and confirmed its superconducting properties in agreement with our prediction. Our approach is also compared to previous work based on random forests (RFs). In particular, RFs require knowledge of the chemical properties of the compound, while our neural net inputs depend solely on the chemical composition. With the help of hints from our network, we discover a new ternary compound $\textrm{Mo}_{20} \textrm{Re}_{6} \textrm{Si}_{4}$, which becomes superconducting below 5.4 K. We further discuss the existing limitations and challenges associated with using AI to predict and, along with potential future research directions.

2606.13408 2026-06-13 cs.SC 新提交

A catalog of fast matrix multiplication algorithms with frontier-closure search

基于前沿闭包搜索的快速矩阵乘法算法目录

Benoit Chatain Lacelle

AI总结 提出前沿闭包搜索方法,统一编录至32×32×32的快速矩阵乘法算法,通过轴翻转、Kronecker积等重组操作,并明确非重叠属性以区分双线性核心发现与组合。

详情
AI中文摘要

2022-2026年间小规模矩阵乘法领域的活跃研究(AlphaTensor 2022, AlphaEvolve 2025, Schwartz–Zwecher 2025)产生了引人注目的个体结果,但这些结果分散在不同领域、归属约定和序列化格式中。另一条互补的工作线——Perminov的开源翻转图框架~\cite{perminov2026fast,perminov2025fast}——则在大规模格式空间上大规模驱动现有构造方法,特别是翻转图和元翻转图搜索,发现了许多新的低秩方案(包括三元整数方案),进一步丰富了本目录必须统一整合的景观。我们提出了一个统一的、机器可检查的目录,涵盖有理数、整数、实数、复数和二元域上形状至\nmpshape{32}{32}{32}的算法,并单独设置交换算法轴(Waksman 1970, Makarov 1986, Rosowski 2019)。该目录的推导通过前沿闭包搜索执行,该搜索通过轴翻转、Kronecker积、轴拼接、偶然乘积、带分配的重组(可选输出剥离和配对融合)以及向下投影来重组目录条目。一个核心方法论要点是非重叠属性:我们的重组不会且无法重新发现手工构造(Strassen, Laderman, Smirnov, AlphaTensor)所围绕的共享双线性乘积。这清晰划分了“寻找更巧妙的双线性核心”与“组合已知核心”两条进展轴线,并解决了文献中的若干归属难题。我们更新了DIS09比较表,按领域拆分并添加交换列,并提供工具以在目录演进时自动重新生成这些表。

英文摘要

The 2022--2026 burst of activity in small-format matrix multiplication (AlphaTensor 2022, AlphaEvolve 2025, Schwartz--Zwecher 2025) has produced striking individual results but scattered them across different fields, attribution conventions, and serialisation formats. A complementary line of work -- Perminov's open-source flip-graph framework~\cite{perminov2026fast,perminov2025fast} -- instead drives existing construction methods, notably flip-graph and \emph{meta-flip-graph} search, at scale across large format spaces, discovering many new low-rank schemes (including ternary-integer ones) that further enrich the landscape this catalog must unify. We present a unified, machine-checkable catalog covering shapes up to \nmpshape{32}{32}{32} over \Rationals, \Integers, \Reals, \Complex, and \Ftwo, with a separate axis for commutative algorithms (Waksman 1970, Makarov 1986, Rosowski 2019). Derivation over this catalog is performed by a \emph{frontier-closure search} that recombines catalog entries by axis-flip, Kronecker, axis concatenation, serendipitous products, recombination-with-allocation (with optional output peeling and pair fusion), and downward projection. A central methodological point is the \emph{non-overlap property}: our recombination does not, and cannot, rediscover the shared bilinear products that hand-crafted constructions (Strassen, Laderman, Smirnov, AlphaTensor) are built around. This draws a clean line between the ``find a cleverer bilinear core'' and ``compose known cores'' axes of progress, and resolves several attribution puzzles in the literature. We refresh the DIS09 comparison tables, split per field and with a commutative column, and provide the tooling to regenerate them automatically as the catalog evolves.

2606.12773 2026-06-13 math.RA 新提交

Near-Trinions: Complete Classification of Unital Three-Dimensional Real Associative Algebras

近三元数:三维实结合代数的完全分类

Joel A. Shelton

AI总结 本文完全分类了三维实结合代数(称为近三元数),得到六个同构类,通过根维数分层和皮尔斯分解给出规范表示与不变量。

详情
AI中文摘要

Frobenius定理明确解决了三维实可除代数的问题:不存在这样的代数。但它留下了一个问题——也是本文要解决的——哪些三维实结合代数实际上存在。我们称这些代数为“近三元数”,并在$\RR$-代数同构意义下完全分类它们。答案恰好是六个同构类;我们提供了规范代表元、显式乘法表以及一组足以区分每一对代数的模论不变量。分类通过根维数$d$进行分层:Wedderburn–Artin定理解决了半单层;在$d=1$层,Peirce分解将根生成元定位在对角或非对角角落,分别产生交换和非交换分支,并通过特征符号障碍排除了剩余域$\CC$;在$d=2$层,根据$\Jac(A)^2$是否为零分裂。论证不仅恢复了六个类,还解释了为什么根维数取值于$\{0,1,2\}$,为什么$d=1$情形在交换性上分裂,以及为什么不存在剩余域为$\CC$的近三元数。最后提出了一个后续研究方向。

英文摘要

Frobenius' theorem settles, definitively, the question of three-dimensional real division algebras: there are none. What it leaves open -- and what this paper addresses -- is the question of which three-dimensional real associative algebras actually exist. We call these \emph{near-trinions} and classify them completely, up to $\mathbb{R}$-algebra isomorphism. The answer is exactly six isomorphism classes; we provide canonical representatives, explicit multiplication tables, and a collection of module-theoretic invariants sufficient to distinguish every pair. The classification proceeds by stratifying on the radical dimension $d$: Wedderburn--Artin resolves the semisimple stratum; in the $d=1$ stratum, the Peirce decomposition locates the radical generator in either a diagonal or off-diagonal corner, producing the commutative and non-commutative branches respectively and ruling out the residue field $\mathbb{C}$ by a characteristic sign obstruction; and the $d=2$ stratum splits on whether Jac$(A)^2$ vanishes. The argument not only recovers the six classes but explains why the radical dimension takes values in $\{0,1,2\}$, why the $d=1$ case splits on commutativity, and why no near-trinion with residue field $\mathbb{C}$ can exist. One direction for subsequent work is proposed.