广西科学

引用本文：

梁骁,黄文明,姚俊,温雅媛,邓珍荣.结合多注意力和条件变分自编码器的宋词生成模型[J].广西科学,2022,29(2):308-315. [点击复制]
LIANG Xiao,HUANG Wenming,YAO Jun,WEN Yayuan,DENG Zhenrong.Song Ci Generation Model Based on Multi-attention and Conditional Variational Auto-encoder[J].Guangxi Sciences,2022,29(2):308-315. [点击复制]

【打印本页】【在线阅读全文】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

本文已被：浏览 348次下载 391次	码上扫一扫！
结合多注意力和条件变分自编码器的宋词生成模型
梁骁¹, 黄文明^1,2, 姚俊³, 温雅媛⁴, 邓珍荣^1,2
0 字体:加大+\|默认\|缩小-
(1.桂林电子科技大学计算机与信息安全学院, 广西桂林 541004;2.广西图形图像与智能处理重点实验室, 广西桂林 541004;3.广西壮族自治区高级人民法院, 广西南宁 530000;4.广西师范大学电子工程学院, 广西桂林 541004)

摘要:

现有的诗词生成方法忽略了风格的重要性。另外，由于宋词大部分词牌词句较多，逐句生成宋词的过程中容易产生上下文缺乏连贯性的现象，在上下文连贯性方面仍存在提升空间。针对这两个问题，在编码解码的文本生成框架基础上，引入自注意力机制的句子表示算法计算多注意力权重矩阵，用于提取词句的多种重要语义特征，让模型更多地关注上文的显著信息来提高上下文连贯性。引入条件变分自编码器(CVAE)将每条宋词数据转化为隐空间中不同风格特征的高维高斯分布，从各自的分布中采样隐变量来控制宋词的风格。由于自构建的宋词语料库缺少完整风格分类标签，使用具有风格标签的宋词微调BERT模型，将其作为风格分类器标注全部的宋词数据。在上述关键技术的基础上实现了宋词生成模型，生成上下文连贯的婉约词以及豪放词。通过与其他4种基准方法进行对比实验，结果表明引入自注意力机制的句子表示算法和条件变分自编码器，在上下文连贯性和风格控制方面有一定的提升。

关键词: 条件变分自编码器宋词风格宋词生成 Bi-GRU 自注意力机制

DOI：10.13656/j.cnki.gxkx.20220526.011

投稿时间：2021-04-16

基金项目:广西科技计划项目(桂科AB20238013)，广西图像图形与智能处理重点实验室培育基地(桂林电子科技大学)开放基金项目(GIIP2011)和广西高校中青年教师科研基础能力提升项目(2019KY0238)资助。

Song Ci Generation Model Based on Multi-attention and Conditional Variational Auto-encoder

LIANG Xiao¹, HUANG Wenming^1,2, YAO Jun³, WEN Yayuan⁴, DENG Zhenrong^1,2

(1.College of Computer Science&Information Security, Guilin University of Electronic Technology, Guilin, Guangxi, 541004, China;2.Guangxi Key Laboratory of Image and Graphic, Intelligent Processing, Guilin, Guangxi, 541004, China;3.Guangxi Zhuang Autonomous Region Higher People Court, Nanning, Guangxi, 530000, China;4.College of Electronic Engineering, Guangxi Normal University, Guilin, Guangxi, 541004, China)

Abstract:

The existing poetry generation methods ignore the importance of style.In addition,since there are the large number of Cipai and phrases in most of Song Ci,it is easy to produce the phenomenon that the context lacks coherence in the process of generating Song Ci sentence by sentence,and there is still space for improvement in context coherence.To solve these two problems,based on the encoding and decoding text generation framework,a sentence representation algorithm based on self-attention mechanism is introduced to calculate multi-attention weight matrix,which is used to extract a variety of important semantic features of words and sentences,so that the model can pay more attention to the significant information above to improve context coherence.The conditional variational self-encoder is introduced to transform each Song Ci data into a high-dimensional Gaussian distribution with different style features in the hidden space,and the hidden variables are sampled from the respective distributions to control the style of Song Ci.Since the self-constructed corpus of Song Ci lacks a complete style classification label,a fine-tuning BERT model of Song Ci with style label is used as a style classifier to label all the data of Song Ci.Based on the above key technologies,the generation model of Song Ci is realized,which generates euphemistic poetry and bold poetry with coherent context.Compared with the other four benchmark methods,the results show that the sentence representation algorithm with self-attention mechanism and the conditional variation self-encoder have certain improvements in context coherence and style control.

Key words: conditional variational auto-encoder Song Ci style generation of Song Ci Bi-GRU self-attention mechanism

用微信扫一扫