帮助关于我们

返回检索结果

基于预训练语言模型的健康谣言检测
Health Rumor Detection based on Pre-Trained Language Model

查看参考文献17篇

许诺赵薇尚柯源陈浩宇

文摘	当前大多数谣言检测主要面向社交媒体数据,所处理文本序列较短,然而面向包含多个句子的段落或长序列文本篇章输入时,因不能提取有效特征进而影响模型识别效果.为获取谣言检测的有效信息,文章提出基于I-BERT-BiLSTM (Improved-BERT-BiLSTM)的健康类谣言检测方法,通过提取文档级长序列文本的摘要,并输入到以多层注意力机制为框架的深层神经网络进行特征提取,最后输入到BiLSTM进行谣言分类.实验结果表明:文章提出的I-BERT-BiLSTM模型在自建健康类谣言数据集与公开数据集上达到了97.75%和91.15%的准确率.
其他语种文摘	Currently,most studies on rumor detection mainly focus on social media data and the length of text sequence is short.We argue that existing methods could not capture effective features from health rumors with long texts and then affect the validity of methods.To solve this,we propose an improved BERT-BiLSTM model (I-BERT-BiLSTM),which leverages effective information extracted from texts with long sequences for the health rumor detection.We first conduct text summarization from document-level text.The results are regarded as the input of the deep network model with multi-layer self-attention mechanisms for feature extraction.Finally,we feed the output into BiLSTM for rumor classification.The experimental results show that the model we proposed in this paper achieves 97.75% and 91.15% accuracy on the self-built health rumor data and public data.
来源	系统科学与数学 ,2022,42(10):2582-2589 【核心库】
DOI	10.12341/jssms22646KSS
关键词	谣言检测 ; 预训练语言模型 ; 摘要提取 ; I-BERT-BiLSTM
地址	中国传媒大学, 北京, 100024
语种	中文
文献类型	研究性论文
ISSN	1000-0577
学科	自动化技术、计算机技术
基金	中国传媒大学中央高校基本科研业务费专项
文献收藏号	CSCD:7356056

参考文献共 17 共1页

引证文献 2 篇

1 王友卫基于事件-词语-特征异质图的微博谣言检测新方法中文信息学报,2023,37(9):161-174
CSCD被引 0 次

2 张奕林基于BERT的短文本分类模型及在铁路CIR设备故障诊断中的应用系统科学与数学,2024,44(1):115-131
CSCD被引 0 次

显示所有2篇文献

论文科学数据集

PlumX Metrics

相关文献
作者相关关键词相关参考文献相关

版权所有 ©2008 中国科学院文献情报中心制作维护：中国科学院文献情报中心
地址：北京中关村北四环西路33号邮政编码：100190 联系电话：(010)82627496 E-mail:cscd@mail.las.ac.cn 京ICP备05002861号-4 | 京公网安备11010802043238号