帮助关于我们

返回检索结果

一种分布式用户浏览点击模型算法
A Distributed User Browse Click Model Algorithm

查看参考文献22篇

张浩盛伦 ^1,2 李翀 ¹ ^* 柯勇 ¹ 张士波 ¹

文摘	为从海量搜索点击日志中快速挖掘用户行为,提出一种分布式用户浏览点击模型(UBM)算法。原始 UBM 算法求出的检验度参数E 只与搜索结果文档所在排序位置以及上一文档的点击位置有关,且非常稳定,基于此特性,将EM 迭代求解转换为抽样估计检验度以求解吸引度的分布式UBM 算法。在Spark 数据平台上进行仿真,结果表明,与原始UBM 算法相比,该算法能够解决点击日志中存在的严重数据倾斜问题,且运行效率较高。
其他语种文摘	A distributed User Browse Click Model(UBM) algorithm is proposed to quickly mine user behavior from massive search click logs.The validation parameter E derived from the original UBM algorithm is only related to the ranking position of the search results and the click position of the previous document,and is very stable.Based on this characteristic,the EM iteration solution is transformed into a distributed UBM algorithm which estimates the test degree by sampling to solve the attraction degree.Results of simulation on Spark data platform show that compared with the original UBM algorithm,the proposed algorithm can solve the serious data skew problem in click log,and has higher efficiency.
来源	计算机工程 ,2019,45(3):1-6 【扩展库】
DOI	10.19678/j.issn.1000-3428.0050119
关键词	点击日志 ; 点击模型 ; 用户浏览点击模型算法 ; 搜索引擎 ; Spark 平台
地址	1. 中国科学院计算机网络信息中心, 北京, 100190 2. 中国科学院大学, 北京, 100190
语种	中文
文献类型	研究性论文
ISSN	1000-3428
学科	自动化技术、计算机技术
基金	中国科学院信息化专项
文献收藏号	CSCD:6504267

参考文献共 22 共2页

引证文献 1 篇

1 宋匡时一个轻量级分布式机器学习系统的设计与实现计算机工程,2020,46(1):201-207
CSCD被引 0 次

显示所有1篇文献

论文科学数据集

PlumX Metrics

相关文献
作者相关关键词相关参考文献相关

版权所有 ©2008 中国科学院文献情报中心制作维护：中国科学院文献情报中心
地址：北京中关村北四环西路33号邮政编码：100190 联系电话：(010)82627496 E-mail:cscd@mail.las.ac.cn 京ICP备05002861号-4 | 京公网安备11010802043238号