首页 | 本学科首页   官方微博 | 高级检索  
     检索      

影视节目扁平化标签获取技术研究
引用本文:殷复莲,徐荣阁,刘志心,冀美琪.影视节目扁平化标签获取技术研究[J].教育技术导刊,2019,18(7):150-153.
作者姓名:殷复莲  徐荣阁  刘志心  冀美琪
作者单位:中国传媒大学 信息与通信工程学院,北京 100024
基金项目:国家自然科学基金项目(61801441);国家级大学生创新创业训练计划项目(JG18110205)
摘    要:针对影视节目标签手动采集费时费力,以及传统树状标签体系信息冗余且不全面等问题,提出一种标签自动获取技术。通过数据爬取技术采集与节目相关的互联网原始数据,然后通过文本分析、同义匹配、数据库匹配等技术进行数据分析与挖掘,最终实现对扁平化节目标签的获取。实验结果表明,在选取8~10个标签时,该算法准确率为84.3%~ 92.4%,召回率为53.4%~ 63.1%,说明该算法获取的标签能够很好地对影视节目进行描述。

关 键 词:扁平化标签  标签自动获取  Web自动信息采集  标签库匹配  
收稿时间:2018-11-15

Research on the Acquisition Technology of Film and Television Program Flat Tags
YIN Fu-lian,XU Ronge-ge,LIU Zhi-xin,JI Mei-qi.Research on the Acquisition Technology of Film and Television Program Flat Tags[J].Introduction of Educational Technology,2019,18(7):150-153.
Authors:YIN Fu-lian  XU Ronge-ge  LIU Zhi-xin  JI Mei-qi
Institution:School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
Abstract:This paper proposes an automatic tag acquisition technology for the which is time-consuming and labor-intensive manual collection of film and television program tags and the information redundancy and incompleteness of the traditional tree tag system. Our research collects the original Internet data related to the program through data crawling technology, and then analyzes and mines the data through text analysis, synonym matching, database matching and other technologies. Finally we achieve the acquisition of flat program tags. The experimental results show that the accuracy of this algorithm is 84.3%~92.4% when 8-10 labels are selected, and the recall rate is 53.4%~63.1%. This proves that the label obtained by the algorithm in this paper can describe a program well.
Keywords:flattened tag  automatic tag acquisition  Web automatic information collection  tag library matching  
点击此处可从《教育技术导刊》浏览原始摘要信息
点击此处可从《教育技术导刊》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号