LDA单词图像表示的蒙古文古籍图像关键词检索方法 LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

LDA单词图像表示的蒙古文古籍图像关键词检索方法

引用本文：	白淑霞,鲍玉来.LDA单词图像表示的蒙古文古籍图像关键词检索方法[J].现代情报,2017,37(7):51.

作者姓名：	白淑霞鲍玉来

作者单位：	内蒙古大学图书馆, 内蒙古呼和浩特 010021

基金项目：	国家自然科学基金项目“基于领域本体的蒙古文数字资源整合机制研究”（项目编号：71163029）。

摘要：	目的]为了克服传统视觉词袋方法（Bag-of-Visual-Words）中忽略视觉单词间的空间关系和语义信息等问题。方法]本文提出一种与视觉语言模型相结合的基于LDA主题模型，并采用查询似然模型实现检索。结果]实验数据表明，本文所提出的基于LDA的表示方法可以高效、准确地解决蒙古文古籍的关键词检索问题。结论]同时，该方法的性能比BoVW方法有显著提高。
关键词：	隐含狄利克雷分配（LDA）主题模型视觉语言模型蒙古文古籍关键词检索查询似然模型
LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents

Authors:	Bai Shuxia Bao Yulai

Institution:	Library, Inner Mongolia University, Hohhot 010021, China

Abstract:	Objective] In order to overcome the problem of ignoring the spatial relations and semantic information between visual words in traditional visual word bag (Bag-of-Visual-Words).Methods] In this paper, a LDA-based topic model was adopted which was linearly combined with a visual language model for each word image. And the basic query likelihood model was used for realizing the procedure of retrieval.Results] Theexperimental results on our dataset showed that the proposed LDA-based representation approach could effi ciently and accurately attain to the aim of keyword spotting on a collection of historical Mongolian documents.Conclusions] Meanwhile, the proposed approach improved the performance signifi cantly than the original BoVW approach.

Keywords:	latent dirichlet allocation (LDA) topic model visual language model historical Mongolian documents keyword spotting query likelihood model

	点击此处可从《现代情报》浏览原始摘要信息
	点击此处可从《现代情报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏