首页 | 本学科首页   官方微博 | 高级检索  
文章检索
  按 检索   检索词:      
出版年份:   被引次数:   他引次数: 提示:输入*表示无穷大
  收费全文   27篇
  免费   0篇
  国内免费   1篇
教育   14篇
科学研究   3篇
综合类   1篇
信息传播   10篇
  2018年   3篇
  2013年   3篇
  2012年   1篇
  2011年   3篇
  2010年   5篇
  2009年   1篇
  2008年   2篇
  2007年   1篇
  2006年   1篇
  2005年   5篇
  2003年   1篇
  2002年   1篇
  2000年   1篇
排序方式: 共有28条查询结果,搜索用时 15 毫秒
1.
学术文献保存格式除了传统的PDF外,还包括CAJ、KDH、NH、CAA、TEB等格式,CAJViewer作为一款专门的文献浏览软件,通吃这些文献格式,具有浏览页面、查找文字、文本识别、邮件传输等九大功能,可取代AdobeReader阅读器对PDF的浏览功能,也不需要单独使用OCR软件而进行文本识别,灵活使用其功能,可极大地方便用户对学术文献的浏览和使用。  相似文献   
2.
学术文献保存格式除了传统的PDF外,还包括CAJ、KDH、NH、CAA、TEB等格式,CAJViewer作为一款专门的文献浏览软件,通吃这些文献格式,具有浏览页面、查找文字、文本识别、邮件传输等九大功能,可取代Adobe Reader阅读器对PDF的浏览功能,也不需要单独使用OCR软件而进行文本识别,灵活使用其功能,可极大地方便用户对学术文献的浏览和使用。  相似文献   
3.
提出一种新的用于识别视频中字幕文字的方法。鉴于视频中文字的大小、颜色、渲染风格和分辨率的不同,以及视频中各种复杂背景的影响,识别视频中的叠加文字是一个尚未解决的问题。目前,大多数视频叠加文字识别方法都基于视频文字的二值化和传统OCR引擎的结合。然而,二值化过程容易引入噪声和文字笔划信息的丢失。另外,传统OCR技术主要专注于高分辨率的扫描打印文档,这些文档具有背景单一、噪声少和笔划信息较完整的特点。因此,传统OCR引擎用于识别叠加文字二值化后的结果可能不够鲁棒。为解决这个问题,直接从未二值化的叠加视频文字图像中提取Gabor特征用于训练二层字符识别器。实验结果表明,本文提出的方法在多字体视频叠加中文文字识别上有良好的效果。  相似文献   
4.
文章通过对小语种文献数据处理解决方案的研究与应用回顾,展示了国家科技图书文献中心(NSTL)在小语种文献加工及多语种数据处理方面的技术成果。该方案的实施,进一步提升了NSTL网络服务系统的功能,使得NSTL在国内文献服务领域率先解决了小语种文献的数字化加工和网上文献服务的多语种显示、检索等问题,对于网络服务系统多语种信息集成揭示具有重要的实践和示范意义。  相似文献   
5.
Modern OCR engines incorporate some form of error correction, typically based on dictionaries. However, there are still residual errors that decrease performance of natural language processing algorithms applied to OCR text. In this paper, we present a statistical learning model for post-processing OCR errors, either in a fully automatic manner or followed by minimal user interaction to further reduce error rate. Our model employs web-scale corpora and integrates a rich set of linguistic features. Through an interdependent learning pipeline, our model produces and continuously refines the error detection and suggestion of candidate corrections. Evaluated on a historical biology book with complex error patterns, our model outperforms various baseline methods in the automatic mode and shows an even greater advantage when involving minimal user interaction. Quantitative analysis of each computational step further suggests that our proposed model is well-suited for handling volatile and complex OCR error patterns, which are beyond the capabilities of error correction incorporated in OCR engines.  相似文献   
6.
周雪莹 《编辑学报》2012,24(6):592-593
以方正书版文件转换所得的几类常见的PDF文件为素材,基于OCR技术和PDF文件编辑技术,探索出2类制作可检索式双层PDF文件的方法。用Readiris法制作的Image-Text型双层PDF操作简便、文件很小、可生成索引书签;用FoxitPDF Editor法制作的Graphic-Text型双层PDF清晰度高、文本精准。这2种双层PDF文件均可以很好地满足网络期刊文献检索的需要。  相似文献   
7.
ABSTRACT

This paper reports on a project undertaken at the American Museum of Natural History Library in 1997 and intended to enhance access to materials in the library's collection by using scanning and OCR software to digitize and add monograph tables of contents to the OPAC bibliographic records. Initially, conference proceedings already in the collection were used, but, as the project developed, other types of materials were also used. The rationale for the project is explained, the procedure developed is described, and the lessons learned from using this particular technology are outlined.  相似文献   
8.
A protection system using a multi-agent concept for power distribution networks is pro- posed. Every digital over current relay(OCR) is developed as an agent by adding its own intelli- gence, self-tuning and communication ability. The main advantage of the multi-agent concept is that a group of agents work together to achieve a global goal which is beyond the ability of each individual agent. In order to cope with frequent changes in the network operation condition and faults, an OCR agent, proposed in this paper, is able to detect a fault or a change in the network and find its optimal parameters for protection in an autonomous manner considering information of the whole network obtained by communication between other agents.Through this kind of coordi- nation and information exchanges, not only a local but also a global protective scheme is com- pleted. Simulations in a simple distribution network show the effectiveness of the proposed protec- tion system.  相似文献   
9.
目前国内的女书文字大多采用手写的方式保存。介绍了女书OCR技术,讨论了女书OCR的整体流程,具体包括二值化,文字分割,特征提取和文字识别等方法,最终实现了对手写女书文字的识别和存储。  相似文献   
10.
INTRODUCTION The Bibliotheca Alexandrina (BA) has develop- ed a workflow for turning printed books into digital books. The process starts with selection of books to be digitized, which is done mainly by BA’s Library Service Department. Books metadata is entered into the Digital Lab database. Metadata entry is followed by three core phases: scanning phase, in which the digital copy is generated; processing phase, in which image enhancement is performed; and OCR phase, in which text i…  相似文献   
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号