首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
《图书馆管理杂志》2013,53(1-2):9-18
Abstract

Both photocopied tables of contents and originals were scanned by optical character recognition (OCR). (The threshold for acceptable OCR conversion is a break-even point of 94% accuracy; it is cheaper to rekey data than fix corrupted data if rates fall below this point.) The results suggest that OCR recognition rates are likely to support improved capture productivity for as many as 75% of tables of contents. More than half (55%) of the sample can be processed effectively.  相似文献   

2.
纸质档案数字化成果的原始分辨率是光学字符识别(OCR)精度的重要影响因素,为保证档案图像的原始分辨率大于等于300dpi,在档案预检阶段需检测出较低原始分辨率被篡改为较高分辨率的档案图像。针对图像文件头篡改、图像内容插值等多种分辨率篡改手段,分析它们的篡改机理,设计了由文件头检测、图像频率域分析、图像质量客观评价多方案组成的综合的篡改检测策略。在宁波市档案局的测试数据库上显示,所提出的原始分辨率篡改检测方法取得了很好的效果。  相似文献   

3.

This study examined factors that were predicted to be related to the effect that attitude dissimilarity has on interpersonal attraction in the first few minutes of an initial interaction. In this study, 114 participants engaged in dyadic interactions with same‐sex partners that varied in attitude similarity. Prior to and after interacting with one another, participants rated their partner's personal character and whether they believed that a future interaction would be a positive one. These data revealed that the perceived quality of future interactions significantly improved over time and character attributions partially improved. The global predictions made in this study were framed and supported as an expectancy violation explanation.  相似文献   

4.
手写文字识别是指计算机自动识别手写体汉字、数字、字母和符号等,在办公自动化、信息处理以及图书情报界有着广阔的应用前景。本文概括了手写文字识别技术在中国的发展及应用状况,提出了九十年在图书馆自动化中的应用目标.  相似文献   

5.
王充两度担任文书档案官员,在他两次任职的离职期间,撰写了84篇20余万言的《论衡》.王充高尚的品德、卓越的见识、渊博的学问、非凡的写作才华,能给从事文书档案史学工作的人们诸多启示.  相似文献   

6.
[目的/意义]从学术期刊中抽取其中的理论是对文献进行内容分析的前提,实现理论名称识别的自动化可以提高内容分析的效率。[方法/过程]将理论识别视为一类命名实体识别问题,总结现有的命名实体识别的常用方法,提出一个基于语义泛化思想的命名实体识别方法,选取词性、知网义原等外部知识,采用CRF模型对《情报学报》1822篇论文的标题和摘要进行实验。[结果/结论]实验表明,识别准确率最高达到95.38%,但召回率较低;训练语料规模对性能影响较大,不同程度的语义泛化方法对准确率和召回率有复杂影响。如何选择语义特征、语义标注和语义消歧是需要解决的新问题。  相似文献   

7.
This article looks at the emergence and potentials of a Balkan cultural studies. I argue that the productivity of a Balkan cultural studies lies in its willingness to engage with popular culture as a very real political force in the revolutionary transformations from the 1980s on. Some recent cultural developments are presented here to show how the mix of culture and cultural studies contributes to the political relevance and academic vibrancy of Balkan cultural studies, which captures the imagination of students in the region precisely because of its engaged character and its contemporary relevance.  相似文献   

8.
This paper explains the character code recognition with the Boolean classifier. The binary values are used both for inputs and outputs, while the learning of the circuit with a set of patterns is done by modified algorithms used in some Boolean neural networks. The use of the fuzzy logic approach offers the possibility of creating a character recognition theory which is fault-tolerant and applicable to all sorts of typefaces and fonts. It provides several examples of patterns scanned with different resolutions and learned with a part of the same set of samples which demonstrates the quality of the fuzzy Boolea classifier.  相似文献   

9.
Building on the persuasion knowledge model, this study examines how audience characteristics and native advertising recognition influence the covert persuasion process. Among a nationally representative sample of U.S. adults (N = 738), we examined digital news readers’ recognition of a sponsored news article as advertising. Although fewer than 1 in 10 readers recognized the article as advertising, recognition was most likely among younger, more educated consumers who engaged with news media for informational purposes. Recognition led to greater counterarguing, and higher levels of informational motivation also led to less favorable evaluations of the content among recognizers. News consumers were most receptive to native advertising in a digital news context when publishers were more transparent about its commercial nature. Beyond theoretical insights into the covert persuasion process, this study offers practical utility to the advertisers, publishers, and policymakers who wish to better understand who is more likely to be confused by this type of advertising so that they can take steps to minimize deception.  相似文献   

10.
A content analysis was conducted to examine sexual references and consequences among lesbian, gay, bisexual (LGB), and heterosexual characters on television. The sample was composed of programs portraying an LGB lead or reoccurring character. Results showed that heterosexual and LGB characters engaged in sexual talk and behavior in similar contexts. When discussing LGB sexualities, however, heterosexual characters were disproportionately likely to make jokes; LGB characters were disproportionately likely to discuss coming out. LGB characters depicted in sexual references were more likely to be in dialogue with a heterosexual character than another LGB character. Sexual consequences were more common for heterosexual characters than LGB characters. No gender differences existed in frequency of sexual references or consequences among LGB characters, evidence that the sexual double standard found in previous research may not apply to LGB characters. Results are discussed in terms of potential effects of exposure.  相似文献   

11.
《Journalism Practice》2013,7(10):1311-1331
Climate change frames in the media affect the political and public debate. However, focusing on the frames in texts, most framing research overlooks the factors which influence frame-building by reporters. However, this is crucial for a fuller understanding of the potential implications and meanings of frames. Besides, the existing frame-building research is exclusively engaged with mainstream media. Also, visual frame-building is under-researched. Therefore, we have conducted interviews with 26 climate journalists, photo editors, chiefs and opinion-makers, working for three mainstream and two progressive alternative outlets in northern Belgium. The findings were combined with the outcome of a deductive framing analysis of 114 climate articles. The results show a strong overlap among journalist frames and news frames. Anthropocentric Subframes prevail in the mainstream news articles and among the reporters. A mixture of Biocentric and Anthropocentric Subframes was found in the context of the alternative outlets. We explain this by presenting the studied mainstream newsrooms as machines and the (progressive) alternative newsrooms as organisms. We conclude that the mainstream journalists are guided towards Anthropocentric Subframes by various (internalised) pressures. The practices in the alternative media liberate reporters to introduce a broader variety of frames.  相似文献   

12.
藏族人名汉译名识别研究   总被引:2,自引:0,他引:2  
藏族人名汉译名识别属于人名识别的范畴,但现有的人名识别方法并不能完全切合藏族人名命名特点:藏族人名具有浓厚的宗教文化内涵,字(串)特征和内部构成复杂;其次,藏族人名中含有大量高频单字,使得藏族人名和普通词语之间歧义冲突变得十分突出,同时也使得藏族人名和上下文之间的边界变得非常模糊.本文在大规模藏族人名实例和语料库调查基础上,统计分析了藏族人名的用字(串)特征,并构建了藏族人名属性特征库;通过藏族人名的命名规则及属性特征将藏族人名形式化表示,实现了藏族人名汉译名自动识别系统.真实语料库开放测试F值达到87.12%.  相似文献   

13.
分析中文自动分词的现状,介绍和描述几种不同的分词思想和方法,提出一种基于字位的分词方法。此分词方法以字为最小单位,根据字的概率分布得到组合成词的概率分布,因此在未登录词识别方面比其它方法有更优秀的表现。使用最大熵的机器学习方法来进行实现并通过两个实验得出实验结果的比较分析。  相似文献   

14.
路茂林 《晋图学刊》2010,(1):43-44,63
以高校图书馆执行力培育和提升为研究对象。执行力是图书馆工作的生命力,是提高图书馆服务效率以及科学管理的重要手段,是提高读者满意度和信任度的关键,同时也是一项值得研究的管理科学。针对现状就如何培育和提升图书馆的执行力,提出了方法和应对措施。  相似文献   

15.
[目的/意义] 针对LDA模型主题识别结果通常包含噪声主题的问题,建立科学有效的主题过滤方法,排除噪声主题,确保主题识别及后续演化分析的准确性。[方法/过程] 基于关键词之间的共现关系,构建关键词关联度指标(KRI),借助定量手段进行主题筛选和过滤。以单细胞研究领域为例,计算各主题-关键词分布的KRI值,与人工判读结果进行对比分析。[结果/结论] 实验结果表明,该方法能够有效排除LDA模型识别结果中的噪声主题,提高主题识别的准确性,也在一定程度上降低了主题识别过程对人工判读的依赖性。  相似文献   

16.
��[Purpose/significance] The identification results of the LDA model is sometimes unsatisfactory due to some meaningless topics mixed together. Therefore, it's quite necessary to establish an effective topic filtering method to eliminate these noise topics and to ensure the accuracy of subsequent evolution analysis.[Method/process] Based on the co-occurrence relationship between keywords, keywords relevance index (KRI) was constructed. Taking the field of single cell research as an example, KRI values of the distribution of theme-keywords were calculated and compared with the results of manual interpretation.[Result/conclusion] Experimental results show that this method can effectively eliminate meaningless noise topics in the LDA model recognition results, which can improve the accuracy of topic recognition and the subsequent topic evolution analysis. It also helps to reduce the dependence on manual interpretation in the process of topic identification through the topic model method.  相似文献   

17.
林德明  王宇开  丁堃 《情报学报》2020,39(2):178-185
以国家知识产权战略的政策工具选择为研究对象,利用融合深度学习和政策文献计量的方法对三份纲领性文件的知识产权战略目标、指导性政策工具与各年度知识产权战略推进计划中的政策工具进行匹配度计算,从而对国家知识产权战略围绕着战略目标调整和战略执行两方面的政策工具选择进行全面分析。结果表明,知识产权战略执行中的政策工具选择较好地匹配了各阶段的战略目标以及指导性政策工具,但是过于集中在强制型工具;其中与战略目标高度匹配的政策工具包括开展专利信息检索和分析、提高知识产权审查效率及质量等,与指导性政策工具高度匹配的有增强知识产权的司法保护、积极参与国际知识产权合作交流等,而知识产权专业人才培养及相关基地建设、知识产权试点建设等政策工具匹配度较低,农林业知识产权的政策工具尚显不足,需要进一步调整完善。  相似文献   

18.
由于自然语言的复杂性,使得情感挖掘仍存在一些问题需要解决,如情感词的领域依赖性、隐式特征识别、同指特征处理和特征极性计算等。为解决这些问题,提出一种基于语义的情感挖掘方法,该方法以主题图为指导进行特征及情感词的识别和情感极性强度计算,充分利用特征之间及其特征与情感词之间的语义关系,可以在一定程度上提高意见挖掘的准确性。  相似文献   

19.
In this work we investigate the sensitivity of individual researchers’ productivity rankings to the time of citation observation. The analysis is based on observation of research products for the 2001–2003 triennium for all research staff of Italian universities in the hard sciences, with the year of citation observation varying from 2004 to 2008. The 2008 rankings list is assumed the most accurate, as citations have had the longest time to accumulate and thus represent the best possible proxy of impact. By comparing the rankings lists from each year against the 2008 benchmark we provide policy-makers and research organization managers a measure of trade-off between timeliness of evaluation execution and accuracy of performance rankings. The results show that with variation in the evaluation citation window there are variable rates of inaccuracy across the disciplines of researchers. The inaccuracy results negligible for Physics, Biology and Medicine.  相似文献   

20.
赵华茗  钱力  余丽 《图书情报工作》2020,64(11):108-115
[目的/意义] 探索科研命名实体及其关系的识别与抽取,提升其在长句等复杂情况下的识别效果,为进一步的应用提供参考与借鉴。[方法/过程] 以依存句法特征分析为基础,提出一种科研命名实体关系抽取方法,过程包括:①使用Standford Tagger工具对目标文本进行词性标注;②基于标注结果,围绕核心谓词和SAO结构,将目标文本分割为结构规范的语义片段;③通过依存句法分析,找出与核心谓词语义相关的主语和宾语,构成(实体,关系,实体)三元组。[结果/结论] 与Ollie、Reverb等主流算法进行的对比测试表明,该方法可以有效提升科研命名实体识别的准确性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号