首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 29 毫秒
1.
Web spam pages exploit the biases of search engine algorithms to get higher than their deserved rankings in search results by using several types of spamming techniques. Many web spam demotion algorithms have been developed to combat spam via the use of the web link structure, from which the goodness or badness score of each web page is evaluated. Those scores are then used to identify spam pages or punish their rankings in search engine results. However, most of the published spam demotion algorithms differ from their base models by only very limited improvements and still suffer from some common score manipulation methods. The lack of a general framework for this field makes the task of designing high-performance spam demotion algorithms very inefficient. In this paper, we propose a unified score propagation model for web spam demotion algorithms by abstracting the score propagation process of relevant models with a forward score propagation function and a backward score propagation function, each of which can further be expressed as three sub-functions: a splitting function, an accepting function and a combination function. On the basis of the proposed model, we develop two new web spam demotion algorithms named Supervised Forward and Backward score Ranking (SFBR) and Unsupervised Forward and Backward score Ranking (UFBR). Our experiments, conducted on three large-scale public datasets, show that (1) SFBR is very robust and apparently outperforms other algorithms and (2) UFBR can obtain results comparable to some well-known supervised algorithms in the spam demotion task even if the UFBR is unsupervised.  相似文献   

2.
针对网络博客空间中垃圾评论泛滥的问题,给出一种半监督学习式网络垃圾评论检测方案。基于评论内容的统计分析,设计相关度、词组重复率、超链接数目、内容淫秽度、句子长度共5个特征指标,给出网络垃圾评论检测系统的框架,并进行实验验证。实验结果表明,本方法能有效检测出网络博客空间中的垃圾评论,具有较好的应用价值。  相似文献   

3.
对文档进行分类并鉴别出垃圾信息是一个非常有实用价值的研究领域,越来越多的网站开始关注这种技术。采用智能算法对垃圾信息进行有效分析,寻找垃圾制作者,并通过网络日志和所发表的内容,判断哪些是广告用户和垃圾信息的发布者,并将其删除。认为对垃圾信息的甄别其实是一种把信息分成有用信息和无用信息的过程,试用贝叶斯分类算法把信息分成不同的类。针对基于规则的分类方法和通过分析广告链接网址来剔除垃圾信息的方法的缺陷,给出贝叶斯分类算法及机器训练方法,从实验结果看,本方法优于基于规则的分类法。  相似文献   

4.
Predatory publishers—those who do not adhere to rigorous standards of academic practice such as peer review—are increasingly infiltrating biomedical databases, to the detriment of the wider scientific community. These publishers frequently send unsolicited ‘spam’ emails to generate submission to their journals, with early career researchers (ECR) particularly susceptible to these practices because of pressures such as securing employment and promotion. This analysis sought to record and characterize the emails received over the course of a PhD and post-doctoral position (~8 years), as well as attempts to unsubscribe from such emails, using a progressive and step-wise manner. A total of 1,280 emails identified as academic spam were received (990 journal invitations, 220 conference invitations, 70 ‘other’). The first email was received 3 months after registration for an international conference. Attempts at unsubscribing were somewhat effective, whereby implications of reporting to respective authorities resulted in a 43% decrease in emails, although did not eliminate them completely, and therefore alternative approaches to eliminating academic spam may be needed. Ongoing education about predatory publishers, as well as action by key academic stakeholders, should look to reduce the impact these predatory publishers have upon the wider literature base.  相似文献   

5.
Serial Killer     
An unsolicited spam email message leads to musings on the improbable correspondence between an attempt at extortion and the future of academic journals. Literal and figurative meanings of “serial killer” are explored.  相似文献   

6.
电子邮件是在线参考咨询服务中广泛采用的工具,但垃圾邮件的泛滥严重干扰了咨询工作的正常进行,如何应用反垃圾邮件技术成为重要课题。本详细论述了在线参考咨询服务中应用反垃圾邮件技术的方法。  相似文献   

7.
改进KNN算法在垃圾邮件过滤中的应用*   总被引:1,自引:1,他引:1  
提出一种改进的KNN算法,并将其用于垃圾邮件的过滤问题。经实验证明,改进的算法能够降低K值和训练文本的分布对过滤效果的影响,减少垃圾邮件的误判和漏判,具有较好的过滤性能。  相似文献   

8.
社会化标注系统中标签检索质量模拟研究   总被引:1,自引:0,他引:1  
社会化标注系统近年发展迅速,伴随出现的垃圾标注泛滥现象不容忽视。本文以社会化标注系统中标签检索质量为研究对象,细化普通用户标注行为,建立社会化标注模拟系统,明确定义系统内用户结构、标注规则、检索策略和检索质量算法,从实证角度评估用户规模变化、用户结构变化、用户标注量变化和垃圾用户的攻击策略选择对标签检索质量的影响。本文对于改善社会化标注系统中的标签检索算法以及提高用户体验具有重要意义。  相似文献   

9.
社会标注在网络中的应用越来越广泛,它为信息资源的标引、组织、检索提供了一种全新的模式。国外学者对社会标注的信息标引功能和标引方式、社会标注系统在信息检索中的功用及基于社会标注的信息检索技术等方面进行了研究,取得了一系列成果,但还存在不足之处。该领域的研究趋势在于规范化社会标注的表达,去除标签噪声及垃圾,使标签序化、层级化等。  相似文献   

10.
This study examines the perceived impacts of electronic government or e-government adoption on U.S. cities. This research conducted a survey of Texas and Florida city managers in the fall of 2005 to find out their opinions on the impact of e-government on their city government. The results indicated that e-government is having a positive impact on management, stakeholder involvement, needs and collaboration, and procurement in American cities. There are, however, concerns over spam or unsolicited e-mail and the ability of e-government to reduce the level of staffing. The results of this study imply that, according to city managers' perceptions, e-government adoption in American city governments is positively viewed as having an impact on their organizations and communities.  相似文献   

11.
The TREC 2009 web ad hoc and relevance feedback tasks used a new document collection, the ClueWeb09 dataset, which was crawled from the general web in early 2009. This dataset contains 1 billion web pages, a substantial fraction of which are spam—pages designed to deceive search engines so as to deliver an unwanted payload. We examine the effect of spam on the results of the TREC 2009 web ad hoc and relevance feedback tasks, which used the ClueWeb09 dataset. We show that a simple content-based classifier with minimal training is efficient enough to rank the “spamminess” of every page in the dataset using a standard personal computer in 48 hours, and effective enough to yield significant and substantive improvements in the fixed-cutoff precision (estP10) as well as rank measures (estR-Precision, StatMAP, MAP) of nearly all submitted runs. Moreover, using a set of “honeypot” queries the labeling of training data may be reduced to an entirely automatic process. The results of classical information retrieval methods are particularly enhanced by filtering—from among the worst to among the best.  相似文献   

12.
The attractiveness of social networking sites (SNSs) has extended to almost all professionals in numerous human organizations including the library. Librarians as a result of this development are now making use of these sites to connect to other libraries and librarians both within and outside their environment. However, it is observed that the use and benefits derived from social networking sites by Nigerian librarians, generally, and those in academic libraries, particularly, has not been well documented. It is against this backdrop that this study examined the use of social networking sites to both the libraries and the librarians in selected academic libraries in six Nigerian States. A survey research design approach was adopted. The simple random study drew upon 200 academic librarians from academic libraries across six selected States in Nigeria. Five research questions were raised and answered by the study. The results demonstrate that Facebook and Twitter are mostly use by academic librarians. Academic librarians are making use of SNSs on a weekly basis and partially on a daily basis. Many potential benefits of SNSs were indicated both to the librarians and their libraries such as creating opportunity to connect with people across the globe, which includes those that have never been seen and those that one is not sure of coming in contact with. It was also found that SNSs give opportunity for academic libraries to incorporate SNSs as a means of creating more interactive user centered library and information services. Examples of the defects identified associated with SNSs include sexual harassment, cybercrime, fraud, and spreading of spam. It is expected that the outcomes of this study will serve as pioneer data upon which future related studies will be anchored.  相似文献   

13.
This paper outlines the story of the country's second-oldest natural history museum from its founding in 1825 to the present. Its history includes seven name changes reflecting the young society's struggle to survive, the changing cultural environment, and the extension of its audiences from the immediate Worcester neighborhood to the New England regional area. The article also reviews its scope from several cabinets of specimens to displays of wildlife, exhibitions about ecology, astronomy, and technology, along with comprehensive education programming.  相似文献   

14.
Terry Cook 《Archival Science》2005,5(2-4):101-161
Macroappraisal as developed in Canada has had significant currency in archival literature over the past decade, and aspects of its program and ideas have been implemented in other jurisdictions. For the first time, this essay probes the theoretical and practical origins of macroappraisal in Canada since 1950 and why its originators no longer found convincing the predominant status quo on appraisal as articulated by T.R. Schellenberg. The essay then summarizes the theory of macroappraisal as articulated at the National Archives of Canada, and the strategic and program infrastructure developed in the 1990s to turn the new theory into operational reality. As no archival concept is universally locked in time, the evolution and changes in the macroappraisal program, both in theory and strategy, are also analysed in its Canadian home base over its first decade, as well as some internal and external criticisms of it. The essay intends to illuminate the deeper context of macroappraisal, so that an international audience may better understand its strengths and weaknesses. As the author is the principal architect of macroappraisal, the essay consists of equal parts of archival history, theoretical analysis, and personal reflection.  相似文献   

15.
《The Reference Librarian》2013,54(38):201-220
The evaluation of reference service is complex and subjective, because reference service itself is complex and subjective. Reference service is more easily evaluated if its facets are judged against corresponding, known criteria. The core facet of reference service, for an automated library, is its online public access catalog (OPAC). Although the library literature contains numerous papers on the functional and performance evaluation of OPACs, as well as on the evaluation of many facets of reference service, it presents little assistance for the evaluation of OPACs as the central facet of reference service. In order to alleviate this lack, this paper evaluates OPACs as if they were any other reference tool, judging them against Norman D. Stevens' classic eighteen criteria for the evaluation of reference books. A selective bibiography of works on both OPAC and reference book evaluation is included.  相似文献   

16.
In this article, the author offers a contextualist approach to contemporary debates about new (and old) media in different historical times and geographical places. This approach, rather than starting with the internal essence of a technology and then attempting to deduce its effects from its technical specifications, begins with an analysis of the interactional and cultural systems in play in a particular context and then investigates how any particular technology is fitted into them. Building on his previous work in microanalyses of technology use in the home, and drawing on recent debates in technology studies and media anthropology, he further develops the implications of this approach, at a macro level, in terms of temporal and cultural contexts. The article concludes by reviewing the outstanding problems that still confront our field, in respect to its deeply ingrained presumptions concerning the universal relevance of what are, in fact, specifically Western (and thus contingent) relations between television, technology and national cultures.  相似文献   

17.
ABSTRACT

Bibliografiia, founded in 1929, is the oldest professional journal in Russia. The author, its editor-in-chief, discusses some important points in its past, current directions, and plans for the future. Among other future directions, the editors hope to increase the number of articles by foreign authors.  相似文献   

18.
In late nineteenth-century USA, technological developments in paper production—a shift from a reliance on scarce cotton rag to plentiful wood—drastically reduced the price of newsprint. That decline helped overturn the reigning economics of the daily newspaper and resulted in the rise of new cheap papers with vastly expanded circulation. This novel mass press encompassed almost all Americans in the public sphere as represented by its pages. Focusing on newspapers in Detroit, this study examines the manifold consequences this shift had for the press's economics, its news agenda, and the implicit identity of the audience it addressed. The rise of a mass press in the late nineteenth century, however, was not specific to Detroit or the USA. As comparative historians have highlighted, the emergence of a mass press in Europe and elsewhere was a turning point that deeply marked the historical evolution of press systems around the globe.  相似文献   

19.
Consumer Health Informatics (CHI) means different things to patients, health professionals, and health care systems. A broader perspective on this new and rapidly developing field will enable us to understand and better apply its advances. This article provides an overview of CHI discussing its evolution and driving forces, along with advanced applications such as Personal Health Records, Internet transmission of personal health data, clinical e-mail, online pharmacies, and shared decision-making tools. Consumer Health Informatics will become integrated with medical care, electronic medical records, and patient education to impact the whole process and business of health care.  相似文献   

20.
50年来,《中国图书馆学报》坚持学术为先、经世惠人的办刊宗旨,发扬严谨求实、爱刊爱人的办刊精神,取得了令人骄傲的成就。希望刊物不断创新,更上层楼。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号