首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Web 信息检索(Information Retrieval)技术研究是应用文本检索研究的成果,它结合Web图论的思想,研究Web上的信息检索,是行之有效的Web知识发现的途径。传统HITS方法所获得的信息精确度相当低,而PageRank作为一通用的搜索方法,不能够应用于特定主题的信息获取。在充分分析了PageRank、HITS等现有算法和Web文档的相似度计算方法的基础上,提出了Web上查询特定主题相关信息发现的RG-HITS算法。它结合了Web超链接、网页知识表示的信息相关度以及HITS方法来搜索Web上特定主题的相关知识。  相似文献   

2.
3.
This paper reports findings from an analysis of medical or health queries to different web search engines. We report results: (i). comparing samples of 10000 web queries taken randomly from 1.2 million query logs from the AlltheWeb.com and Excite.com commercial web search engines in 2001 for medical or health queries, (ii). comparing the 2001 findings from Excite and AlltheWeb.com users with results from a previous analysis of medical and health related queries from the Excite Web search engine for 1997 and 1999, and (iii). medical or health advice-seeking queries beginning with the word 'should'. Findings suggest: (i). a small percentage of web queries are medical or health related, (ii). the top five categories of medical or health queries were: general health, weight issues, reproductive health and puberty, pregnancy/obstetrics, and human relationships, and (iii). over time, the medical and health queries may have declined as a proportion of all web queries, as the use of specialized medical/health websites and e-commerce-related queries has increased. Findings provide insights into medical and health-related web querying and suggests some implications for the use of the general web search engines when seeking medical/health information.  相似文献   

4.
中文搜索引擎用户检索式特征探析   总被引:2,自引:0,他引:2  
马寒  冯锦玲 《情报学报》2005,24(6):718-722
这项研究采集了百度、一搜、中搜和搜狗四家中文搜索引擎的七千余项检索式,分别从词汇出现频次、词汇量、类别等方面分析了中文搜索引擎用户的检索行为特征,对开展用户教育和搜索服务设计都有一定的实用价值。  相似文献   

5.
中文搜索引擎的搜索结果重合率研究   总被引:1,自引:0,他引:1  
本文的研究目的是测试主流中文搜索引擎搜索结果之间的重合程度和差异程度.利用一个具有11 171条来自真实用户的提问样本集对百度、谷歌和中国雅虎进行实际测试,发现中文搜索引擎搜索结果之间的差异很大,重合率很低.在全部的第一页搜索结果中,三个引擎中任何一个引擎独有的搜索结果总数占89.34%,任何两个引擎之间重合的搜索结果总数占8.11%,三个引擎重合的搜索结果数量占2.54%.三个引擎前两页搜索结果的重合比例更低.通过和已有的英文搜索引擎重合率测试数据相比较,发现中英文搜索引擎的搜索结果重合率都很低,且很相近.  相似文献   

6.
搜索引擎在网络链接分析中的应用研究   总被引:7,自引:0,他引:7  
比较主要用于收集链接分析数据的搜索引擎在国内研究实践中的应用、使用后的评价、检索式构造,并结合检索实践对搜索引擎存在的各种问题进行探讨,得到结论:①在进行网络链接分析研究的数据收集中,搜索引擎存在很大的不确定性,研究者必须考虑这一缺陷所带来的后果;②Alltheweb是目前用于中文网络链接分析研究相对较好的搜索引擎;③需要进一步开发针对网络链接分析研究的专门搜索引擎。  相似文献   

7.
The study reports on a longitudinal and comparative evaluation of Greek language searching on the web. Ten engines, five global (A9, AltaVista, Google, MSN Search, and Yahoo!) and five Greek (Anazitisi, Ano-Kato, Phantis. Trinity, and Visto), were evaluated using (a) navigational queries in 2004 and 2006; and (b) by measuring the freshness of the search engine indices in 2005 and 2006. Homepage finding queries for known Greek organizations were created and searched. Queries included the name of the organization in its Greek and non-Greek, English or transliterated equivalent forms. The organizations represented ten categories: government departments, universities, colleges, travel agencies, museums, media (TV, radio, newspapers), transportation, and banks. The freshness of the indices was evaluated by examining the status of the returned URLs (live versus dead) from the navigational queries, and by identifying if the engines have indexed 32480 active (live) Greek domain URLs. Effectiveness measures included (a) qualitative assessment of how engines handle the Greek language; (b) precision at 10 documents (P@10); (c) mean reciprocal rank (MRR); (d) Navigational Query Discounted Cumulative Gain (NQ-DCG), a new heuristic evaluation measure; (e) response time; (f) the ratio of the dead URL links returned, (g) the presence or absence of URLs and the decay observed over the period of the study. The results report on which of the global and Greek search engines perform best; and if the performance achieved is good enough from a user’s perspective.  相似文献   

8.
Transaction logs of NAVER, a major Korean Web search engine, were analyzed to track the information-seeking behavior of Korean Web users. These transaction logs include more than 40 million queries collected over 1 week. This study examines current transaction log analysis methodologies and proposes a method for log cleaning, session definition, and query classification. A term definition method which is necessary for Korean transaction log analysis is also discussed. The results of this study show that users behave in a simple way: they type in short queries with a few query terms, seldom use advanced features, and view few results' pages. Users also behave in a passive way: they seldom change search environments set by the system. It is of interest that users tend to change their queries totally rather than adding or deleting terms to modify the previous queries. The results of this study might contribute to the development of more efficient and effective Web search engines and services.  相似文献   

9.
This study assesses the effectiveness of New Zealand government Web sites in providing equitable and appropriate access to government information to all citizens. A range of government Web sites was evaluated, and visitors to approximately half of these sites were surveyed to determine their perceptions of the effectiveness of the sites. Results show that there are several key issues for the government to address in formulating effective policy for government Web sites. These include the need for: a clear statement of purpose; good meta-data; good contacts for feedback and update of information; clear statements and adequate provision for confidentiality and privacy of personal data, liability, and copyright; access for disabled users; availability of publications in both electronic and print formats. Key issues to emerge from the user survey focus on the need for better search engines, indexes, and site maps to help people find out quickly if the information they are wanting is likely to be there, and to locate it. Users also need to be assured that the information on government Web sites is accurate and up-to-date. The authors concluded that there is a major gap in government policy emerging from this research that needs urgently to be addressed.  相似文献   

10.
The problem of language in Web searching has been discussed primarily in the area of cross-language information retrieval (CLIR). However, much CLIR research centers on investigation of the effectiveness of automatic translation techniques. The case study reported here explored bilingual user behaviors, perceptions, and preferences with respect to the capability of the Web as a multilingual information resource. Twenty-eight bilingual academic users from Myongji University in Korea were recruited for the study. Findings show that the subjects did not use Web search engines as multilingual tools. For search queries, they selected a language that represents their information need most accurately depending on the types of information task rather than choosing their first language. Subjects expressed concerns about the accuracy of machine translation of scholarly terminologies and preferred to have user control over multilingual Web searches.  相似文献   

11.
中英文网络检索工具评价与比较   总被引:10,自引:1,他引:9  
选用AltaVista 和搜狐两个网络检索工具, 利用10 个来自实际参考咨询的检索提问分别进行检索测试, 并根据由索引数据库构成、检索功能、检索效果、检索结果显示、用户负担五方面构成的评价标准, 进行具体评述和比较。由于无法估算网络中相关信息的总量, 所以文中忽略了查全率R 的计算, 但根据需要采用了重复率R r、死链接率Rd 两个新指标作为检索效果评价的标准。在上述评价的基础上, 分析中文检索工具存在的差距, 以及在选择评价标准时, 根据中、英文检索工具的特点, 应该有哪些不同。  相似文献   

12.
Query suggestion, which enables the user to revise a query with a single click, has become one of the most fundamental features of Web search engines. However, it has not been clear what circumstances cause the user to turn to query suggestion. In order to investigate when and how the user uses query suggestion, we analyzed three kinds of data sets obtained from a major commercial Web search engine, comprising approximately 126 million unique queries, 876 million query suggestions and 306 million action patterns of users. Our analysis shows that query suggestions are often used (1) when the original query is a rare query, (2) when the original query is a single-term query, (3) when query suggestions are unambiguous, (4) when query suggestions are generalizations or error corrections of the original query, and (5) after the user has clicked on several URLs in the first search result page. Our results suggest that search engines should provide better assistance especially when rare or single-term queries are input, and that they should dynamically provide query suggestions according to the searcher’s current state.  相似文献   

13.
ABSTRACT

This article examined the search functions for all individual EAD Web sites listed on the Library of Congress Web site in 2003. In particular, the type of search engine, search modes, options for searching, search results display, search feedback, and other features of the search systems were studied. The data analysis suggests that there have been some improvements for EAD finding aids within Web sites, but also that problems persist. In addition, the functionality of search systems on Web sites varied considerably and the advantages of EAD finding aids for hierarchical searching have not been fully realized. The article also offers observations about cooperative EAD projects, the issue of search queries, and the relationship between Google and EAD Web sites.  相似文献   

14.
少年儿童正逐渐成为使用网络的主要人群,国外已经出现了许多功能较为完善的儿童搜索引擎,然而中文儿童搜索引擎却仍然很罕见。文章首先分析了建设儿童搜索引擎的必要性,再介绍了国外儿童搜索引擎KidRex和Yahoo!Kids及对不良网站的防止效果,还介绍了较为知名的中文儿童搜索引擎"少儿信息港"和小蕃薯,最后比较了中外儿童搜索引擎的异同,对儿童搜索引擎的建设提出了参考意见。  相似文献   

15.
为减少元搜索引擎中无效成员搜索引擎返回的大量重复冗余信息、减轻后期结果处理的负担、提高系统的查准率,文章提出一种基于奖励机制的成员搜索引擎调度策略。该策略引入Agent技术,将每个成员搜索引擎Agent对查询的重要程度进行量化管理,选择检索性能最佳的若干成员搜索引擎进行调度。实验结果证明,这种基于奖励机制的成员搜索引擎调度策略在提高查准率、缩短查询时间、减轻元搜索引擎后期的结果处理负担方面,都优于传统的成员搜索引擎调度策略。  相似文献   

16.
WWW网络信息资源搜索引擎的研究进展   总被引:6,自引:0,他引:6  
夏旭  李健康  方平 《图书馆论坛》2000,20(5):32-35,68
1994年的杨致远等的YAHOO主题指南拉开了WWW网络信息检索的序幕,使得网络搜索引擎和主题指南的研究成为当前国内外研究的热点,对盂内外搜索引擎的比较研究、开发利用、搜索引擎的质量和性能评价、搜索引擎的选择等,均有大量文献报道,本文从以上几个方面综述其研究进展。  相似文献   

17.
When searching for health information, results quality can be judged against available scientific evidence: Do search engines return advice consistent with evidence based medicine? We compared the performance of domain-specific health and depression search engines against a general-purpose engine (Google) on both relevance of results and quality of advice. Over 101 queries, to which the term ‘depression’ was added if not already present, Google returned more relevant results than those of the domain-specific engines. However, over the 50 treatment-related queries, Google returned 70 pages recommending for or against a well studied treatment, of which 19 strongly disagreed with the scientific evidence. A domain-specific index of 4 sites selected by domain experts was only wrong in 5 of 50 recommendations. Analysis suggests a tension between relevance and quality. Indexing more pages can give a greater number of relevant results, but selective inclusion can give better quality.  相似文献   

18.
国外语义搜索引擎调查与分析   总被引:1,自引:0,他引:1  
郭卫宁  司莉 《图书情报工作》2013,57(23):121-129
收集Hakia、SenseBot、Swoogle、Kngine等13种国外典型的语义搜索引擎,以用户体验为视角,通过网站访问与实例测试,对其功能定位、检索范围和检索方法,尤其是语义搜索技术、结果呈现方式等进行深入调查与分析。研究表明,与传统搜索引擎链接清单相比,语义搜索引擎可为查询提供更加精准与直接的答案,能为用户提供更智能化、人性化、知识化的服务。最后对语义搜索引擎存在的问题、局限性和发展趋势进行讨论与展望。  相似文献   

19.
The availability of web search engines offers opportunities in addition to those provided by bibliographic databases for identifying academic literature, but their usefulness for retrieving research is uncertain. A rigorous literature search was undertaken to investigate whether web search engines might replace bibliographic databases, using empirical research in health and social care as a case study. Eight databases and five web search engines were searched between 20 July and 6 August 2015. Sixteen unique studies which compared at least one database with at least one web search engine were examined, as well as drawing lessons from the authors’ own search process. Web search engines were limited in that the searcher cannot be certain that the principles of Boolean logic apply and they were more limited than bibliographic databases in their functions, such as exporting abstracts. Recommendations are made for improving the rigour and quality of reporting studies of academic literature searching.  相似文献   

20.
介绍一种新的站内搜索引擎实现,它是基于Google、Baidu等大型通用搜索引擎实现站内搜索的二次开发。与其他类似应用相比其优点是:搜索结果页面干净、无其他广告、推广信息等附加内容;能同时指定多个域名,达到在主网站、子网站及类网站间同时搜索的目的。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号