首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于Web数据挖掘的个性化信息智能Agent挖掘系统模型   总被引:1,自引:0,他引:1  
张莉 《科技广场》2006,31(8):53-55
个性化智能挖掘是近几年出现的一个崭新的研究方向,它是人工智能与数据挖掘技术在Web或Internet环境下相互融合的产物。大部分个性化信息挖掘都只是建立在纯粹的Web数据挖掘之上,然而面对大部分的智能化技术的出现,面对用户能够快速准确地检索自己最想要的信息的需求,Web数据挖掘要进行相应的扩展,通过将Web数据挖掘技术和智能Agent技术相结合,从而满足用户的需求。本文主要提出两个模型:典型的个性化Web挖掘模型和个性化Agent智能挖掘模型。  相似文献   

2.
Web3.0环境自主学习平台建设模式研究   总被引:1,自引:0,他引:1  
随着互联网的迅速普及与信息量的日益增长,自主学习显得日渐重要。针对现有自主学习平台方案个性化、自主性不强的缺陷,结合web3.0的个性化、智能化等特点,提出新的自主学习平台建设方案。详细介绍了Web3.0环境自主学习平台的实施方案与主要功能实现,论述了基于Web3.0技术的自主学习平台所具有的独特优势,以期为个性化自主学习平台建设提供总体技术参考。  相似文献   

3.
Drawing on the ideas of the Sense-Making approach, the ways in which people face and bridge gaps in Web searching are analyzed. The empirical study is based on videotaped Web searches conducted by seven participants. Altogether 11 gaps and 13 search tactics of various types were identified. The gaps faced by the searchers originated from three major factors: problematic content of information, insufficient search competence and problems caused by the search environment. Of individual gaps, no relevant material available, inaccessible content and confusion were most frequent. Of the search tactics used in gap-bridging, following links and activating the Back button were most popular. Gaps related to the problematic content of information led the informants to redirect the search to find Web pages that focus better on the search topic. If the movement was stopped by insufficient search competence, the searchers tended to return to material that was familiar from earlier use contexts in order to regain control of the search process. Alternatively, they tried to specify the search terms. In cases where the search was interrupted by technical problems or other factors originating from the search system, gap-bridging aimed at returning to familiar and technically reliable links. The Sense-Making theory provides relevant conceptual tools to approach the dynamic and discontinuous nature of Web searching in terms of gap-facing and gap-bridging. The concept of gap-facing enables a context-sensitive analysis of the ways in which Web search processes may be stopped. Gap-bridging indicates a general level motive to find alternative ways to continue searching.  相似文献   

4.
This paper presents not only mycommunityinfo.ca (MCI) as an innovative World Wide Web (WWW)-based community information (CI) site, but also how its unique approach to facilitating online CI searching on the Web reveals through empirical data how people use such information and communication technologies (ICTs) to address their everyday information needs. The geographic focus for this study is on three communities in Southwestern Ontario. MCI collects unobtrusively query data that are logged daily from its own Web site, the Web sites of three municipal governments, and one municipal agency from this region. One year’s worth of these data was supplied to determine the types of CI that are sought through Web searching. A content analysis of a large purposive sample of all of MCI’s query data reveals more specific and diverse conceptual CI needs between and within communities than those reported in other studies employing different data collection methods. As a result, using a centralized approach to online CI access via the WWW by other CI providers such as the 211 network may be a disservice to its users. Additionally, the findings demonstrate how a thorough analysis of such data may improve the informational content and overall design of municipal government Web sites. The analysis of these data also has the potential of improving current CI taxonomies.  相似文献   

5.
Understanding users’ navigation on the Web is important towards improving the quality of information and the speed of accessing large-scale Web data sources. Clustering of users’ navigation into sessions has been proposed in order to identify patterns and similarities which are then managed in the context of Web users oriented applications (searching, e-commerce, etc.). This paper deals with the problem of assessing the quality of user session clusters in order to make inferences regarding the users’ navigation behavior. A common model-based clustering algorithm is used to result in clusters of Web users’ sessions. These clusters are validated by using a statistical test, which measures the distances of the clusters’ distributions to infer their dissimilarity and distinguishing level. Furthermore, a visualization method is proposed in order to interpret the relation between clusters. Using real data sets, we illustrate how the proposed analysis can be applied in popular application scenarios to reveal valuable associations among Web users’ navigation sessions.  相似文献   

6.
针对传统的基于Web图的垂直搜索策略Authorities and Hubs,提出了一种融合了网页内容评价和Web图的启发式垂直搜索策略,此外,引入向量空间模型进行针对网页内容的主题相关度判断,进一步提高主题网页下载的准确率.实验表明,文中算法有效地提高了主题网页的聚合程度,且随着网页下载数量的增加,垂直搜索引擎的准确率逐渐递增,并在下载网页达到一定数量后,准确率趋于稳定,算法具有较好的鲁棒性,可以应用到相关垂直搜索引擎系统中.  相似文献   

7.
基于XML本体描述语言的数字图书馆Web信息资源整合   总被引:2,自引:0,他引:2  
王军 《现代情报》2007,27(11):84-86
信息资源整合是实现数字图书馆建设目标的关键.而信息资源的整合涉及诸多快速发展的新技术和新方法。本文从概述数字图书馆Web信息资源整合入手,论述了XML作为与本体语言紧密相关的关键性技术,在数字图书馆Web信息资源整合方面的优势;探讨了基于XML本体语言描述的数字图书馆Web信息资源整合系统的功能及其实现途径;提出了基于该系统的数字图书馆Web信息资源整合及其配套工作,需要进一步研究和实践的几个主要问题。  相似文献   

8.
Frequent requests from users to search engines on the World Wide Web are to search for information about people using personal names. Current search engines only return sets of documents containing the name queried, but, as several people usually share a personal name, the resulting sets often contain documents relevant to several people. It is necessary to disambiguate people in these result sets in order to to help users find the person of interest more readily. In the task of name disambiguation, effective measurement of similarities in the documents is a crucial step towards the final disambiguation. We propose a new method that uses web directories as a knowledge base to find common contexts in documents and uses the common contexts measure to determine document similarities. Experiments, conducted on documents mentioning real people on the web, together with several famous web directory structures, suggest that there are significant advantages in using web directories to disambiguate people compared with other conventional methods.  相似文献   

9.
This article presents conceptual navigation and NavCon, an architecture that implements this navigation in World Wide Web pages. NavCon architecture makes use of ontology as metadata to contextualize user search for information. Based on ontologies, NavCon automatically inserts conceptual links in Web pages. By using these links, the user may navigate in a graph representing ontology concepts and their relationships. By browsing this graph, it is possible to reach documents associated with the user desired ontology concept. This Web navigation supported by ontology concepts we call conceptual navigation. Conceptual navigation is a technique to browse Web sites within a context. The context filters relevant retrieved information. The context also drives user navigation through paths that meet his needs. A company may implement conceptual navigation to improve user search for information in a knowledge management environment. We suggest that the use of an ontology to conduct navigation in an Intranet may help the user to have a better understanding about the knowledge structure of the company.  相似文献   

10.
沈奎林  杜瑾 《现代情报》2010,30(10):140-143,154
以2004-2009年间CNKI数据库收录的关于web2.0的文献为研究对象,运用文献计量学方法,分别进行载文分析、著者分析、期刊分析和主题分析。基于这些分析,结合web2.0产生的历史和发展的现状,提出在该领域研究的核心作者和核心期刊等,指出目前的研究重点和存在的不足之处,进而探讨未来该领域研究的发展方向,以期对国内图书情报界对于web2.0在图书馆的应用方面的研究提供借鉴。  相似文献   

11.
In this paper, we define and present a comprehensive classification of user intent for Web searching. The classification consists of three hierarchical levels of informational, navigational, and transactional intent. After deriving attributes of each, we then developed a software application that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users. Our findings show that more than 80% of Web queries are informational in nature, with about 10% each being navigational and transactional. In order to validate the accuracy of our algorithm, we manually coded 400 queries and compared the results from this manual classification to the results determined by the automated method. This comparison showed that the automatic classification has an accuracy of 74%. Of the remaining 25% of the queries, the user intent is vague or multi-faceted, pointing to the need for probabilistic classification. We discuss how search engines can use knowledge of user intent to provide more targeted and relevant results in Web searching.  相似文献   

12.
刘志辉 《情报探索》2020,(6):132-134
[目的/意义]对社会科学视角下Web主题结构挖掘研究的最新著作进行述评。[方法/过程]从数据采集技术、主题信息抽取和专题Web信息资源组织等方面,介绍了《学术Web主题结构挖掘研究》一书的核心内容,并点评了其主要研究贡献。[结果/结论]该书为宏观Web主题结构分析研究提供了系统化的解决方案,是近年来这一研究方向为数不多的专著。  相似文献   

13.
随着网络的发展和广泛应用,Internet中的信息已经由静态方式发展为动态的交互方式,网站与数据库的连接也显得越来越重要。先简要介绍了目前流行的几种Web数据库访问技术,进而详细探讨了数据库技术在专题学习网站中的应用。  相似文献   

14.
张云坤 《现代情报》2010,30(10):74-76
互联网和信息数字化的高度发展为信息查询带来了新的挑战,如何改善查询模式使用户从海量的数据中快速找到真正需要的信息成为了当前业界亟待解决的课题。本文描述了一种个性化信息服务模型,使用DeepWeb数据集成技术设计了一个图书馆数字资源个性化信息服务系统,实现了以用户为中心的图书馆信息服务。  相似文献   

15.
主要介绍了主题搜索引擎、网络爬虫的基本概念和Heritrix系统的体系结构,分析了Heritrix的工作流程,在Heritrix框架的基础上进行扩展和优化。通过一个实例,实现了对京东网图书信息的抓取,为建立面向图书信息的垂直搜索引擎提供了网页信息资源。  相似文献   

16.
随着Web应用的普及,其安全面临着前所未有的挑战。针对Windows平台中广泛使用的IIS服务器,利用ISAPI在Web应用层设计并实现安全防护系统,可对常见的Web攻击,如SQL注入、XSS攻击、Cookies攻击进行防御。实验表明,该安全防护系统可以在一定程度上保护网站的应用层安全,有效提高网站安全性。  相似文献   

17.
对网络信息资源的评价与管理一直是人们研究和讨论的热点之一.本文提出可用网站被引分析法对学术网站进行评价,并对网站被引分析法的特点、相关评价步骤和指标进行了阐述.以国内100所大学的网站为例进行实证分析,得出大学网站同被引的相关数据,通过多维尺度分析和Pajek,对网站之间的关系进行了可视化展示.表明从同被引的角度可以较好地评价大学网站间的关系,网站被引分析法是一种值得深入研究与发展的评价方法.  相似文献   

18.
用户生成内容是Web2.0下由网络用户创作的互联网信息,分析其相关理论研究进展从而为UGC的进一步研究及商业利用提供思路具有重要的参考意义。本文在界定了UGC产生的背景基础上,从UGC基本范畴、生成动机、质量问题和法律问题4方面,分析了UGC研究动态;探究UGC情感分析中情感词语和情感量化问题,归纳现有情感分析涉及的算法应用。在研究现状分析基础上,指出下一步应重点研究促进网络用户生产出高质量UGC的激励和约束政策,建立更加完善、准确的推荐模型,设计符合不同应用领域的情感词量化方法,构建保护用户隐私的UGC生产、利用机制。  相似文献   

19.
A critical challenge for Web search engines concerns how they present relevant results to searchers. The traditional approach is to produce a ranked list of results with title and summary (snippet) information, and these snippets are usually chosen based on the current query. Snippets play a vital sensemaking role, helping searchers to efficiently make sense of a collection of search results, as well as determine the likely relevance of individual results. Recently researchers have begun to explore how snippets might also be adapted based on searcher preferences as a way to better highlight relevant results to the searcher. In this paper we focus on the role of snippets in collaborative web search and describe a technique for summarizing search results that harnesses the collaborative search behaviour of communities of like-minded searchers to produce snippets that are more focused on the preferences of the searchers. We go on to show how this so-called social summarization technique can generate summaries that are significantly better adapted to searcher preferences and describe a novel personalized search interface that combines result recommendation with social summarization.  相似文献   

20.
科技信息的网络动态监测和信息自动获取技术研究   总被引:1,自引:0,他引:1  
赵燕平  朱东华 《科学学研究》2003,21(Z1):230-237
文章综述了国内外基于Robot的Web信息检索(IR)、定题检索、智能信息检索及其代理以及相关的信息自动获取技术的有关理论、方法和技术,探讨了适合技术预测与评估的需要、用于科学领域专题信息采集的、网络科技信息智能获取的方法和策略。构建了网络科技信息动态监测和信息自动获取系统的总体框架,实现了一个基于Web的系统原型(简称BIT),并分析了该系统的特点。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号