期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Document replication strategies for geographically distributed web search engines

Enver Kayaaslan B. Barla Cambazoglu Cevdet Aykanat 《Information processing & management》2013

Large-scale web search engines are composed of multiple data centers that are geographically distant to each other. Typically, a user query is processed in a data center that is geographically close to the origin of the query, over a replica of the entire web index. Compared to a centralized, single-center search engine, this architecture offers lower query response times as the network latencies between the users and data centers are reduced. However, it does not scale well with increasing index sizes and query traffic volumes because queries are evaluated on the entire web index, which has to be replicated and maintained in all data centers. As a remedy to this scalability problem, we propose a document replication framework in which documents are selectively replicated on data centers based on regional user interests. Within this framework, we propose three different document replication strategies, each optimizing a different objective: reducing the potential search quality loss, the average query response time, or the total query workload of the search system. For all three strategies, we consider two alternative types of capacity constraints on index sizes of data centers. Moreover, we investigate the performance impact of query forwarding and result caching. We evaluate our strategies via detailed simulations, using a large query log and a document collection obtained from the Yahoo! web search engine. 相似文献

2.

Chat mining: Predicting user and message attributes in computer-mediated communication

Tayfun Kucukyilmaz B. Barla Cambazoglu Cevdet Aykanat Fazli Can 《Information processing & management》2008

The focus of this paper is to investigate the possibility of predicting several user and message attributes in text-based, real-time, online messaging services. For this purpose, a large collection of chat messages is examined. The applicability of various supervised classification techniques for extracting information from the chat messages is evaluated. Two competing models are used for defining the chat mining problem. A term-based approach is used to investigate the user and message attributes in the context of vocabulary use while a style-based approach is used to examine the chat messages according to the variations in the authors’ writing styles. Among 100 authors, the identity of an author is correctly predicted with 99.7% accuracy. Moreover, the reverse problem is exploited, and the effect of author attributes on computer-mediated communications is discussed. 相似文献

3.

Performance of query processing implementations in ranking-based text retrieval systems using inverted indices

B. Barla Cambazoglu Cevdet Aykanat 《Information processing & management》2006

Similarity calculations and document ranking form the computationally expensive parts of query processing in ranking-based text retrieval. In this work, for these calculations, 11 alternative implementation techniques are presented under four different categories, and their asymptotic time and space complexities are investigated. To our knowledge, six of these techniques are not discussed in any other publication before. Furthermore, analytical experiments are carried out on a 30 GB document collection to evaluate the practical performance of different implementations in terms of query processing time and space consumption. Advantages and disadvantages of each technique are illustrated under different querying scenarios, and several experiments that investigate the scalability of the implementations are presented. 相似文献

4.

Limiting GPR in a two-layer soil model via genetic algorithms

Baris Gursu Author Vitae Melih Cevdet Ince Author Vitae 《Journal of The Franklin Institute》2009,346(8):768-783

In this paper, an optimum grounding grid that provides the conditions of GPR<E_touch and minimum cost in the structures of two-layer soil model is designed and the length of total conductor and the quantity of ground rod are calculated via Genetic Algorithms (GA). A new approach is presented for the calculation of total conductor length. At the same time, the subject regarding in which layer the ground conductors and rods that form the grounding grid in a substation are to be placed in two-layer soil is analysed using GA. With this as the goal, the depth of optimum grid burial is determined. Our study is compared with the design study for a two-layer soil model in the literature. As a result, the high performance of optimum grid design that is achieved using GA is emphasized by varied applications. 相似文献

5.

The development of TPACK,Technology Integrated Self-Efficacy and Instructional Technology Outcome Expectations of pre-service physical education teachers

Cevdet Cengiz 《Asia-Pacific Journal of Teacher Education》2015,43(5):411-422

相似文献

6.

Architecture of a grid-enabled Web search engine

B. Barla Cambazoglu Evren Karaca Tayfun Kucukyilmaz Ata Turk Cevdet Aykanat 《Information processing & management》2007

Search Engine for South-East Europe (SE4SEE) is a socio-cultural search engine running on the grid infrastructure. It offers a personalized, on-demand, country-specific, category-based Web search facility. The main goal of SE4SEE is to attack the page freshness problem by performing the search on the original pages residing on the Web, rather than on the previously fetched copies as done in the traditional search engines. SE4SEE also aims to obtain high download rates in Web crawling by making use of the geographically distributed nature of the grid. In this work, we present the architectural design issues and implementation details of this search engine. We conduct various experiments to illustrate performance results obtained on a grid infrastructure and justify the use of the search strategy employed in SE4SEE. 相似文献