SEMANTIC FOCUSED WEB CRAWLER FOR SERVICE DISCOVERY USING DATA MINING TECHNIQUE

  • Ruchika Patel
  • Pooja Bhatt

Abstract

Data mining is the process of extraction of hidden predictive information from the huge databases. It is a new technology with great latent to help companies focus on the most important information in their data warehouses. Web mining is a data mining techniques which automatically discover information from web documents. The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. It’s a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy by focusing on web pages for some given topic or a set of topics. Nowadays finding meaningful information among the billions of information resources on the World Wide Web is a difficult task due to growing popularity of the Internet. This paper basically focuses on study of the various techniques of data mining for finding the relevant information from World Wide Web using web crawler.

Downloads

Download data is not yet available.

References

Lu LIU, Tao PENG “Clustering-based topical Web crawling using CFu-tree guided by link-context†in Higher Education Press and Springer-Verlag Berlin Heidelberg 2014

Hai Dong, Farookh Khadeer Hussain, and Elizabeth Chang “Ontology-Learning-Based Focused Crawling for Online Service Advertising Information Discovery and Classification†in Springer-Verlag Berlin Heidelberg 2012

Rodolfo Zunino, Roberto Surlinelli “An Analyst-Adaptive Approach to Focused Crawlers†in 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Hai Dong, Farookh Khadeer Hussain “Self-Adaptive Semantic Focused Crawler for Mining Services Information Discovery†in IEEE Transactions On Industrial Informatics, Vol. 10, No. 2, May 2014

Hardik P. Trivedi, Gaurav N. Daxini, Jignesh A. Oswal, Vinay D. Gor, Swati Mali “An Approach to Design Personalized Focused Crawler†in International Journal of Computer Science and Engineering Volume-2, Issue-3 E-ISSN: 2347-2693

Bireshwar Ganguly, Devashri Raich “Performance Optimization of Focused Web Crawling Using Content Block Segmentation’’ in 978-1-4799-2102-7/14 $31.00 © 2014 IEEE DOI 10.1109/ICESC.2014.69

Hai Dong, Farookh Khadeer Hussain, Elizabeth Chang “A Transport Service Ontology based Focused Crawler†in 2008 IEEE

R.Eswaramoorthy, M.Jayanthi “A Survey on Detection of Mining Service Information Discovery Using SASF Crawler†in International Journal of Innovative Research in Computer and Communication Engineering Vol. 2, Issue 10, October 2014

Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM: Pennsylvania, United States, 1992; 144-152.

http://www.eclipse.org/

http://www.csie.ntu.edu.tw/~cjlin/libsvm/

Osmar R. Zaïane, “Introduction to Data Mining†in CMPUT690 Principles of Knowledge Discovery in Databases

Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik, “Study of Web Crawler and its Different Types†in IOSR Journal of Computer Engineering (IOSR-JCE)

http://cacm.acm.org/blogs/blog-cacm/153780-data-mining-the-web-via-crawling/fulltext

http://upload.wikimedia.org/wikipedia/commons/thumb/d/df/WebCrawlerArchitecture.svg/300px-WebCrawlerArchitecture.svg.png

Published
2015-10-07
How to Cite
Patel, R., & Bhatt, P. (2015). SEMANTIC FOCUSED WEB CRAWLER FOR SERVICE DISCOVERY USING DATA MINING TECHNIQUE. COMPUSOFT: An International Journal of Advanced Computer Technology, 4(7). Retrieved from https://www.ijact.in/index.php/ijact/article/view/21