SEMANTIC FOCUSED WEB CRAWLER FOR SERVICE DISCOVERY USING DATA MINING TECHNIQUE
AbstractData mining is the process of extraction of hidden predictive information from the huge databases. It is a new technology with great latent to help companies focus on the most important information in their data warehouses. Web mining is a data mining techniques which automatically discover information from web documents. The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. Itâ€™s a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy by focusing on web pages for some given topic or a set of topics. Nowadays finding meaningful information among the billions of information resources on the World Wide Web is a difficult task due to growing popularity of the Internet. This paper basically focuses on study of the various techniques of data mining for finding the relevant information from World Wide Web using web crawler.
Lu LIU, Tao PENG â€œClustering-based topical Web crawling using CFu-tree guided by link-contextâ€ in Higher Education Press and Springer-Verlag Berlin Heidelberg 2014
Hai Dong, Farookh Khadeer Hussain, and Elizabeth Chang â€œOntology-Learning-Based Focused Crawling for Online Service Advertising Information Discovery and Classificationâ€ in Springer-Verlag Berlin Heidelberg 2012
Rodolfo Zunino, Roberto Surlinelli â€œAn Analyst-Adaptive Approach to Focused Crawlersâ€ in 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Hai Dong, Farookh Khadeer Hussain â€œSelf-Adaptive Semantic Focused Crawler for Mining Services Information Discoveryâ€ in IEEE Transactions On Industrial Informatics, Vol. 10, No. 2, May 2014
Hardik P. Trivedi, Gaurav N. Daxini, Jignesh A. Oswal, Vinay D. Gor, Swati Mali â€œAn Approach to Design Personalized Focused Crawlerâ€ in International Journal of Computer Science and Engineering Volume-2, Issue-3 E-ISSN: 2347-2693
Bireshwar Ganguly, Devashri Raich â€œPerformance Optimization of Focused Web Crawling Using Content Block Segmentationâ€™â€™ in 978-1-4799-2102-7/14 $31.00 Â© 2014 IEEE DOI 10.1109/ICESC.2014.69
Hai Dong, Farookh Khadeer Hussain, Elizabeth Chang â€œA Transport Service Ontology based Focused Crawlerâ€ in 2008 IEEE
R.Eswaramoorthy, M.Jayanthi â€œA Survey on Detection of Mining Service Information Discovery Using SASF Crawlerâ€ in International Journal of Innovative Research in Computer and Communication Engineering Vol. 2, Issue 10, October 2014
Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM: Pennsylvania, United States, 1992; 144-152.
Osmar R. ZaÃ¯ane, â€œIntroduction to Data Miningâ€ in CMPUT690 Principles of Knowledge Discovery in Databases
Trupti V. Udapure, Ravindra D. Kale, Rajesh C. Dharmik, â€œStudy of Web Crawler and its Different Typesâ€ in IOSR Journal of Computer Engineering (IOSR-JCE)
The submitter hereby warrants that the Work (collectively, the “Materials”) is original and that he/she is the author of the Materials. To the extent the Materials incorporate text passages, figures, data or other material from the works of others, the undersigned has obtained any necessary permissions. Where necessary, the undersigned has obtained all third party permissions and consents to grant the license above and has all copies of such permissions and consents.
The submitter represents that he/she has the power and authority to make and execute this assignment. The submitter agrees to indemnify and hold harmless the COMPUSOFT from any damage or expense that may arise in the event of a breach of any of the warranties set forth above. For authenticity, validity and originality of the research paper the author/authors will be totally responsible.