TEXT MINING MENGGUNAKAN GENERATE ASSOCIATION RULE WITH WEIGHT (GARW) ALGORITHM UNTUK ANALISIS TEKS WEB CRAWLER

Zulkifli Arsyad

Abstract


Text mining is widely used to find hidden patterns and information in a large number of semi and unstructured texts. Text mining extracts interesting patterns to explore knowledge from textual data sources. Association rule extraction GARW (Generating Association Rule using Weighting Scheme) can be used to find knowledge from a collection of web content without having to read all the web content manually from the many search results of crawlers. The GARW algorithm is a development of a priori to produce relevant association rules. From the results of this knowledge discovery can facilitate netizens users in finding relevant information from search keywords without having to review one by one web content generated from search engine searches.

Keywords


Text Mining; Association Rule; GARW; Web Crawler

Full Text:

PDF

References


J. R and V. S, "Text Mining Reserach: A Survey," International Journal of Innovative Research in Computer and Communication Engineering, p. 6544, 2016.

R. Talib, M. K. Hanif, S. Ayesha, and F. Fatima, "Text Mining : Techniques, Applications and Issues," International Journal of Advanced Computer Science and Applications, vol. 7, no. 11, p. 414, 2016.

A. Kaushik and S. Naithani, "A Comprehensive Study of Text Mining Approach ," International Journal of Computer Science and Networking Security, vol. 16, no. 2, p. 69, Feb. 2016.

M. Kulkarni and S. Kulkarni, "KNowledge Discovery in Text Mining using Association Role Extraction," International Journal of Computer Application, vol. 143, no. 12, p. 31, Jun. 2016.

V. Bhujade and N. J. Janwe, "Knowledge Discovery in Text Mining Technique Using Association Rules Extraction," International Conference on Computational Intelligence and Communication Systems, pp. 498-502, 2011.

C. Olston and M. Najork, "Web Crawling," p. 176, 2010.

R. Feldman and J. Sanger, The Text Mining Handbook Advanced Approaches in Analyzing Unstructured Data, 1, Ed. Cambrige, UK: Cambrige University, 2007.

Tala, F.Z, "A Study of Stemming Effects on Information Retrieval in Bahasa IndonesiaM.S. thesis. M.Sc. Thesis. Master of Logic Project. Institute for Logic, Language and Computation," Universiteti van Amsterdam The Netherlands Thesis, 2003.

G. Bathla and R. Jindal, "Similarity Measures of Research Papers and Patents," International Journal of Computer Applications , vol. 33, p. 10, Nov. 2011.

H. Mahgoub, D. Rösner, and N. Ismail, "A Text Mining Technique Using Association Rules Extraction," International Journal of Information and Mathematical Sciences, pp. 21-28, 2008.

M. Najork and A. Heydon, High Performance Web Crawling. California, USA: SRC, 2001.

N. N. Pawar and Rajeswari, "Study Of Different Focused Web Crawler to Search Domain Spesific Information," International Journal of Computer Applications, vol. 136, no. 11, p. 1, Feb. 2016.

D. T. V. R. Md. Umar Khan, "XWADF: Architectural Pattern for Improving Performance of Web Application," IJCSI, vol. 11, no. 2, p. 1, 2014.

F. Chen, et al., "Data Mining for the Internet of Thinks: Literature Review and challanges," International Journal of Distributed Sensor Network, p. 14, 2015.




DOI: https://doi.org/10.32627/internal.v2i2.86

Refbacks

  • There are currently no refbacks.


Copyright (c) 2020 Zulkifli Arsyad

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Journal INTERNAL telah terindeks pada :