COMPARISON OF K-MEANS CLUSTERING METHOD AND K-MEDOIDS ON TWITTER DATA

Authors

  • Cahyani Oktarina Department of Statistics, IPB University, Indonesia
  • Khairil Anwar Notodiputro Department of Statistics, IPB University, Indonesia
  • Indahwati Indahwati Department of Statistics, IPB University, Indonesia

DOI:

https://doi.org/10.29244/ijsa.v4i1.599

Keywords:

text mining, clustering, k-means, k-medoids, twitter

Abstract

The presidential election is one of the political events that occur in Indonesia once in five years. Public satisfaction and dissatisfaction with political issues have led to an increase in the number of political opinion tweets. The purpose of this study is to examine the performance of the k-means and k-medoids method in the Twitter data and to tweet about the presidential election in 2019. The data used in this study are primary data taken from Muhyi's research, then mining the text against data obtained. Because this data has been processed by Muhyi to analyze the electability of the 2019 presidential candidate pairs, for this journal needs a preprocessing was carried out to analyze the tendency of tweets to side with the candidate pairs of one or two. The difference in the pre-processing of this research with previous research is that there is a cleaning of duplicate data and normalizing. The results of this study indicate that the optimal number of clusters resulting from the k-means method and the k-medoid method are different.

Downloads

Download data is not yet available.

Downloads

Published

2020-02-28

How to Cite

Oktarina, C., Notodiputro, K. A., & Indahwati, I. (2020). COMPARISON OF K-MEANS CLUSTERING METHOD AND K-MEDOIDS ON TWITTER DATA. Indonesian Journal of Statistics and Its Applications, 4(1), 189–202. https://doi.org/10.29244/ijsa.v4i1.599

Issue

Section

Articles