27,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 6-10 Tagen
payback
14 °P sammeln
  • Broschiertes Buch

This work aimed to investigate the use of a parallel K-Means clustering algorithm, based on the MapReduce programming model, to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end, experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers in agricultural regions and belong to Ameriflux. The experiments were performed using 3, 4, and 6 machines, respectively. The results showed that with the increase in the number of machines,…mehr

Produktbeschreibung
This work aimed to investigate the use of a parallel K-Means clustering algorithm, based on the MapReduce programming model, to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end, experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers in agricultural regions and belong to Ameriflux. The experiments were performed using 3, 4, and 6 machines, respectively. The results showed that with the increase in the number of machines, there was a gain in performance, with the best time obtained using six machines, reaching a SpeedUp of 3.25. It was found that the application scales well with the equivalent increase in data size and number of machines in the cluster, achieving similar performance in the tests.
Autorenporträt
She is currently a doctoral student in Computer Science at the Pontifical Catholic University of Paraná (PUCPR). She obtained a master's degree in Applied Computing from the State University of Ponta Grossa in 2015. She has a bachelor's degree in Systems Analysis and Development from the Federal Technological University of Paraná (2012).