Week 8: Data Stream Mining
In this practical, we use the stream R package for analysing stream data. Please install the stream package to complete the practical.
I. Creating a data stream
-
We firstly create a generator to generate stream data points that will belong to one of three clusters
(k=3)
. Each data point will have 2 dimensions(d=2)
. The data points will follow Gaussian distribution with 5% noise. When a new data point is requested from this data generator, a cluster will be chosen randomly using the probability weights inp
.library("stream") stream <- DSD_Gaussians(k = 3, d = 2, noise = .05, p = c(.5, .3, .1)) stream
-
Generate 5 data points using the generator.