Blog dataset

Legocolor: a computer vision dataset for learning datascience
The Legocolor dataset is a new dataset designed to test common data science techniques such as k-nearest neighbour and decision trees. The dataset consists of color samples (red, green and blue value) from real world images of Lego and the official lego color of the brick, and the goal is to train a model to…
Sieuwert van Otterloo
The Utrecht housing dataset – example dataset for prediction
The Utrecht housing dataset is a freely available dataset that can be used by students to learn about data science and machine learning. It is a synthetic dataset that was derived from actual data about the Dutch hoursing market. Since it is synthetic, it containt no noise, high qulaity data and comes in three different…
Sieuwert van Otterloo