Blog dataset

The Utrecht Housing dataset: A housing appraisal dataset
Van Otterloo, S and Burda, P. 2025. The Utrecht Housing dataset: A housing appraisal dataset. Computers and Society Research Journal (2025), 1 DOI: https://doi.org/10.54822/QVHM1662 Abstract This paper introduces a real-world dataset for analysing and predicting house prices. The dataset consists of actual data on the Dutch housing market collected in 2024 for a total of 153…
Sieuwert van Otterloo
Legocolor: a computer vision dataset for learning datascience
The Legocolor dataset is a new dataset designed to test common data science techniques such as k-nearest neighbour and decision trees. The dataset consists of color samples (red, green and blue value) from real world images of Lego and the official lego color of the brick, and the goal is to train a model to…
Sieuwert van Otterloo
The Utrecht housing dataset – example dataset for prediction
The Utrecht housing dataset is a freely available dataset that can be used by students to learn about data science and machine learning. It is suitable for multiple techniques, including decision trees, linear regression, logistic regression and neural networks. Note: the dataset was updated in 2025, see a detailed description of the latest 2025 version…
Sieuwert van Otterloo