In order to build your movie recommendation engine, we will be using one of the MovieLens
dataset, as there are multiple one with different sizes.
These datasets are made available by the GroupLens
Research © group. They have collected and made available rating data sets from the MovieLens
web site which were collected over various periods of time.
The data set that we will be using for this series is the small version of the MovieLens
Latest Datasets downloadable here.
This dataset, thanks to its size, can easily be used with your SAP HANA MDC instance on the SAP Cloud Platform developer/trial account.
If you have a SAP Cloud Platform productive account with a larger SAP HANA instance, you can run this tutorial series with the larger datasets, but the validations steps implemented were built based on the “small dataset”.
Before using these data sets, please review the README file for the usage licenses and other details.