The purpose of this project is to analyse YouTubes public data sets. This work will be carried out by first breaking the data sets down by region. Then using the Python programming language paired with machine learning algorithms to retrieve the required results.
- The most common languages in each region excluding its native language?
- This will give an estimated percentage of its populations nationality, allowing more specific marketing strategies.
- Algorithm to automatically generate tags based on analysis of top performing videos.
- This will allow a user to generate the most effective tags to increase their videos reach.
- Most popular categories in each region based on interactions with the video? i.e. likes and comments.
- This will allow individuals to choose the most effective category for video growth as interactions directly correlate to a videos popularity.