Simple projects during my self-learning of Machine Learning in Python. Except for Spark files, run the other files in Jupyter Notebooks for better results.
Classfies mails as Spam or Not Spam(Ham).
Clustering people based on income and age.
Useful for hiring considering the factors if the candidate was employed before, has done an internship, has attended top-tier school.
Finding all similar movies from a data-set.
Make movie recommendations to a person, based on the movie watched priorly by that person, using Item Based Classification.
Using K-Nearest-Neighbours algorithm for making movie reccomendations in a more effective way.
Find the shortest way for a taxi, from an existing point to pickup point and from pickup point to drop point using Q-Learning.
Cleaning web access log to figure out most viewed pages on a website from it.
To execute: spark-submit FILENAME.py
You give a word, it tells you which is the best article in wikipedia you can read to learn about that word.
How to perform linear regression using Spark.