This project focuses on predicting whether the patients in the dataset will suffer from a stroke or not, by building a machine learning algorithm. I used three different big data technologies: Logistic Regression, Random Forest, and Artificial Neural Networks.