Heart Disease Website

Models

To view more information about the models, please click on the image.

Decision Trees

Decision tree models use supervised learning and can be used for classification and regression. They classify data by passing it through different nodes based the data's attributes. The data is then classified once it reaches a “leaf”, or a node that does not split.
Random Forest

Random Forests are an ensemble learning method that utilizes multiple decision trees to reach an outcome. The individual decision trees classify the data and then vote on the final outcome.
Adaboost

Adaboost is a ensemble learning method that uses multiple decision stumps, which have one node and two leaves as opposed to trees which contain multiple, to classify data.
Logistic Regression

Logistic Regression models make binary predictions based on the relationship between one or more independent variables.
Support Vector Classifier

Support Vector Classification Model uses two groups of data to categorize new data. It does this by plotting the data onto a graph, and then using a hyperplane to split the data according to the group that the data belongs to.
Naive Bayes

Naive Bayes classifies a point by its given x values and figures out the probability of y given the x variables, in order to classify the data point.
K Nearest Neighbors

KNN stands for K-Nearest-Neighbors. It finds the data points that the input are closest to and then uses that data to classifies the input.
Neural Network

A neural network tries to simulate the human brain through layers and nodes. The data passes through the an input layer, hidden layers, and ultimately an output layer which are all connected through nodes.
Model Scores

We evaluated all these models based on their accuracy score, precision score, recall score, and f1 score. Click on the image to view both the scores and what they mean for each model.

Conclusion

In conclusion, we wanted to make different machine learning models to see if we can predict heart disease, and we did exactly that. In the end, we found that adaboost, random forest, and k nearest neighbors were our best models, while mlp and random forest grid were our worst models.If we were to improve our project for the future, we would try to reduce the number of false negatives since they are the most harmful when it comes to trying to diagnose heart disease. We would also try and use more complex models that could possibly be better at finding a trend and thus improve our accuracy.

Our Amazing Team

We spent three weeks learning how to code and work together.

Anthony Campbell

Hi, my name is Anthony Campbell and I am the leader of the Frenzied Physicists. I received a double major in Computer Science and Statistics at the University of Massachusetts Amherst. I’m really proud of the work that my team has done over the past 3 weeks, from building a website to utilizing machine learning models to predict heart failure. I’m sure that everyone is going to do great things in life and I wish them all the best!!!

Thomas Li

I am from Massachusetts and have loved AI and coding since a young age. I like eating ice cream and playing my instrument

Jeffrey Wang

Hi, I'm Jeffrey Wang, a member of the Frenzied Physicists. I do some coding for fun, along with the piano and robotics at my school.

Rhone Galchen

I am from New York and am interested in coding and AI, and enjoy playing chess and soccer

Emma Morales

Hi! My name is Emma and I am from Texas. I am interested in computer science and AI. I also love to play music and learn new things.

Halli Hewett

Hello! My name is Halli. I am from Texas and I had fun coding and working on this project. In my free time I read and play piano.

Kenju Tomita

Hi, I am Kenju. I am from New York and I am interested in coding and AI, and in this project I learned a lot about the finer details of AI, as well as on how to create models and visualizations based on data.