Heart Disease can be complicated and hard to diagnose. We decided to make our project goal about determining whether someone has heart disease, to make it easier for both doctors and regular people to handle heart disease.
Using machine learning models such as neural networks.
Decision tree models use supervised learning and can be used for classification and regression. They classify data by passing it through different nodes based the data's attributes. The data is then classified once it reaches a “leaf”, or a node that does not split.
Random Forests are an ensemble learning method that utilizes multiple decision trees to reach an outcome. The individual decision trees classify the data and then vote on the final outcome.
Adaboost is a ensemble learning method that uses multiple decision stumps, which have one node and two leaves as opposed to trees which contain multiple, to classify data.
Logistic Regression models make binary predictions based on the relationship between one or more independent variables.
Support Vector Classification Model uses two groups of data to categorize new data. It does this by plotting the data onto a graph, and then using a hyperplane to split the data according to the group that the data belongs to.
Naive Bayes classifies a point by its given x values and figures out the probability of y given the x variables, in order to classify the data point.
KNN stands for K-Nearest-Neighbors. It finds the data points that the input are closest to and then uses that data to classifies the input.
A neural network tries to simulate the human brain through layers and nodes. The data passes through the an input layer, hidden layers, and ultimately an output layer which are all connected through nodes.
We evaluated all these models based on their accuracy score, precision score, recall score, and f1 score. Click on the image to view both the scores and what they mean for each model.
In conclusion, we wanted to make different machine learning models to see if we can predict heart disease, and we did exactly that. In the end, we found that adaboost, random forest, and k nearest neighbors were our best models, while mlp and random forest grid were our worst models.If we were to improve our project for the future, we would try to reduce the number of false negatives since they are the most harmful when it comes to trying to diagnose heart disease. We would also try and use more complex models that could possibly be better at finding a trend and thus improve our accuracy.
Anthony Campbell
Hi, my name is Anthony Campbell and I am the leader of the Frenzied Physicists. I received a double major in Computer Science and Statistics at the University of Massachusetts Amherst. I’m really proud of the work that my team has done over the past 3 weeks, from building a website to utilizing machine learning models to predict heart failure. I’m sure that everyone is going to do great things in life and I wish them all the best!!!
Thomas Li
I am from Massachusetts and have loved AI and coding since a young age. I like eating ice cream and playing my instrument
Jeffrey Wang
Hi, I'm Jeffrey Wang, a member of the Frenzied Physicists. I do some coding for fun, along with the piano and robotics at my school.
Rhone Galchen
I am from New York and am interested in coding and AI, and enjoy playing chess and soccer
Emma Morales
Hi! My name is Emma and I am from Texas. I am interested in computer science and AI. I also love to play music and learn new things.
Halli Hewett
Hello! My name is Halli. I am from Texas and I had fun coding and working on this project. In my free time I read and play piano.
Kenju Tomita
Hi, I am Kenju. I am from New York and I am interested in coding and AI, and in this project I learned a lot about the finer details of AI, as well as on how to create models and visualizations based on data.