Diabetes-data-analysis

Abstract

In this project, we plan to analyze the problem of predicting hospital readmission rates among diabetic patients using the "Diabetes 130-US hospitals" dataset. Tradi- tionally, this problem is dealt with by using statistical machine learning algorithms like Naive Bayes, K-Nearest Neighbors, and Logistic regression. These algorithms are known to not perform well on non-separable and high-dimensional datasets. To overcome these pitfalls, we will explore advanced techniques such as random forests, ensemble methods, and neural networks. Missing data, overfitting, and feature engineering are some of the challenges that we will encounter. The ideal outcome of the project would be to gain deeper insights into hospital readmission rates and investigate robust methods that can make improved predictions than the statistical methods. Our experiments show that Random forests performed better than other methods in the predictions.Attributes like gender, race, total number of medications, lab procedures, admission type, time in hospital of the patient had a significant influence in these predictions.

Authors

Rajasekhar Mekala([email protected])

Agniraj Baikani([email protected])

Shravan Balamurugan([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ML_project.ipynb		ML_project.ipynb
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diabetes-data-analysis

Abstract

Authors

About

Uh oh!

Releases

Packages

Languages

rajasekharmekala/Diabetes-data-analysis

Folders and files

Latest commit

History

Repository files navigation

Diabetes-data-analysis

Abstract

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages