Project
I wanted to create a complete pipeline for a decently sized dataset that trains and test many types of model families through grid search and stratified cross validation and outputs the best model to be used in a simple app. For this project I used a diabetes dataset that contains 100,000 data points.
Technologies
Python
Pytorch
Scikit Learn
AWS EC2
AWS ECR
Github Actions
Back