The dataset can be downloaded from kaggle under the name of IDC_regular_ps50_idx5 which is a dataset of over 200000 images already structured into zero and one format specifying negative and positive cancer results. We have used this dataset and then seperated into randomly training,testing,validation and then processed it as per our needs.