Classification of IDC in Breast Cancer Histopathology Images

Keywords: Image classification, Convolutional neural network, Optimizers, Machine learning algorithms, Visualization, Image processing

1. Breast cancer is the most common form of cancer in women, and invasive ductal carcinoma (IDC) is the most common form of breast cancer. The goal was to identify IDC when it was present in otherwise unlabeled histopathology images

2. The dataset contained 5547 breast histopathology images, each with a pixel size of 50 x 50 x 3. Among these images, there were 2759 negative (-) IDC counts and 2788 positive (+) IDC counts

3. Classification was assessed across multiple ML algorithms and six different convolutional neural networks with each network architecture paired with three different optimizers, namely, Adam, Adadelta and RMSprop

4. Accuracy and loss for each network were observed in order to conclude better performing convolutional neural network

5. The Support Vector Machine performed well in predicting IDC labels among 6 other machine learning models, followed by the Random Forest Classifier. In neural networks, the CNN with Adam optimizer outperformed other models, ranking as the top performer

IDC

cnn architecture

Python Jupyter Notebook Google Colab Keras OpenCV