AWS Machine Learning Engineer Nanodegree

Airplane Object Detection - Capstone Project

Author

Tomás Baidal

Summary

This is the capstone project repository for the AWS Machine Learning Engineer Nanodegree from Udacity. This project serves as a comprehensive endeavor focusing on the deployment and utilization of Machine Learning models for object detection within images. Specifically, it embraces Apache MXNet ResNet 50, an open-source deep learning software framework, used for training, and deploying deep neural networks. It is particularly noteworthy for its capability to define, train, and deploy deep neural networks in a versatile and efficient manner, primarily focusing on cloud environments.

This project focuses on developing an object detection model using MXNet to identify airplanes in images, benchmarked against a TensorFlow reference model. Subsequently, the MXNet model was deployed on AWS, enabling real-time detection through a web application developed with Dash, AWS Lambda and AWS API Gateway.

Project Structure Flowchart

Stack

This project has been completely developed in python 3 on AWS Sagemaker. We also used AWS Lambda, AWS API Gateway and Python Dash framework in order to develop a pipeline and an interface where the user can inference with the deployed model.

To run the notebooks and the app, please install the requirements.

pip install -r requirements.txt

The main libraries used are:

Data Manipulation and Analysis

import pandas as pd 
import numpy as np

Utility and System

import os         
import shutil     
import json       
import importlib  
import zipfile    
import glob

Data Visualization

import matplotlib.pyplot as plt     
import matplotlib.pyplot as mpimg

Image Processing and Handling

from PIL import Image
import cv2

Machine Learning and Data Splitting

from sklearn.model_selection import train_test_split

AWS Services and Machine Learning Deployment

import boto3  
import sagemaker
from sagemaker import get_execution_role  
from sagemaker import image_uris 
from sagemaker.tuner import CategoricalParameter, ContinuousParameter, HyperparameterTuner 
from sagemaker.debugger import Rule, rule_configs, ProfilerRule

Dash App Development

import dash
import dash_bootstrap_components as dbc
import dash_core_components as dcc
import dash_html_components as html
from dash import Dash, dcc, html, Input, Output, State, callback
from dash.dependencies import Input, Output, State

Repository Structure

This project is composed by several notebooks and folders organized in the following way:

Folder Structure

utils: folder with the funtioncs and scripts used in the notebooks and pipeline.
data: folder images and docs.
AWS: folder with the snapshots and scripts related to AWS Lambda and API Gateway

Notebook Structure

1 - Data EDA Visualization: Notebook that contains exploratory data analysis, first filtering and data visualization.

2 - Data Preprocessing: First data preprocessing, test dataset creation and filtered CSV's creation.

3 - Feature Modelling: CSV's modelling for lst files creation.

4 - Data Augmentation: Data Augmentation process.

4.1 - Data Augmentation Demo: Explanation and demo of applied data augmention.

5 - Benchmark Model: TensorFlow model inference

6 - MXNEt RecordIO Engineering: Generation of .rec files for our model.

7 - Hyperparameter Tuning - Model Deployment - Inference: Hyperparameter tuning jobs, model deployment and inference to our new model.

8 - Validation and Comparison of TensorFlow and MXNet Models: Analysis of inferences and model comparision

9 - Model Selection and Endpoint Deployment: New model endpoint deployment and inference testing

9.1 - Test API Enndpoint Model: Script to test locally the pipeline (Lambda <> API Gateway <> Model Endpoint)

10 - Python Dash App Development: Dash App Development code. App will run in the notebook or locally in http://127.0.0.1:8050

Dash App

Dash App will run when running the notebook Python Dash App Development

Images can be downloaded or dragged and dropped in the notebook and get inference:

Dash App will run on http://127.0.0.1:8050/:

IMPORTANT

When creating the RecordIO files it is necessary to run the following commands before in order to use the script im2rec.py.

The script code is licensed under the Apache License, Version 2.0 . We are grateful to the authors for making their code available.

import distro
if distro.id() == "debian":
    !apt-get update
    !apt-get install ffmpeg libsm6 libxext6 -y

import sys
!{sys.executable} -m pip install opencv-python
!{sys.executable} -m pip install mxnet

It is also important to downgrade the numpy version when running the script as uses a deprecated numpy function

pip install numpy==1.23.0

References

Evaluation Metrics:

Object Detection:

Deployment:

MXNet:

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
AWS		AWS
data/docs		data/docs
utils		utils
01-data-eda-visualization.ipynb		01-data-eda-visualization.ipynb
02-data-preprocessing.ipynb		02-data-preprocessing.ipynb
03-feature-modelling.ipynb		03-feature-modelling.ipynb
04-data-augmentation.ipynb		04-data-augmentation.ipynb
04.1-data-augmentation-demo.ipynb		04.1-data-augmentation-demo.ipynb
05-benchmark-model.ipynb		05-benchmark-model.ipynb
06-mxnet-rec-engineering.ipynb		06-mxnet-rec-engineering.ipynb
07-hyperparameter-tuning-model-deployment-inference.ipynb		07-hyperparameter-tuning-model-deployment-inference.ipynb
08-validation-results-analysis (1).ipynb		08-validation-results-analysis (1).ipynb
09-endpoint-deployment-lambda-code.ipynb		09-endpoint-deployment-lambda-code.ipynb
10-python-dash-app-development.ipynb		10-python-dash-app-development.ipynb
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS Machine Learning Engineer Nanodegree

Airplane Object Detection - Capstone Project

Author

Summary

Project Structure Flowchart

Stack

Repository Structure

Folder Structure

Notebook Structure

Dash App

IMPORTANT

References

About

Releases

Packages

Languages

License

tbs89/aws-object-detection-tf-mxnet

Folders and files

Latest commit

History

Repository files navigation

AWS Machine Learning Engineer Nanodegree

Airplane Object Detection - Capstone Project

Author

Summary

Project Structure Flowchart

Stack

Repository Structure

Folder Structure

Notebook Structure

Dash App

IMPORTANT

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages