A Strassen-based CUDA Implementation of AtA Matrix Multiplication For detailed information about the implementation, please read the report.pdf file.