SclaedYOLOwithViT Improvement on Object Detection by Self-Attention and Receptive Field Expansion; YOLOv4s(scaled-yolo) with Vision Transformer.