ocr_it only returns the first box in boxes #7

jontio · 2022-12-17T20:34:17Z

If an image has more than one number plate then ocr_it will only return one of them...

def ocr_it(image, detections, detection_threshold, region_threshold):
    
    # Scores, boxes and classes above threhold
    scores = list(filter(lambda x: x> detection_threshold, detections['detection_scores']))
    boxes = detections['detection_boxes'][:len(scores)]
    classes = detections['detection_classes'][:len(scores)]
    
    # Full image dimensions
    width = image.shape[1]
    height = image.shape[0]
    
    # Apply ROI filtering and OCR
    for idx, box in enumerate(boxes):
        roi = box*[height, width, height, width]
        region = image[int(roi[0]):int(roi[2]),int(roi[1]):int(roi[3])]
        reader = easyocr.Reader(['en'])
        ocr_result = reader.readtext(region)
        
        text = filter_text(region, ocr_result, region_threshold)
        
        plt.imshow(cv2.cvtColor(region, cv2.COLOR_BGR2RGB))
        plt.show()
        print(text)
        return text, region

The return shouldn't be there. It should be more like...

def ocr_it(image, detections, detection_threshold, region_threshold):
    
    # We may have more than one number plate in an image
    texts = []
    regions = []

    # Scores, boxes and classes above threhold
    scores = list(filter(lambda x: x> detection_threshold, detections['detection_scores']))
    boxes = detections['detection_boxes'][:len(scores)]
    classes = detections['detection_classes'][:len(scores)]
    
    # Full image dimensions
    width = image.shape[1]
    height = image.shape[0]
    
    # Apply ROI filtering and OCR
    for idx, box in enumerate(boxes):
        roi = box*[height, width, height, width]
        region = image[int(roi[0]):int(roi[2]),int(roi[1]):int(roi[3])]
        reader = easyocr.Reader(['en'])
        ocr_result = reader.readtext(region)
        
        text = filter_text(region, ocr_result, region_threshold)

        if(text!=[]):
            texts.append(text[0])
            regions.append(region)
    
    return texts, regions

That means the function save_results won't work and also needs to be change to something like...

def save_result(text, region, csv_filename, folder_path):
    img_name = '{}.jpg'.format(uuid.uuid1())
    
    cv2.imwrite(os.path.join(folder_path, img_name), region)
    
    with open(csv_filename, mode='a', newline='') as f:
        csv_writer = csv.writer(f, delimiter=',', quotechar='"', quoting=csv.QUOTE_MINIMAL)
        csv_writer.writerow([img_name, text])

def save_results(texts, regions, csv_filename, folder_path):
    for idx, region in enumerate(regions):
        save_result(texts[idx], region, csv_filename, folder_path)

There is also a hard coded detection threshold in the live detection that I changed to the variable detection_threshold

I realize if you do a merge then your video will be out of step with the code but I thought that I would send you one anyway.

Cheers,
Jonti

…rom real time section

jontio added 9 commits December 16, 2022 12:35

return all number plates in an image

fc157b9

test to text mistake

cfc14b3

want just an array of strings

304fbfa

empty array test

9b893ad

saving results of multiple plates in image

df38231

multi plates for real time section and missing detection_threshhold f…

d94e1f4

…rom real time section

save_results changed to save_result

55a8b66

csv_filename and folder_path to save_results

0743d9a

missed tab

7d9652e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocr_it only returns the first box in boxes #7

ocr_it only returns the first box in boxes #7

jontio commented Dec 17, 2022

ocr_it only returns the first box in boxes #7

Are you sure you want to change the base?

ocr_it only returns the first box in boxes #7

Conversation

jontio commented Dec 17, 2022