Data Loader #1

andrealoddo · 2023-06-10T15:35:12Z

Hi everyone, and thanks for sharing your outstanding contribution!

I would like to know how the datasets are supposed to be organized.
We are getting some troubles with the init function of the DataLoader because we need help understanding how the images should be managed. Can you help us with this? I really appreciate any help you can provide.

RahelehSalehi · 2023-06-13T10:58:58Z

Dear Andrealoddo,
Thanks for reaching out, I would be happy to know how our method performs on your data,
Your data loader class should have two main methods of get_item and len. If you have 1000 cells, "len" should return 1000, and "get item" returns the following for a given index between 0 and 999:

1- Feature vector obtained from Mask R-CNN (feats in code) with a shape of 256x14x14
2- Cropped RGB image of the cell using the bounding box obtained from Mask R-CNN (roi_cropped) with a shape of 3x128x128
3- The label of the given cell indicating which class it belongs to (label)
4- The dataset the cell comes from (ds). This should be one hot encoded vector of size 1 x number of datasets you have. in our case 1 x 3
5- I returned key which is the unique identifier of the cell for later retrieval. But not necessary for training.

Please check the get_item code for details about normalization of feature vectors and bounding boxes.
Based on how your data is stored, you need to rewrite the init function such that it can accommodate get_item calls. I wrote init function such that it loads everything into memory first and stores them in a dictionary data structure.

Best,
Raheleh

costantin0 · 2023-06-19T14:21:04Z

Dear Raheleh,

Thank you for the prompt response. Absolutely we will let you know how your method performs with our data!

The indications you have provided are clear and we thank you for that.

However, we are stuck on the following point: the readme says "To train the model, please run train.py, then to extract the features, you can use FeatureExtraction.py code and finally to evaluate the quantitatively of the extracted features by AE-CFE, please run RandomForest.py.".
Therefore, we tried to train the model by running train.py. However, train.py calls the DataLoader and we have some troubles in lines 84 - 111 of DataLoader.py. In particular, what do the first two fors do? In lines 91-101, it seems that you tried to load feature data but it cannot be extracted using FeatureExtraction.py because it also needs the DataLoader. How can we address this issue?

Many thanks again,
for your help and response,
Best,
Costantino and Andrea

RahelehSalehi · 2023-06-20T08:07:24Z

Dear Andrea,
Sorry for the misunderstanding about the extracted features' names.
Here is an explain in a little bit more details:

We trained the MaskRCNN on almost 1000 images of the Matek-19 dataset. Then feature vector was obtained from Mask R-CNN with a shape of 256x14x14. Unfortunately, the trained model and the extracted Mask R-CNN feature code are not in the github.
Feature Extraction code in github repo extracts the features representations from the latent space when the model is trained in an unsupervised way. So these are two different sets of feature vectors with different sizes and sources. Then random forest model classifies these latent representations .

I hope this helps. Please don't hesitate if you have any further questions.

costantin0 · 2023-07-07T18:10:53Z

Hi Raheleh,

Thanks for uploading the feature code here, it will be of great help to us.
There is another thing that we would like to ask you. We are having some trouble with the program dependencies, since the latest versions of tensorflow and of other modules are giving us errors (for example, the latest version of tensorflow gives the error "no module named keras.engine", while the version 2.12 does not).
Would it be possible to also have a requirements.txt file and know the Python version that you are using, in order to avoid any sort of ambiguity?

Many thanks again

RahelehSalehi · 2023-07-10T09:37:58Z

Hi,
I updated the README file. Please find it there.
Best,
Raheleh

RahelehSalehi · 2023-07-10T12:05:37Z

Hi, I used tensorflow 1 and it worked good, there are ways to get this working on newer tensorflow versions, please consult https://github.com/matterport/Mask_RCNN Best Regards, Raheleh

…

On Mon, Jul 10, 2023 at 12:50 PM costantino2000 ***@***.***> wrote: Hi, Thanks for the requirements, what Python version should we use? We tried Python 3.7 with a clean install of the requirements, but the mask feature extraction script gives the error "module 'keras.engine' has no attribute 'Layer'", which is one of the problems we were facing before. — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZ3TMIETDG74W4PJB6BUBQDXPPM55ANCNFSM6AAAAAAZBX2MPA> . You are receiving this because you commented.Message ID: ***@***.***>

-- Best Regards Raheleh Salehi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Loader #1

Data Loader #1

andrealoddo commented Jun 10, 2023

RahelehSalehi commented Jun 13, 2023

costantin0 commented Jun 19, 2023

RahelehSalehi commented Jun 20, 2023

costantin0 commented Jul 7, 2023 •

edited

Loading

RahelehSalehi commented Jul 10, 2023 •

edited

Loading

RahelehSalehi commented Jul 10, 2023 via email

Data Loader #1

Data Loader #1

Comments

andrealoddo commented Jun 10, 2023

RahelehSalehi commented Jun 13, 2023

costantin0 commented Jun 19, 2023

RahelehSalehi commented Jun 20, 2023

costantin0 commented Jul 7, 2023 • edited Loading

RahelehSalehi commented Jul 10, 2023 • edited Loading

RahelehSalehi commented Jul 10, 2023 via email

costantin0 commented Jul 7, 2023 •

edited

Loading

RahelehSalehi commented Jul 10, 2023 •

edited

Loading