[WIP] Load or download the model. #55

georgeamccarthy · 2021-08-10T13:01:08Z

PR type

💔 Breaking Changes - may interfere with Dockerization / have overlap with [Structure] Dockerizing the project #30
🏆 Enhancements

Purpose

Allows model and tokenizor to be stored locally & will download if not found.

Why?

Unable to download indexer within flow on GCP. (deployment).

Extra info

New protein_search/models directory to store models in.

models/
└── prot_bert
    ├── model
    │   ├── config.json
    │   └── pytorch_model.bin
    └── tokenizer
        ├── special_tokens_map.json
        ├── tokenizer_config.json
        └── vocab.txt

Models downloaded from huggingface and then I moved them into these dirs.

Feedback required over

A quick pair of 👀 on the code
Discussion on the technical approach

Mentions

@fissoreg
@Rubix182

References

https://huggingface.co/transformers/model_doc/bert.html

Legal

I have read and agreed to the terms of contributing.

georgeamccarthy · 2021-08-10T13:11:01Z

Not sure if gonna merge this but needing it on GCP without the Dockerization merged. Could probs use a simpler model files structure https://huggingface.co/Rostlab/prot_bert/tree/main

georgeamccarthy · 2021-08-10T13:27:10Z

There may be a simpler to get around the issue. If I try and download the model with a simple script

from transformers import BertModel, BertTokenizer

model_path = "Rostlab/prot_bert"

print("Loading tokenizer.")
tokenizer = BertTokenizer.from_pretrained(model_path, do_lower_case=False)
print("loading model.")
model = BertModel.from_pretrained(model_path)

self.tokenizer = tokenizer
self.model = model

print("Done.")

then the system runs out of RAM ~1 GB and throws an error Killed.

To monitor RAM usage ps -m -o %cpu,%mem,command

Instead of downloading the repo I might just be able to configure the download to use a disk cache.

internal: ignore local models

7c0e67b

georgeamccarthy added backend deployment labels Aug 10, 2021

feat: load or download model and tokenizor

e930d84

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Load or download the model. #55

[WIP] Load or download the model. #55

Uh oh!

georgeamccarthy commented Aug 10, 2021 •

edited

Loading

Uh oh!

georgeamccarthy commented Aug 10, 2021

Uh oh!

georgeamccarthy commented Aug 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WIP] Load or download the model. #55

Are you sure you want to change the base?

[WIP] Load or download the model. #55

Uh oh!

Conversation

georgeamccarthy commented Aug 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR type

Purpose

Why?

Extra info

Feedback required over

Mentions

References

Legal

Uh oh!

georgeamccarthy commented Aug 10, 2021

Uh oh!

georgeamccarthy commented Aug 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

georgeamccarthy commented Aug 10, 2021 •

edited

Loading