load_pretrained_model dependency issues #522

IFenton · 2023-02-14T18:13:49Z

I've been working on trying to get the Scivision example gallery notebooks running (#83, and I've come across an issue with load_pretrained_model that is causing some problems.

Running load_pretrained_model(<url>, allow_install=True), if the model is not present, currently force installs both the model and its dependencies. As I understand it, this was intended as an intentional feature to make sure that the dependencies were updated to the required versions (#223). However the force reinstalling of dependencies is leading to issues with some of the scivision-examples-gallery notebooks (e.g. scivision-basic-usage-example #5, plant-phenotyping-classification #8). In both these cases, running load_pretrained_model causes the latest version of packages (tensorflow and timm respectively) to be installed. However, these versions are causing the errors in running the notebooks.

There seem to be a couple of possible options to fix this:

Install the model separately, whether in the environment.yml or in the terminal before running the notebook
Update the model requirements.txt to a pinned version of the package
Change load_pretrained_model so it doesn't force reinstall dependencies by default

Does anyone have any preferences on which is the best option?

The text was updated successfully, but these errors were encountered:

edwardchalstrey1 · 2023-02-15T11:47:15Z

This is a good question, and a bit of a fundamental one for scivision! My opinion has been that installing models via the scivision package is too inherently problematic for reasons like this and we should no longer support this - see #346 (which proved controversial).

I think the tension here is we could either:

Make the postdocs notebooks just work somehow, e.g. one of @IFenton first two bullet points
Take some time to consider what scivision "best practice" should be for installing models in a Python environment, then conform all the project notebooks to this way of working (which could still be the first option, via enironment.yml)

edwardchalstrey1 · 2023-02-15T11:48:53Z

There could even be a rule like "We only accept models for the scivision catalog that have pinned package versions" or as @ots22 has discussed previously, some kind of installation checks that run in GitHub actions

ots22 · 2023-02-15T12:41:01Z

There are a few general issues with model-loading-as-package-installation as noted by Ed. Fixing this bug should at least let us maintain a reproducible environment without load_pretrained_model clobbering it, even without addressing these wider issues, so some comments in that spirit.

It might not be appropriate to completely pin dependencies in a model repository (or use a lockfile from the model if one exists) for the usual reasons - the right place to pin things would seem to be in the environment.yml in the gallery example. A model should ensure that its dependencies are specified correctly.

We should pin the version of the model itself in the gallery example. The options for doing that are:

as part of the load_pretrained_model command (e.g. include the commit hash when loading it)
in environment.yml (in this case, allow_install would not need to be set when loading the model)

We should allow for either possibility if we can: option 2 means someone running the model doesn't have the installation appearing inline, and option 1 if we want to explicitly show off this behaviour. (Some guidance on this could be useful to capture perhaps something like - when working interactively you might want to use allow_install=True but as soon as you start specifying dependencies in a virtual environment of some kind you should include your models there too - I'm not sure if that's right, but probably for a separate discussion.)

It seemed like (from our discussion the other day) the 'force' flag of pip is inherited when installing dependencies and this is the real culprit, since it is then no longer possible to use option 1 above (it might result in an 'upgraded' version of the package for instance, different from the version in environment.yml). Have you tried passing the 'force' flag to pip only when allow_install='force'? I think we had assumed that there was no downside to this flag if the package wasn't installed.

So I'd say it would be nice to end up with:

load_pretrained_model(..., allow_install=True) not clobbering versions of other packages in the environment if the environment is consistent with the model dependencies
having pinned versions of everything in the gallery examples, including the model - both options above should work - making a decision separately about which is preferable

ots22 · 2023-08-29T11:17:34Z

Did #533 fix this?

(@IFenton)

IFenton · 2023-08-29T15:51:10Z

It fixed the specific problem of force updating to the latest versions, although I think the broader discussion of how to handle the dependencies are still unresolved

ots22 · 2023-08-29T15:58:56Z

Okay, will close based on having fixed the force install problem.

Is there a summary of the broader problem somewhere?

IFenton added the bug Something isn't working label Feb 15, 2023

IFenton assigned ots22, edwardchalstrey1 and IFenton Feb 15, 2023

acocac added this to the Core features milestone Apr 6, 2023

ots22 added this to Scivision fortnightly planning: 2023-08-22 to 2023-09-05 Aug 29, 2023

ots22 closed this as completed Aug 29, 2023

github-project-automation bot moved this to Done in Scivision fortnightly planning: 2023-08-22 to 2023-09-05 Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

load_pretrained_model dependency issues #522

load_pretrained_model dependency issues #522

IFenton commented Feb 14, 2023 •

edited

Loading

edwardchalstrey1 commented Feb 15, 2023

edwardchalstrey1 commented Feb 15, 2023

ots22 commented Feb 15, 2023 •

edited

Loading

ots22 commented Aug 29, 2023 •

edited

Loading

IFenton commented Aug 29, 2023

ots22 commented Aug 29, 2023

load_pretrained_model dependency issues #522

load_pretrained_model dependency issues #522

Comments

IFenton commented Feb 14, 2023 • edited Loading

edwardchalstrey1 commented Feb 15, 2023

edwardchalstrey1 commented Feb 15, 2023

ots22 commented Feb 15, 2023 • edited Loading

ots22 commented Aug 29, 2023 • edited Loading

IFenton commented Aug 29, 2023

ots22 commented Aug 29, 2023

IFenton commented Feb 14, 2023 •

edited

Loading

ots22 commented Feb 15, 2023 •

edited

Loading

ots22 commented Aug 29, 2023 •

edited

Loading