Fix huggingface inference endpoint name #1011

jmartin-tech · 2024-11-18T19:44:32Z

the name provided during construction is populated by the call to super().__init__(), access to self attributes is required for any value populated by Configurable.

Verification

List the steps needed to make sure this thing works

Execute against public InferenceAPI:

python -m garak --model_type huggingface.InferenceAPI --model_name microsoft/Phi-3-mini-4k-instruct --probes malwaregen.Evasion

Verify valid responses are received from endpoint.
Verify added automation tests pass

the name provided during construction is populated by the call to `super().__init__()`, access to `self` attributes is required for any value populated by `Configurable`. Signed-off-by: Jeffrey Martin <[email protected]>

Signed-off-by: Jeffrey Martin <[email protected]>

Add validation test to ensure uri values and names populate as expected Signed-off-by: Jeffrey Martin <[email protected]>

erickgalinkin

LGTM. Did we test on an actual HF InferenceAPI just to be sure it works as intended? If not, I can spin one up tomorrow.

jmartin-tech · 2024-11-18T22:48:23Z

I tested the public endpoint referenced in the issue, I have not done an end to end test with a private endpoint.

ppietikainen · 2024-11-19T20:57:08Z

Tested the fix and works for what we've been trying out. Also
python3 -m garak --model_type huggingface.InferenceEndpoint --model_name https://api--inference.huggingface.co/models/microsoft/Phi-3-mini-4k-instruct --probes malwaregen.Evasion
seems to do queries to the right place, so I would assume private ones would work too.

I guess (and seemed to work, tho it's getting late)

def __init__(name="")
  super().__init__(name, config_root=config_root)
  self.uri = ... + self.name

in both would make it a bit more easy to follow, the self.name = name confused me when trying to figure the code out, as I thought it implies the super() won't do anything to it :-)

jmartin-tech · 2024-11-19T21:10:56Z

@ppietikainen thanks for the extra testing, we have tried to add documentation on how Configurable classes config values are expected to be prioritized. The assignment of name as provided by to the constructor relates to how the precedence of values are treated allowing constructor values to be held above configuration file values that are injected by super().

Also, in the different classes name has competing usage. For InferenceAPI it is expected to just be the model name and gets added to the public base uri defined in a constant, in InferenceEndpoint the full uri must be supplied. I do think there may be some consistency to be gained at some point in the future. I don't have a chosen path forward in mind so for now this can land to at least return the functionality to original state before the regression.

jmartin-tech added 3 commits November 18, 2024 13:39

use populated name value in endpoint uri

25c4dfe

the name provided during construction is populated by the call to `super().__init__()`, access to `self` attributes is required for any value populated by `Configurable`. Signed-off-by: Jeffrey Martin <[email protected]>

use populated name for uri in InferenceEndpoint

3b98a00

Signed-off-by: Jeffrey Martin <[email protected]>

additional tests for hugginface inference

ca2e050

Add validation test to ensure uri values and names populate as expected Signed-off-by: Jeffrey Martin <[email protected]>

jmartin-tech requested review from leondz and erickgalinkin November 18, 2024 19:44

erickgalinkin reviewed Nov 18, 2024

View reviewed changes

jmartin-tech merged commit c7a9fa6 into NVIDIA:main Nov 19, 2024
9 checks passed

jmartin-tech deleted the fix/hf-endpoint-name branch November 19, 2024 21:22

github-actions bot locked and limited conversation to collaborators Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix huggingface inference endpoint name #1011

Fix huggingface inference endpoint name #1011

jmartin-tech commented Nov 18, 2024

erickgalinkin left a comment

jmartin-tech commented Nov 18, 2024

ppietikainen commented Nov 19, 2024 •

edited

Loading

jmartin-tech commented Nov 19, 2024 •

edited

Loading

Fix huggingface inference endpoint name #1011

Fix huggingface inference endpoint name #1011

Conversation

jmartin-tech commented Nov 18, 2024

Verification

erickgalinkin left a comment

Choose a reason for hiding this comment

jmartin-tech commented Nov 18, 2024

ppietikainen commented Nov 19, 2024 • edited Loading

jmartin-tech commented Nov 19, 2024 • edited Loading

ppietikainen commented Nov 19, 2024 •

edited

Loading

jmartin-tech commented Nov 19, 2024 •

edited

Loading