BUG harm categories for PKU-SafeRLHF Dataset aren't added yet #736
Labels
bug
Something isn't working
datasets
Pulling in external datasets into PyRIT
help wanted
Extra attention is needed
Similar to this issue, but the harm categories are explicitly included in the repo: #730
The
fetch_adv_bench_dataset
currently does not have any applied harm categories to the different prompts. We want to be able to use this dataset with harm category filters and this requires we grab the values from its category labels to the dataset to use in PyRIT. There areharm_categories
in the original hugging face dataset which we can grab them from.The text was updated successfully, but these errors were encountered: