You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now we assume no feedback between adversary and classifier.
What if the adversary has access to the labels? What if the adversary has access to the raw probabilities? What is the adversary has access to some observation that can be linked back to the label or probability?
These are very broad, and while some have been addressed in machine learning literature, there are many possible takes on this as it specifically applies to text classification.
Potential ideas (this list will grow):
Use Lime to identify words that are important to classification results and apply targeted attacks
Simulate a sequence of back-and-forths between classifier and adversary
The text was updated successfully, but these errors were encountered:
Right now we assume no feedback between adversary and classifier.
What if the adversary has access to the labels? What if the adversary has access to the raw probabilities? What is the adversary has access to some observation that can be linked back to the label or probability?
These are very broad, and while some have been addressed in machine learning literature, there are many possible takes on this as it specifically applies to text classification.
Potential ideas (this list will grow):
The text was updated successfully, but these errors were encountered: