Add other attack mechanisms

Right now we assume no feedback between adversary and classifier. 

What if the adversary has access to the labels? What if the adversary has access to the raw probabilities? What is the adversary has access to some observation that can be linked back to the label or probability?

These are very broad, and while some have been addressed in machine learning literature, there are many possible takes on this as it specifically applies to text classification.

Potential ideas (this list will grow):
- Use [Lime](https://github.com/marcotcr/lime) to identify words that are important to classification results and apply targeted attacks
- Simulate a sequence of back-and-forths between classifier and adversary

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add other attack mechanisms #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add other attack mechanisms #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions