Text Preprocessing #56

someshsingh22 · 2020-05-13T18:18:58Z

To implement a common black box we need text loading, extraction of words to be attacked, perturbations, distance metrics, models.

Text Loading needs to be very uniform and universal, it should encapsulate all common practices including embedding, tokenizers, batch_loaders, and should support commonly used libraries like nltk, spacy, BERT etc.

We need to think about how we should design this before our first attack.

parantak · 2020-05-22T03:57:31Z

@someshsingh22 , I believe we could start by creating a common class for this inside decepticonlp/preprocess, and then start by implementing a class for each of the separate practices. Does that sound good, or do you want to take a different approach to this?

parantak · 2020-05-22T12:45:49Z

Refer to #75

someshsingh22 added the question Further information is requested label May 13, 2020

someshsingh22 added this to In progress in Character level attacks May 13, 2020

someshsingh22 moved this from In progress to To do in Character level attacks May 13, 2020

someshsingh22 added the Priority: Medium label May 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text Preprocessing #56

Text Preprocessing #56

someshsingh22 commented May 13, 2020

parantak commented May 22, 2020

parantak commented May 22, 2020

Text Preprocessing #56

Text Preprocessing #56

Comments

someshsingh22 commented May 13, 2020

parantak commented May 22, 2020

parantak commented May 22, 2020