This is browser plug-in that filters out most important clauses from any list of Terms & Conditions(T&Cs) and also summarizes and visualizes results.
- It aims to empower web users to make informed decisions online by knowing the tradeoffs mentioned in the Terms of Service.
- The main objective is to eduate the users about what bounds they have agreed to abide by and how their data is being managed by a service.
- JavaScript and Python are used to scrape the docuement from the website.
- The dataset is created with contribution from the community via a website (inspired by the Common Voice project)
- The dataset is compiled to a suitable format for use by anyone.
- This dataset will be open source too.
- A Machine Learning model is trained with a suitable data-set built by the community.
- Bigger and more relevant the data-set, better the Classifier.
- More training = Accurate filtering.
- User clicks on the extension icon
- Extension scrapes out the T&C document and feeds it to the trained model.
- The Classifier system picks out a few clauses that require the user's attention the most.
- The resulting few important clauses are displayed in a pop-up.
- An NLP engine is trained with the help of existing datasets.
- The engine then summarizes the sections under the document.
- The Summarized content is made available in the form of an API.
- The summarized data is visualized with the help of flow diagrams and associativity diagrams.
- Currently, it will be developed as a firefox add-on.
- Chrome extension to follow quickly as we plan to use the WebExtensions standard.