Skip to content

AndrewMicallef/clean-read

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

clean-read

This is the first step in what I am calling Project Lucidity

The driving philosphy is to Eschew Obfuscation and Enunciate Lucidity.

#To install in firefox

Get it from the add-ons list here: https://addons.mozilla.org/en-US/firefox/addon/clean-read/

development versions: In firefox address bar type: about:debugging

  • click Load Temporary Add On
  • select any file in directory

##TODO:

  • Easy:
    • Think of a better name and come up with a logo (really not important)
    • Replace acronymns with expanded phrases
    • Make a toggle button for Firefox plugin
    • Make a toggle switch to highlight what will be hidden to confirm we aren't losing actual page content
    • Make it actually do something when you install it
  • Less Easy:
    • Compress inline references
  • Hard:
    • Graph inline numerical statements
    • Make a database which holds the type of Journal and the place to find the body text ?
    • Alternativley (bad) allow the user to select paragraphs to be filtered (I don't like that solution as it increases the transaction cost for the user and makes them less likely to want to lucidify the text)

##Mechanism of action

The basic principle is this program grabs all text nested under the HTML p (for paragraph) tags, removes all other markup, and then searches for patterns matching the "Bracket, ((name, comma,) x I date) x J, close Bracket" format.

This is not a robust look up, and have already found it fails on a few sites. Good nes is that you can bring back the content by reloading the page. Bad news is you have to reload the page

A Better idea:

Query: the style repository for the journal name, use the resulting .csl file to build an appropriate regex pattern!!!!

Perhaps leveraging Zotero won't work for this purpose. Adam Smith advises that the zotero look up would be slow. I would need to translate the pages meta-data and then pull out the journal title then pass that to a custom script for searching the repository. I suppose it comes down to working out how slow this would be in the worst case, after all it should only be done once per page. It might even be possible to stor a lite-weight lookup table that remembers the regex's for common sites. Or better [optionally] stores the look ups for the sites visited by the user in a cookie?

The zotero style repository appears to be a fork of this repsitory. I should use this for searching for the styles. Then perhaps citeproc-js might help for rendering the regex?


Regex pattern to capture name dates: (([a-z]*[;&,\s]){1,} (\d{4}))

... Because I need to be able to use any unicode letter for peoples names, and regex in Mozilla does not support the \p{L} character lookup, which in PERL would get a letter from any language, I will need to find a different more complex strategy...

About

A firefox add on to filter clutter and focus on arguments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published