GooDiff core software for fetching, processing and storing the retrieved web pages.
License
quuxlabs/goodiff-core
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
master
Could not load branches
Nothing to show
Could not load tags
Nothing to show
{{ refName }}
default
Name already in use
A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code
-
Clone
Use Git or checkout with SVN using the web URL.
Work fast with our official CLI. Learn more about the CLI.
- Open with GitHub Desktop
- Download ZIP
Sign In Required
Please sign in to use Codespaces.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching Xcode
If nothing happens, download Xcode and try again.
Launching Visual Studio Code
Your codespace will open once ready.
There was a problem preparing your codespace, please try again.
GooDiff is a consumer-oriented service for keeping track of changes of important documents – and indirectly the services described by these documents – provided by selected Internet service providers. Our “mission” is to increase transparency for end consumers such as you and I. GooDiff is released under the GNU Affero General Public License (AGPL) version 3. The GooDiff software is a work-in-progress separated in various modules to provide an unique way to track documents on the Internet. The module is called : goodiff-core == Description == goodiff-core is the software module for fetching, processing and storing the retrieved web pages. The configuration is located in ./config/goodiffmonitor.ini and the monitored services are defined in ./config/providers.xml . Don't forget to create two Subversion repositories : - One for the HTML source of the web pages fetched and - One for the text version of the HTML document. == Requirements == * Python >= 2.4 * BeautifulSoup (http://www.crummy.com/software/BeautifulSoup/) * pysvn (http://pysvn.tigris.org/) == Authors == Michael G. Noll - http://www.michael-noll.com/ Alexandre Dulaunoy - http://www.foo.be/ Software also includes : === html2text (http://www.aaronsw.com/2002/html2text/) === Aaron Swartz - http://www.aaronsw.com/ == License == Copyright (C) 2006-2009 Alexandre Dulaunoy - http://www.foo.be/ Copyright (C) 2006-2009 Michael G. Noll - http://www.michael-noll.com/ This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details. You should have received a copy of the GNU Affero General Public License along with this program. If not, see <http://www.gnu.org/licenses/>.
About
GooDiff core software for fetching, processing and storing the retrieved web pages.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published