Skip to content
This repository was archived by the owner on Nov 7, 2018. It is now read-only.
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
25 changes: 25 additions & 0 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
## Team Processes

* Use PEP8 as the coding standard for Python.
* Pull requests for all commits, even typos.
* Don't merge your own pull request. Find a friend to review your code and merge your pull request.
* Pull requests some contain some tests. Ideally they would contain decent test coverage.
* If you make changes to the API, please help update the API documentation.

When creating a new pull request:

* If the pull request is still a work-in-progress and should not be merged, say so in the description and then **assign the PR to yourself**. When the PR is ready to be merged, **unassign yourself and add a comment**.
* If a new pull request is ready for review, **leave it unassigned**. This is the assumed state of new PRs, but work-in-progress PRs are quite welcome.
* If you decide to review a pull request with the intent of merging it (or deciding what still needs to be done before merge), then **assign the PR to yourself** so that it's clear that someone's grabbed it.
* Anyone is welcome to informally review a PR and comment on it at any time, no matter who is assigned.

## Public domain

This project is in the public domain within the United States, and
copyright and related rights in the work worldwide are waived through
the [CC0 1.0 Universal public domain dedication](https://creativecommons.org/publicdomain/zero/1.0/).

All contributions to this project will be released under the CC0
dedication. By submitting a pull request, you are agreeing to comply
with this waiver of copyright interest.

34 changes: 34 additions & 0 deletions LICENSE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
# Public domain

As a work of the United States Government, this project is in the
public domain within the United States.

Additionally, we waive copyright and related rights in the work
worldwide through the CC0 1.0 Universal public domain dedication.

## CC0 1.0 Universal Summary

This is a human-readable summary of the
[Legal Code (read the full text)](https://creativecommons.org/publicdomain/zero/1.0/legalcode).

### No Copyright

The person who associated a work with this deed has dedicated the work to
the public domain by waiving all of his or her rights to the work worldwide
under copyright law, including all related and neighboring rights, to the
extent allowed by law.

You can copy, modify, distribute and perform the work, even for commercial
purposes, all without asking permission.

### Other Information

In no way are the patent or trademark rights of any person affected by CC0,
nor are the rights that other persons may have in the work or in how the
work is used, such as publicity or privacy rights.

Unless expressly stated otherwise, the person who associated a work with
this deed makes no warranties about the work, and disclaims liability for
all uses of the work, to the fullest extent permitted by applicable law.
When using or citing the work, you should not imply endorsement by the
author or the affirmer.
12 changes: 12 additions & 0 deletions setup.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
from distutils.core import setup

readme = open('README.md').read()

setup(
name="TextExtraction",
version="0.0",
packages=['textextraction',],
description="Extract or OCR text from a PDF document",
license="Public Domain",
long_description=readme,
)
Empty file added tests/__init__.py
Empty file.
File renamed without changes.
6 changes: 3 additions & 3 deletions tests.py → tests/test_extraction.py
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
import unittest
import doc_process_toolkit as dpt
from textextraction import doc_process_toolkit as dpt


class TestDocProcessToolkit(unittest.TestCase):
Expand All @@ -21,10 +21,10 @@ def test_check_for_text(self):
Check if check_for_text returns True when document contains text
"""

doc_path = "fixtures/record_text.pdf"
doc_path = "tests/fixtures/record_text.pdf"
self.assertTrue(dpt.check_for_text(doc_path))

doc_path = "fixtures/record_no_text.pdf"
doc_path = "tests/fixtures/record_no_text.pdf"
self.assertFalse(dpt.check_for_text(doc_path))

if __name__ == '__main__':
Expand Down
Empty file added textextraction/__init__.py
Empty file.