Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
Shreeshrii committed Jun 9, 2018
1 parent 4290951 commit a01d160
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions unlvtests/README.md
Expand Up @@ -40,10 +40,12 @@ wget -O spa.stopwords.txt https://raw.githubusercontent.com/stopwords-iso/stopwo
```
Edit ~/ISRI-OCRtk/stopwords/spa.stopwords.txt
wordacc uses a space delimited stopwords file, not line delimited.
s/\n/ /g

Edit *~/ISRI-OCRtk/spn.3B/pages*
delete the line containing the following imagename as it crashes tesseract.
7733_005.3B.tif
Edit ~/ISRI-OCRtk/spn.3B/pages
Delete the line containing the following imagename as it [crashes tesseract](https://github.com/tesseract-ocr/tesseract/issues/1647#issuecomment-395954717).

7733_005.3B 3

### Step 3: Download the modified ISRI toolkit, make and install the tools :
These will be installed in /usr/local/bin.
Expand Down

0 comments on commit a01d160

Please sign in to comment.