From a01d1604c301cac64f0e243c413846abf3553f77 Mon Sep 17 00:00:00 2001 From: Shree Devi Kumar Date: Sat, 9 Jun 2018 14:44:54 +0000 Subject: [PATCH] update readme --- unlvtests/README.md | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/unlvtests/README.md b/unlvtests/README.md index 4522ab5bcb..d98df97fc8 100644 --- a/unlvtests/README.md +++ b/unlvtests/README.md @@ -40,10 +40,12 @@ wget -O spa.stopwords.txt https://raw.githubusercontent.com/stopwords-iso/stopwo ``` Edit ~/ISRI-OCRtk/stopwords/spa.stopwords.txt wordacc uses a space delimited stopwords file, not line delimited. +s/\n/ /g -Edit *~/ISRI-OCRtk/spn.3B/pages* -delete the line containing the following imagename as it crashes tesseract. -7733_005.3B.tif +Edit ~/ISRI-OCRtk/spn.3B/pages +Delete the line containing the following imagename as it [crashes tesseract](https://github.com/tesseract-ocr/tesseract/issues/1647#issuecomment-395954717). + +7733_005.3B 3 ### Step 3: Download the modified ISRI toolkit, make and install the tools : These will be installed in /usr/local/bin.