PDFToTextConverter: [WinError 2] The system can't find the specified file #1347
-
Hi guys, I think what you are doing is very interesting. I am currently struggling with data Preprocessing(Tutorial 8). [WinError 2] The system can't find the specified file Unfortunately, I have not yet found a specific solution for it. Can you guide me? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 5 replies
-
Hi @ehsanVIP sorry for taking so long to reply to your question. Could this be a problem with read permissions? Or maybe the path to the file is wrong? Does your file have the file extension .pdf in the filename? |
Beta Was this translation helpful? Give feedback.
-
Hey @ehsanVIP and also @Chance-Obondo
My guess would be you just did not specify the path correctly or possibly the pdf file is actually of the wrong format. |
Beta Was this translation helpful? Give feedback.
-
@Chance-Obondo It seems your system have issue with pdftotext installation. Can you please try this https://pypi.org/project/pdftotext/ or use TIKA (https://github.com/deepset-ai/haystack/blob/master/haystack/file_converter/tika.py) to convert PDF to text.
|
Beta Was this translation helpful? Give feedback.
@Chance-Obondo It seems your system have issue with pdftotext installation. Can you please try this https://pypi.org/project/pdftotext/ or use TIKA (https://github.com/deepset-ai/haystack/blob/master/haystack/file_converter/tika.py) to convert PDF to text.