Skip to content

Conversation

Amit366
Copy link
Contributor

@Amit366 Amit366 commented Mar 25, 2021

Description

Converts a pdf file to text file

Fixes #608

Have you read the Contributing Guidelines on Pull Requests?

Yes

Copy link

@Kushal997-das Kushal997-das left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Amit366 if i put my path like the below it will give me the error . It occurs because i used This a normal string as a path. To overcome this just put r before your normal string it converts normal string to raw string. So change this so that everyone can use your project .

gssoc1 (2)

Moreover , there could be more error i think . Please @kaustubhgupta check this carefully thank you :)

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

@Kushal997-das on running the script it asks the user to enter the url. But as I can see you have changed input part.

Copy link
Contributor

@kaustubhgupta kaustubhgupta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why there are flappy bird game files in this PR?

@kaustubhgupta kaustubhgupta added the bug Something isn't working label Mar 25, 2021
@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

@kaustubhgupta I am really sorry

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

I will remove that

@kaustubhgupta
Copy link
Contributor

I will remove that

Don't remove them one by one. Just delete this branch, and create new PR from new branch

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

@kaustubhgupta I already deleted and also commited it

@Kushal997-das
Copy link

@Kushal997-das on running the script it asks the user to enter the url . But as I can see you have changed input part.

It doesn't matter if you give input runtime or not it gives you same error as same input .

@Kushal997-das
Copy link

@Kushal997-das on running the script it asks the user to enter the url . But as I can see you have changed input part.

It doesn't matter if you give input runtime or not it gives you same error as same input .

@Kushal997-das in run time it doesn't give the error

Could You please make a demo video of it ? And moreover in worst case scenario your program should run in both cases .

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

@Kushal997-das I made the changes

@Kushal997-das
Copy link

@Kushal997-das I made the changes

@Amit366 Yeah i saw . @kaustubhgupta Could you please run this code in your local because in my system this program is not working properly .

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 25, 2021

@Kushal997-das is it showing any error??


Sit back and Relax. Let the Script do the Job. ☕

### Requirements
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
### Requirements
## Requirements


st += page.getPage(i).extractText()

with open('./PDF2Text/pages/text.txt','w') as f:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will not work as there is no folder named "pages".

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 26, 2021

@kaustubhgupta made the changes

Copy link
Contributor

@kaustubhgupta kaustubhgupta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No text gets extracted from PDFs. Only blank test files were generated. Test the script locally and fix the bugs.

@Kushal997-das
Copy link

No text gets extracted from PDFs. Only blank test files were generated. Test the script locally and fix the bugs.

Same problem i also faced @kaustubhgupta

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta it's working fine for me

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta do I thing. Place the PDF file into the script folder. Then try once.

@kaustubhgupta
Copy link
Contributor

@kaustubhgupta do I thing. Place the PDF file into the script folder. Then try once.

It's not about the path, it' about: no text was present in the files. I tested this for my resume, the document with text only

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta then how is it working for me

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

I will send the pics.

@kaustubhgupta
Copy link
Contributor

kaustubhgupta commented Mar 27, 2021

No text gets extracted from PDFs. Only blank test files were generated. Test the script locally and fix the bugs.

Same problem i also faced @kaustubhgupta

I am not the only one @Amit366

@Kushal997-das
Copy link

I will send the pics.

Send one gif of this even in my case i faced same problem .

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

Screenshot (127)

Screenshot (128)

Screenshot (129)

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

Screenshot (131)

Screenshot (132)

@kaustubhgupta
Copy link
Contributor

@Amit366 Okay got it. Include these Screenshots in the readme file too. Name that section: "Expected Output"

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta I have placed the pics into the readme

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta please review it.

Copy link
Contributor

@kaustubhgupta kaustubhgupta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Images not rendered:

image

@kaustubhgupta kaustubhgupta added minor-change-not-bug Suggested a minor change in code, not a bug and removed bug Something isn't working labels Mar 27, 2021
@Amit366
Copy link
Contributor Author

Amit366 commented Mar 27, 2021

@kaustubhgupta please review it

@kaustubhgupta kaustubhgupta added next review needed Approved by some mentors, more approvals needed and removed minor-change-not-bug Suggested a minor change in code, not a bug labels Mar 28, 2021
@santushtisharma10 santushtisharma10 added Approved PR Approved and Ready to Merge gssoc23 Issues created for/by the GirlScript Summer of Code'23 Participants level1 Wiki or Major Documentation Work and removed next review needed Approved by some mentors, more approvals needed labels Mar 28, 2021
Copy link

@Kushal997-das Kushal997-das left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 28, 2021

@kaustubhgupta it was a level 2 issue but it is given level 1. Please check.

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 29, 2021

@kaustubhgupta can u please check the level

@kaustubhgupta
Copy link
Contributor

@kaustubhgupta can u please check the level

Yes, the level was initially 2 but as you submitted the code, 3 days ago, we changed it to level 1 as the code doesn't work for all types of files plus the Code Quality was not up to level2

@Amit366
Copy link
Contributor Author

Amit366 commented Mar 29, 2021

@kaustubhgupta if the script is run in Vs code it will run fine .Once try it out, it works for all types of files (pdf files)

@avinashkranjan avinashkranjan merged commit c4e0720 into avinashkranjan:master Mar 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Approved PR Approved and Ready to Merge gssoc23 Issues created for/by the GirlScript Summer of Code'23 Participants level1 Wiki or Major Documentation Work
Projects
None yet
Development

Successfully merging this pull request may close these issues.

PDF2text
6 participants