Saving a PDF document #116

remil19 · 2013-08-09T15:50:37Z

J'ai essayé d'ajouter un pdf qui s'était ouvert dans mon navigateur via pdf.js à poche, mais celui-ci l'enregistre comme du texte brut et sans titre. Ce serait bien qu'il enregistre un titre et qu'il l'ouvre dans un onglet quand on clique dessus.

nicosomb · 2014-01-30T14:57:51Z

Hello @remil19!
Sorry to answer so late.

I don't know if this feature has to be implemented in wallabag (our new name).
If anyone has an idea?

tcitworld · 2014-01-30T17:59:26Z

In fact, with a PDF file, no informations about the title are transmitted by HTTP. You can test it with any pdf document online with http://web-sniffer.net/.
So the only information we could get would be the filename, and sometimes they're not at all speaking to the user. But maybe if Wallabag v2 allows to change the title, something can be made.
However, a detection should be made (with the Content-Type : application/pdf http header) to say to Wallabag "this isn't a regular page".

Of course, we can also use an external library to read metadata informations.

nicosomb · 2014-01-31T13:47:58Z

I just added the "Plugin" label to this issue.

remil19 · 2014-02-21T23:47:13Z

Sorry to answer so late but the first line of a PDF usually begin with %PDF so i guess we could look for this string when a entry is created (and i think it could be usefull to be implemented directly in Wallabag : many long and interesting documents are published in PDF and it may not require a lot of modification : maybe just a boolean in the database or a type attribute and you also could just use this viewer: https://github.com/mozilla/pdf.js).

mariroz · 2014-02-22T07:31:56Z

hi, @remil19 , yes, of course, pdf should be handled when entry is imported. But not by parsing itself, but 1 step ahead: by checking document http headers. (see related issue #444 about plain text handling). I hope, that now or later we will implement this. Anyway I will try :).

tcitworld · 2014-02-22T08:46:31Z

Yes, detection with parsing first bytes of files isn't really easy to made, compared to http headers detection. Although, I don't know if all servers serve pdf properly.

tcitworld · 2014-06-07T14:59:01Z

Going for v2.x.

nicosomb · 2014-10-11T20:53:05Z

Assigned to Tender discussion #4.

j0k3r · 2015-09-15T07:05:57Z

For now, instead of storing the pdf itself, we provide a text version of it: j0k3r/graby#16

nicosomb · 2016-04-08T12:25:12Z

Done by @j0k3r in graby 👍

mdimura · 2016-04-16T16:41:28Z

I tried saving PDF-url with wallabag v2.0.1, but I get " wallabag can't retrieve contents for this article. Please report this issue to us. " error. Would be great if wallabag downloaded the original PDF and stored it locally for future reading.

nicosomb · 2016-04-18T07:51:01Z

Can you open a new issue for that please?

toobluescientist · 2024-02-28T05:29:23Z

I tried saving PDF-url with wallabag v2.0.1, but I get " wallabag can't retrieve contents for this article. Please report this issue to us. " error. Would be great if wallabag downloaded the original PDF and stored it locally for future reading.

Hey, I have the same trouble as you. For me, I still cannot save PDF from a PDF url onto Wallabag. If it saves, sometimes the text is just unreadable. How is it now for you?

nicosomb assigned mariroz Feb 22, 2014

nicosomb modified the milestones: 1.7.0, 2.0 Feb 22, 2014

nicosomb added Feature and removed Plugin labels Feb 22, 2014

tcitworld modified the milestones: 1.8.0, 1.7.0 Apr 24, 2014

tcitworld mentioned this issue May 13, 2014

Future plans for wallabag #687

Closed

tcitworld modified the milestones: 2.0, 1.8.0 Jun 7, 2014

nicosomb removed the Question label Jul 30, 2014

j0k3r unassigned mariroz Sep 15, 2015

nicosomb closed this as completed Apr 8, 2016

nicosomb removed this from the 2.1.0 milestone Sep 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving a PDF document #116

Saving a PDF document #116

remil19 commented Aug 9, 2013

nicosomb commented Jan 30, 2014

tcitworld commented Jan 30, 2014

nicosomb commented Jan 31, 2014

remil19 commented Feb 21, 2014

mariroz commented Feb 22, 2014

tcitworld commented Feb 22, 2014

tcitworld commented Jun 7, 2014

nicosomb commented Oct 11, 2014

j0k3r commented Sep 15, 2015

nicosomb commented Apr 8, 2016

mdimura commented Apr 16, 2016 •

edited

nicosomb commented Apr 18, 2016

toobluescientist commented Feb 28, 2024

Saving a PDF document #116

Saving a PDF document #116

Comments

remil19 commented Aug 9, 2013

nicosomb commented Jan 30, 2014

tcitworld commented Jan 30, 2014

nicosomb commented Jan 31, 2014

remil19 commented Feb 21, 2014

mariroz commented Feb 22, 2014

tcitworld commented Feb 22, 2014

tcitworld commented Jun 7, 2014

nicosomb commented Oct 11, 2014

j0k3r commented Sep 15, 2015

nicosomb commented Apr 8, 2016

mdimura commented Apr 16, 2016 • edited

nicosomb commented Apr 18, 2016

toobluescientist commented Feb 28, 2024

mdimura commented Apr 16, 2016 •

edited