Need an example of apply watermark #110

StevenLOL · 2020-07-15T15:17:03Z

hi,

Need an example of applying watermark.

Tested with following code, It doesn't work as I exptected, the "watermark" was not on the right location.

from reportlab.pdfgen import canvas
from pikepdf import Array, Dictionary, Name, Pdf, PdfMatrix, Stream


INPUT_PDF="./test3.pdf"
WATERMARK_PDF='./test4.pdf'
OUTPUT_PDF='./test3_test4.pdf'

def generate_watermark(msg,fileName,x=55,y=220):
    c = canvas.Canvas(fileName, bottomup=0)
    c.setFontSize(32)
    c.setFillColorCMYK(0, 0, 0, 0, alpha=0.7)
    c.rect(204, 199, 157, 15, stroke=0, fill=1)
    c.setFillColorCMYK(0, 0, 0, 100, alpha=0.7)
    c.drawString(x, y,msg )
    c.save()
   
# generate two pdfs
generate_watermark('file3',INPUT_PDF,100,100)
generate_watermark('file4',WATERMARK_PDF)


with pikepdf.open(INPUT_PDF) as input_pdf, \
            pikepdf.open(WATERMARK_PDF) as watermark_pdf, \
            open(OUTPUT_PDF, 'wb') as output_stream:
        
        # Create new output PDF
        output_pdf = pikepdf.new()


        for i in range(len(input_pdf.pages)):
            #load and insert watermark
            input_pdf.pages[i].page_contents_add(watermark_pdf.pages[0].Contents)
            input_pdf.pages[i].page_contents_coalesce()

        output_pdf.pages.extend(input_pdf.pages)
        output_pdf.save(output_stream)  # save to a new file

jbarlow83 · 2020-07-21T20:53:24Z

It's usually best to capture a page as a Form XObject:

dictpage = watermark_pdf.pages[0]
page = pikepdf.Page(dictpage)
formx = page.as_form_xobject()

Then attach it to input_pdf as a resource and draw it.

The reason it's not working is likely the alpha channel. Transparency, like so many PDF features, was awkwardly bolted on after the original spec. The alpha channel information goes into a /Resources /ExtGState object (extended graphics state), and the content stream will activate it. However, if you merge with the input page, you need to check its resource dictionary for name conflicts and possibly edit the content stream. It may be that reportlab put other interesting details in to ExtGState as well.

pmg007 · 2020-12-24T00:36:21Z

Hello @jbarlow83
I am trying to add a watermark to a pdf.
I have created a watermark pdf using reportlab and now I am trying to add that to the input pdf using page_contents_add almost just like mentioned in the code snippet above by Steven. I am having no luck getting it to watermark all pages, the first page which is simple one gets watermarked while others do not.
Referred following:
#110
#95
#42
#43

All of them are trying to use page_contents_add but have no luck. I see that it is recommended in some comments which say that we need to somehow add that as a resource or copy it into the input pdf in Resources section, but I am not sure how to achieve that. The usage is not clear. Could you please provide some examples or sample code snippet/template on how to do that?
Thanks.
References:
#42 (comment)
#110 (comment)

Also, @StevenLOL if you were able to get the pdf watermarked, any of your help is appreciated!

jbarlow83 · 2020-12-24T08:43:49Z

Here's an improved, fully functioning example. This is also an improved over the past versions that combined the content streams. This one instead captures the watermark in a "form XObject" (sort of like a sub-page) to isolate it from the rest of the document and makes it less likely to alter the rest of the document.

In a future release I will add add_resource to pikepdf's codebase (or some variation thereof) which will simplify this.

import pikepdf
from pikepdf import Name, Pdf, Object, Dictionary
from reportlab.pdfgen import canvas
from typing import Optional

def generate_watermark(msg,xy):
    x, y = xy
    buf = BytesIO()
    c = canvas.Canvas(buf, bottomup=0)
    c.setFontSize(32)
    c.setFillColorCMYK(0, 0, 0, 0, alpha=0.7)
    c.rect(204, 199, 157, 15, stroke=0, fill=1)
    c.setFillColorCMYK(0, 0, 0, 100, alpha=0.7)
    c.drawString(x, y, msg)
    c.save()
    buf.seek(0)
    return buf

wm = generate_watermark('Watermark', (100, 100))
txt = generate_watermark('Document text', (200, 200))

with pikepdf.open(wm) as pdf_wm, pikepdf.open(txt) as pdf_txt:
    wm_page = pikepdf.Page(pdf_wm.pages[0])
    wm_formx = wm_page.as_form_xobject()

    formx = pdf_txt.copy_foreign(wm_formx)
    page = pdf_txt.pages[0]
    formx_page = pikepdf.Page(page)
    formx_name = formx_page.add_resource(formx, Name.XObject)
    
    draw_watermark_content_stream = pdf_txt.make_stream(b'q 1 0 0 1 0 0 cm %s Do Q' % formx_name)

    pdf_txt.pages[0].page_contents_add(draw_watermark_content_stream, prepend=True)
    pdf_txt.save('out.pdf')

pmg007 · 2020-12-28T15:22:43Z

Thanks @jbarlow83, I will try this out.

pmg007 · 2021-01-05T20:01:05Z

Hi,
I tried this template out and it has helped up to an extent, thanks! Although for some pdfs I see that I am not able to see the watermark, when I search using Cmd+F I can see that text is present at the bottom of all pages in the pdf but is not visible to the eye. Any hints or guess so as to what might be the cause for that?

jbarlow83 · 2021-01-05T22:02:31Z

If a document completely draws it over the page it will hide the watermark. It might do this by drawing an image or filled rectangle. You could draw the watermark last, as a transparent overlay, or make whatever is on top of the watermark use a transparent background. Could also be a page with an unusual weird MediaBox or CropBox.

…

On Tue., Jan. 5, 2021, 12:01 pmg007, ***@***.***> wrote: Hi, I tried this template out and it has helped up to an extent, thanks! Although for some pdfs I see that I am not able to see the watermark, when I search using Cmd+F I can see that text is present at the bottom of the pdf but is not visible to eye. Any hints or guess so as to what might be the cause for that? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#110 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN5YM4AJCKCDPJ4M7W2NNDSYNVZHANCNFSM4O2UL2MQ> .

pmg007 · 2021-01-06T16:23:23Z

Thanks for the quick reply. What do you mean by adding the watermark last? In the code template above, we are adding a watermark to the existing page afterward, or am I mistaken?

jbarlow83 · 2021-01-06T16:42:50Z

prepend=True, so the watermark is placed before all other drawing operations.

…

On Wed., Jan. 6, 2021, 08:23 pmg007, ***@***.***> wrote: Thanks for the quick reply. What do you mean by adding the watermark last? In the code template above, we are adding a watermark to the existing page afterward, or am I mistaken? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#110 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN5YM73SJEX2M6RQM4BMMTSYSFAXANCNFSM4O2UL2MQ> .

pmg007 · 2021-01-06T17:04:12Z

Got it thanks, it mostly works now except in one of the faulty PDFs where it shows a mirror image of the watermark on the next page. Anyways, thanks a lot @jbarlow83

jbarlow83 · 2021-01-06T19:49:53Z

I updated the example to demonstrate the use pikepdf.Page.add_resource which is available in pikepdf 2.3.0.

pmg007 · 2021-01-06T21:34:24Z

That's great! Thanks for the new release with add_resource in it!

basileos · 2021-01-29T15:49:01Z

Thank you very much, @jbarlow83 you rock!

Sofa0908 · 2021-04-20T01:14:46Z

Thank you so much @jbarlow83, This helped out a lot in my case as well.

However I'm curious of the purpose and meaning of the b'q 1 0 0 1 0 0 cm %s Do Q' % formx_name in
draw_watermark_content_stream = pdf_txt.make_stream(b'q 1 0 0 1 0 0 cm %s Do Q' % formx_name)

I've checked the documentation on make_stream, but the examples or descriptions don't even come close to what you have written here. Could you maybe show me a pointer to where can I read more about this kind of usages and what do they mean respectively?

Please and thank you.

jbarlow83 · 2021-04-20T02:01:20Z

It's PDF content stream operators. q - push graphics stack 1 0 0 1 0 0 cm - multiply current transform matrix by identify matrix %s Do - draw named object here Q - pop stack The identity matrix isn't strictly necessary. It's a clue that you could use a different matrix to perform rotation, scale, translation. See PDF manual on content streams.

…

On Mon., Apr. 19, 2021, 18:15 Vincent Yang, ***@***.***> wrote: Thank you so much @jbarlow83 <https://github.com/jbarlow83>, This helped out a lot in my case as well. However I'm curious of the purpose and meaning of the b'q 1 0 0 1 0 0 cm %s Do Q' % formx_name in draw_watermark_content_stream = pdf_txt.make_stream(b'q 1 0 0 1 0 0 cm %s Do Q' % formx_name) I've checked the documentation on make_stream <https://pikepdf.readthedocs.io/en/latest/api/main.html?highlight=make_stream#pikepdf.Pdf.make_stream>, but the examples or descriptions don't even come close to what you have written here. Could you maybe show me a pointer to where can I read more about this kind of usages and what do they mean respectively? Please and thank you. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#110 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN5YM53CRCCP6YBM2BNPQ3TJTIRLANCNFSM4O2UL2MQ> .

jbarlow83 · 2021-07-07T08:16:54Z

Now implemented officially https://pikepdf.readthedocs.io/en/latest/topics/overlays.html

StevenLOL changed the title ~~Unable~~ Need an example of apply watermark Jul 15, 2020

jbarlow83 added the question label Jul 21, 2020

StevenLOL closed this as completed Apr 23, 2021

mara004 mentioned this issue Sep 18, 2021

Add support for adding images overlay pdfarranger/pdfarranger#530

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need an example of apply watermark #110

Need an example of apply watermark #110

StevenLOL commented Jul 15, 2020 •

edited by jbarlow83

jbarlow83 commented Jul 21, 2020

pmg007 commented Dec 24, 2020 •

edited

jbarlow83 commented Dec 24, 2020 •

edited

pmg007 commented Dec 28, 2020

pmg007 commented Jan 5, 2021 •

edited

jbarlow83 commented Jan 5, 2021 via email

pmg007 commented Jan 6, 2021

jbarlow83 commented Jan 6, 2021 via email

pmg007 commented Jan 6, 2021

jbarlow83 commented Jan 6, 2021

pmg007 commented Jan 6, 2021

basileos commented Jan 29, 2021

Sofa0908 commented Apr 20, 2021

jbarlow83 commented Apr 20, 2021 via email

jbarlow83 commented Jul 7, 2021

Need an example of apply watermark #110

Need an example of apply watermark #110

Comments

StevenLOL commented Jul 15, 2020 • edited by jbarlow83

jbarlow83 commented Jul 21, 2020

pmg007 commented Dec 24, 2020 • edited

jbarlow83 commented Dec 24, 2020 • edited

pmg007 commented Dec 28, 2020

pmg007 commented Jan 5, 2021 • edited

jbarlow83 commented Jan 5, 2021 via email

pmg007 commented Jan 6, 2021

jbarlow83 commented Jan 6, 2021 via email

pmg007 commented Jan 6, 2021

jbarlow83 commented Jan 6, 2021

pmg007 commented Jan 6, 2021

basileos commented Jan 29, 2021

Sofa0908 commented Apr 20, 2021

jbarlow83 commented Apr 20, 2021 via email

jbarlow83 commented Jul 7, 2021

StevenLOL commented Jul 15, 2020 •

edited by jbarlow83

pmg007 commented Dec 24, 2020 •

edited

jbarlow83 commented Dec 24, 2020 •

edited

pmg007 commented Jan 5, 2021 •

edited