抓取精选文件过多导致pdfkit生成不了pdf文件 #1

xingstarx · 2019-05-10T14:02:39Z

def make_pdf():
    html_files = []
    for index, html in enumerate(range(0, 400)):
        file = str(index) + ".html"
        html_files.append(file)
        

    options = {
        "user-style-sheet": "test.css",
        "page-size": "Letter",
        "margin-top": "0.75in",
        "margin-right": "0.75in",
        "margin-bottom": "0.75in",
        "margin-left": "0.75in",
        "encoding": "UTF-8",
        "custom-header": [("Accept-Encoding", "gzip")],
        "cookie": [
            ("cookie-name1", "cookie-value1"), ("cookie-name2", "cookie-value2")
        ],
        "outline-depth": 10,
    }
    try:
        pdfkit.from_file(html_files, "xxx电子书.pdf", options=options)
    except Exception as e:
        pass

    for file in html_files:
        os.remove(file)

    print("已制作电子书在当前目录！")

xingstarx · 2019-05-10T14:06:20Z

可以考虑先获取所有的html文件，然后生成多个pdf文件(xxx-1.pdf, xxx-2.pdf类似这样的)，那么我们就可以考虑修改make_pdf的方法了，在内部记得修改为上面的代码(很简单的，for循环)， for index, html in enumerate(range(0, 400)): 类似这种，自己控制文件的起始序号到结束序号。然后执行程序即可。

if __name__ == '__main__':
    make_pdf()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

抓取精选文件过多导致pdfkit生成不了pdf文件 #1

抓取精选文件过多导致pdfkit生成不了pdf文件 #1

xingstarx commented May 10, 2019

xingstarx commented May 10, 2019

抓取精选文件过多导致pdfkit生成不了pdf文件 #1

抓取精选文件过多导致pdfkit生成不了pdf文件 #1

Comments

xingstarx commented May 10, 2019

xingstarx commented May 10, 2019