Skip to content
View PDFsail's full-sized avatar
πŸ˜†
πŸ˜†

Block or report PDFsail

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PDFsail/README.md

pdfsail 🚒

pdfsail is an open-source PDF processing library designed to simplify operations, parsing, and conversion of PDF files. Whether you need to convert a PDF to another format or extract information from a PDF, pdfsail makes it easy.

πŸš€ Features

  • PDF Conversion: Convert PDF files to images or text.
  • Text Extraction: Extract text, metadata, images, etc., from PDFs.
  • Page Operations: Merge, split, rotate, crop pages, and more.
  • Cross-platform: Supports all major operating systems (Windows, macOS, Linux).
  • High Performance: Optimized performance, smooth operation even with large files.

πŸ“¦ Installation

Install via Python’s package manager pip:

pip install pdfsail

πŸ’» Usage Examples

Here are a few simple examples to get you started with pdfsail.

from pdfsail import PDFToImage

# Convert PDF to images (one image per page)
pdf_to_image = PDFToImage("example.pdf")
pdf_to_image.convert("output_folder/")

2. Extract Text from PDF

from pdfsail import PDFTextExtractor

# Extract text from PDF
pdf_text = PDFTextExtractor("example.pdf")
text = pdf_text.extract_text()
print(text)
from pdfsail import PDFMerger

pdf_merger = PDFMerger()
pdf_merger.merge(["file1.pdf", "file2.pdf"], "output_merged.pdf")

πŸ› οΈ Tech Stack

  • Python: Main development language.
  • PyPDF2: For basic PDF operations.
  • Pillow: For converting PDF pages to images.
  • PDFMiner: For text extraction.

πŸ“„ Documentation & Help

🀝 Contributing

We welcome contributions in any form! If you have suggestions or find a bug, feel free to submit a Pull Request or open an Issue.

  1. Fork the project and clone it to your local machine.
  2. Create a new branch.
  3. Commit your changes.
  4. Submit a Pull Request.

πŸ“ License

The pdfsail project is licensed under the MIT License.

Popular repositories Loading

  1. PDFsail PDFsail Public

    Config files for my GitHub profile.

  2. web-development-resources web-development-resources Public

    Forked from milanaryal/web-development-resources

    A list of open source front-end tools and resources for web designers and developers.

  3. skills-github-pages skills-github-pages Public

    My clone repository

  4. awesome-mac awesome-mac Public

    Forked from jaywcjlove/awesome-mac

    ο£Ώ Now we have become very big, Different from the original idea. Collect premium software in various categories.

    JavaScript

  5. awesome-useful-websites awesome-useful-websites Public

    Forked from atakanaltok/awesome-useful-websites

    Hand-curated list of awesome and useful websites