Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recognize txt #836

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Commits on Dec 22, 2023

  1. Add test samples with various text encodings

    Test-driven development: want to correctly detect these as text in ftguess.
    Already use future ftguess text type.
    
    Since we're at it: slightly improve output of unittest
    christian-intra2net committed Dec 22, 2023
    Configuration menu
    Copy the full SHA
    7eb14b4 View commit details
    Browse the repository at this point in the history
  2. Implement detection of plain text

    This is not so simple since various text encodings can look rather
    "binary", but a few simple heuristics will deal with many text types (at
    least those encountered here in Europe).
    
    Of course, all xml is text as well, so use checks for "is this text" only
    after more specialized tests like "is this xml".
    christian-intra2net committed Dec 22, 2023
    Configuration menu
    Copy the full SHA
    9560015 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    929d2c0 View commit details
    Browse the repository at this point in the history