bdfr2text

Converts output files from Bulk Downloader for Reddit into pretty text files like this:

Issues and PRs are welcome.

Usage

$ git clone https://github.com/DownrightNifty/bdfr2text.git
$ cd bdfr2text
$ python3 bdfr2text.py INPUT_DIR OUTPUT_DIR

INPUT_DIR is the output dir of bdfr archive. See python3 bdfr2text.py -h.

Only JSON or YAML (not XML) output from BDFR is supported. If converting YAML files, PyYAML is necessary (but this should already have been installed by BDFR). Otherwise, no dependencies.

Parsable output

The --parsable-out (-p) option produces a parsable output by escaping delimiters used by bdfr2text found in the Reddit posts. It replaces [ with [, ] with ], and --- with ┄.

Searching through output

--parsable-out makes the output text files searchable with your favorite programs. Personally, I use Sublime Text, which can search entire folders and supports regex. For example, you could use the following regex to search for the string "query" within Reddit comments (excluding metadata blocks): query(?=[^\]]+\[)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bdfr2text.py		bdfr2text.py
demo.png		demo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bdfr2text

Usage

Parsable output

Searching through output

See also

About

Releases

Packages

Languages

License

DownrightNifty/bdfr2text

Folders and files

Latest commit

History

Repository files navigation

bdfr2text

Usage

Parsable output

Searching through output

See also

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages