Skip to content
Python tool like unix csplit but which splits binary files based on content.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
csplitb
scripts
.gitignore Initial commit Oct 31, 2013
LICENSE
MANIFEST.in
README.md
setup.cfg
setup.py

README.md

csplitb

Command-line tool like unix csplit but which splits binary files based on content boundaries rather than index boundaries.

This is particularly useful for salvaging jpegs from a corrupted disk image.

Example usage:

$ csplitb --prefix photo --suffix .jpg --number 4 ffd8ffe1 corrupted-disk-image.raw

This will split the disk image, keeping the header, into photo0000.jpg, photo0001.jpg, etc. Because a disk image represents a potentially fragmented file system, some (or all) of the jpeg files may be corrupted.

The "ffd8ffe1" here is a jpeg header. I'm not an expert on the jpeg format, so it's possible that you may need to use a different hex search term. Use a hex editor or viewer such as xxd to find what content you want to split on.

Installation

pip install csplitb

You can’t perform that action at this time.