Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Better modularisation #124
First off, love the project and the work that's gone into it, SO useful.
I've been working on a project recently where a binary file (dll) was read as UTF8, and so invalid bytes were corrupted with the utf-8 replacement character. As you could imagine, this makes it rather hard to parse.
I've been slowly making progress with pefile and a ton of reference material, but it's highlighted a point to me that I thought I would raise.
As it currently is, pefile is REALLY good at 'point and click' for a valid binary file, but nowhere near as simple to use piecemeal as an analysis toolkit. Since it will basically error out at the first problem it finds, it never really gives you a chance to properly use what it's already parsed/manually correct things before trying to parse further.
So I decided maybe I could just call each relevant section directly, and build up my own 'parse'. In doing so, I found that the
What I think would end up being awesome, is modularising the relevant parsing sections (including the little checks/fixes along the way), and making them callable without needing an instance of
This is getting a little long and is a bit mind dumpy, so i'll leave it here, but more than happy to further discuss how I think this would work/help implement it.