Hint which parser to apply first using the filename #135

julik · 2019-07-06T23:56:34Z

In some situations we spend way more time applying parsers because the file is in a less-popular format - for example a WAV file. We spend quite a bit of time going through other parsers - some of which are slow. So before we have matched it as a WAV we have to walk through the JPEG parser, then through the ZIP parser (which is quite eager and slow-ish as well), and only at the end we are going to match it as a WAV file.

However, if we know the filename upfront we can do it smarter and apply the parser that is more likely to match first. Note that we won't be assuming the file is of a certain format just based on the filename - we are merely going to rearrange the order of application of the parsers, to optimise it for this particular file.

linkyndy · 2019-07-08T07:49:54Z

.travis.yml

@@ -1,9 +1,7 @@
 rvm:
 - 2.2.0
- 2.3.0
- 2.4.2
- 2.5.0


I would keeps supporting all Ruby versions that didn't reach their EOL.

What I wanted to do is to have sufficient "spread" of versions but run less builds. So I've set the lowest supported and the newest supported, assuming that versions in between will kinda work 🕵🏻‍♂️ Do you think it is safe enough, or would you prefer us test with all minor Ruby versions?

I would do with all latest minor versions TBH:

- 2.4.6 - 2.5.5 - 2.6.3 - jruby

(I'm not up to date with jruby)

Okay, let's do that until we bump major

lib/format_parser.rb

Keep 2.2 because our major version still should be compatible with it - this is a guarantee we give our users

julik added 7 commits July 7, 2019 01:24

Add filename-based hinting methods

572b3fb

Add a little test to make sure we are ++

c9f7c5d

Add filename_hint to parse() etc.

f891ac9

Satisfy the rubocop

5f3173a

Like so

b103f68

Reduce the number of Rubies we test

45e63a1

And this?

ce258e2

julik requested a review from linkyndy July 7, 2019 00:11

julik changed the title ~~Hint which parser should be applied first using filename hinting~~ Hint which parser should be applied first using the filename Jul 7, 2019

julik changed the title ~~Hint which parser should be applied first using the filename~~ Hint which parser to apply first using the filename Jul 7, 2019

Improve comments a little

366a4a9

linkyndy reviewed Jul 8, 2019

View reviewed changes

Re-add more Rubies

dc68e3b

Keep 2.2 because our major version still should be compatible with it - this is a guarantee we give our users

linkyndy approved these changes Jul 8, 2019

View reviewed changes

julik merged commit 87a3d05 into master Jul 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hint which parser to apply first using the filename #135

Hint which parser to apply first using the filename #135

julik commented Jul 6, 2019

linkyndy Jul 8, 2019

julik Jul 8, 2019

linkyndy Jul 8, 2019 •

edited

julik Jul 8, 2019

@@ @@ -1,9 +1,7 @@ @@
               rvm:
               - 2.2.0
-              - 2.3.0
-              - 2.4.2
-              - 2.5.0

Hint which parser to apply first using the filename #135

Hint which parser to apply first using the filename #135

Conversation

julik commented Jul 6, 2019

linkyndy Jul 8, 2019

Choose a reason for hiding this comment

julik Jul 8, 2019

Choose a reason for hiding this comment

linkyndy Jul 8, 2019 • edited

Choose a reason for hiding this comment

julik Jul 8, 2019

Choose a reason for hiding this comment

linkyndy Jul 8, 2019 •

edited