WIP: Performance improvements #85

aaronrenner · 2016-11-19T01:01:05Z

I think we can achieve a lot of performance gains in this library by using pattern matching to quickly narrow down the list of possible phone number parsers and then using regular expressions to extract the necessary data and parse the phone number. I tried the following benchmark earlier today to compare pattern matching vs regular expressions and found there's a huge speed difference.

defmodule PhoneBench do
  use Benchfella

  @andorra_phone_number "376123456"

  def phone_number_matches?("376"<>_), do: true
  def phone_number_matches?(_), do: false

  bench "pattern match with prefix" do
    true = phone_number_matches?(@andorra_phone_number)
  end

  bench "match with regex" do
    true = Regex.match?(~r/^(376)()(.{6})/, @andorra_phone_number)
  end
end

## PhoneBench
benchmark name                              iterations   average time
pattern match with prefix                    100000000   0.04 µs/op
match with regex                               1000000   1.24 µs/op

That makes me think if we can leverage elixir's pattern matching, we have a tone of room for performance improvements.

This is my first try at using pattern matching, and I think there are a lot more gains we can make. I changed how you set the the match type for each country module (to eventually allow for overriding functions) and added a number_prefix option that helps with the pattern matching in the generated match? function. I would recommend looking at the commits one at a time and letting me know what you think.
Here's the before and after performance metrics.

Before

## PhoneBench
benchmark name                              iterations   average time
Phone.parse/1 with an Andorra phone number        5000   673.05 µs/op
Phone.parse/1 with a Zimbabwe phone number        5000   691.99 µs/op
Phone.parse/1 with an NANP phone number           1000   1047.04 µs/op

With these changes

## PhoneBench
benchmark name                              iterations   average time
Phone.parse/1 with an Andorra phone number       10000   294.79 µs/op
Phone.parse/1 with a Zimbabwe phone number       10000   298.49 µs/op
Phone.parse/1 with an NANP phone number           5000   713.80 µs/op

Difference

## PhoneBench
Phone.parse/1 with a Zimbabwe phone number    0.43
Phone.parse/1 with an Andorra phone number    0.44
Phone.parse/1 with an NANP phone number       0.68

I'm curious to see if we could have even bigger gains if we could move the phone number parsing into one giant (generated) parse function with one definition for each prefix. I'm not sure how we'd do it, but it would be interesting to see how fast we can make this library.

This change makes the dynamically generated functions be written first, allowing overridable functions to be defined. Previously, when the match macro was run as the last statement, it would overwrite any previously defined functions.

Using pattern matching on the start of the string significantly increases the performance. Before ## PhoneBench benchmark name iterations average time Phone.parse/1 with an Andorra phone number 5000 673.05 µs/op Phone.parse/1 with a Zimbabwe phone number 5000 691.99 µs/op Phone.parse/1 with an NANP phone number 1000 1047.04 µs/op With these changes ## PhoneBench benchmark name iterations average time Phone.parse/1 with an Andorra phone number 10000 294.79 µs/op Phone.parse/1 with a Zimbabwe phone number 10000 298.49 µs/op Phone.parse/1 with an NANP phone number 5000 713.80 µs/op Difference ## PhoneBench Phone.parse/1 with a Zimbabwe phone number 0.43 Phone.parse/1 with an Andorra phone number 0.44 Phone.parse/1 with an NANP phone number 0.68 I still think we can do better, especially on NANP phone numbers, but this is a good start.

coveralls · 2016-11-19T01:02:47Z

Coverage decreased (-20.5%) to 55.407% when pulling fa4ff52 on aaronrenner:ar-performance-improvements into d60bc5e on fcevado:master.

coveralls · 2016-11-19T01:02:47Z

Coverage decreased (-20.5%) to 55.407% when pulling fa4ff52 on aaronrenner:ar-performance-improvements into d60bc5e on fcevado:master.

aaronrenner · 2016-11-19T01:08:39Z

Regarding doing phone number parsing with multiple function definitions, the MIME library shows a great example of how to do this: https://github.com/elixir-lang/mime/blob/master/lib/mime.ex#L93-L100. I think I'll look into that and see if I can put together something.

fcevado · 2016-11-19T03:10:49Z

Great to know that pattern matching even with string partials are faster than regex. I'll take a look at it and try to come up with some strategy to have less pattern matching possible.

fcevado · 2016-11-19T12:52:44Z

I've done some tests with pure pattern matching and pure regex match, I've come up with those results:

benchmark name    iterations   average time 
pattern matching   100000000   0.04 µs/op
pure regex          10000000   0.72 µs/op

I did it with only one case of pattern matching, I think if that number increase the performance would drop, but the difference is huge. I thought of changing the Phone.Countries module, so we could pattern match to identify the country instead of the current iteration way. Problems would come with some countries that have the same international code, for example Russia and Kazakhstan. But I think that can be solved with the iteration method only for those cases.
I'll take your commit that add benchmarks and try to come up with something.

aaronrenner · 2016-11-19T15:44:06Z

Ideally I'm thinking we could use metaprogramming to generate one big parse function with multiple definitions like this.

def parse("1970" <> <<number::byte_size(7)>>) do
  # Run regex just to be sure
  Regex.match?(...)
  {:ok, %{a2: "US", a3: "USA", country: "United States", international_code: "1", area_code: "970", number: number, area_abbreviation: "CO", area_type: "state", area_name: "Colorado"}}
end
def parse("1303" <> <<number::byte_size(7)>>) do
  # Run regex just to be sure
  Regex.match?(...)
  {:ok, %{a2: "US", a3: "USA", country: "United States", international_code: "1", area_code: "303", number: number, area_abbreviation: "CO", area_type: "state", area_name: "Colorado"}}
end

My thought is calling one big function will be faster than calling a function for each country (but we should benchmark that to see if it's true). The trick would be storing all of this data so you could generate this function at compile time. Also, I wonder if it would be faster to generate separate function definitions for each area code in a state or use the in guard clause like this:

def parse("1"<<area_code::byte_size(3)>> <> <<number::byte_size(7)>>) when area_code in ~w(303 719 720 970) do
  # Run regex just to be sure
  Regex.match?(...)
  {:ok, %{a2: "US", a3: "USA", country: "United States", international_code: "1", area_code: area_code, number: number, area_abbreviation: "CO", area_type: "state", area_name: "Colorado"}}
end

This sure is exciting to find room for such a huge performance gain. I don't think we have to do all of the checking via pattern matching, just enough to quickly exclude the other countries so we only have to run 1 regex.

Thanks again for looking at this. Please let me know how I can help.

fcevado · 2016-11-19T21:32:12Z

I understand that performance is a huge issue, but I think the main objective is try to keep the lib easy to contribute(phone rules use to change a lot, even more in small countries). My first idea was to change the Phone.Countries module, so it would have a build/1 and build/2 function to pattern match every country code and call their modules directly. I'm thinking of doing it too with the Phone.NANP area codes too(but just for identify the countries) since it has a more stable numbering plan. If we can keep it at hundreds of µs is a good thing.

edit:
I was thinking just before I wrote that, that I can come up with some idea to make the call of the area codes inside the countries modules paralel, this way we can have a better performance even with countries with many areas.

fcevado · 2016-11-19T21:47:09Z

@aaronrenner if you agree and want to spend more time with that, we could do it both ways, you do it the way you said and I create a pr with the way I'm thinking, and we compare the performances and readability of the code.
I think it would help both of us to learn how to achieve better performances understanding how it works on the BEAM.

aaronrenner · 2016-11-21T14:47:59Z

@fcevado I'd be happy to spend a little more time on this so we can try it both ways. Looking forward to the learning experience! 😄

fcevado · 2017-04-02T22:16:42Z

@aaronrenner I'll close those PRs, I worked on that this weekend and I believe that i reached it, follows my results:

## PhoneBench
benchmark name                              iterations   average time 
Phone.parse/1 with an Andorra phone number      500000   7.93 µs/op
Phone.parse/1 with a Zimbabwe phone number      500000   7.99 µs/op
Phone.parse/1 with an NANP phone number         200000   9.95 µs/op

In a counterpart the complexity was thrown to the compiler, the compilation time is really big.

aaronrenner added 3 commits November 18, 2016 16:41

Added benchmarks

8b3e6ba

Use Helper.Country now defines the match type

eb1bbf2

This change makes the dynamically generated functions be written first, allowing overridable functions to be defined. Previously, when the match macro was run as the last statement, it would overwrite any previously defined functions.

fcevado mentioned this pull request Mar 25, 2017

phoenix 1.3 support ? #91

Closed

fcevado closed this Apr 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Performance improvements #85

WIP: Performance improvements #85

aaronrenner commented Nov 19, 2016

coveralls commented Nov 19, 2016 •

edited

coveralls commented Nov 19, 2016

aaronrenner commented Nov 19, 2016

fcevado commented Nov 19, 2016

fcevado commented Nov 19, 2016 •

edited

aaronrenner commented Nov 19, 2016

fcevado commented Nov 19, 2016 •

edited

fcevado commented Nov 19, 2016

aaronrenner commented Nov 21, 2016

fcevado commented Apr 2, 2017

WIP: Performance improvements #85

WIP: Performance improvements #85

Conversation

aaronrenner commented Nov 19, 2016

coveralls commented Nov 19, 2016 • edited

coveralls commented Nov 19, 2016

aaronrenner commented Nov 19, 2016

fcevado commented Nov 19, 2016

fcevado commented Nov 19, 2016 • edited

aaronrenner commented Nov 19, 2016

fcevado commented Nov 19, 2016 • edited

fcevado commented Nov 19, 2016

aaronrenner commented Nov 21, 2016

fcevado commented Apr 2, 2017

coveralls commented Nov 19, 2016 •

edited

fcevado commented Nov 19, 2016 •

edited

fcevado commented Nov 19, 2016 •

edited