[elixir] Version and code updates #9198

atavistock · 2024-08-08T01:48:29Z

Updates the version of Erlang and Elixir for most current. There have been significant performance changes over the last year which should have a noticeable impact on performance. This didn't require and changes to the web/test code itself.
Updates the version of Phoenix and some other libraries to most current. Again there were some optimizations which should have an impact on the overall performance. This did require a small confirmation changes to how libraries are loaded.
Remove older cowboy server in favor of the now default bandit server.
Some smaller optimizations to use sort_by instead of sort.
Fix problem with update endpoint which intermittently returned errors.
Fix caching that was not working.

josevalim · 2024-09-28T08:56:01Z

frameworks/Elixir/phoenix/lib/hello_web/controllers/page_controller.ex

-    fortunes = [additional_fortune | Repo.all(Fortune)]
+    fortunes =
+      [additional_fortune | Repo.all(Fortune)]
+      |> Enum.sort_by(& &1.message)


Have you benchmarked this one? I actually expect sort to be more efficient, because sort_by needs to do additional passes on the data.

josevalim · 2024-09-28T08:57:50Z

frameworks/Elixir/phoenix/lib/hello_web/controllers/page_controller.ex

@@ -29,7 +30,7 @@ defmodule HelloWeb.PageController do
    worlds =
      Stream.repeatedly(&random_id/0)
      |> Stream.uniq()
-      |> Stream.map(fn idx -> Repo.get(World, idx) end)
+      |> Stream.map(&Repo.get(World, &1))
      |> Enum.take(size(params["queries"]))


We can optimize this in two ways:

Wrap the whole operation on a Repo.checkout, so we avoid checking out the connection multiple times

Use Task.async/await or Task.async_stream so we run the queries in parallel

josevalim · 2024-09-28T08:59:11Z

frameworks/Elixir/phoenix/lib/hello_web/controllers/page_controller.ex

+      # If this is not sorted it sometimes generates
+      #  FAIL for http://tfb-server:8080/updates/20
+      #  Only 20470 executed queries in the database out of roughly 20480 expected.
+      |> Enum.sort_by(& &1.id)


Do you know why we need to sort it? I can't see anything that would explain it. :D

I'm not sure on this. I was digging into why this intermittently failed, found the Rails version did a sort at the end, and found comments in other languages about this exact issue.

FrameworkBenchmarks/frameworks/Rust/tide/src/handlers.rs

Lines 124 to 137 in db30328

// This line is required to pass test verification otherwise we get:

//

// ```

// FAIL for http://tfb-server:8080/updates/20

// Only 20470 executed queries in the database out of roughly 20480 expected.

// PASS for http://tfb-server:8080/updates/20

// Rows read: 10128/10240

// FAIL for http://tfb-server:8080/updates/20

// Only 10118 rows updated in the database out of roughly 10240 expected.

// ```

//

// I don't know why this is the case, but I copied it from the actix

// implementation and here it shall stay.

loaded_worlds.sort_by_key(|loaded_world| loaded_world.id);

Thanks for sharing. I don't understand why but at least it is "consistent". :) I will try to dig deeper later.

And, if possible, please let me know if the parallelism or Repo.checkout changes I suggest above are allowed. I think they could improve the timing of a few endpoints. :)

I'm not affiliated with this project other than enjoying Elixir wanting to make it show well. ;)

Your feedback is greatly appreciated. IMHO using Repo.checkout and Tasks are within both the rules of many of the tests and within spirit of the benchmarks. I can make a PR for those but won't be able to until ~7 hours from now.

I see, thank you! ❤️

After thinking a bit about this, I would probably write this as:

Stream.repeatedly(&random_id/0) |> Stream.uniq() |> Enum.take(size(params["queries"])) |> Enum.sort() |> Enum.map(fn id -> world = Repo.get(World, id) %{id: world.id, randomnumber: :rand.uniform(@random_max)} end)

The first three parts of the pipeline can be abstracted into a function which we can reuse across actions:

def sample(params_count) do Stream.repeatedly(&random_id/0) |> Stream.uniq() |> Enum.take(size(params_count)) end

EDIT 1: I also think we can remove the plug :accepts from the router. :)

EDIT 2: I would probably start with the Repo.checkout version. However, I am also worried that the contention we are seeing on single query for c=512 is related to the pool, I will have to investigate that on the Ecto side.

I've done some iterations of running benchmarks and these are my observations.

The greatest impact seems to be Repo.checkout where there are a lot of queries. On examples with a very large number of queries and concurrency reduced the time by 200%

There was a small but consistent reduction simply by building the ids first.

Parallelization using Task seems to have had a negative effect. I'm not confident on this because the difference was in the margin of error kind of range and inconsistent. I'm guessing because the the db is so small and on the same docker container that running the queries is nearly no cost?

Putting in a new PR with the changes that have a clear benefit.

atavistock added 5 commits August 7, 2024 17:23

Updates for Phoenix, Elixir, Erlang, and Docker image

43a68c3

Missed on mix file and template should be heex

46ef818

Add back fortunes with heex

f4f0914

Update plug libraries

f646094

Code based performance improvements

f127890

atavistock changed the title ~~Patch/elixir updates~~ Elixir: Version and code updates Aug 11, 2024

atavistock changed the title ~~Elixir: Version and code updates~~ [elixir] Version and code updates Aug 11, 2024

NateBrady23 merged commit 1b2f23a into TechEmpower:master Aug 13, 2024
3 checks passed

josevalim reviewed Sep 28, 2024

View reviewed changes

atavistock mentioned this pull request Sep 30, 2024

[Elixir/phoenix] Implementing suggestions from @josevalim #9302

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[elixir] Version and code updates #9198

[elixir] Version and code updates #9198

atavistock commented Aug 8, 2024 •

edited

Loading

josevalim Sep 28, 2024

josevalim Sep 28, 2024

josevalim Sep 28, 2024

atavistock Sep 28, 2024

josevalim Sep 28, 2024

atavistock Sep 28, 2024

josevalim Sep 29, 2024 •

edited

Loading

atavistock Sep 30, 2024 •

edited

Loading

	// This line is required to pass test verification otherwise we get:
	//
	// ```
	// FAIL for http://tfb-server:8080/updates/20
	// Only 20470 executed queries in the database out of roughly 20480 expected.
	// PASS for http://tfb-server:8080/updates/20
	// Rows read: 10128/10240
	// FAIL for http://tfb-server:8080/updates/20
	// Only 10118 rows updated in the database out of roughly 10240 expected.
	// ```
	//
	// I don't know why this is the case, but I copied it from the actix
	// implementation and here it shall stay.
	loaded_worlds.sort_by_key(\|loaded_world\| loaded_world.id);

[elixir] Version and code updates #9198

[elixir] Version and code updates #9198

Conversation

atavistock commented Aug 8, 2024 • edited Loading

josevalim Sep 28, 2024

Choose a reason for hiding this comment

josevalim Sep 28, 2024

Choose a reason for hiding this comment

josevalim Sep 28, 2024

Choose a reason for hiding this comment

atavistock Sep 28, 2024

Choose a reason for hiding this comment

josevalim Sep 28, 2024

Choose a reason for hiding this comment

atavistock Sep 28, 2024

Choose a reason for hiding this comment

josevalim Sep 29, 2024 • edited Loading

Choose a reason for hiding this comment

atavistock Sep 30, 2024 • edited Loading

Choose a reason for hiding this comment

atavistock commented Aug 8, 2024 •

edited

Loading

josevalim Sep 29, 2024 •

edited

Loading

atavistock Sep 30, 2024 •

edited

Loading