-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Preparation for benchmark in hpc #22
Conversation
For increasing number of processes, average nodes each query point visits is increasing, but total time is decreasing:
|
Need to declare a reasonable b value with respect to local n. #Update: b = 0.3 * log2(n / world_size) |
We can also measure how long it takes to calculate each local knn based on the tree, and how long each data transaction takes, to search for possible bottlenecks. |
v1: Almost perfect speedup (check cache misses). |
Sure, we could choose some of our datasets and analyze them in depth. |
I will perform an in depth cache miss and branch miss analysis for the single threaded instance of the problem, and maybe a simple profiling report for the speedup analysis. |
We could compare v1 versus v2 average communication time and analyze the tree sending time. |
Yes, each possible procedure in the algorithm should be analyzed to an extend. Communication times, We should also take |
If we take b and y block size into account too, we will need 10 pages for the report.. |
Let's move the discussion to #23 to avoid clutter here. |
No description provided.