You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am running an application with 12 of them launched on 12 different cores vs 1 core. I expected when all 12 process run on single core latency should increase due to obvious but turns out vma performs much better than 12 cores case. What is the shared resource here? i am not even running vma. Also from mlx_fs_dump i always see 12 queues. Is there any info on details of vma architecture?
The text was updated successfully, but these errors were encountered:
I am running an application with 12 of them launched on 12 different cores vs 1 core. I expected when all 12 process run on single core latency should increase due to obvious but turns out vma performs much better than 12 cores case. What is the shared resource here? i am not even running vma. Also from mlx_fs_dump i always see 12 queues. Is there any info on details of vma architecture?
The text was updated successfully, but these errors were encountered: