Reduce memory consumption in node creation with many threads #2316

hakonsbm · 2022-02-25T15:36:42Z

Creating nodes of different models while using many threads uses an excessive amount of memory. For example, creating 10 populations of 5000 nodes each and different models for each population:

Threads	Memory [MB]
1	135
256	2255

This is because node creation relies on memory management with sli::pool:

nest-simulator/nestkernel/model.h

Line 253 in d2e4974

std::vector< sli::pool > memory_;

By replacing the sli::pool with an std::vector, the memory consumption is kept at a more manageable level:

Threads	Memory [MB]
1	129
256	306

heplesser · 2022-02-28T08:14:45Z

@hakonsbm That is an impressive reduction in memory use. It would be interesting to see construction and simulation times for realistic network sizes as well. And could memory consumption be reduced even further using BlockVector without noticeable performance loss?

hakonsbm · 2022-03-01T08:18:39Z

@heplesser For connection and simulation the memory_ vector is actually not used. The node pointer is added to local_nodes_, which is the one used for connection and simulation:

nest-simulator/nestkernel/node_manager.cpp

Lines 252 to 261 in d2e4974

    
           Node* node = model.allocate( t ); 
        
           node->set_node_id_( node_id ); 
        
           node->set_nc_( nc_ptr ); 
        
           node->set_model_id( model.get_model_id() ); 
        
           node->set_thread( t ); 
        
           node->set_vp( kernel().vp_manager.thread_to_vp( t ) ); 
        
           node->set_local_device_id( num_thread_local_devices_[ t ] - 1 ); 
        
           node->set_initialized(); 
        
           local_nodes_[ t ].add_local_node( *node );

Therefore, the connection and simulation times should be unaffected, which a run of hpc_benchmark with a scale of 20 supports.

Using BlockVector instead of std::vector actually uses more memory (696 MB with BlockVector vs 306 MB with std::vector). That's because BlockVector initializes 1024 elements when created, and that is done 256 times (once for each thread) for every model, leading to a big overhead.

heplesser · 2022-03-01T08:38:03Z

@hakonsbm Good to know that building and simulation time is not affected. But I think the memory_ is used in the sense that this is where the neuron objects are stored, so if memory layout had an effect on performance, we might have seen changes. But given that vector stores objects contiguously, this is presumably the optimal memory layout.

heplesser

Looks generally good, just a few comments/questions.

heplesser · 2022-03-04T15:30:31Z

nestkernel/model.h

-   * Initialize the pool allocator with the Node specific values.
-   */
-  virtual void init_memory_( sli::pool& ) = 0;
-
  /**
   * Allocate a new object at the specified memory position.


Comment is outdated.

heplesser · 2022-03-04T15:31:49Z

nestkernel/model.h

@@ -250,22 +232,17 @@ class Model
  /**
   * Memory for all nodes sorted by threads.
   */
-  std::vector< sli::pool > memory_;
+  std::vector< std::vector< Node* > > memory_;
 };


 inline Node*
 Model::allocate( thread t )
 {


Are allocate() and allocate_() actually good names? Wouldn't create or clone be more readable?

heplesser · 2022-03-04T15:33:34Z

nestkernel/node_manager.cpp


 #pragma omp parallel
  {
    const index t = kernel().vp_manager.get_thread_id();

    try
    {
-      model.reserve_additional( t, max_new_per_thread );


Could it make sense to keep reserve_additional to avoid unnecessary automatic vector resizings?

jougs

I'm fine with the changes here. Many thanks!

More fundamentally, I'm wondering if we wouldn't be better of with simple thread-local vectors of nodes of a certain type in the GenericModel instead of packing them into custom pool-based allocators that have never been really tested or designed for performance... But that's probably for another time.

nestkernel/model.h

Co-authored-by: Jochen Martin Eppler <jougs@gmx.net>

Replaced sli::pool with std::vector

e26b4c8

hakonsbm added S: Normal Handle this with default priority T: Maintenance Work to keep up the quality of the code and documentation. I: No breaking change Previously written code will work as before, no one should note anything changing (aside the fix) labels Feb 25, 2022

hakonsbm added this to PRs in progress in Kernel via automation Feb 25, 2022

heplesser requested review from heplesser and jougs February 28, 2022 08:10

heplesser reviewed Mar 4, 2022

View reviewed changes

hakonsbm added 2 commits March 7, 2022 09:45

Renamed allocate() functions to create()

77318d6

Re-added reserve_additional() function

b88d7ca

jougs approved these changes Mar 7, 2022

View reviewed changes

nestkernel/model.h Outdated Show resolved Hide resolved

Kernel automation moved this from PRs in progress to PRs approved Mar 7, 2022

heplesser approved these changes Mar 7, 2022

View reviewed changes

Correct comment.

9abee78

Co-authored-by: Jochen Martin Eppler <jougs@gmx.net>

heplesser merged commit bd79d7c into nest:master Mar 7, 2022

Kernel automation moved this from PRs approved to Done (PRs and issues) Mar 7, 2022

hakonsbm deleted the sli_pool_replacement branch March 8, 2022 08:32

heplesser mentioned this pull request Mar 11, 2022

Segfault when trying to instantiate NESTML module or load model #2342

Closed

heplesser mentioned this pull request Nov 3, 2022

Fix memory leak with ResetKernel #2520

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory consumption in node creation with many threads #2316

Reduce memory consumption in node creation with many threads #2316

hakonsbm commented Feb 25, 2022

heplesser commented Feb 28, 2022

hakonsbm commented Mar 1, 2022

heplesser commented Mar 1, 2022

heplesser left a comment

heplesser Mar 4, 2022

heplesser Mar 4, 2022

heplesser Mar 4, 2022

jougs left a comment

Reduce memory consumption in node creation with many threads #2316

Reduce memory consumption in node creation with many threads #2316

Conversation

hakonsbm commented Feb 25, 2022

heplesser commented Feb 28, 2022

hakonsbm commented Mar 1, 2022

heplesser commented Mar 1, 2022

heplesser left a comment

Choose a reason for hiding this comment

heplesser Mar 4, 2022

Choose a reason for hiding this comment

heplesser Mar 4, 2022

Choose a reason for hiding this comment

heplesser Mar 4, 2022

Choose a reason for hiding this comment

jougs left a comment

Choose a reason for hiding this comment