Adding coding standards and practices to Docs (#592)

terrapower · Mar 11, 2022 · 2c4c6df · 2c4c6df
1 parent edaf4d5
commit 2c4c6df
Show file tree

Hide file tree

Showing 3 changed files with 193 additions and 440 deletions.
diff --git a/armi/reactor/components/__init__.py b/armi/reactor/components/__init__.py
@@ -118,6 +118,9 @@ class UnshapedComponent(Component):
     A component with undefined dimensions.
 
     Useful for situations where you just want to enter the area directly.
+    For instance, in filler situations where the exact shape of this component is
+    is unknown but you have some left-over space between other components filled
+    with a known material you might need to model.
     """
 
     pDefs = componentParameters.getUnshapedParameterDefinitions()
@@ -154,7 +157,7 @@ def getComponentArea(self, cold=False):
         Parameters
         ----------
         cold : bool, optional
-            Compute the area with as-input dimensions instead of thermally-expanded
+            Ignored for this component
         """
         return self.p.area
 

diff --git a/doc/developer/parallel_coding.rst b/doc/developer/parallel_coding.rst
@@ -2,42 +2,41 @@
 Parallel Code in ARMI
 #####################
 
+ARMI simulations can be parallelized using the `mpi4py <http://mpi4py.scipy.org/docs/usrman/index.html>`_
+module. You should go there and read about collective and point-to-point communication if you want to
+understand everything in-depth.
 
-ARMI code can be parallelized. In most cases, the `mpi4py <http://mpi4py.scipy.org/docs/usrman/index.html>`_
-module is used to accomplish this. You should go there and read about collective and point-to-point communication
-if you want to understand everything in-depth.
+The OS-level ``mpiexec`` command is used to run ARMI on, say, 10 parallel processors. This fires up 10 identical
+and independent runs of ARMI; they do not share memory. If you change the reactor on one process, the reactors
+don't change on the others.
 
-When a case is started on 10 processors on a cluster in parallel, the OS-level ``mpiexec`` command is used on
-the command line. What this does is fire up 10 identical and independent runs of ARMI. None of independent
-processes share memory. If you change the reactor on one, the reactors don't change on the others.
-
-Never fear. You can communicate between these processes using the cluster's Message Passing Interface (MPI) driver
-via the aforementioned Python module mpi4py. In fact, ARMI is set up to do a lot of the MPI work
-for you, so if you follow these instructions, you can have your code working in no time. In ARMI, there's
-the master processor (which is the one that does most of the work and organization) and then there are the
-worker processors, which do whatever you need them to in parallel.
+Never fear. You can communicate between these processes using the Message Passing Interface (MPI) driver
+via the Python ``mpi4py`` module. In fact, ARMI is set up to do a lot of the MPI work for you, so if you follow
+these instructions, you can have your code working in parallel in no time. In ARMI, there's the primary processor
+(which is the one that does most of the organization) and then there are the worker processors, which do whatever
+you need them to in parallel.
 
 MPI communication crash course
 ------------------------------
-First, let's do a crash course in MPI communications. We'll only discuss a few important ones, you
-can read about more on the mpi4py web page. The first method of communication is called the `broadcast`,
-which happens when the master processor sends information to all others. An example of this
-would be when you want to sync up the settings object (``self.cs``) among all processors. An even more
-common example is when you want to send a simple string command to all other processors. This is
-used all the time to inform the workers what they are expected to do next.
+First, let's do a crash course in MPI communications. We'll only discuss a few important ideas, you can read
+about more on the ``mpi4py`` web page. The first method of communication is called the ``broadcast``, which
+happens when the primary processor sends information to all others. An example of this would be when you want to
+sync up the settings object (``self.cs``) among all processors. An even more common example is when you want to
+send a simple string command to all other processors. This is used all the time to inform the workers what they
+are expected to do next.
 
-Here is an actual example::
+Here is an example::
 
-    if rank==0:
-        # The master node will send the string 'bob' to all others
+    if rank == 0:
+        # The primary node will send the string 'bob' to all others
         cmd = 'bob'
-        comm.bcast(cmd,root=0)
+        comm.bcast(cmd, root=0)
     else:
         # these are the workers. They receive a value and set it to the variable cmd
-        cmd = comm.bcast(None,root=0)
+        cmd = comm.bcast(None, root=0)
 
-Note that the comm object is an object from the mpi4py module that deals with the MPI drivers.
-The value of cmd on the worker before and after the bcast command are shown in the table.
+Note that the ``comm`` object is from the ``mpi4py`` module that deals with the MPI drivers. The value of cmd on
+the worker before and after the ``bcast`` command are shown in the table.
 
 ============ ===== ===== ===== =====
              Proc1 Proc2 Proc3 Proc4
@@ -46,38 +45,37 @@ Before bcast 'bob'   4   'sam' 3.14
 After bcast  'bob' 'bob' 'bob' 'bob'
 ============ ===== ===== ===== =====
 
-The second important type of communication is the scatter/gather combo. These are used when you
-have a big list of work you'd like to get done in parallel and you want to farm it off
-to a bunch of processors. To do this, set up a big list of work to get done on the master.
-Some real examples are that the list contains things like run control parameters (for XS generation)
-or assemblies (for T/H work) or blocks (for depletion). For a trivial example, let's make a
-parallel adder. This will add a bunch of values in parallel. First, let's make 1000 numbers
-to add::
+The second important type of communication is the ``scatter``/``gather`` combo. These are used when you have a
+big list of work you'd like to get done in parallel and you want to farm it off to a bunch of processors. To do
+this, set up a big list of work to get done on the primary. Some real examples are that the list contains things
+like run control parameters, assemblies, or blocks. For a trivial example, let's add a bunch of values in parallel.
+First, let's create 1000 random numbers to add::
 
     import random
-    workList = [(random.random(),random.random()) for _i in range(1234)]
-
-Ok. Now we want to distribute this work to each of the worker processors (and take one for the master too, so
-it's not just sitting around waiting and wishing). This is what ``scatter`` will do. But
-wait a minute! Scatter requires a list that has length exactly equal to the number of processors available. You
-have some options here. Assuming there are 10 CPUs, tou can either pass the first 10 values out of the list and 
-keeping sending groups of  10 values until they are all sent (multiple sets of transmitions) or you can split 
-the data up into 10 evenly-populated groups (single transmition to each cpu). This is called *load balancing*. 
+    workList = [(random.random(), random.random()) for _i in range(1000)]
+
+Now we want to distribute this work to each of the worker processors (and take one for the primary too, so it's
+not just sitting around waiting). This is what ``scatter`` will do. But ``scatter`` requires a list that has
+length exactly equal to the number of processors available. You have some options here. Assuming there are 10
+CPUs, you can either pass the first 10 values out of the list and keep sending groups of  10 values until they
+are all sent (multiple sets of transmitions) or you can split the data up into 10 evenly-populated groups (single
+transmition to each CPU). This is called *load balancing*. 
+
 ARMI has utilities that can help called :py:func:`armi.utils.chunks` and :py:func:`armi.iterables.flatten`. 
 Given an arbitrary list, ``chunks`` breaks it up into a certain number of chunks and ``unchunk`` does the 
-opposite to reassemble the original list afterprocessing. Check it out::
+opposite to reassemble the original list after processing. Check it out::
 
-    if rank==0:
-        # master. Make data and send it.
-        workListLoadBalanced = iterables.split(workList,nCpu,padWith=())
+    if rank == 0:
+        # primary. Make data and send it.
+        workListLoadBalanced = iterables.split(workList, nCpu, padWith=())
         # this list looks like:
         # [[v1,v2,v3,v4...], [v5,v6,v7,v8,...], ...]
         # And there's one set of values for each processor
-        myValsToAdd = comm.scatter(workListLoadBalanced,root=0)
+        myValsToAdd = comm.scatter(workListLoadBalanced, root=0)
         # now myValsToAdd is the first entry from the work list, or [v1,v2,v3,v4,...].
     else:
         # workers. Receive data. Pass a dummy variable to scatter (None)
-        myValsToAdd = comm.scatter(None,root=0)
+        myValsToAdd = comm.scatter(None, root=0)
         # now for the first worker, myValsToAdd==[v5,v6,v7,v8,...]
         # and for the second worker, it is [v9,v10,v11,v12,...] and so on.
         # Recall that in this example, each vn is a tuple like (randomnum, randomnum)
@@ -86,51 +84,51 @@ opposite to reassemble the original list afterprocessing. Check it out::
     # all processors do their bit of the work
     results = []
     for num1, num2 in myValsToAdd:
-        results.append(num1+num2)
+        results.append(num1 + num2)
 
     # now results is a list of results with one entry per myValsToAdd, or
     # [r1,r2,r3,r4,...]
 
-    # all processors call gather to send their results back. it all assembles on the master processor.
-    allResultsLoadBalanced = comm.gather(results,root=0)
+    # all processors call gather to send their results back. it all assembles on the primary processor.
+    allResultsLoadBalanced = comm.gather(results, root=0)
     # So we now have a list of lists of results, like this:
     # [[r1,r2,r3,r4,...], [r5,r6,r7,r8,...], ...]
 
-    # master processor does stuff with the results, like prints them out.
-    if rank==0:
+    # primary processor does stuff with the results, like print them out.
+    if rank == 0:
         # first take the individual result lists and reassemble them back into the big list.
         # These results correspond exactly to workList from above. All ordering has been preserved.
         allResults = iterables.flatten(allResultsLoadBalanced)
         # allResults now looks like: [r1,r2,r3,r4,r5,r6,r7,...]
         print('The total sum is: {0:10.5f}'.format(sum(allResults)))
 
-See? Remember that this code is running on all processors. So it's just the ``if rank==0`` statements that differentiate
-between the master and the workers. Try writing this program as a script and submitting it to the cluster
-via the command line to see if you really understand what's going on. You will have to add some MPI imports
-before you can do that (see :py:mod:`twr_shuffle.py <armi.twr_shuffle>` in the ARMI code for a major hint!).
+Remember that this code is running on all processors. So it's just the ``if rank == 0`` statements that differentiate
+between the primary and the workers. Try writing this program as a script and submitting it to a cluster via the command
+line to see if you really understand what's going on. You will have to add some MPI imports before you can do that
+(see :py:mod:`twr_shuffle.py <armi.twr_shuffle>` in the ARMI code for a major hint!).
 
-.. _mpiInterface:
 
-MPI communication within ARMI
+MPI Communication within ARMI
 -----------------------------
-OK, now that you understand the basics, here's how you should get your :doc:`code interfaces </developer/dev_task_support/interfaces>`
+Now that you understand the basics, here's how you should get your :doc:`code interfaces </developer/dev_task_support/interfaces>`
 to run things in parallel in ARMI.
+
 You don't have to worry too much about the ranks, etc. because ARMI will set that up for you. Basically,
-the interfaces are executed by the master node unless you say otherwise. All workers are stalled in an ``MPI.bcast`` waiting
+the interfaces are executed by the primary node unless you say otherwise. All workers are stalled in an ``MPI.bcast`` waiting
 for your command! The best coding practice is to create an :py:class:`~armi.mpiActions.MpiAction` subclass and override
 the :py:meth:`~armi.mpiActions.MpiAction.invokeHook` method. `MpiActions` can be broadcast, gathered, etc. and within
 the :py:meth:`~armi.mpiActions.MpiAction.invokeHook` method have ``o``, ``r``, and ``cs`` attributes.
 
 .. warning::
 
     When communicating raw Blocks or Assemblies all references to parents are lost. If a whole reactor is needed
-    use DistributeStateAction and syncMpiState (shown in last example).  Additionally, note that if a self.r 
-    exists on the MpiAction prior to transmission it will be removed when invoke() is called
+    use ``DistributeStateAction`` and ``syncMpiState`` (shown in last example).  Additionally, note that if a ``self.r`` 
+    exists on the ``MpiAction`` prior to transmission it will be removed when ``invoke()`` is called.
 
 If you have a bunch of blocks that you need independent work done on, always remember that unless you explicitly
-MPI transmit the results, they will not survive on the master node. For instance, if each CPU computes and sets
-a block parameter (e.g. ``b.p.paramName = 10.0)``, these **will not** be set on the master! There are a few
-mechanisms that can help you get the data back to the master reactor.
+MPI transmit the results, they will not survive on the primary node. For instance, if each CPU computes and sets
+a block parameter (e.g. ``b.p.paramName = 10.0)``, these **will not** be set on the primary! There are a few
+mechanisms that can help you get the data back to the primary reactor.
 
 .. note:: If you want similar capabilities for objects that are not blocks, take another look at :py:func:`armi.utils.chunks`.
 
@@ -139,21 +137,21 @@ Example using ``bcast``
 ***********************
 
 Some actions that perform the same task are best distributed through a broadcast. This makes sense for if your are
-parallelizing code that is a function of an individual assembly, or block. In the following example (which could be
-cleaned up further), the interface simply creates an Action and broadcasts it as appropriate.::
+parallelizing code that is a function of an individual assembly, or block. In the following example, the interface simply
+creates an ``Action`` and broadcasts it as appropriate.::
 
     class SomeInterface(interfaces.Interface):
 
         def interactEverNode(self, cycle, node):
-            action = BcastAction()  # class
+            action = BcastAction()
             armi.MPI_COMM.bcast(action)
             results = action.invoke(self.o, self.r, self.cs)
 
-            # allResults is a list of len(self.r) ... apply results
+            # allResults is a list of len(self.r)
             for aResult in results:
                 a.p.someParam = aResult
 
-    class BcastAction(mpiActions.MpiAction):  # must implement invokeHook
+    class BcastAction(mpiActions.MpiAction):
       
         def invokeHook(self):
             # do something with the local self.r, self.o, and self.cs.
@@ -162,10 +160,10 @@ cleaned up further), the interface simply creates an Action and broadcasts it as
             for a in self.mpiIter(self.r):
                 results.append(someFunction(a))
 
-            # in this usage, it makes sense to gather the results...
+            # in this usage, it makes sense to gather the results
             allResults = self.gather(results)
 
-            # Only master node has allResults
+            # Only primary node has allResults
             if allResults:
                 # Flatten results returns the original order after having
                 # made lists of mpiIter results.
@@ -175,15 +173,15 @@ cleaned up further), the interface simply creates an Action and broadcasts it as
 .. warning::
 
     Currently, there is no guarantee that the reactor state is the same across all nodes. Consequently, the above code
-    should really contain a ``mpiActions.DistributeStateAction.invokeAsMaster`` call prior to broadcasting the
+    should really contain a ``mpiActions.DistributeStateAction.invokeAsprimary`` call prior to broadcasting the
     ``action``. See example below.
 
 
 Example using ``scatter``
 *************************
 
-When trying to independent actions at the same time, you can use scatter to distribute the work. The following example
-shows some a fake example where different operations can be performed in parallel.::
+When trying two independent actions at the same time, you can use ``scatter`` to distribute the work. The following example
+shows how different operations can be performed in parallel.::
 
     class SomeInterface(interfaces.Interface):
 
@@ -193,17 +191,13 @@ shows some a fake example where different operations can be performed in paralle
             for opt in self.cs['someSetting']:
                 actions.append(factory(opt))
             
-            
-            
             distrib = mpiActions.DistributeStateAction()
             distrib.broadcast()
             
             # this line any existing reactor on workers to ensure consistency
             distrib.invoke(self.o, self.r, self.cs)
             # the 3 lines above are equivalent to:
-            # mpiActions.DistributeStateAction.invokeAsMaster(self.o, self.r, self.cs)
-            
-            
+            # mpiActions.DistributeStateAction.invokeAsprimary(self.o, self.r, self.cs)
             
             results = mpiActions.runActions(self.o, self.r, self.cs, actions)
 
@@ -218,18 +212,16 @@ shows some a fake example where different operations can be performed in paralle
     class WhatAction(mpiActions.MpiAction):
 
         def invokeHook(self):
-            # does something...
+            # does something
             # somehow gathers results.
             return self.gather(results)
 
 
-
 A simplified approach
 *********************
 
-Transferring state to and from a Reactor state can be complicated and add a lot of code.
-An alternative approach is to ensure that the reactor state is synchronized across all nodes, and then use the reactor
-instead of raw data.::
+Transferring state to and from a Reactor can be complicated and add a lot of code. An alternative approachis to ensure
+that the reactor state is synchronized across all nodes, and then use the reactor instead of raw data.::
 
     class SomeInterface(interfaces.Interface):
 
@@ -239,7 +231,7 @@ instead of raw data.::
             for opt in self.cs['someSetting']:
                 actions.append(factory(opt))
             
-            mpiActions.DistributeStateAction.invokeAsMaster(self.o, self.r, self.cs)
+            mpiActions.DistributeStateAction.invokeAsprimary(self.o, self.r, self.cs)
             results = mpiActions.runActions(self.o, self.r, self.cs, actions)
 
     class WhatAction(mpiActions.MpiAction):
@@ -253,15 +245,15 @@ instead of raw data.::
                     b.p.someParam = func(b)
 
             # notice we don't return an value, but instead just sync the state,
-            # which updates the master node with the params that the workers changed.
+            # which updates the primary node with the params that the workers changed.
             self.r.syncMpiState()
             
 .. warning::
 
-    Only parameters that are set are synchronized to the master node. Consequently if a mutable 
+    Only parameters that are set are synchronized to the primary node. Consequently if a mutable 
     parameter (e.g. ``b.p.depletionMatrix`` which is of type ``BurnMatrix``) is changed, it will 
     not natively be synced. To flag it to be synced, ``b.p.paramName`` must be set, even if it is 
     to the same object. For this reason, setting parameters to mutable objects should be avoided. 
     Further, if the mutable object has a reference to a large object, such as a composite or 
-    cross section library, it can be very computationally expensive to pass all this data to the master node. 
+    cross section library, it can be very computationally expensive to pass all this data to the primary node. 
     See also: :py:mod:`armi.reactor.parameters`