New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'daxctl list' should display whether a system-ram device is online or offline #139
Comments
How about instead of a boolean flag add 2 fields, "total_memblocks" and "online_memblocks", so that it's clear when a memory range is only partially online. |
That's a much better idea. Given a memory block can be online or offline in the ZONE_NORMAL (kernel) or ZONE_MOVABLE domains, we could remove the current
A memblock may not mean anything to a system administrator, so we should display the size in bytes (perhaps even dropping the number of memblocks in favour of *_size). Including an "offline_memblocks" makes it more obvious in the partial-online case for those needing to troubleshoot, monitor, and report the system state. Example:
Could we also include the memory block range in the list output to make it easy to map the NUMA node to an address range shown in
Example:
|
Only took a year.. :) But I was doing something related to this, and remembered this request. With the libdaxctl APIs we have, it is easy enough to add (and I've got a patch for this that I'll post shortly):
I'm hesitant to add fields like 'online_size' - the most common case is that the entire range will be online or offline, and the main device size should be enough for that. Partial online/offline states are more of a problematic state rather than a normal use case - and especially considering daxctl doesn't support actively putting a device in such a state, I'm not sure how useful it will be adding extra information related to that. For splitting out online counts by zone - movable vs. kernel - that does seem reasonable, but would need some reworks in libdaxctl. The example I listed above is easy to implement with current APIs. Is there a strong need to split counts by zone? |
Posted patch here: http://lore.kernel.org/r/20210331195619.533491-1-vishal.l.verma@intel.com |
Fir daxctl device listings, if in 'system-ram' mode, it is useful to know whether the memory associated with the device is online or not. Since the memory is comprised of a number of 'memblocks', and it is possible (albeit rare) to have a subset of them online, and the rest offline, we can't just use a boolean online-or-offline flag for the state. Add a couple of counts, one for the total number of memblocks associated with the device, and another for the ones that are online. Link: #139 Cc: Dan Williams <dan.j.williams@intel.com> Cc: Dave Hansen <dave.hansen@linux.intel.com> Reported-by: Steve Scargall <steve.scargall@intel.com> Signed-off-by: Vishal Verma <vishal.l.verma@intel.com>
It would be very useful to know whether a system-ram device is online or not. Currently, we do not display this information, eg:
For the NUMA node, we can look at the state for each memory block, eg:
A proposal would be to include an 'online' boolean in the output, eg:
We may need to consider the case where some of the memory blocks within the node are online and some are offline. Such a situation may occur if the memory blocks are manipulated outside of daxctl or when 'daxctl offline-memory' offlines some, but not all, memory blocks because they are in-use.
The text was updated successfully, but these errors were encountered: