I replied to your comments to the spec however missed this email.

Please see my replies in line.

03.01.2019, 21:31, "Jay Pipes" <jaypipes@gmail.com>:

I agree the more precise the better as far as resource tracking is concerned.

However, as for persistent memory, it usually comes out in large capacity --

terabytes are normal. And the targeting applications are also expected to use

persistent memory in that quantity. GB is a reasonable unit not to make

the number too nasty.

First let's talk about "block mode" v.s. "persistent memory mode".

They are not tiered up, they are counterparts. Each of them describes an access

method to the unlerlying hardware. Quote some sectors from

inside the dash line block.

------------------------------8<-------------------------------------------------------------------

Why BLK?
--------

While PMEM provides direct byte-addressable CPU-load/store access to
NVDIMM storage, it does not provide the best system RAS (recovery,
availability, and serviceability) model.  An access to a corrupted
system-physical-address address causes a CPU exception while an access
to a corrupted address through an BLK-aperture causes that block window
to raise an error status in a register.  The latter is more aligned with
the standard error model that host-bus-adapter attached disks present.
Also, if an administrator ever wants to replace a memory it is easier to
service a system at DIMM module boundaries.  Compare this to PMEM where
data could be interleaved in an opaque hardware specific manner across
several DIMMs.

PMEM vs BLK
BLK-apertures solve these RAS problems, but their presence is also the
major contributing factor to the complexity of the ND subsystem.  They
complicate the implementation because PMEM and BLK alias in DPA space.
Any given DIMM's DPA-range may contribute to one or more
system-physical-address sets of interleaved DIMMs, *and* may also be
accessed in its entirety through its BLK-aperture.  Accessing a DPA
through a system-physical-address while simultaneously accessing the
same DPA through a BLK-aperture has undefined results.  For this reason,
DIMMs with this dual interface configuration include a DSM function to
store/retrieve a LABEL.  The LABEL effectively partitions the DPA-space
into exclusive system-physical-address and BLK-aperture accessible
regions.  For simplicity a DIMM is allowed a PMEM "region" per each
interleave set in which it is a member.  The remaining DPA space can be
carved into an arbitrary number of BLK devices with discontiguous
extents.

------------------------------8<-------------------------------------------------------------------

You can see that "block mode" does not provide "direct access", thus not the best

performance. That is the reason "persistent memory mode" is proposed in the spec.

However, people can still create a block device out of a "persistent memory mode"

namespace. And further more, create a file system on top of that block device.

Applications can map files from that file system into their memory namespaces,

and if the file system is DAX (direct-access) capable. The application's access to

the hardware is still direct-access which means direct byte-addressable

CPU-load/store access to NVDIMM storage.

This is perfect so far, as one can think of why not just track the DAX file system

and let the VM instances map the files of the file system?

However, this usage model is reported to have severe issues with hardware

pass-ed through. So the recommended model is still mapping namespaces

of "persistent memory mode" into applications' address space.