Serial EEPROM or Serial Flash?| page 3

Reply by pozz ●June 15, 20182018-06-15

Il 15/06/2018 19:11, Richard Damon ha scritto:
 > On 6/15/18 11:20 AM, pozz wrote:
 >> Il 15/06/2018 16:21, David Brown ha scritto:
 >>> On 15/06/18 15:10, pozz wrote:
 >>>> Il 15/06/2018 11:25, David Brown ha scritto:
 >>>>> On 15/06/18 09:38, pozz wrote:
 >>>>>> Il 14/06/2018 12:49, David Brown ha scritto:
 >>>>>>> On 14/06/18 12:20, pozz wrote:
 >>>>>>>> I need to save some data on a non-volatile memory. They are some
 >>>>>>>> parameters that the user could change infrequently, for example 10
 >>>>>>>> times
 >>>>>>>> per day at a maximum.  In the range 100-2kB.
 >>>>>>>>
 >>> <snip>
 >>>>>>>
 >>>>>>> Multiple copies (at least 2) are key, along with timestamps or
 >>>>>>> counters
 >>>>>>
 >>>>>> Two should be sufficient to prevent corruption caused by 
interruption
 >>>>>> during the writing of a block of data.
 >>>>>> Maybe three (or more) are needed to face memory *physical* 
corruption
 >>>>>> (maybe for many writings).
 >>>>>> I think I can ignore this event in my actual project.
 >>>>>>
 >>>>>
 >>>>> You use more for wear levelling.  Often your flash chip is /way/ 
bigger
 >>>>> than you need - perhaps by a factor of 1000 simply because that's 
what
 >>>>> you have on stock, or that's the cheapest device in the package you
 >>>>> want.  Spread your writes over 1000 blocks instead of 2, and you have
 >>>>> 500 times the endurance.  Use a good checksum and accept that 
sometimes
 >>>>> a block will be worn out and move onto the next one, and you have
 >>>>> perhaps a million times the endurance (because most blocks last a lot
 >>>>> longer than the guaranteed minimum).
 >>>>
 >>>> So the writing process should be:
 >>>>
 >>>> 1. write the data at block i+1 (where i is the block of the current
 >>>>     data in RAM)
 >>>> 2. read back the block i+1 and check if checksum is ok
 >>>> 3. if ok, writing process is finished
 >>>> 4. if not, go to block i+2 and start again from 1.
 >>>>
 >>>
 >>> Yes.
 >>>
 >>> Then comes step 5 (for flash with separate erasing) :
 >>>
 >>> 5. If you have written block x, check if block x+1 (modulo the size of
 >>> the device) is erased.  If not, then erase it ready for the next write.
 >>>
 >>> Note that it does not matter if the erase block size is bigger than the
 >>> program block size - if it is, then your "erase block x+1" command will
 >>> cover the next few program blocks.
 >>>
 >>>>>>> and checksums.
 >>>>>>
 >>>>>> This is interesting.  Why do I need a checksum?  My approach is 
to use
 >>>>>> only a magic number plus a counter... and two memory areas.
 >>>>>> At first startup magic number isn't found on any areas, so the 
device
 >>>>>> starts with default and write data on Area1 (magic/0, where 0 is the
 >>>>>> counter).
 >>>>>
 >>>>> You use checksums to ensure that you haven't had a power-out or
 >>>>> reset in
 >>>>> the middle of writing,
 >>>>
 >>>> Only for this thing, you can write the counter as the last byte. 
If the
 >>>> writing is interrupted in the middle, counter hasn't written yet, 
so the
 >>>> block is not valid (because considered too old or empty).
 >>>
 >>> Nope.  You can't rely on that, unless you are absolutely sure that you
 >>> have a simple, old-fashioned E&#4294967295;PROM device.  Many devices are /not/ 
like
 >>> that, even if they provide an interface that matches it logically.
 >>>
 >>> A common structure for a modern device is to have 32-byte pages as a
 >>> compromise between density, cost, and flexibility.  (Bigger pages are
 >>> more efficient in device area and cost.)  When you send a command to
 >>> write a byte, the device reads the old 32-byte page into ram, 
erases the
 >>> old page, updates the ram with the new data, the writes the whole 32
 >>> byte page back in.
 >>>
 >>> The write process is done by a loop that writes all the data, reads it
 >>> back at a low voltage to see if it has stuck, and writes again as 
needed
 >>> until the data looks good.  Then it writes again a few times for safety
 >>> - either a fixed number, or a percentage of the number of writes taken.
 >>>
 >>> So it is /entirely/ possible for an interrupted write to give you a
 >>> valid counter, but invalid data.  It is also entirely possible to get
 >>> some bits of the counter as valid while others are still erased (giving
 >>> ones on most devices).
 >>>
 >>> And that is just for simple devices that don't do any fancy wear
 >>> levelling, packing, garbage collection, etc.
 >>>
 >>>
 >>>>
 >>>>> and that the flash has not worn out.
 >>>>>
 >>>>>>
 >>>>>> When the configuration is changed, Area2 is written with magic/1,
 >>>>>> being
 >>>>>> careful to save magic/1 only at the end of area writing.
 >>>>>>
 >>>>>> At startup magics and counters from both area are loaded, and 
one area
 >>>>>> is chosen (magic should be valid and counter should be the maximum).
 >>>>>>
 >>>>>> I think this approach works, even when the area writing is 
interrupted
 >>>>>> at the middle.
 >>>>>>
 >>>>>> Why do I need checksum?  The only thing that comes in mind is to
 >>>>>> prevent
 >>>>>> writing errors: for example, I want to write 0x00 but the value 
really
 >>>>>> written is 0x01, maybe for a noise on the serial bus.
 >>>>>>
 >>>>>> To solve this situation, I need checksum... but also I need to 
re-read
 >>>>>> and re-calculate the checksum at *every* area writing... and start
 >>>>>> a new
 >>>>>> writing if something was wrong.
 >>>>>>
 >>>>>> Do you have a better strategy?
 >>>>>
 >>>>> You calculate the checksum for a block before writing it, and you 
check
 >>>>> it when reading it.  Simple.
 >>>>
 >>>> Do you calc the checksum of all the data block in RAM, including 
padding
 >>>> bytes?
 >>>
 >>> Yes, of course.  The trick is not to have unknown padding bytes.  I 
make
 >>> a point of /never/ having compiler-generated padding in my structs.
 >>>
 >>> So you have something like this:
 >>>
 >>> #define sizeOfRawBlock 32
 >>> #define noOfRawBlocks 4
 >>> #define magicNumber 0x9185be91
 >>> #define dataStructVersionExpected 0x0001
 >>>
 >>> typedef union {
 >>>      uint8_t raw8[sizeOfRawBlock * noOfRawBlocks];
 >>>      uint16_t raw16[sizeOfRawBlock * noOfRawBlocks / 2];
 >>>      struct {
 >>>          uint32_t magic;
 >>>          uint16_t dataStructVersion;
 >>>          uint16_t crc;
 >>>          uint32_t count;
 >>>
 >>>          // real data
 >>>      }
 >>> } nvmData_t;
 >>>
 >>> static_assert(sizeof(nvmData_t) == (sizeofRawBlock * noOfRawBlocks),
 >>>       "Check size of nvnData!");
 >>
 >> Why raw8[]?
 >>
 >> I think you can avoid raw16[] too. If you have the function:
 >>
 >>     uint16_t calc_crc(const void *data, size_t size);
 >>
 >> you can simply call:
 >>
 >>     nvmData.crc =
 >>       calc_crc( ((unsigned char *)&nvmData) + 8, sizeof(nvmData) );
 >>
 >>
 >
 > I typically also do something like this. The data structure is a union
 > of the basic data structure with a preamble that includes (in very fixed
 > locations) a data structure version, checksum/crc, and if a versioning
 > store a timestamp/data generation number. A 'Magic Number' isn't often
 > needed unless it is a removable media as it will either be or not the
 > expected data, nothing else could be there (if the unit might have
 > different sorts of programs, then a piece of the data version would be a
 > program ID.)
 >
 > Often I will have TWO copies of the data packet.

TWO copies in EEPROM or in RAM?


 > Often organized something like this:
 >
 > struct SystemParameters {
 >     // List of data parameters that need to be saved
 > } parms;
 >
 > union {
 >    uint8_t	raw_data[DATA_SIZE];
 >    struct {
 >        struct StandardHeader header;
 >        struct SystemParameters parms;
 >
 > } flash_parms;
 >
 > /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
 > a multiple of the flash sector size */

Do you need raw_data[] array only to set a well-known size to flash_parms?
Do you alloc an entire (or a multiple of) flash sector in RAM?  Even if 
your params in flash take only 1/5 or 1/2 or 1/10 of a sector?


 > And a request to save the data will fill flash_parms with a 'NULL'
 > fill (0 or all ones, or whatever I want unused flash locations to be),

Is it really necessary?  What happens if you don't care about unused
bytes in flash_parms?  I think... nothing.


 > then
 > copy the parms into flash_parms.parms, then fill in the header, and then
 > program the flash. Only the part that copys parms to flash_parms.parms
 > needs to be protected from other operation in the system updating the
 > main parms structure.
 >
 > The flash_parms structure can be further modified to be a union of the
 > current data structure of the SystemParameters structure and any
 > previous 'incompatible' version of it. (A version from just prior to
 > adding elements at the end doesn't need to be listed). This way at
 > startup I can read the values into that same structure, and if it was
 > old, parse to old version in flash_parms to get the new version in
 > parms, defaulting new parameters as needed.

It seems difficult, if the changes from one version to the other are 
"important".

struct {
   uint16_t foo;
   uint16_t bar;
   uint16_t dummy;
} SystemParameters;

struct {
   uint8_t foo;
   uint8_t bar;
   uint8_t dummy;
} SystemParametersOld;

union {
    uint8_t	raw_data[DATA_SIZE];
    struct {
        struct StandardHeader header;
        struct SystemParameters parms;
        struct SystemParametersOld parms_old;
} flash_parms;


for (uint8_t sector = 0; sector < SECTORS_NUM; sector++) {
   uint16_t memory_address = OFFSET + sector * SECTOR_SIZE;
   sermem_read(&flash_params, memory_address, sizeof(flash_params));
   uint16_t chk_read = flash_parms.header.crc;
   flash_parms.header.crc = 0xFFFF;
   if (checksum_calc(&flash_parms, sizeof(flash_parms)) == chk_read) {
     /* Datablock is valid */
     if (flash_parms.header.counter > prev_counter) {
       /* Datablock is newer */
       prev_counter = flash_parms.header.counter;
       if (flash_parms.header.version == FLASH_PARMS_OLD) {
         /* How to convert old version in new version in place? */
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
       } else {
         /* Ok, new version. No convertion is needed. */
       }
     }
   }
}

As I wrote in the comment, how to convert old version to new version, 
using the same union in RAM?  Do I need another union?

 > Checksums / CRC are always over the whole data array (skipping over,
 > or using a fixed value, the value of it stored in the Header).
 >
 >
 >>> When you want to store the data, run your crc calculation over the
 >>> raw16[] from index 4 upwards, and set it in the crc field.  Set the
 >>> magic field with a different fixed value per program.  Use the
 >>> dataStructVersion to track versions of the structure of the data - so
 >>> that updated firmware can recognise old data and quietly update it.
 >>
 >> This is interesting.  Suppose you have a new struct version and you are
 >> reading an old version from the memory.  How do you read it?  Do you put
 >> it completely in RAM and after copy it in the final struct making the
 >> changes from one version to the other?  In this case you need a double
 >> space in RAM, for old and new version.
 >>
 >> Or you declare a "unsigned char reserved_for_future[1024]" big array in
 >> the struct, so the CRC is calculated over the full area?
 >>
 >>
 > (see above)
 >
 >>> Set the count to a new number each time.
 >>>
 >>> Leave plenty of room to add new fields to your nvm structure.
 >>>
 >>>
 >>>>
 >>>>
 >>>>>>>> This means I need to copy&paste an entire block everytime, even
 >>>>>>>> for a
 >>>>>>>> single byte change.  And this is similar to Flash approach, 
where I
 >>>>>>>> *need* a sector erase before changing a single byte.
 >>>>>>>
 >>>>>>> Yes.
 >>>>>>>
 >>>>>>> Of course, it is possible to have multiple smaller blocks 
rather than
 >>>>>>> just one big one.
 >>>>>>
 >>>>>> This is good to reduce writing time of a block of data.
 >>>>>> In my case I have 2kB of data, around 62 32-bytes EEPROM pages,
 >>>>>> that is
 >>>>>> 5*62=300ms.  I don't like to block a task (in a cooperative OS) 
for so
 >>>>>> long time.
 >>>>>
 >>>>> Your writing should be done in the background.  The background writer
 >>>>> task is blocked - the system is not.
 >>>>
 >>>> Are you thinking of a preemption OS?  I'm using a cooperative OS.
 >>>>
 >>>
 >>> It doesn't matter.  In a pre-emptive OS, your background writer is:
 >>                             ^^^^^^^^^^^
 >> You mean cooperative.
 >>
 > You can use cooperative or pre-emptive. If cooperative then your flash
 > write routine should yield (like the example seems to show) while it is
 > waiting for the flash to do the actual operations. (If pre-emptive you
 > just need to block for a bit and let the pre-emption do its thing, or
 > make the flash routine a background level operation so a spin wait
 > doesn't impact other operations)
 >
 >>>
 >>> static enum { idle, writing, checking, erasing } writerState;
 >>>
 >>> void doWriter(void) {
 >>>      switch (writerState) {
 >>>          case idle :
 >>>              if (writeTriggered) {
 >>>                  writeTriggered = false;
 >>>                  startWriting();
 >>>                  writerState = writing;
 >>>              }
 >>>              break;
 >>>          case writing :
 >>>              bool stillWorking = pollNVMdevice();
 >>>              if (stillWorking) return;
 >>>              startChecking();
 >>>              writerState = checking;
 >>>              break;
 >>>          case checking :
 >>>              bool stillWorking = pollNVMdevice();
 >>>              if (stillWorking) return;
 >>>              startErasingNextBlock();
 >>>              writerState = erasing;
 >>>              break;
 >>>          case erasing :
 >>>              bool stillWorking = pollNVMdevice();
 >>>              if (stillWorking) return;
 >>>              writerState = idle;
 >>>              break;
 >>>      }
 >>> }
 >>>
 >>> There - half your program is done :-)
 >>>
 >>> Cooperative multitasking does not mean busy waiting for long-running
 >>> tasks.  It means using state machines, interrupt-driven SPI/I&#4294967295;C 
drivers,
 >>> polling, etc., so that you can mix fast and slow tasks.
 >>
 >> Yes, ok.
 >>
 >

Reply by Richard Damon ●June 15, 20182018-06-15

On 6/15/18 4:54 PM, pozz wrote:
> Il 15/06/2018 19:11, Richard Damon ha scritto:
>> I typically also do something like this. The data structure is a union
>> of the basic data structure with a preamble that includes (in very fixed
>> locations) a data structure version, checksum/crc, and if a versioning
>> store a timestamp/data generation number. A 'Magic Number' isn't often
>> needed unless it is a removable media as it will either be or not the
>> expected data, nothing else could be there (if the unit might have
>> different sorts of programs, then a piece of the data version would be a
>> program ID.)
>>
>> Often I will have TWO copies of the data packet.
> 
> TWO copies in EEPROM or in RAM?
> 
> 

As I described, normally both.

>> Often organized something like this:
>>
>> struct SystemParameters {
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; // List of data parameters that need to be saved
>> } parms;
>>
>> union {
>>&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>&#4294967295;&#4294967295;&#4294967295; struct {
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>
>> } flash_parms;
>>
>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
>> a multiple of the flash sector size */
> 
> Do you need raw_data[] array only to set a well-known size to flash_parms?
> Do you alloc an entire (or a multiple of) flash sector in RAM?&#4294967295; Even if
> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
> 
> 

One purpose of the raw_data array is to fix the size of the flash data.
Another purpose is to provide an easy way to access to pre-'zero' the data
Another purpose is to provide an easy way to compute the CRC/Checksum.

Yes, the last two could be done via type casting via void* parameters.
>> And a request to save the data will fill flash_parms with a 'NULL'
>> fill (0 or all ones, or whatever I want unused flash locations to be),
> 
> Is it really necessary?&#4294967295; What happens if you don't care about unused
> bytes in flash_parms?&#4294967295; I think... nothing.
> 
> 

Setting them to what an erased byte is can sometimes save you power and
time for programming. Having them be a know value can also make it
easier to read the contents, and

>> then
>> copy the parms into flash_parms.parms, then fill in the header, and then
>> program the flash. Only the part that copys parms to flash_parms.parms
>> needs to be protected from other operation in the system updating the
>> main parms structure.
>>
>> The flash_parms structure can be further modified to be a union of the
>> current data structure of the SystemParameters structure and any
>> previous 'incompatible' version of it. (A version from just prior to
>> adding elements at the end doesn't need to be listed). This way at
>> startup I can read the values into that same structure, and if it was
>> old, parse to old version in flash_parms to get the new version in
>> parms, defaulting new parameters as needed.
> 
> It seems difficult, if the changes from one version to the other are
> "important".
> 
> struct {
> &#4294967295; uint16_t foo;
> &#4294967295; uint16_t bar;
> &#4294967295; uint16_t dummy;
> } SystemParameters;
> 
should be struct SystemParameters { or typedef struct { you are defining
a type here, not a varaiable.

> struct {
> &#4294967295; uint8_t foo;
> &#4294967295; uint8_t bar;
> &#4294967295; uint8_t dummy;
> } SystemParametersOld;

ditto.
> 
> union {
> &#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
> &#4294967295;&#4294967295; struct {
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParametersOld parms_old;
> } flash_parms;
> 
should be:

union {
   uint8_t raw_data[DATA_SIZE];
   struct {
      struct StandardHeader header;
      union {
         struct SystemParameters parms_new;
         struct SystemParametersOld parms_old;
      } parms;
   } data;
} flash_parms;

I actually don't use labels like Old but like Rev1, Rev2 etc.
For some compilers you can use an 'anonymous union' an omit the parms
name here and in the code.
> 
> for (uint8_t sector = 0; sector < SECTORS_NUM; sector++) {
> &#4294967295; uint16_t memory_address = OFFSET + sector * SECTOR_SIZE;
> &#4294967295; sermem_read(&flash_params, memory_address, sizeof(flash_params));
> &#4294967295; uint16_t chk_read = flash_parms.header.crc;
> &#4294967295; flash_parms.header.crc = 0xFFFF;
> &#4294967295; if (checksum_calc(&flash_parms, sizeof(flash_parms)) == chk_read) {
> &#4294967295;&#4294967295;&#4294967295; /* Datablock is valid */
> &#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.counter > prev_counter) {
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Datablock is newer */
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; prev_counter = flash_parms.header.counter;
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.version == FLASH_PARMS_OLD) {
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* How to convert old version in new version in place? */
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; } else {
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Ok, new version. No convertion is needed. */
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
> &#4294967295;&#4294967295;&#4294967295; }
> &#4294967295; }
> }
> 
> As I wrote in the comment, how to convert old version to new version,
> using the same union in RAM?&#4294967295; Do I need another union?

You do NOT convert 'in place', but in transfering from the flash_parms
block to the main usage block. Only the flash parm save/load routine has
access to flash_parms, and they are used just for save and load. If you
are tight on memory, it could be allocated on the heap only for load
and/or save and freed after, but you then you do need to be sure the
memory IS available when you need it. maybe something else turns off for
a bit when you need to do it.

sample code:

parms.foo = flash_parms.data.parms.parms_old.foo;
parms.bar = flash_parms.data.parms.parms_old.bar;
parms.dummy = flash_parms.data.parms.parms_old.dummy;

and for the new code, you do

parms = flash_parms.data.parms.parms_new;

you can do a single line and let the compiler move it all.

Reply by pozz ●June 16, 20182018-06-16

Il 16/06/2018 00:25, Richard Damon ha scritto:
 > On 6/15/18 4:54 PM, pozz wrote:
 >> Il 15/06/2018 19:11, Richard Damon ha scritto:
 >>> I typically also do something like this. The data structure is a union
 >>> of the basic data structure with a preamble that includes (in very 
fixed
 >>> locations) a data structure version, checksum/crc, and if a versioning
 >>> store a timestamp/data generation number. A 'Magic Number' isn't often
 >>> needed unless it is a removable media as it will either be or not the
 >>> expected data, nothing else could be there (if the unit might have
 >>> different sorts of programs, then a piece of the data version would 
be a
 >>> program ID.)
 >>>
 >>> Often I will have TWO copies of the data packet.
 >>
 >> TWO copies in EEPROM or in RAM?
 >
 > As I described, normally both.

So you are so lucky that you have abundant RAM.


 >>> Often organized something like this:
 >>>
 >>> struct SystemParameters {
 >>>       // List of data parameters that need to be saved
 >>> } parms;
 >>>
 >>> union {
 >>>      uint8_t    raw_data[DATA_SIZE];
 >>>      struct {
 >>>          struct StandardHeader header;
 >>>          struct SystemParameters parms;
 >>>
 >>> } flash_parms;
 >>>
 >>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
 >>> a multiple of the flash sector size */
 >>
 >> Do you need raw_data[] array only to set a well-known size to 
flash_parms?
 >> Do you alloc an entire (or a multiple of) flash sector in RAM?  Even if
 >> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
 >>
 >
 > One purpose of the raw_data array is to fix the size of the flash data.

Sincerely I can't understand why you need a fixed data size in RAM.
Suppose release 1.0 needs 2kB, but I'm so smart to provide a double 
space for new params in future releases. So I decide to use 4kB blocks 
*on the serial memory*.  Maybe this big space will never be used in all 
the future releases.

Why the hell I should load all 4kB datablock in RAM at startup?  IMHO it 
is sufficient to load only 2kB.

The only reason I see is to simplify the checksum calculation.
CRC must be calculated over the entire fixed-size 4kB block, otherwise 
how to know the datablock size if it could change from one release to 
the other?
However you can calc the checksum of 4kB block by reading small blocks 
multiple times (for example, 16 sections of 256 bytes).
After CRC validation passed, you can read in RAM *only* 2kB of data that 
the application 1.0 needs.

I know, maybe this process is slower, but it happens only at startup.
During writing of 4kB datablock, the checksum can be calculated on-the-fly.


 > Another purpose is to provide an easy way to access to pre-'zero' the 
data
 > Another purpose is to provide an easy way to compute the CRC/Checksum.
 >
 > Yes, the last two could be done via type casting via void* parameters.

Yes, indeed.


 >>> And a request to save the data will fill flash_parms with a 'NULL'
 >>> fill (0 or all ones, or whatever I want unused flash locations to be),
 >>
 >> Is it really necessary?  What happens if you don't care about unused
 >> bytes in flash_parms?  I think... nothing.
 >>
 >>
 >
 > Setting them to what an erased byte is can sometimes save you power and
 > time for programming. Having them be a know value can also make it
 > easier to read the contents, and
 >
 >>> then
 >>> copy the parms into flash_parms.parms, then fill in the header, and 
then
 >>> program the flash. Only the part that copys parms to flash_parms.parms
 >>> needs to be protected from other operation in the system updating the
 >>> main parms structure.
 >>>
 >>> The flash_parms structure can be further modified to be a union of the
 >>> current data structure of the SystemParameters structure and any
 >>> previous 'incompatible' version of it. (A version from just prior to
 >>> adding elements at the end doesn't need to be listed). This way at
 >>> startup I can read the values into that same structure, and if it was
 >>> old, parse to old version in flash_parms to get the new version in
 >>> parms, defaulting new parameters as needed.
 >>
 >> It seems difficult, if the changes from one version to the other are
 >> "important".
 >>
 >> struct {
 >>    uint16_t foo;
 >>    uint16_t bar;
 >>    uint16_t dummy;
 >> } SystemParameters;
 >>
 > should be struct SystemParameters { or typedef struct { you are defining
 > a type here, not a varaiable.

Yes, you're right.


 >> struct {
 >>    uint8_t foo;
 >>    uint8_t bar;
 >>    uint8_t dummy;
 >> } SystemParametersOld;
 >
 > ditto.

Of course.


 >> union {
 >>     uint8_t    raw_data[DATA_SIZE];
 >>     struct {
 >>         struct StandardHeader header;
 >>         struct SystemParameters parms;
 >>         struct SystemParametersOld parms_old;
 >> } flash_parms;
 >>
 > should be:
 >
 > union {
 >     uint8_t raw_data[DATA_SIZE];
 >     struct {
 >        struct StandardHeader header;
 >        union {
 >           struct SystemParameters parms_new;
 >           struct SystemParametersOld parms_old;
 >        } parms;
 >     } data;
 > } flash_parms;

Again you're right.


 > I actually don't use labels like Old but like Rev1, Rev2 etc.

Yes, it was only for the example here.


 > For some compilers you can use an 'anonymous union' an omit the parms
 > name here and in the code.
 >>
 >> for (uint8_t sector = 0; sector < SECTORS_NUM; sector++) {
 >>    uint16_t memory_address = OFFSET + sector * SECTOR_SIZE;
 >>    sermem_read(&flash_params, memory_address, sizeof(flash_params));
 >>    uint16_t chk_read = flash_parms.header.crc;
 >>    flash_parms.header.crc = 0xFFFF;
 >>    if (checksum_calc(&flash_parms, sizeof(flash_parms)) == chk_read) {
 >>      /* Datablock is valid */
 >>      if (flash_parms.header.counter > prev_counter) {
 >>        /* Datablock is newer */
 >>        prev_counter = flash_parms.header.counter;
 >>        if (flash_parms.header.version == FLASH_PARMS_OLD) {
 >>          /* How to convert old version in new version in place? */
 >>          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 >>        } else {
 >>          /* Ok, new version. No convertion is needed. */
 >>        }
 >>      }
 >>    }
 >> }
 >>
 >> As I wrote in the comment, how to convert old version to new version,
 >> using the same union in RAM?  Do I need another union?
 >
 > You do NOT convert 'in place', but in transfering from the flash_parms
 > block to the main usage block. Only the flash parm save/load routine has
 > access to flash_parms, and they are used just for save and load. If you
 > are tight on memory, it could be allocated on the heap only for load
 > and/or save and freed after, but you then you do need to be sure the
 > memory IS available when you need it. maybe something else turns off for
 > a bit when you need to do it.
 >
 > sample code:
 >
 > parms.foo = flash_parms.data.parms.parms_old.foo;
 > parms.bar = flash_parms.data.parms.parms_old.bar;
 > parms.dummy = flash_parms.data.parms.parms_old.dummy;
 >
 > and for the new code, you do
 >
 > parms = flash_parms.data.parms.parms_new;
 >
 > you can do a single line and let the compiler move it all.

Now I understand your process.  You have an image in RAM of datablock 
saved in serial memory *and* you have another "operative" datablock in 
RAM (maybe identical, maybe differently organized) that is actually used 
by the code.

I don't like to waste so much space in valuable RAM memory, maybe 
because I started with very limited 8-bit MCUs where the RAM was very small.

One good compromise could be to change datablock layout with great care, 
only:
* adding variables at the end
* nullifying no-more used parameters by simply changing their name with 
"_deprecated" (or "_old") suffix (in order to be sure the code will 
never use them again).
Immediately after an upgrade, the "_old" params will be converted in new 
params, if needed (for example if they change from uint8_t in uint16_t).

Reply by Richard Damon ●June 16, 20182018-06-16

On 6/16/18 2:26 AM, pozz wrote:
> Il 16/06/2018 00:25, Richard Damon ha scritto:
>> On 6/15/18 4:54 PM, pozz wrote:
>>> Il 15/06/2018 19:11, Richard Damon ha scritto:
>>>> I typically also do something like this. The data structure is a union
>>>> of the basic data structure with a preamble that includes (in very
> fixed
>>>> locations) a data structure version, checksum/crc, and if a versioning
>>>> store a timestamp/data generation number. A 'Magic Number' isn't often
>>>> needed unless it is a removable media as it will either be or not the
>>>> expected data, nothing else could be there (if the unit might have
>>>> different sorts of programs, then a piece of the data version would
> be a
>>>> program ID.)
>>>>
>>>> Often I will have TWO copies of the data packet.
>>>
>>> TWO copies in EEPROM or in RAM?
>>
>> As I described, normally both.
> 
> So you are so lucky that you have abundant RAM.

If you can't spare the space for the two buffers, then you become forced
to 'lock' the system (or at least the parameter table) for the entire
time of the flash write, otherwise you end up with corrupted (CRC error)
data blocks. It also makes writing the format uprev code much easier.
You comment about how to update 'in place' becomes a REAL question which
needs to be answered, and that answer will generally require detailed
understanding of how the elements are physically stored and making sure
that you do things in the right order (which may require limiting the
optimizations the compiler can do for that code, as you are likely to do
actions the Standard defines as Undefined Behavior).
> 
> 
>>>> Often organized something like this:
>>>>
>>>> struct SystemParameters {
>>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; // List of data parameters that need to be saved
>>>> } parms;
>>>>
>>>> union {
>>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>>>
>>>> } flash_parms;
>>>>
>>>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
>>>> a multiple of the flash sector size */
>>>
>>> Do you need raw_data[] array only to set a well-known size to
> flash_parms?
>>> Do you alloc an entire (or a multiple of) flash sector in RAM?&#4294967295; Even if
>>> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
>>>
>>
>> One purpose of the raw_data array is to fix the size of the flash data.
> 
> Sincerely I can't understand why you need a fixed data size in RAM.
> Suppose release 1.0 needs 2kB, but I'm so smart to provide a double
> space for new params in future releases. So I decide to use 4kB blocks
> *on the serial memory*.&#4294967295; Maybe this big space will never be used in all
> the future releases.
> 
> Why the hell I should load all 4kB datablock in RAM at startup?&#4294967295; IMHO it
> is sufficient to load only 2kB.
> 
> The only reason I see is to simplify the checksum calculation.
> CRC must be calculated over the entire fixed-size 4kB block, otherwise
> how to know the datablock size if it could change from one release to
> the other?
> However you can calc the checksum of 4kB block by reading small blocks
> multiple times (for example, 16 sections of 256 bytes).
> After CRC validation passed, you can read in RAM *only* 2kB of data that
> the application 1.0 needs.
> 
> I know, maybe this process is slower, but it happens only at startup.
> During writing of 4kB datablock, the checksum can be calculated on-the-fly.
> 
A big reason for having a copy in RAM being an exact copy of the flash
memory is design isolation. It allows me to define a checksum routine to
compute the value of an arbitrary buffer, and flash routines that
transfer arbitrary data without it needing full understanding of the
parameter storage.

As to the use of extra memory, If I anticipate that I am going to
possibly need a 4k parameter block in the future, but don't have the ram
memory now to read in the 4k block, most assuredly in the future when I
do need that bigger parameter block, I am not going to have more ram on
THIS processor. Unless the flash memory is a removable media (which
presents a totally different set of issues) there is no need to reserve
the extra space for this processor.

> 
>> Another purpose is to provide an easy way to access to pre-'zero' the
> data
>> Another purpose is to provide an easy way to compute the CRC/Checksum.
>>
>> Yes, the last two could be done via type casting via void* parameters.
> 
> Yes, indeed.
> 
> 
>>>> And a request to save the data will fill flash_parms with a 'NULL'
>>>> fill (0 or all ones, or whatever I want unused flash locations to be),
>>>
>>> Is it really necessary?&#4294967295; What happens if you don't care about unused
>>> bytes in flash_parms?&#4294967295; I think... nothing.
>>>
>>>
>>
>> Setting them to what an erased byte is can sometimes save you power and
>> time for programming. Having them be a know value can also make it
>> easier to read the contents, and
>>
>>>> then
>>>> copy the parms into flash_parms.parms, then fill in the header, and
> then
>>>> program the flash. Only the part that copys parms to flash_parms.parms
>>>> needs to be protected from other operation in the system updating the
>>>> main parms structure.
>>>>
>>>> The flash_parms structure can be further modified to be a union of the
>>>> current data structure of the SystemParameters structure and any
>>>> previous 'incompatible' version of it. (A version from just prior to
>>>> adding elements at the end doesn't need to be listed). This way at
>>>> startup I can read the values into that same structure, and if it was
>>>> old, parse to old version in flash_parms to get the new version in
>>>> parms, defaulting new parameters as needed.
>>>
>>> It seems difficult, if the changes from one version to the other are
>>> "important".
>>>
>>> struct {
>>>&#4294967295;&#4294967295;&#4294967295; uint16_t foo;
>>>&#4294967295;&#4294967295;&#4294967295; uint16_t bar;
>>>&#4294967295;&#4294967295;&#4294967295; uint16_t dummy;
>>> } SystemParameters;
>>>
>> should be struct SystemParameters { or typedef struct { you are defining
>> a type here, not a varaiable.
> 
> Yes, you're right.
> 
> 
>>> struct {
>>>&#4294967295;&#4294967295;&#4294967295; uint8_t foo;
>>>&#4294967295;&#4294967295;&#4294967295; uint8_t bar;
>>>&#4294967295;&#4294967295;&#4294967295; uint8_t dummy;
>>> } SystemParametersOld;
>>
>> ditto.
> 
> Of course.
> 
> 
>>> union {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParametersOld parms_old;
>>> } flash_parms;
>>>
>> should be:
>>
>> union {
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t raw_data[DATA_SIZE];
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; union {
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms_new;
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParametersOld parms_old;
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; } parms;
>>&#4294967295;&#4294967295;&#4294967295;&#4294967295; } data;
>> } flash_parms;
> 
> Again you're right.
> 
> 
>> I actually don't use labels like Old but like Rev1, Rev2 etc.
> 
> Yes, it was only for the example here.
> 
> 
>> For some compilers you can use an 'anonymous union' an omit the parms
>> name here and in the code.
>>>
>>> for (uint8_t sector = 0; sector < SECTORS_NUM; sector++) {
>>>&#4294967295;&#4294967295;&#4294967295; uint16_t memory_address = OFFSET + sector * SECTOR_SIZE;
>>>&#4294967295;&#4294967295;&#4294967295; sermem_read(&flash_params, memory_address, sizeof(flash_params));
>>>&#4294967295;&#4294967295;&#4294967295; uint16_t chk_read = flash_parms.header.crc;
>>>&#4294967295;&#4294967295;&#4294967295; flash_parms.header.crc = 0xFFFF;
>>>&#4294967295;&#4294967295;&#4294967295; if (checksum_calc(&flash_parms, sizeof(flash_parms)) == chk_read) {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Datablock is valid */
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.counter > prev_counter) {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Datablock is newer */
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; prev_counter = flash_parms.header.counter;
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.version == FLASH_PARMS_OLD) {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* How to convert old version in new version in place? */
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; } else {
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Ok, new version. No convertion is needed. */
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>>>&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>>>&#4294967295;&#4294967295;&#4294967295; }
>>> }
>>>
>>> As I wrote in the comment, how to convert old version to new version,
>>> using the same union in RAM?&#4294967295; Do I need another union?
>>
>> You do NOT convert 'in place', but in transfering from the flash_parms
>> block to the main usage block. Only the flash parm save/load routine has
>> access to flash_parms, and they are used just for save and load. If you
>> are tight on memory, it could be allocated on the heap only for load
>> and/or save and freed after, but you then you do need to be sure the
>> memory IS available when you need it. maybe something else turns off for
>> a bit when you need to do it.
>>
>> sample code:
>>
>> parms.foo = flash_parms.data.parms.parms_old.foo;
>> parms.bar = flash_parms.data.parms.parms_old.bar;
>> parms.dummy = flash_parms.data.parms.parms_old.dummy;
>>
>> and for the new code, you do
>>
>> parms = flash_parms.data.parms.parms_new;
>>
>> you can do a single line and let the compiler move it all.
> 
> Now I understand your process.&#4294967295; You have an image in RAM of datablock
> saved in serial memory *and* you have another "operative" datablock in
> RAM (maybe identical, maybe differently organized) that is actually used
> by the code.
> 
> I don't like to waste so much space in valuable RAM memory, maybe
> because I started with very limited 8-bit MCUs where the RAM was very
> small.
> 
> One good compromise could be to change datablock layout with great care,
> only:
> * adding variables at the end
> * nullifying no-more used parameters by simply changing their name with
> "_deprecated" (or "_old") suffix (in order to be sure the code will
> never use them again).
> Immediately after an upgrade, the "_old" params will be converted in new
> params, if needed (for example if they change from uint8_t in uint16_t).
> 

Yes, for VERY small processors this doesn't work. But programs on very
small processors tend to have small parameter blocks and flashes with
small sectors (at the very least you are going to probably need at least
one memory block reserved the size of a flash sector, and the
flash_parms structure can fit in that.

And yes, most of the time when you revise your needs in the parameter
block, you add new items at the end, and if you stop using an item, it
gets renamed with an distinctive suffix. When possible I try to maintain
it having a reasonable representation of its value, so if you 'downrev'
the code, it gets something reasonable. I try to use a semantic
versioning method for my flash version numbers, so that code can decide
if the parameter values are usable. If the major revision number is
bigger than what it knows, it will refuse to use the parameter table
(and either throw an error or revert to defaults). If the major revision
matches, and the minor is greater than or equal to what it has, it just
uses the parameters as is. If the major is equal and the minor is lower,
after copying the parameters, it adds appropriate defaults (maybe
computed from other items) to any items that have been added since the
indicated revision. If the major revision is lower, then the code needs
a full conversion routine as shown above (or throw the error/go to
defaults for too old like too new).

The changing of the type of a parameter is one of the cases where you
are forced to bump the major revision, or you create new parameters in
with the new type, and the code does its best to give the old, now
obsolete, values as reasonable a value as possible in case of a down rev.

Reply by pozz ●June 16, 20182018-06-16

Il 16/06/2018 15:20, Richard Damon ha scritto:
> On 6/16/18 2:26 AM, pozz wrote:
>> Il 16/06/2018 00:25, Richard Damon ha scritto:
>>> On 6/15/18 4:54 PM, pozz wrote:
>>>> Il 15/06/2018 19:11, Richard Damon ha scritto:
>>>>> I typically also do something like this. The data structure is a union
>>>>> of the basic data structure with a preamble that includes (in very
>> fixed
>>>>> locations) a data structure version, checksum/crc, and if a versioning
>>>>> store a timestamp/data generation number. A 'Magic Number' isn't often
>>>>> needed unless it is a removable media as it will either be or not the
>>>>> expected data, nothing else could be there (if the unit might have
>>>>> different sorts of programs, then a piece of the data version would
>> be a
>>>>> program ID.)
>>>>>
>>>>> Often I will have TWO copies of the data packet.
>>>>
>>>> TWO copies in EEPROM or in RAM?
>>>
>>> As I described, normally both.
>>
>> So you are so lucky that you have abundant RAM.
> 
> If you can't spare the space for the two buffers, then you become forced
> to 'lock' the system (or at least the parameter table) for the entire
> time of the flash write, otherwise you end up with corrupted (CRC error)
> data blocks. 

Why?  You have a single datablock in RAM that must be transfered to the 
serial memory when changes occur.  You start an activity (a task) that 
performs the copy in background.
If new changes occur during writing, the task can be restarted without 
problems, even on the same memory area.  I hope the changes aren't so 
frequent so finally you will have a valid datablock (with correct CRC) 
written in memory, without blocking the main application.

Why do you need an additional datablock to avoid blocking task?


> It also makes writing the format uprev code much easier.
> You comment about how to update 'in place' becomes a REAL question which
> needs to be answered, and that answer will generally require detailed
> understanding of how the elements are physically stored and making sure
> that you do things in the right order (which may require limiting the
> optimizations the compiler can do for that code, as you are likely to do
> actions the Standard defines as Undefined Behavior).

Yes, solving the problem of converting an old layout in a new completely 
different layout is very complex without an additional structure.  The 
in-place conversion would be very difficult and depends on the changes.

I think you can avoid all this stuff, as I said, changing the datablock 
layout with care (adding parameters at the end, without changing the 
first/old layout).


>>>>> Often organized something like this:
>>>>>
>>>>> struct SystemParameters {
>>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; // List of data parameters that need to be saved
>>>>> } parms;
>>>>>
>>>>> union {
>>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>>>>
>>>>> } flash_parms;
>>>>>
>>>>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
>>>>> a multiple of the flash sector size */
>>>>
>>>> Do you need raw_data[] array only to set a well-known size to
>> flash_parms?
>>>> Do you alloc an entire (or a multiple of) flash sector in RAM?&#4294967295; Even if
>>>> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
>>>>
>>>
>>> One purpose of the raw_data array is to fix the size of the flash data.
>>
>> Sincerely I can't understand why you need a fixed data size in RAM.
>> Suppose release 1.0 needs 2kB, but I'm so smart to provide a double
>> space for new params in future releases. So I decide to use 4kB blocks
>> *on the serial memory*.&#4294967295; Maybe this big space will never be used in all
>> the future releases.
>>
>> Why the hell I should load all 4kB datablock in RAM at startup?&#4294967295; IMHO it
>> is sufficient to load only 2kB.
>>
>> The only reason I see is to simplify the checksum calculation.
>> CRC must be calculated over the entire fixed-size 4kB block, otherwise
>> how to know the datablock size if it could change from one release to
>> the other?
>> However you can calc the checksum of 4kB block by reading small blocks
>> multiple times (for example, 16 sections of 256 bytes).
>> After CRC validation passed, you can read in RAM *only* 2kB of data that
>> the application 1.0 needs.
>>
>> I know, maybe this process is slower, but it happens only at startup.
>> During writing of 4kB datablock, the checksum can be calculated on-the-fly.
>>
> A big reason for having a copy in RAM being an exact copy of the flash
> memory is design isolation. It allows me to define a checksum routine to
> compute the value of an arbitrary buffer, and flash routines that
> transfer arbitrary data without it needing full understanding of the
> parameter storage.

Yes, your earn some cents on implementation of some functions.


> As to the use of extra memory, If I anticipate that I am going to
> possibly need a 4k parameter block in the future, but don't have the ram
> memory now to read in the 4k block, most assuredly in the future when I
> do need that bigger parameter block, I am not going to have more ram on
> THIS processor. Unless the flash memory is a removable media (which
> presents a totally different set of issues) there is no need to reserve
> the extra space for this processor.

When you reserve 4k block for future uses, you don't know if it will be 
REALLY used in the future.  You are right when you say "no space now, no 
space in the future", but are we sure we can know if we have space or not?

I don't know if my approach is correct, I usually don't use heap memory, 
so I have only static data and stack.  It's very difficult to calculate 
how much space you need for stack to avoid stack overflow in ANY 
condition.  So I end up to reserve all the free RAM space to stack, 
where the free space is total RAM minus static allocation.

So I try being parsimonius when allocate global static variables, mostly 
if they are big.
Reserving some space for extra not used memory will sacrify stack space.

Maybe the stack space is sufficient even with extra padding in RAM, who 
knows.  At first I try to do all my best to avoid useless big static 
allocation in RAM.

At the contrary, what happens if you notice that your stack is critical 
when padding is in RAM?  Do you prefer to reduce the datablock *padding* 
size or to recode some functions to avoid having the padding at all in 
the RAM?


I think there is another approach to keep checksum functions simple and 
avoid having padding space in RAM.
The problem here is the space where CRC is calculated.  At startup we 
don't know the size of data, because it could belong to an old or new 
layout, so we can't calculate CRC on the right area.  We should know the 
data layout version before calculating CRC, but version code is *in* the 
CRC area.

We could think to pull the layout code outside the CRC area and protect 
it by duplicating.

typedef struct {
   uint8_t layout_version;
   uint8_t layout_version_check;
   uint16_t crc;
   /* The CRC is calculated starting from the following byte */
   uint16_t counter;
   union {
     nvmdatav1_t datav1;
     nvmdatav2_t datav2;
}

At startup the layout version is read and checked.  Then CRC is read and 
checked against the correct size that depends on the layout version. 
The CRC checksum is simple.

if (datablock.layout_version == datablock.layout_version_check) {
   size_t s = datablock.layout_version == 1 ?
              sizeof(nvmdatav1_t) :
              sizeof(nvmdatav2_t);
   if (datablock.crc ==
         calc_crc(&((const unsigned char *)&datablock)[5]),
              s) {
     if (datablock.counter > prev_counter) {
       /* New valid datablock found */
     }
   }
}

No padding at all (in RAM and in serial memory), simple functions.

Of course padding must be taken into account during writing to separate 
the blocks in memory for redundancy and wear-leveling.


>>> Another purpose is to provide an easy way to access to pre-'zero' the
>> data
>>> Another purpose is to provide an easy way to compute the CRC/Checksum.
>>>
>>> Yes, the last two could be done via type casting via void* parameters.
>>
>> Yes, indeed.
>>
>>
>>>>> And a request to save the data will fill flash_parms with a 'NULL'
>>>>> fill (0 or all ones, or whatever I want unused flash locations to be),
>>>>
>>>> Is it really necessary?&#4294967295; What happens if you don't care about unused
>>>> bytes in flash_parms?&#4294967295; I think... nothing.
>>>>
>>>>
>>>
>>> Setting them to what an erased byte is can sometimes save you power and
>>> time for programming. Having them be a know value can also make it
>>> easier to read the contents, and
>>>
>>>>> then
>>>>> copy the parms into flash_parms.parms, then fill in the header, and
>> then
>>>>> program the flash. Only the part that copys parms to flash_parms.parms
>>>>> needs to be protected from other operation in the system updating the
>>>>> main parms structure.
>>>>>
>>>>> The flash_parms structure can be further modified to be a union of the
>>>>> current data structure of the SystemParameters structure and any
>>>>> previous 'incompatible' version of it. (A version from just prior to
>>>>> adding elements at the end doesn't need to be listed). This way at
>>>>> startup I can read the values into that same structure, and if it was
>>>>> old, parse to old version in flash_parms to get the new version in
>>>>> parms, defaulting new parameters as needed.
>>>>
>>>> It seems difficult, if the changes from one version to the other are
>>>> "important".
>>>>
>>>> struct {
>>>>  &#4294967295;&#4294967295;&#4294967295; uint16_t foo;
>>>>  &#4294967295;&#4294967295;&#4294967295; uint16_t bar;
>>>>  &#4294967295;&#4294967295;&#4294967295; uint16_t dummy;
>>>> } SystemParameters;
>>>>
>>> should be struct SystemParameters { or typedef struct { you are defining
>>> a type here, not a varaiable.
>>
>> Yes, you're right.
>>
>>
>>>> struct {
>>>>  &#4294967295;&#4294967295;&#4294967295; uint8_t foo;
>>>>  &#4294967295;&#4294967295;&#4294967295; uint8_t bar;
>>>>  &#4294967295;&#4294967295;&#4294967295; uint8_t dummy;
>>>> } SystemParametersOld;
>>>
>>> ditto.
>>
>> Of course.
>>
>>
>>>> union {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParametersOld parms_old;
>>>> } flash_parms;
>>>>
>>> should be:
>>>
>>> union {
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t raw_data[DATA_SIZE];
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; union {
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms_new;
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParametersOld parms_old;
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; } parms;
>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; } data;
>>> } flash_parms;
>>
>> Again you're right.
>>
>>
>>> I actually don't use labels like Old but like Rev1, Rev2 etc.
>>
>> Yes, it was only for the example here.
>>
>>
>>> For some compilers you can use an 'anonymous union' an omit the parms
>>> name here and in the code.
>>>>
>>>> for (uint8_t sector = 0; sector < SECTORS_NUM; sector++) {
>>>>  &#4294967295;&#4294967295;&#4294967295; uint16_t memory_address = OFFSET + sector * SECTOR_SIZE;
>>>>  &#4294967295;&#4294967295;&#4294967295; sermem_read(&flash_params, memory_address, sizeof(flash_params));
>>>>  &#4294967295;&#4294967295;&#4294967295; uint16_t chk_read = flash_parms.header.crc;
>>>>  &#4294967295;&#4294967295;&#4294967295; flash_parms.header.crc = 0xFFFF;
>>>>  &#4294967295;&#4294967295;&#4294967295; if (checksum_calc(&flash_parms, sizeof(flash_parms)) == chk_read) {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Datablock is valid */
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.counter > prev_counter) {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Datablock is newer */
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; prev_counter = flash_parms.header.counter;
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (flash_parms.header.version == FLASH_PARMS_OLD) {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* How to convert old version in new version in place? */
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; } else {
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* Ok, new version. No convertion is needed. */
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>>>>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>>>>  &#4294967295;&#4294967295;&#4294967295; }
>>>> }
>>>>
>>>> As I wrote in the comment, how to convert old version to new version,
>>>> using the same union in RAM?&#4294967295; Do I need another union?
>>>
>>> You do NOT convert 'in place', but in transfering from the flash_parms
>>> block to the main usage block. Only the flash parm save/load routine has
>>> access to flash_parms, and they are used just for save and load. If you
>>> are tight on memory, it could be allocated on the heap only for load
>>> and/or save and freed after, but you then you do need to be sure the
>>> memory IS available when you need it. maybe something else turns off for
>>> a bit when you need to do it.
>>>
>>> sample code:
>>>
>>> parms.foo = flash_parms.data.parms.parms_old.foo;
>>> parms.bar = flash_parms.data.parms.parms_old.bar;
>>> parms.dummy = flash_parms.data.parms.parms_old.dummy;
>>>
>>> and for the new code, you do
>>>
>>> parms = flash_parms.data.parms.parms_new;
>>>
>>> you can do a single line and let the compiler move it all.
>>
>> Now I understand your process.&#4294967295; You have an image in RAM of datablock
>> saved in serial memory *and* you have another "operative" datablock in
>> RAM (maybe identical, maybe differently organized) that is actually used
>> by the code.
>>
>> I don't like to waste so much space in valuable RAM memory, maybe
>> because I started with very limited 8-bit MCUs where the RAM was very
>> small.
>>
>> One good compromise could be to change datablock layout with great care,
>> only:
>> * adding variables at the end
>> * nullifying no-more used parameters by simply changing their name with
>> "_deprecated" (or "_old") suffix (in order to be sure the code will
>> never use them again).
>> Immediately after an upgrade, the "_old" params will be converted in new
>> params, if needed (for example if they change from uint8_t in uint16_t).
> 
> Yes, for VERY small processors this doesn't work. But programs on very
> small processors tend to have small parameter blocks and flashes with
> small sectors (at the very least you are going to probably need at least
> one memory block reserved the size of a flash sector, and the
> flash_parms structure can fit in that.
> 
> And yes, most of the time when you revise your needs in the parameter
> block, you add new items at the end, and if you stop using an item, it
> gets renamed with an distinctive suffix. When possible I try to maintain
> it having a reasonable representation of its value, so if you 'downrev'
> the code, it gets something reasonable. I try to use a semantic
> versioning method for my flash version numbers, so that code can decide
> if the parameter values are usable. If the major revision number is
> bigger than what it knows, it will refuse to use the parameter table
> (and either throw an error or revert to defaults). If the major revision
> matches, and the minor is greater than or equal to what it has, it just
> uses the parameters as is. If the major is equal and the minor is lower,
> after copying the parameters, it adds appropriate defaults (maybe
> computed from other items) to any items that have been added since the
> indicated revision. If the major revision is lower, then the code needs
> a full conversion routine as shown above (or throw the error/go to
> defaults for too old like too new).

Good suggestion.


> The changing of the type of a parameter is one of the cases where you
> are forced to bump the major revision, or you create new parameters in
> with the new type, and the code does its best to give the old, now
> obsolete, values as reasonable a value as possible in case of a down rev.

Reply by pozz ●June 16, 20182018-06-16

Il 15/06/2018 16:21, David Brown ha scritto:

 > static enum { idle, writing, checking, erasing } writerState;
 >
 > void doWriter(void) {
 >     switch (writerState) {
 >         case idle :
 >             if (writeTriggered) {
 >                 writeTriggered = false;
 >                 startWriting();
 >                 writerState = writing;
 >             }
 >             break;
 >         case writing :
 >             bool stillWorking = pollNVMdevice();
 >             if (stillWorking) return;
 >             startChecking();
 >             writerState = checking;
 >             break;
 >         case checking :
 >             bool stillWorking = pollNVMdevice();
 >             if (stillWorking) return;
 >             startErasingNextBlock();
 >             writerState = erasing;
 >             break;
 >         case erasing :
 >             bool stillWorking = pollNVMdevice();
 >             if (stillWorking) return;
 >             writerState = idle;
 >             break;
 >     }
 > }
 >
 > There - half your program is done

I was thinking on this "long-running" task that writes non-volatile data 
to serial memory in the background, during normal execution of the  main 
application.

First question. If I understood well your words, the checking state has 
the goal of comparing data written (by reading them) against original 
data in RAM.  If they differ, we should go back in writing state, maybe 
changing the destination sector (because the write could be failed again 
for physical damage of that sector).

Another question is: what happens if saving is triggered again during 
writing?  I think there isn't any big problem.  The writing task can be 
started again from the beginning, even on the same destination sector.
So during writing and checking states I should check for writeTriggered 
flags again and prematurely stop the writing and start again from the 
beginning.

Third question, more complex (for me).
Suppose I decided to split non-volatile data in two blocks, for example 
calibration data and user settings.
What happens if user settings change when calibration settings are being 
written?  I think I should convert your writeTriggered in two updated 
flags: calibration_updated and settings_updated.
In idle state I should check both flags and start writing the relative 
block.


Again another scenario.  Until now we talked about settings, a structure 
filled with parameters that can change when the user wants at any time.

How to manage a log, a list of events with timestamp and some data?
Suppose one entry takes 8 bytes.  I reserve 4kB of memory for around 500 
entries organized in a FIFO.

Log isn't so critical as the settings, so I think I could avoid 
redundancy in non-volatile memory.  Maybe only the CRC that, when not 
valid, clears all the log.  It should be acceptable.

As usual we can talk about the opportunity to read the full log and put 
it in RAM, or read the entries when needed (because the user wants to 
read some entries, mostly the more recent).
Reading 10 entries (80 bytes) from 10MHz SPI memory doesn't take too 
much (no more than 100usec). But here the problem is that reading can be 
needed during writing of settings.  And this is a big problem.

As usual, simplest solution is to have the full log in RAM... sigh!

What about writing of one or a few new entries in the log?  A writing 
operation (for example, settings) should be in process.  I should 
schedule and postpone the log update after writing of settings is finished.

Because you have much more experience than me (and you are so kind to 
share it with me and other lurkers), could you suggest a smart approach?

Reply by Richard Damon ●June 16, 20182018-06-16

On 6/16/18 1:38 PM, pozz wrote:
> Il 16/06/2018 15:20, Richard Damon ha scritto:
>> On 6/16/18 2:26 AM, pozz wrote:
>>> Il 16/06/2018 00:25, Richard Damon ha scritto:
>>>> On 6/15/18 4:54 PM, pozz wrote:
>>>>> Il 15/06/2018 19:11, Richard Damon ha scritto:
>>>>>> I typically also do something like this. The data structure is a
>>>>>> union
>>>>>> of the basic data structure with a preamble that includes (in very
>>> fixed
>>>>>> locations) a data structure version, checksum/crc, and if a
>>>>>> versioning
>>>>>> store a timestamp/data generation number. A 'Magic Number' isn't
>>>>>> often
>>>>>> needed unless it is a removable media as it will either be or not the
>>>>>> expected data, nothing else could be there (if the unit might have
>>>>>> different sorts of programs, then a piece of the data version would
>>> be a
>>>>>> program ID.)
>>>>>>
>>>>>> Often I will have TWO copies of the data packet.
>>>>>
>>>>> TWO copies in EEPROM or in RAM?
>>>>
>>>> As I described, normally both.
>>>
>>> So you are so lucky that you have abundant RAM.
>>
>> If you can't spare the space for the two buffers, then you become forced
>> to 'lock' the system (or at least the parameter table) for the entire
>> time of the flash write, otherwise you end up with corrupted (CRC error)
>> data blocks. 
> 
> Why?&#4294967295; You have a single datablock in RAM that must be transfered to the
> serial memory when changes occur.&#4294967295; You start an activity (a task) that
> performs the copy in background.
> If new changes occur during writing, the task can be restarted without
> problems, even on the same memory area.&#4294967295; I hope the changes aren't so
> frequent so finally you will have a valid datablock (with correct CRC)
> written in memory, without blocking the main application.
> 
> Why do you need an additional datablock to avoid blocking task?
> 

First, often in the parameter block for me are cumulative usage stats,
so the parameter block will be automatically saved on the controlled
shutdown (like the use flipping the off switch) and maybe automatically
after sufficient or significant usage. Yes, I can lose some usage on an
unexpected turn off (pull the plug), but some units may even have enough
power reserves to save in this situation.
> 
>> It also makes writing the format uprev code much easier.
>> You comment about how to update 'in place' becomes a REAL question which
>> needs to be answered, and that answer will generally require detailed
>> understanding of how the elements are physically stored and making sure
>> that you do things in the right order (which may require limiting the
>> optimizations the compiler can do for that code, as you are likely to do
>> actions the Standard defines as Undefined Behavior).
> 
> Yes, solving the problem of converting an old layout in a new completely
> different layout is very complex without an additional structure.&#4294967295; The
> in-place conversion would be very difficult and depends on the changes.
> 
> I think you can avoid all this stuff, as I said, changing the datablock
> layout with care (adding parameters at the end, without changing the
> first/old layout).

Yes, you can put off the need with controlled changes, but sometimes you
just build up enough cruft, that a re-layout makes sense. Maybe on the
really small processors, your tasks are simpler and more stable. I find
I want to allow for the possibility that at some point the structure may
need to be rearranged, and often I want a unit to be able to be upgraded
and preserve its settings.
> 
> 
>>>>>> Often organized something like this:
>>>>>>
>>>>>> struct SystemParameters {
>>>>>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; // List of data parameters that need to be saved
>>>>>> } parms;
>>>>>>
>>>>>> union {
>>>>>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint8_t&#4294967295;&#4294967295;&#4294967295; raw_data[DATA_SIZE];
>>>>>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct {
>>>>>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct StandardHeader header;
>>>>>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; struct SystemParameters parms;
>>>>>>
>>>>>> } flash_parms;
>>>>>>
>>>>>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that DATA_SIZE is
>>>>>> a multiple of the flash sector size */
>>>>>
>>>>> Do you need raw_data[] array only to set a well-known size to
>>> flash_parms?
>>>>> Do you alloc an entire (or a multiple of) flash sector in RAM?&#4294967295;
>>>>> Even if
>>>>> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
>>>>>
>>>>
>>>> One purpose of the raw_data array is to fix the size of the flash data.
>>>
>>> Sincerely I can't understand why you need a fixed data size in RAM.
>>> Suppose release 1.0 needs 2kB, but I'm so smart to provide a double
>>> space for new params in future releases. So I decide to use 4kB blocks
>>> *on the serial memory*.&#4294967295; Maybe this big space will never be used in all
>>> the future releases.
>>>
>>> Why the hell I should load all 4kB datablock in RAM at startup?&#4294967295; IMHO it
>>> is sufficient to load only 2kB.
>>>
>>> The only reason I see is to simplify the checksum calculation.
>>> CRC must be calculated over the entire fixed-size 4kB block, otherwise
>>> how to know the datablock size if it could change from one release to
>>> the other?
>>> However you can calc the checksum of 4kB block by reading small blocks
>>> multiple times (for example, 16 sections of 256 bytes).
>>> After CRC validation passed, you can read in RAM *only* 2kB of data that
>>> the application 1.0 needs.
>>>
>>> I know, maybe this process is slower, but it happens only at startup.
>>> During writing of 4kB datablock, the checksum can be calculated
>>> on-the-fly.
>>>
>> A big reason for having a copy in RAM being an exact copy of the flash
>> memory is design isolation. It allows me to define a checksum routine to
>> compute the value of an arbitrary buffer, and flash routines that
>> transfer arbitrary data without it needing full understanding of the
>> parameter storage.
> 
> Yes, your earn some cents on implementation of some functions.
> 
> 
>> As to the use of extra memory, If I anticipate that I am going to
>> possibly need a 4k parameter block in the future, but don't have the ram
>> memory now to read in the 4k block, most assuredly in the future when I
>> do need that bigger parameter block, I am not going to have more ram on
>> THIS processor. Unless the flash memory is a removable media (which
>> presents a totally different set of issues) there is no need to reserve
>> the extra space for this processor.
> 
> When you reserve 4k block for future uses, you don't know if it will be
> REALLY used in the future.&#4294967295; You are right when you say "no space now, no
> space in the future", but are we sure we can know if we have space or not?
> 
> I don't know if my approach is correct, I usually don't use heap memory,
> so I have only static data and stack.&#4294967295; It's very difficult to calculate
> how much space you need for stack to avoid stack overflow in ANY
> condition.&#4294967295; So I end up to reserve all the free RAM space to stack,
> where the free space is total RAM minus static allocation.

I follow roughly similar guidelines. The one exception is that I allow
the use of dynamic memory that is allocated during 'startup'. Also, I
find it useful to use a lightweight RTOS, so I don't have a single
'system stack', but several task stacks, so I do need to figure out the
space requirements and can't just say it gets everything. You do run
tests and check how much of the allocated space is used (stack space is
prefilled with a funny pattern, and you see how much remains with that
pattern.
> 
> So I try being parsimonius when allocate global static variables, mostly
> if they are big.
> Reserving some space for extra not used memory will sacrify stack space.
> 
> Maybe the stack space is sufficient even with extra padding in RAM, who
> knows.&#4294967295; At first I try to do all my best to avoid useless big static
> allocation in RAM.
> 
> At the contrary, what happens if you notice that your stack is critical
> when padding is in RAM?&#4294967295; Do you prefer to reduce the datablock *padding*
> size or to recode some functions to avoid having the padding at all in
> the RAM?
> 
When RAM space gets tight, you need to sit down and look at ALL RAM
utilization and compare it to your initial estimates. You need to do
this sort of estimate at the very beginning of the project so you know
it should be feasible to begin with. You need to be able to flag that
there is a serious risk in implementation before the project gets too
far, and perhaps re-target to a better processor.
> 
> I think there is another approach to keep checksum functions simple and
> avoid having padding space in RAM.
> The problem here is the space where CRC is calculated.&#4294967295; At startup we
> don't know the size of data, because it could belong to an old or new
> layout, so we can't calculate CRC on the right area.&#4294967295; We should know the
> data layout version before calculating CRC, but version code is *in* the
> CRC area.
> 
> We could think to pull the layout code outside the CRC area and protect
> it by duplicating.
> 
> typedef struct {
> &#4294967295; uint8_t layout_version;
> &#4294967295; uint8_t layout_version_check;
> &#4294967295; uint16_t crc;
> &#4294967295; /* The CRC is calculated starting from the following byte */
> &#4294967295; uint16_t counter;
> &#4294967295; union {
> &#4294967295;&#4294967295;&#4294967295; nvmdatav1_t datav1;
> &#4294967295;&#4294967295;&#4294967295; nvmdatav2_t datav2;
> }
> 
> At startup the layout version is read and checked.&#4294967295; Then CRC is read and
> checked against the correct size that depends on the layout version. The
> CRC checksum is simple.
> 
> if (datablock.layout_version == datablock.layout_version_check) {
> &#4294967295; size_t s = datablock.layout_version == 1 ?
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; sizeof(nvmdatav1_t) :
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; sizeof(nvmdatav2_t);
> &#4294967295; if (datablock.crc ==
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; calc_crc(&((const unsigned char *)&datablock)[5]),
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; s) {
> &#4294967295;&#4294967295;&#4294967295; if (datablock.counter > prev_counter) {
> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; /* New valid datablock found */
> &#4294967295;&#4294967295;&#4294967295; }
> &#4294967295; }
> }
> 
> No padding at all (in RAM and in serial memory), simple functions.
> 
> Of course padding must be taken into account during writing to separate
> the blocks in memory for redundancy and wear-leveling.
> 
> 

There is no problem with the version code being inside the CRCed area.
In fact you really want it checked as if something corrupted the version
code, you really want the block invalidated. The version should be in
the early header (so parm changes can't affect it, one reason I define a
separate struct for it that is put in the flash_parms structure. It also
says that the flash parameter saving can be a canned routine that
defines that structure and uses the parameter definition from the
application code.

I suppose this is one reason I don't want the size to change, at that
point the 'canned' code has to be taken out of the can, as it need more
application knowledge to know the parameter size. The way I do it, I can
have a parms.h that defines the ram parameter structure, and a #define
for the size of the block to use in flash for it, and the canned routine
does most of the work to find the best saved block and to save the
parameters to flash on command. It does need a callback into the
application layer with the block to use to provide the application layer
up-rev of the parameter data.

Reply by pozz ●June 16, 20182018-06-16

Il 16/06/2018 22:41, Richard Damon ha scritto:
 > On 6/16/18 1:38 PM, pozz wrote:
 >> Il 16/06/2018 15:20, Richard Damon ha scritto:
 >>> On 6/16/18 2:26 AM, pozz wrote:
 >>>> Il 16/06/2018 00:25, Richard Damon ha scritto:
 >>>>> On 6/15/18 4:54 PM, pozz wrote:
 >>>>>> Il 15/06/2018 19:11, Richard Damon ha scritto:
 >>>>>>> I typically also do something like this. The data structure is a
 >>>>>>> union
 >>>>>>> of the basic data structure with a preamble that includes (in very
 >>>> fixed
 >>>>>>> locations) a data structure version, checksum/crc, and if a
 >>>>>>> versioning
 >>>>>>> store a timestamp/data generation number. A 'Magic Number' isn't
 >>>>>>> often
 >>>>>>> needed unless it is a removable media as it will either be or 
not the
 >>>>>>> expected data, nothing else could be there (if the unit might have
 >>>>>>> different sorts of programs, then a piece of the data version would
 >>>> be a
 >>>>>>> program ID.)
 >>>>>>>
 >>>>>>> Often I will have TWO copies of the data packet.
 >>>>>>
 >>>>>> TWO copies in EEPROM or in RAM?
 >>>>>
 >>>>> As I described, normally both.
 >>>>
 >>>> So you are so lucky that you have abundant RAM.
 >>>
 >>> If you can't spare the space for the two buffers, then you become 
forced
 >>> to 'lock' the system (or at least the parameter table) for the entire
 >>> time of the flash write, otherwise you end up with corrupted (CRC 
error)
 >>> data blocks.
 >>
 >> Why?  You have a single datablock in RAM that must be transfered to the
 >> serial memory when changes occur.  You start an activity (a task) that
 >> performs the copy in background.
 >> If new changes occur during writing, the task can be restarted without
 >> problems, even on the same memory area.  I hope the changes aren't so
 >> frequent so finally you will have a valid datablock (with correct CRC)
 >> written in memory, without blocking the main application.
 >>
 >> Why do you need an additional datablock to avoid blocking task?
 >>
 >
 > First, often in the parameter block for me are cumulative usage stats,
 > so the parameter block will be automatically saved on the controlled
 > shutdown (like the use flipping the off switch) and maybe automatically
 > after sufficient or significant usage. Yes, I can lose some usage on an
 > unexpected turn off (pull the plug), but some units may even have enough
 > power reserves to save in this situation.
I see, so in your situation the usage stats parameters changes more 
frequently than the writing time of datablock.  So you need to freeze a 
datablock for saving, but allowing the application to continue updating it.

As usuale, different requirements lead to different solutions.


 >>> It also makes writing the format uprev code much easier.
 >>> You comment about how to update 'in place' becomes a REAL question 
which
 >>> needs to be answered, and that answer will generally require detailed
 >>> understanding of how the elements are physically stored and making sure
 >>> that you do things in the right order (which may require limiting the
 >>> optimizations the compiler can do for that code, as you are likely 
to do
 >>> actions the Standard defines as Undefined Behavior).
 >>
 >> Yes, solving the problem of converting an old layout in a new completely
 >> different layout is very complex without an additional structure.  The
 >> in-place conversion would be very difficult and depends on the changes.
 >>
 >> I think you can avoid all this stuff, as I said, changing the datablock
 >> layout with care (adding parameters at the end, without changing the
 >> first/old layout).
 >
 > Yes, you can put off the need with controlled changes, but sometimes you
 > just build up enough cruft, that a re-layout makes sense. Maybe on the
 > really small processors, your tasks are simpler and more stable. I find
 > I want to allow for the possibility that at some point the structure may
 > need to be rearranged, and often I want a unit to be able to be upgraded
 > and preserve its settings.
In my case such big redesign of the memory layout can be managed with a 
reset to default settings.  Instead of implementing a complex conversion 
function.


 >>>>>>> Often organized something like this:
 >>>>>>>
 >>>>>>> struct SystemParameters {
 >>>>>>>          // List of data parameters that need to be saved
 >>>>>>> } parms;
 >>>>>>>
 >>>>>>> union {
 >>>>>>>         uint8_t    raw_data[DATA_SIZE];
 >>>>>>>         struct {
 >>>>>>>             struct StandardHeader header;
 >>>>>>>             struct SystemParameters parms;
 >>>>>>>
 >>>>>>> } flash_parms;
 >>>>>>>
 >>>>>>> /* Check that sizeof(flash_parms) == DATA_SIZE, and that 
DATA_SIZE is
 >>>>>>> a multiple of the flash sector size */
 >>>>>>
 >>>>>> Do you need raw_data[] array only to set a well-known size to
 >>>> flash_parms?
 >>>>>> Do you alloc an entire (or a multiple of) flash sector in RAM?
 >>>>>> Even if
 >>>>>> your params in flash take only 1/5 or 1/2 or 1/10 of a sector?
 >>>>>>
 >>>>>
 >>>>> One purpose of the raw_data array is to fix the size of the flash 
data.
 >>>>
 >>>> Sincerely I can't understand why you need a fixed data size in RAM.
 >>>> Suppose release 1.0 needs 2kB, but I'm so smart to provide a double
 >>>> space for new params in future releases. So I decide to use 4kB blocks
 >>>> *on the serial memory*.  Maybe this big space will never be used 
in all
 >>>> the future releases.
 >>>>
 >>>> Why the hell I should load all 4kB datablock in RAM at startup? 
IMHO it
 >>>> is sufficient to load only 2kB.
 >>>>
 >>>> The only reason I see is to simplify the checksum calculation.
 >>>> CRC must be calculated over the entire fixed-size 4kB block, otherwise
 >>>> how to know the datablock size if it could change from one release to
 >>>> the other?
 >>>> However you can calc the checksum of 4kB block by reading small blocks
 >>>> multiple times (for example, 16 sections of 256 bytes).
 >>>> After CRC validation passed, you can read in RAM *only* 2kB of 
data that
 >>>> the application 1.0 needs.
 >>>>
 >>>> I know, maybe this process is slower, but it happens only at startup.
 >>>> During writing of 4kB datablock, the checksum can be calculated
 >>>> on-the-fly.
 >>>>
 >>> A big reason for having a copy in RAM being an exact copy of the flash
 >>> memory is design isolation. It allows me to define a checksum 
routine to
 >>> compute the value of an arbitrary buffer, and flash routines that
 >>> transfer arbitrary data without it needing full understanding of the
 >>> parameter storage.
 >>
 >> Yes, your earn some cents on implementation of some functions.
 >>
 >>
 >>> As to the use of extra memory, If I anticipate that I am going to
 >>> possibly need a 4k parameter block in the future, but don't have 
the ram
 >>> memory now to read in the 4k block, most assuredly in the future when I
 >>> do need that bigger parameter block, I am not going to have more ram on
 >>> THIS processor. Unless the flash memory is a removable media (which
 >>> presents a totally different set of issues) there is no need to reserve
 >>> the extra space for this processor.
 >>
 >> When you reserve 4k block for future uses, you don't know if it will be
 >> REALLY used in the future.  You are right when you say "no space now, no
 >> space in the future", but are we sure we can know if we have space 
or not?
 >>
 >> I don't know if my approach is correct, I usually don't use heap memory,
 >> so I have only static data and stack.  It's very difficult to calculate
 >> how much space you need for stack to avoid stack overflow in ANY
 >> condition.  So I end up to reserve all the free RAM space to stack,
 >> where the free space is total RAM minus static allocation.
 >
 > I follow roughly similar guidelines. The one exception is that I allow
 > the use of dynamic memory that is allocated during 'startup'. Also, I
 > find it useful to use a lightweight RTOS, so I don't have a single
 > 'system stack', but several task stacks, so I do need to figure out the
 > space requirements and can't just say it gets everything. You do run
 > tests and check how much of the allocated space is used (stack space is
 > prefilled with a funny pattern, and you see how much remains with that
 > pattern.
 >>
 >> So I try being parsimonius when allocate global static variables, mostly
 >> if they are big.
 >> Reserving some space for extra not used memory will sacrify stack space.
 >>
 >> Maybe the stack space is sufficient even with extra padding in RAM, who
 >> knows.  At first I try to do all my best to avoid useless big static
 >> allocation in RAM.
 >>
 >> At the contrary, what happens if you notice that your stack is critical
 >> when padding is in RAM?  Do you prefer to reduce the datablock *padding*
 >> size or to recode some functions to avoid having the padding at all in
 >> the RAM?
 >>
 > When RAM space gets tight, you need to sit down and look at ALL RAM
 > utilization and compare it to your initial estimates. You need to do
 > this sort of estimate at the very beginning of the project so you know
 > it should be feasible to begin with. You need to be able to flag that
 > there is a serious risk in implementation before the project gets too
 > far, and perhaps re-target to a better processor.
 >>
 >> I think there is another approach to keep checksum functions simple and
 >> avoid having padding space in RAM.
 >> The problem here is the space where CRC is calculated.  At startup we
 >> don't know the size of data, because it could belong to an old or new
 >> layout, so we can't calculate CRC on the right area.  We should know the
 >> data layout version before calculating CRC, but version code is *in* the
 >> CRC area.
 >>
 >> We could think to pull the layout code outside the CRC area and protect
 >> it by duplicating.
 >>
 >> typedef struct {
 >>    uint8_t layout_version;
 >>    uint8_t layout_version_check;
 >>    uint16_t crc;
 >>    /* The CRC is calculated starting from the following byte */
 >>    uint16_t counter;
 >>    union {
 >>      nvmdatav1_t datav1;
 >>      nvmdatav2_t datav2;
 >> }
 >>
 >> At startup the layout version is read and checked.  Then CRC is read and
 >> checked against the correct size that depends on the layout version. The
 >> CRC checksum is simple.
 >>
 >> if (datablock.layout_version == datablock.layout_version_check) {
 >>    size_t s = datablock.layout_version == 1 ?
 >>               sizeof(nvmdatav1_t) :
 >>               sizeof(nvmdatav2_t);
 >>    if (datablock.crc ==
 >>          calc_crc(&((const unsigned char *)&datablock)[5]),
 >>               s) {
 >>      if (datablock.counter > prev_counter) {
 >>        /* New valid datablock found */
 >>      }
 >>    }
 >> }
 >>
 >> No padding at all (in RAM and in serial memory), simple functions.
 >>
 >> Of course padding must be taken into account during writing to separate
 >> the blocks in memory for redundancy and wear-leveling.
 >>
 >>
 >
 > There is no problem with the version code being inside the CRCed area.
 > In fact you really want it checked as if something corrupted the version
 > code, you really want the block invalidated. The version should be in
 > the early header (so parm changes can't affect it, one reason I define a
 > separate struct for it that is put in the flash_parms structure. It also
 > says that the flash parameter saving can be a canned routine that
 > defines that structure and uses the parameter definition from the
 > application code.
I suggested to move version code outside CRCed area because you need to 
read version code *before* check CRC, because you need to know the size 
of CRCed area and this depends on the version code.

Before looking inside the CRCed area, I think it would be better to 
check the CRC itsel.


 > I suppose this is one reason I don't want the size to change, at that
 > point the 'canned' code has to be taken out of the can, as it need more
 > application knowledge to know the parameter size. The way I do it, I can
 > have a parms.h that defines the ram parameter structure, and a #define
 > for the size of the block to use in flash for it, and the canned routine
 > does most of the work to find the best saved block and to save the
 > parameters to flash on command. It does need a callback into the
 > application layer with the block to use to provide the application layer
 > up-rev of the parameter data.
If this is your problem, you can save the size of CRCed area, besides 
version code and so on.  The 'canned' code will know the size when the 
application calls the save() function, specifying the size as a parameter.

At startup the 'canned' code reads the size and check the CRC, without 
having to know much more details from the application.

Reply by David Brown ●June 17, 20182018-06-17

On 15/06/18 17:20, pozz wrote:
> Il 15/06/2018 16:21, David Brown ha scritto:
>> On 15/06/18 15:10, pozz wrote:
>>> Il 15/06/2018 11:25, David Brown ha scritto:
>>>> On 15/06/18 09:38, pozz wrote:
>>>>> Il 14/06/2018 12:49, David Brown ha scritto:
>>>>>> On 14/06/18 12:20, pozz wrote:
>>>>>>> I need to save some data on a non-volatile memory. They are some
>>>>>>> parameters that the user could change infrequently, for example 10
>>>>>>> times
>>>>>>> per day at a maximum.&#4294967295; In the range 100-2kB.
>>>>>>>
>> <snip>
>>>>>>
>>>>>> Multiple copies (at least 2) are key, along with timestamps or 
>>>>>> counters
>>>>>
>>>>> Two should be sufficient to prevent corruption caused by interruption
>>>>> during the writing of a block of data.
>>>>> Maybe three (or more) are needed to face memory *physical* corruption
>>>>> (maybe for many writings).
>>>>> I think I can ignore this event in my actual project.
>>>>>
>>>>
>>>> You use more for wear levelling.&#4294967295; Often your flash chip is /way/ bigger
>>>> than you need - perhaps by a factor of 1000 simply because that's what
>>>> you have on stock, or that's the cheapest device in the package you
>>>> want.&#4294967295; Spread your writes over 1000 blocks instead of 2, and you have
>>>> 500 times the endurance.&#4294967295; Use a good checksum and accept that sometimes
>>>> a block will be worn out and move onto the next one, and you have
>>>> perhaps a million times the endurance (because most blocks last a lot
>>>> longer than the guaranteed minimum).
>>>
>>> So the writing process should be:
>>>
>>> 1. write the data at block i+1 (where i is the block of the current
>>> &#4294967295;&#4294967295; data in RAM)
>>> 2. read back the block i+1 and check if checksum is ok
>>> 3. if ok, writing process is finished
>>> 4. if not, go to block i+2 and start again from 1.
>>>
>>
>> Yes.
>>
>> Then comes step 5 (for flash with separate erasing) :
>>
>> 5. If you have written block x, check if block x+1 (modulo the size of
>> the device) is erased.&#4294967295; If not, then erase it ready for the next write.
>>
>> Note that it does not matter if the erase block size is bigger than the
>> program block size - if it is, then your "erase block x+1" command will
>> cover the next few program blocks.
>>
>>>>>> and checksums.
>>>>>
>>>>> This is interesting.&#4294967295; Why do I need a checksum?&#4294967295; My approach is to use
>>>>> only a magic number plus a counter... and two memory areas.
>>>>> At first startup magic number isn't found on any areas, so the device
>>>>> starts with default and write data on Area1 (magic/0, where 0 is the
>>>>> counter).
>>>>
>>>> You use checksums to ensure that you haven't had a power-out or 
>>>> reset in
>>>> the middle of writing,
>>>
>>> Only for this thing, you can write the counter as the last byte. If the
>>> writing is interrupted in the middle, counter hasn't written yet, so the
>>> block is not valid (because considered too old or empty).
>>
>> Nope.&#4294967295; You can't rely on that, unless you are absolutely sure that you
>> have a simple, old-fashioned E&#4294967295;PROM device.&#4294967295; Many devices are /not/ like
>> that, even if they provide an interface that matches it logically.
>>
>> A common structure for a modern device is to have 32-byte pages as a
>> compromise between density, cost, and flexibility.&#4294967295; (Bigger pages are
>> more efficient in device area and cost.)&#4294967295; When you send a command to
>> write a byte, the device reads the old 32-byte page into ram, erases the
>> old page, updates the ram with the new data, the writes the whole 32
>> byte page back in.
>>
>> The write process is done by a loop that writes all the data, reads it
>> back at a low voltage to see if it has stuck, and writes again as needed
>> until the data looks good.&#4294967295; Then it writes again a few times for safety
>> - either a fixed number, or a percentage of the number of writes taken.
>>
>> So it is /entirely/ possible for an interrupted write to give you a
>> valid counter, but invalid data.&#4294967295; It is also entirely possible to get
>> some bits of the counter as valid while others are still erased (giving
>> ones on most devices).
>>
>> And that is just for simple devices that don't do any fancy wear
>> levelling, packing, garbage collection, etc.
>>
>>
>>>
>>>> and that the flash has not worn out.
>>>>
>>>>>
>>>>> When the configuration is changed, Area2 is written with magic/1, 
>>>>> being
>>>>> careful to save magic/1 only at the end of area writing.
>>>>>
>>>>> At startup magics and counters from both area are loaded, and one area
>>>>> is chosen (magic should be valid and counter should be the maximum).
>>>>>
>>>>> I think this approach works, even when the area writing is interrupted
>>>>> at the middle.
>>>>>
>>>>> Why do I need checksum?&#4294967295; The only thing that comes in mind is to 
>>>>> prevent
>>>>> writing errors: for example, I want to write 0x00 but the value really
>>>>> written is 0x01, maybe for a noise on the serial bus.
>>>>>
>>>>> To solve this situation, I need checksum... but also I need to re-read
>>>>> and re-calculate the checksum at *every* area writing... and start 
>>>>> a new
>>>>> writing if something was wrong.
>>>>>
>>>>> Do you have a better strategy?
>>>>
>>>> You calculate the checksum for a block before writing it, and you check
>>>> it when reading it.&#4294967295; Simple.
>>>
>>> Do you calc the checksum of all the data block in RAM, including padding
>>> bytes?
>>
>> Yes, of course.&#4294967295; The trick is not to have unknown padding bytes.&#4294967295; I make
>> a point of /never/ having compiler-generated padding in my structs.
>>
>> So you have something like this:
>>
>> #define sizeOfRawBlock 32
>> #define noOfRawBlocks 4
>> #define magicNumber 0x9185be91
>> #define dataStructVersionExpected 0x0001
>>
>> typedef union {
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;uint8_t raw8[sizeOfRawBlock * noOfRawBlocks];
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;uint16_t raw16[sizeOfRawBlock * noOfRawBlocks / 2];
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;struct {
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint32_t magic;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint16_t dataStructVersion;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint16_t crc;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; uint32_t count;
>>
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; // real data
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;}
>> } nvmData_t;
>>
>> static_assert(sizeof(nvmData_t) == (sizeofRawBlock * noOfRawBlocks),
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295; "Check size of nvnData!");
> 
> Why raw8[]?
> 
> I think you can avoid raw16[] too. If you have the function:
> 
>  &#4294967295;&#4294967295; uint16_t calc_crc(const void *data, size_t size);
> 
> you can simply call:
> 
>  &#4294967295;&#4294967295; nvmData.crc =
>  &#4294967295;&#4294967295;&#4294967295;&#4294967295; calc_crc( ((unsigned char *)&nvmData) + 8, sizeof(nvmData) );
> 

No, you can't necessarily do that.  C has rules about what pointers can 
alias what data in what ways - so the compiler knows the data inside the 
struct is independent of a pointer to an uint16_t (such as you might use 
in the crc function), unless they are part of a union.  In many cases, 
the compiler can't actually make use of such information for 
optimisation - so code messing about with pointers of different types 
will work.  But sometimes it won't, especially with higher optimisation 
choices and link-time optimisation.  It is much better to get in the 
habit of doing things correctly - if you want to access the data as 
16-bit or 32-bit blocks (for checksums, for copying data, for passing in 
bulk to external memory, etc.) then use a union here.  It might not be 
necessary, depending on the rest of your code (and by C's alias rules, 
the 8-bit raw is not needed), but I'm showing the principle here.

> 
>> When you want to store the data, run your crc calculation over the
>> raw16[] from index 4 upwards, and set it in the crc field.&#4294967295; Set the
>> magic field with a different fixed value per program.&#4294967295; Use the
>> dataStructVersion to track versions of the structure of the data - so
>> that updated firmware can recognise old data and quietly update it. 
> 
> This is interesting.&#4294967295; Suppose you have a new struct version and you are 
> reading an old version from the memory.&#4294967295; How do you read it?&#4294967295; Do you put 
> it completely in RAM and after copy it in the final struct making the 
> changes from one version to the other?&#4294967295; In this case you need a double 
> space in RAM, for old and new version.
> 

That depends on the application.  I'm not doing /all/ your work for you :-)

> Or you declare a "unsigned char reserved_for_future[1024]" big array in 
> the struct, so the CRC is calculated over the full area?
> 

Keeping some extra space for the future is a good idea.  How much you 
use is a matter of taste.

> 
>> Set the count to a new number each time.
>>
>> Leave plenty of room to add new fields to your nvm structure.
>>
>>
>>>
>>>
>>>>>>> This means I need to copy&paste an entire block everytime, even 
>>>>>>> for a
>>>>>>> single byte change.&#4294967295; And this is similar to Flash approach, where I
>>>>>>> *need* a sector erase before changing a single byte.
>>>>>>
>>>>>> Yes.
>>>>>>
>>>>>> Of course, it is possible to have multiple smaller blocks rather than
>>>>>> just one big one.
>>>>>
>>>>> This is good to reduce writing time of a block of data.
>>>>> In my case I have 2kB of data, around 62 32-bytes EEPROM pages, 
>>>>> that is
>>>>> 5*62=300ms.&#4294967295; I don't like to block a task (in a cooperative OS) for so
>>>>> long time.
>>>>
>>>> Your writing should be done in the background.&#4294967295; The background writer
>>>> task is blocked - the system is not.
>>>
>>> Are you thinking of a preemption OS?&#4294967295; I'm using a cooperative OS.
>>>
>>
>> It doesn't matter.&#4294967295; In a pre-emptive OS, your background writer is:
>  &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; ^^^^^^^^^^^
> You mean cooperative.

Yes.

> 
>>
>> static enum { idle, writing, checking, erasing } writerState;
>>
>> void doWriter(void) {
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;switch (writerState) {
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case idle :
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (writeTriggered) {
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writeTriggered = false;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startWriting();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = writing;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case writing :
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startChecking();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = checking;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case checking :
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startErasingNextBlock();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = erasing;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case erasing :
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = idle;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>> &#4294967295;&#4294967295;&#4294967295;&#4294967295;}
>> }
>>
>> There - half your program is done :-)
>>
>> Cooperative multitasking does not mean busy waiting for long-running
>> tasks.&#4294967295; It means using state machines, interrupt-driven SPI/I&#4294967295;C drivers,
>> polling, etc., so that you can mix fast and slow tasks.
> 
> Yes, ok.
>

Reply by David Brown ●June 17, 20182018-06-17

(You emailed me a copy of this post too.  I guess that was a slip of the 
mouse - we all do that from time to time.  Anyway, I'm replying the 
newsgroup and ignoring the email version.  That way everyone can see it, 
and more people can join in the fun.  It's good to see an interesting 
thread here in c.a.e. - it's been a bit idle recently.)

On 16/06/18 20:12, pozz wrote:
> 
> Il 15/06/2018 16:21, David Brown ha scritto:
> 
>  > static enum { idle, writing, checking, erasing } writerState;
>  >
>  > void doWriter(void) {
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295; switch (writerState) {
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case idle :
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (writeTriggered) {
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writeTriggered = false;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startWriting();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = writing;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case writing :
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startChecking();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = checking;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case checking :
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; startErasingNextBlock();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = erasing;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; case erasing :
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; bool stillWorking = pollNVMdevice();
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; if (stillWorking) return;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; writerState = idle;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295; break;
>  >&#4294967295;&#4294967295;&#4294967295;&#4294967295; }
>  > }
>  >
>  > There - half your program is done
> 
> I was thinking on this "long-running" task that writes non-volatile data 
> to serial memory in the background, during normal execution of the&#4294967295; main 
> application.
> 
> First question. If I understood well your words, the checking state has 
> the goal of comparing data written (by reading them) against original 
> data in RAM.&#4294967295; If they differ, we should go back in writing state, maybe 
> changing the destination sector (because the write could be failed again 
> for physical damage of that sector).

Yes, basically.  It is up to you how you handle things if the ram copy 
can be changed underway.  You might use a second copy in ram for a 
check, you might ban changes during the write/check period, or you might 
simply run a checksum on the written data and check that it matches.

> 
> Another question is: what happens if saving is triggered again during 
> writing?&#4294967295; I think there isn't any big problem.&#4294967295; The writing task can be 
> started again from the beginning, even on the same destination sector.
> So during writing and checking states I should check for writeTriggered 
> flags again and prematurely stop the writing and start again from the 
> beginning.

Usually the most important thing is that the data stored is a consistent 
snapshot of the data structure, rather than the most recent version.  So 
you should continue saving what you are doing before triggering a new 
write.  But be very careful if you allow writing to the ram copy of the 
data while a write is in progress.

> 
> Third question, more complex (for me).
> Suppose I decided to split non-volatile data in two blocks, for example 
> calibration data and user settings.
> What happens if user settings change when calibration settings are being 
> written?&#4294967295; I think I should convert your writeTriggered in two updated 
> flags: calibration_updated and settings_updated.
> In idle state I should check both flags and start writing the relative 
> block.
> 

Sure, have as many blocks as you like.

> 
> Again another scenario.&#4294967295; Until now we talked about settings, a structure 
> filled with parameters that can change when the user wants at any time.
> 
> How to manage a log, a list of events with timestamp and some data?
> Suppose one entry takes 8 bytes.&#4294967295; I reserve 4kB of memory for around 500 
> entries organized in a FIFO.
> 
> Log isn't so critical as the settings, so I think I could avoid 
> redundancy in non-volatile memory.&#4294967295; Maybe only the CRC that, when not 
> valid, clears all the log.&#4294967295; It should be acceptable.

Have part of your NVM chip reserved for the logs.  Log in blocks, with a 
crc on each.  Don't bother holding more than two log blocks in ram (one 
being stored to NVM, and one for updating at the moment).

> 
> As usual we can talk about the opportunity to read the full log and put 
> it in RAM, or read the entries when needed (because the user wants to 
> read some entries, mostly the more recent).
> Reading 10 entries (80 bytes) from 10MHz SPI memory doesn't take too 
> much (no more than 100usec). But here the problem is that reading can be 
> needed during writing of settings.&#4294967295; And this is a big problem.
> 
> As usual, simplest solution is to have the full log in RAM... sigh!
> 
> What about writing of one or a few new entries in the log?&#4294967295; A writing 
> operation (for example, settings) should be in process.&#4294967295; I should 
> schedule and postpone the log update after writing of settings is finished.
> 
> Because you have much more experience than me (and you are so kind to 
> share it with me and other lurkers), could you suggest a smart approach?
>