4 Hash Value Calculations

A hash value is a small fixed-size value that is calculated from and used to represent all the values in an arbitrary-sized block of data. If that data block is copied, a hash recalculated from the new block can be compared to the original hash. Agreement between the two hashes provides a high level of certainty that the copy is valid. There are many hash algorithms. More complex algorithms provide a more robust verification but can sometimes be too computationally demanding when used in an embedded environment, particularly for smaller devices.

Hexmate implements several hash algorithms, such as checksums and cyclic redundancy checks, which can be selected to calculate a hash value of a program image that is contained in a HEX file. This value can be embedded into that same HEX file and burned into the target device along with the program image. At runtime, the target device can run a similar hash algorithm over the program image, now stored in its memory. If the stored and calculated hashes are the same, the embedded program can assume that it has a valid program image to execute.

Hexmate's -ck option requests that a hash be calculated, as described in Ck Hexmate Option.

Some consideration is required when a hash value is being calculated over memory that contains unused memory locations. Consider using Hexmate's -fill option (see Fill Hexmate Option) to have these locations programmed with a known value. Avoid filling the locations where the hash value will be stored, as memory is filled before the hash is calculated and this can result in an error.

Hexmate can produce a hash value from any Intel HEX file, regardless of which compiler produced the file and which device that file is intended to program. However, the architecture of the target device may restrict which memory locations can be read at runtime, thus requiring modification to the way in which Hexmate should perform hash calculations, so that the two hashes are calculated similarly and agree. In addition, some compilers might insert padding or phantom bytes into the HEX file that are not present in the device memory. These bytes might need to be ignored by Hexmate when it calculates a hash value and the following discussion indicates possible solutions.

Not all devices can read the entire width of their program memory. For example, Baseline and Mid-range PIC devices can only read the lower byte of each program memory location. The HEX file, however, will contain two bytes for each program memory word and both these bytes will normally be processed by Hexmate when calculating a hash value. Use the s2 suboption to Hexmate's -ck option to have the MSB of each 2-byte word skipped. Note, however, that this sort of verification process will not detect corruption in the MSB of each program word.

To accommodate the 24-bit (3 byte) program memory word size on 24-bit instruction set dsPIC and PIC24 devices, the compiler inserts a 0x00 phantom byte after each 3-byte instruction to make up a 4-byte word. Hexmate will normally see and process these phantom bytes when calculating a hash value, whereas code running on the device to perform the same calculation might not. When executing Hexmate explicitly, use the s4 suboption to the -ck option to have the MSB of each 4-byte word skipped for these devices.

Some devices have hardware CRC modules which can calculate a CRC hash value. If desired, program memory data can be streamed to this module using the Scanner module to automate the calculation. As the Scanner module reads the MSB of each program memory word first, you need to have Hexmate also process HEX file bytes within an instruction word in the reverse order. Use the r2 suboption to Hexmate's -ck option to have Hexmate process the bytes in a 2-byte word in reverse order.

Some consideration must also be given to how the Hexmate hash value encoded in the HEX file can be read at runtime.

Baseline and Mid-range PIC devices must store data in program memory using retlw instructions. Thus they need one instruction to store each byte of the hash value calculated by Hexmate. Use the t34 suboption to Hexmate's -ck option to have Hexmate store each byte of the hash value in a retlw instruction. The retlw instruction is encoded as 0x34nn, where nn is the 8-bit data value to be loaded to WREG when executed.

If you are targeting a 24-bit PIC device, where the 24-bits of program memory associated with an instruction appear as 3 bytes of data and 1 phantom byte in the HEX file and, for example, you wanted to have a 32-bit hash stored in only the least significant 16-bit word of each location, use Hemxate's t0000.2 suboption to the -ck option to store two bytes of the hash value in the lower half of each 4-bytes of the HEX file, with the upper bytes set to zero.