When replying to the thread comparing FTM and Legacy I wondered why using a well-documented standard like GEDCOM can lead to data corruption. I knew from past internet searching that the problem was connected with the CONC tag used to divide long text and then reconstitute it so I read the standrad and here is what it says:
GEDCOM 5.5
CONC implies concatenation to the previous line without a new line. This is used so that a text note or description can be processed (word wrapped) in a text window without fixed carriage returns.
GEDCOM 5.5.1
The CONC tag assumes that the accompanying subordinate value is concatenated to the previous line value without saving the carriage return prior to the line terminator. If a concatenated line is broken at a space, then the space must be carried over to the next line. Otherwise most operating systems will strip off the trailing space and the space is lost in the reconstitution of the note.
The distinction in the handling of spaces between a word that ends one line and the word that starts the next. In GEDCOM 5.5.1 this must be placed at the beginning of the following line, in GEDCOM 5.5 it can be placed at the end of the previous line. The version of GEDCOM being used is contained in the header of the GEDCOM so this really should not be a problem, but it clearly is a problem.
David
GEDCOM 5.5
CONC implies concatenation to the previous line without a new line. This is used so that a text note or description can be processed (word wrapped) in a text window without fixed carriage returns.
GEDCOM 5.5.1
The CONC tag assumes that the accompanying subordinate value is concatenated to the previous line value without saving the carriage return prior to the line terminator. If a concatenated line is broken at a space, then the space must be carried over to the next line. Otherwise most operating systems will strip off the trailing space and the space is lost in the reconstitution of the note.
The distinction in the handling of spaces between a word that ends one line and the word that starts the next. In GEDCOM 5.5.1 this must be placed at the beginning of the following line, in GEDCOM 5.5 it can be placed at the end of the previous line. The version of GEDCOM being used is contained in the header of the GEDCOM so this really should not be a problem, but it clearly is a problem.
David
Comment