The second phase is analyzing the SFM file. There are 5 steps to this phase:
Line ends
Remove trailing blanks and look for tabs that should be spaces
Join lines
Determine if file is already in Unicode, or if encoding conversion is needed (if so, seek expert help)
Ensure linguist has a Unicode font and keyboard
Learn what sort orders are used
Be prepared to add these to the empty database
Which fields have list content
Is it consistent?
Are custom lists or list items needed?
Does it exist?
Is it applied correctly?
Should any be applied that isn't there?
Link to script for numbering examples
Script for applying formatting to headword in examples?