Quality Assessment and Quality Control (QA/QC) - a new method was developed by Dr. Vimal and Geothara and Dr. Stein Kokin, and it is a work in progress
Errors of Omission (~5% or less)
Ma'abara: These absorption camps have thus far not been included as they were designed for temporary residence. We hope to add the ma’abarot at a later stage.
Non-Jewish communities in the Sinai Peninsula: These have not been included as this region no longer encompasses the region of Israeli settlement.
Pre-1967 Arab villages on the Golan Heights: These have thus far not been included due to difficulty of obtaining the necessary data, but we plan to add them as the Golan Heights remains an integral part of the region of Israeli settlement.
Errors of Commission
Data points in the database:
Are there any blank lines in the map database?
Are all the key columns (latitude, longitude, time of settlement, settlement classification) fully populated?
Are the points (colors) in the correct regions?
Are there any issues with the coordinate system?
Are there any points outside the Israel domain? (this happens when the lat and lon values are swapped or other errors have occurred).
Is the link to Wikipedia correct?
Map features:
Does the time animation bar appear correctly?
Do legends and labels appear as expected?
Are the links to the legends from Google Drive broken?
Is public sharing activated for all features of the map? (Ccheck the web link of the final map with incognito window).
Is the base map set to the right black background?
Are all the individual layers working without flashing? I.e. are all the enable editing options turned off for all layers that needed manual editing?
Update errors: file comparison (version control)
Is there information that has been lost by chance?
Are there data entries in the previous version of the database that have been lost?
Are all the points (including metadata) preserved?
Non-systematic errors: point by point checking (with expert knowledge, references and sources)
Routine checks: Select some (say 50) critical points across categories: several critical points for each large category, at least a few critical points for each small category. Do the critical points appear in the right place and are the data represented correctly?
Random spot checks: Do the points (say 50 random points) correspond with the community visible from the base map? Are the years of appearance and disappearance correct for the categories?
Validation across time: As we move further back in time, the greater the uncertainty with our data.
Errors will have to be quantified in terms of time (say decade by decade) to report a rough statistic on data accuracy.
Data quality level is high in these cases: Date of founding of Israel communities is generally accurate, though uncertainty again increases prior to the founding of the State of Israel.
Data quality in moderate in these cases: Prior to 1948.
Data quality level is not properly scrutinized in these cases: the early history of Arab villages (for example, the CBS does not collect or include data concerning their date of origin).
Metadata and citation of sources:
Is the metadata complete to track all changes made to the database?
Can we trace the metadata for each point?
Who added the point and when? From what source?
Is the citation to the source provided in meta-data of each point?