Hackathon Challenge: Difference between revisions

Jump to navigation Jump to search
Line 113: Line 113:
Gold Parsed NY01075791_lg.csv converts the "u" in "Mull" to an umlaut yielding "Müll".  This actually reflects the original label, but not the Gold OCR NY01075791_lg.txt file, which has "Mull".  Same for NY01075792_lg.csv, and several other in the series.
Gold Parsed NY01075791_lg.csv converts the "u" in "Mull" to an umlaut yielding "Müll".  This actually reflects the original label, but not the Gold OCR NY01075791_lg.txt file, which has "Mull".  Same for NY01075792_lg.csv, and several other in the series.


===========================================================
There are more errors in gold csv files. (Qianjin)
There are moRich Editorre errors in gold csv files. (Qianjin)


NY01075759_lg verbatimEventDate (1998-04-19), it should be 19 April 1998
NY01075759_lg verbatimEventDate (1998-04-19), it should be 19 April 1998
Line 194: Line 193:
WIS-L-0012074_lg county (null)
WIS-L-0012074_lg county (null)
WIS-L-0012077_lg verbatimLocality contains verbatimCoordinates
WIS-L-0012077_lg verbatimLocality contains verbatimCoordinates
==========================================================
==========================================================
'''Gold OCR Errors'''
'''Gold OCR Errors'''
7

edits

Navigation menu