Hackathon Challenge: Difference between revisions

Jump to navigation Jump to search
Line 110: Line 110:


Gold Parsed WIS-L-0011732_lg.csv (and many other lichen gold parsed labels) removes a space from verbatimLatitude and from verbatimLongitude, changing this: 60° 33.579'N into this: 60°33.579'N.  The space removal is inconsistent, on some labels, not on others.
Gold Parsed WIS-L-0011732_lg.csv (and many other lichen gold parsed labels) removes a space from verbatimLatitude and from verbatimLongitude, changing this: 60° 33.579'N into this: 60°33.579'N.  The space removal is inconsistent, on some labels, not on others.
Gold Parsed NY01075791_lg.csv converts the "u" in "Mull" to an umlaut yielding "Müll".  This actually reflects the original label, but not the Gold OCR NY01075791_lg.txt file, which has "Mull".


'''Gold OCR Errors'''
'''Gold OCR Errors'''
Line 118: Line 120:


TENN-L-0000029_lg.txt adds a "1" to the scientificName ("Actinogyra muhlenbergii 1 (Ach.) Schol.").
TENN-L-0000029_lg.txt adds a "1" to the scientificName ("Actinogyra muhlenbergii 1 (Ach.) Schol.").
NY01075791_lg.txt converted "Müll" on the original label NY01075791_lg.jpg to "Mull" (converted umlaut "ü" to "u".  We may want to do this, but if we do it should be standardized and consistent across all the labels.


== Parameters ==
== Parameters ==

Navigation menu