Dataset Errata: Difference between revisions

Jump to navigation Jump to search
Line 96: Line 96:
----
----


=== Gold Parsed CSV Files ===
=== Gold Parsed CSV File Issues ===
==== Lichen NY ====
==== Lichen NY CSV====
There are more errors in gold csv files. (Qianjin) <br>
There are more errors in gold csv files. (Qianjin) <br>
'''(Bryan: I agree with Qianjin's edits except as noted below)'''  
'''(Bryan: I agree with Qianjin's edits except as noted below)'''  
Line 202: Line 202:
:::FIXED in ocr txt files and parsed csv files. --[[User:Dpaul|Dpaul]] 15:21, 1 July 2013 (EDT)
:::FIXED in ocr txt files and parsed csv files. --[[User:Dpaul|Dpaul]] 15:21, 1 July 2013 (EDT)


==== Lichen TENN ====
==== Lichen TENN CSV ====


TENN-L-0000001_lg verbatimLocality mixed with verbatimElevation
TENN-L-0000001_lg verbatimLocality mixed with verbatimElevation
Line 324: Line 324:
:::FIXED, put AK in stateProvince. --[[User:Dpaul|Dpaul]] 17:37, 1 July 2013 (EDT)
:::FIXED, put AK in stateProvince. --[[User:Dpaul|Dpaul]] 17:37, 1 July 2013 (EDT)


==== Lichen WIS ====
==== Lichen WIS CSV ====
WIS-L-0011728_lg stateProvince (AK) in the text file; but it is (ALASKA) in the csv file.
WIS-L-0011728_lg stateProvince (AK) in the text file; but it is (ALASKA) in the csv file.
:::image has '''FLORA OF ALASKA''' so seems okay to me to put ALASKA. The image also shows '''U.S. Fish & Wildlife Service, Anchorage, AK'''...neither of these is in the '''Location:''' string that reads: '''Funny River road; Kenai NWR'''
:::image has '''FLORA OF ALASKA''' so seems okay to me to put ALASKA. The image also shows '''U.S. Fish & Wildlife Service, Anchorage, AK'''...neither of these is in the '''Location:''' string that reads: '''Funny River road; Kenai NWR'''
Line 382: Line 382:
:::NOT FIXED, this is okay, they are embedded. --[[User:Dpaul|Dpaul]] 14:26, 3 July 2013 (EDT)
:::NOT FIXED, this is okay, they are embedded. --[[User:Dpaul|Dpaul]] 14:26, 3 July 2013 (EDT)
   
   
<br> '''Gold OCR Errors'''
=== Gold OCR (TXT file) Errors ===


NY01075761_lg.txt has catalogNumber as 0107576, omitting the 1 at the end.  
NY01075761_lg.txt has catalogNumber as 0107576, omitting the 1 at the end.  
Line 395: Line 395:
<br> '''Silver Parsed CSV Files''' '''(Bryan: I do not get most of these. There should be OCR errors in silver. We do need to stay true to the OCR output.)&nbsp;'''  
<br> '''Silver Parsed CSV Files''' '''(Bryan: I do not get most of these. There should be OCR errors in silver. We do need to stay true to the OCR output.)&nbsp;'''  


=== Silver Parsed CSV File Issues ===
"Silver Parsed CSV Files" There were some errors in the Silver CSV dataset. (Steven C.)  
"Silver Parsed CSV Files" There were some errors in the Silver CSV dataset. (Steven C.)