Data Ingestion Guidance: Difference between revisions

Line 141: Line 141:
** If you have a UUID GUID for your image records, then assign it to the optional ac:providerManagedID field.
** If you have a UUID GUID for your image records, then assign it to the optional ac:providerManagedID field.
*** sample
*** sample
****id = identifier of the specimen record (the coreId field in the Audubon Core extension file) <pre>urn:catalog:institutionCode:collectionCode:catalogNumber</pre>
****id = (this is the coreid field in the Audubon Core extension file), it matches one identifier of the specimen record.<pre>urn:catalog:institutionCode:collectionCode:catalogNumber</pre>
****identifier = id of the media record - needs to be unique within Audubon Core file, is the equivalent of the occurrenceID in the occurrence file<pre>urn:catalog:institutionCode:collectionCode:Image:catalogNumber</pre>
****identifier = id of the media record - needs to be unique within Audubon Core file, is the equivalent of the occurrenceID in the occurrence file (dcterms:identifier)<pre>urn:catalog:institutionCode:collectionCode:Image:catalogNumber</pre>
****accessURI = link to the media file <pre>http://bgbasesrvr.univ.edu/DATABASEIMAGES/LONN00000001.JPG</pre>
****accessURI = link to the media file <pre>http://bgbasesrvr.univ.edu/DATABASEIMAGES/LONN00000001.JPG</pre>
****providerManagedID = <pre>urn:uuid:32e5da5d-c747-435c-a368-07d989259bf4  (optional)</pre>
****providerManagedID = <pre>urn:uuid:32e5da5d-c747-435c-a368-07d989259bf4  (optional)</pre>
*** If you are not using IPT, generate a meta.xml file by hand and package up the files in a DwC A-like format. (No eml.xml required).
*In the case where the data are coming from an aggregator, an additional recordId field is required (idigbio:recordId)
In the case where the data are coming from an aggregator, then is an additional recordId field is required
*Use Audubon Core terms, http://terms.tdwg.org/wiki/Audubon_Core_Term_List, with one record to go with each media record. The more you can flesh out the details of the image, the more likely it will be to be highly retrievable. The best practice is to use the taxonomic and geographic fields to capture as much information as possible when only media are given to iDigBio.
*Use Audubon Core terms, http://terms.tdwg.org/wiki/Audubon_Core_Term_List, with one record to go with each media record. The more you can flesh out the details of the image, the more likely it will be to be highly retrievable. The best practice is to use the taxonomic and geographic fields to capture as much information as possible when only media are given to iDigBio.
*Just like permission of catalog records, the media records need to be provided freely and with permission, and each record should have at least Creative Commons permission = be in the public domain (e.g., CC0). Any combination of BY, NC, and SA of CC media license you wish to apply is fine with
*Just like permission of catalog records, the media records need to be provided freely and with permission, and each record should have at least Creative Commons permission = be in the public domain (e.g., CC0). Any combination of BY, NC, and SA of CC media license you wish to apply is fine with
Line 152: Line 151:
*a sample of an [https://www.idigbio.org/sites/default/files/sites/default/files/DarwinCoreExamples/AudubonCoreExample.xlsx Audubon Core file]
*a sample of an [https://www.idigbio.org/sites/default/files/sites/default/files/DarwinCoreExamples/AudubonCoreExample.xlsx Audubon Core file]
*The media records represent a one-to-one relationship between the media object (the fit-for-display best quality JPG, in the case of images, for example) and the specimen record. There is no need to include links to any other forms of the media, like an enclosing webpage. Below is some guidance on handling special cases. If none of these media attachment rules make sense to you, please get in touch with us for further assistance.
*The media records represent a one-to-one relationship between the media object (the fit-for-display best quality JPG, in the case of images, for example) and the specimen record. There is no need to include links to any other forms of the media, like an enclosing webpage. Below is some guidance on handling special cases. If none of these media attachment rules make sense to you, please get in touch with us for further assistance.
If you are not using IPT, generate a meta.xml file by hand and package up the files in a DwC A-like format. (No eml.xml required).


==Best practice for getting Audubon Core images linked to specimen records - special cases==
==Best practice for getting Audubon Core images linked to specimen records - special cases==
5,887

edits