Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adapt DWC Importer to add Field Occurrences #4216

Open
mabecabrera opened this issue Feb 19, 2025 · 2 comments
Open

Adapt DWC Importer to add Field Occurrences #4216

mabecabrera opened this issue Feb 19, 2025 · 2 comments
Assignees
Labels
enhancement Suggest an improvement to an existing function.

Comments

@mabecabrera
Copy link

Feature or enhancement

I have a dataset with Taxon Names with their authors, Locality and Coordinates only.
I guess I should add the column for basisOfRecord, and I do not know if there is another info required.

I also want to add the source for all this records... but maybe I could add it from the field occurrence filter by selecting all of them and adding the source?

Location

DwC Import

Screenshot, napkin sketch of interface, or conceptual description

Image
This is how it looks like the info.

Your role

Curator

@mabecabrera mabecabrera added the enhancement Suggest an improvement to an existing function. label Feb 19, 2025
@mabecabrera
Copy link
Author

The publication with this info, just to keep track is Bao, B., Wang, X., Peng, Z., Zhu, Q., Li, X. & Zhang, D. (2025) Pattern of Diversity and Prediction of Suitable Areas of Grasshoppers from the Qinghai–Tibet Plateau in China (Orthoptera: Acridoidea). Insects, 16(2), 191. https://doi.org/10.3390/insects16020191

@LocoDelAssembly
Copy link
Contributor

My implementation idea would be as follows:

When basisOfRecord is occurrence create FO in place of CO. All the rest is the same as preservedSpecimen (CE included) except for catalogNumber, institutionCode and collectionCode that would be ignored.

As for mapping sources, DwC has references, bibliographicCitation and associatedReferences. None of them are currently supported. Without using extensions it seems it is not possible to import reliably on anything other than Source::Verbatim, and perhaps parse the text to detect identifiers and use them (specially DOI). If a source can be located by its identifier then use it instead of create anew (should be scoped to in-project sources?). For Source::Verbatim we likely need to allow an ImportDataset to be an origin like we did for Person::Unvetted as a way to avoid creating the same source multiple times (unless it is OK to assume that existing matching source is the same no matter its origin). I guess we could start by supporting bibilographicCitation initially.

The dataset should be adapted to follow DwC column names and semantics. The data to be imported is in Appendix F @mabecabrera?

In principle columns have straightforward mappings to DwC except for Collection Site that would need a bit of handwork to split into country, stateProvince, county (maybe) and verbatimLocality (or no handwork at all and just use verbatimLocality with exact text and let TW figure out the rest by Lat/Lng).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Suggest an improvement to an existing function.
Projects
None yet
Development

No branches or pull requests

2 participants