Mathew Biddle
Mathew Biddle
@cperaltab you should have permissions now. https://ipt-obis.gbif.us/manage/resource.do?r=sfmbon_zooplankton Please check
I like all those resources, however they require a full [Darwin Core Archive](https://dwc.tdwg.org/text/) package (including eml.xml and meta.xml) or something loaded into an IPT. That seems burdensome to me when...
from @sbeaulieu see https://github.com/EMODnet/EMODnetBiocheck for the under the hood code in the lifewatch tool.
xref: https://cioos-siooc.github.io/pyobistools/index.html might be able to put that checker in a GH Action which runs on csv files found in `data/processed/` (see [example GH Action running python script](https://github.com/MathewBiddle/ioos_by_the_numbers/blob/main/.github/workflows/metrics.yml)). Could also...
Testing gemini to build it https://gist.github.com/MathewBiddle/624afde54c5055efdc812602a5e56e63
see the section "Matt's home grown code for checking" in [this Gist](https://gist.github.com/MathewBiddle/65d6a75331df9856a5ad5bb6cddfe896).
486k records!! Holy smokes. This is fantastic. Since the data are already accessible on ERDDAP, we can develop a small script to do the reformatting. First and foremost, I would...
Also, take a look at the Wilkinson Basin Zooplankton Timeseries dataset mobilization process: https://github.com/ioos/bio_data_guide/tree/main/datasets/WBTS_MBON There is probably a lot of similarities between the two.
The eml.xml and meta.xml should be skipped when **updating** a dataset in the IPT (ie. updating the data files only). If you **start** a dataset with a DwCA .zip package...