Gregory Wiedeman

Results 29 issues of Gregory Wiedeman

WARC files can have [metadata records](https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/#metadata). It seems relatively common for these metadata records to be arbitrary JSON key-value pairs. As a consumer of WARC files, I would like to...

I'm not sure if this wasn't included for some reason I'm not seeing, but this lets you use multiple cores when validating with the `is_valid()` method as a library. You...

Bagit-python currently assumes that a user owns the directory it is creating a bag in. We have a use case where a user needs to create a bag on an...

"As a small repository contributing to a regional aggregator that uses Arclight, I don't have a local discovery system. Thus I would like to use the arclight landing page for...

Updated the README to simplify the quickstart index command. It now suggests using `rails arclight:index` as the least error-prone way of getting an EAD in the index and then links...

It looks to me like the [Indexing EAD in ArcLight](https://github.com/projectblacklight/arclight/wiki/Indexing-EAD-in-ArcLight) wiki was mostly written before the Traject workflow. It does document how to [index with Traject](https://github.com/projectblacklight/arclight/wiki/Indexing-EAD-in-ArcLight#using-traject), but its a bit...

documentation

This simplifies how Arclight handles ``s for both collections and components per the approach in #1028 comments. Arclight now stores a list of unitdates in `unitdates_ssm` and a corresponding list...

By default ArchivesSpace permits three date types, inclusive, bulk, and single (and you can add more local types). Only inclusive and bulk are valid in EAD2002, so single dates export...

data model

When indexing a single EAD 2002 file with traject, there is a lot of text that is sent to `stdout`. Much of this seems to be because the traject log...

data harvesting pipeline

**Describe the bug** Some dependencies that get built locally, like typed-ast and libpff-python, have trouble installing on Python 3.11. This is probably mostly an issue for Windows and slim docker...

bug