pkg.gl

The span-amsl-discovery tool will create a discovery (now defunkt) like API response from available AMSL endpoints, refs #14456, #14415.

span-compare

span-compare renders a table with ISIL/SID counts of two indices side by side.

span-crossref-members

span-crossref-members fetches crossref members api.

span-crossref-snapshot

Given as single file with crossref works API messages, create a potentially smaller file, which contains only the most recent version of each document.

span-crossref-sync

span-crossref-sync downloads and caches raw crossref messages from the crossref works API: https://www.crossref.org/documentation/retrieve-metadata/rest-api/ Example usage: $ span-crossref-sync \ -p zstd \ # compress program -P feed-1- \ # file prefix (to separate different runs) -i d \ # interval (daily) -verbose \ # verbose -t 30m \ # timeout -s 2022-01-01 \ # start -e 2023-05-01 \ # end (leave out for default: yesterday) -c /data/finc/crossref/ # cache dir Space requirements: One day yields about 1M update docs, or a ~2GB compressed file.

span-crossref-table

Create a tabular representation of crossref data.

span-doisniffer

Sniff out DOI from a VuFind SOLR JSON document, optionally update docs with found DOI, cf.

span-export

span-export creates various destination formats, mostly for SOLR.

span-folio

WIP: span-folio talks to FOLIO API to fetch ISIL, collections and other information relevant to attachments.

span-freeze

Freeze file containing urls along with the content of all urls.

span-hcov

The span-hcov tool will generate a simple coverage report given a holding file in KBART format.

span-import

span-reshape is a dumbed down span-import.

span-join-assets

span-join-assets combines a directory of json or single column TSV configurations into a single file.

span-local-data

The span-local-data extracts data from a JSON file - something `jq` can do just as well, albeit a bit slower.

span-oa-filter

span-oa-filter will set x.oa to true, if the given KBART file validates a record.

span-redact

Redact intermediate schema, that is set fulltext field to the empty string.

span-report

span-report creates data subsets from an index for reporting.

span-tag

span-tag takes an intermediate schema file and a configuration forest of filters for various tags and runs all filters on every record of the input to produce a stream of tagged records.

span-update-labels

span-update-labels takes a TSV of IDs and ISILs and updates an intermediate schema record x.labels field accordingly.

# Packages