directory
0.2.4
Repository: https://github.com/miku/span.git
Documentation: pkg.go.dev

# Packages

The span-amsl-discovery tool will create a discovery (now defunkt) like API response from available AMSL endpoints, refs #14456, #14415.
span-compare renders a table with ISIL/SID counts of two indices side by side.
span-crossref-members fetches crossref members api.
Given as single file with crossref works API messages, create a potentially smaller file, which contains only the most recent version of each document.
span-crossref-sync downloads and caches raw crossref messages from the crossref works API: https://www.crossref.org/documentation/retrieve-metadata/rest-api/ Example usage: $ span-crossref-sync \ -p zstd \ # compress program -P feed-1- \ # file prefix (to separate different runs) -i d \ # interval (daily) -verbose \ # verbose -t 30m \ # timeout -s 2022-01-01 \ # start -e 2023-05-01 \ # end (leave out for default: yesterday) -c /data/finc/crossref/ # cache dir Space requirements: One day yields about 1M update docs, or a ~2GB compressed file.
Create a tabular representation of crossref data.
Sniff out DOI from a VuFind SOLR JSON document, optionally update docs with found DOI, cf.
span-export creates various destination formats, mostly for SOLR.
WIP: span-folio talks to FOLIO API to fetch ISIL, collections and other information relevant to attachments.
Freeze file containing urls along with the content of all urls.
The span-hcov tool will generate a simple coverage report given a holding file in KBART format.
span-reshape is a dumbed down span-import.
span-join-assets combines a directory of json or single column TSV configurations into a single file.
The span-local-data extracts data from a JSON file - something `jq` can do just as well, albeit a bit slower.
span-oa-filter will set x.oa to true, if the given KBART file validates a record.
Redact intermediate schema, that is set fulltext field to the empty string.
span-report creates data subsets from an index for reporting.
span-tag takes an intermediate schema file and a configuration forest of filters for various tags and runs all filters on every record of the input to produce a stream of tagged records.
span-update-labels takes a TSV of IDs and ISILs and updates an intermediate schema record x.labels field accordingly.