Categorygithub.com/bserdar/jsonstream
modulepackage
0.0.0-20190428032403-9f1769267072
Repository: https://github.com/bserdar/jsonstream.git
Documentation: pkg.go.dev

# README

JSON Streams

GoDoc

This library supports streaming JSON streaming conventions described in this Wikipedia page.

This is for the concatenated JSON streams, where each JSON document is concatenated one after the other:

rd:=jsonstream.NewConcatReader(reader)
var entry MyEntry
for {
  err:=rd.Unmarshal(&entry)
  if err==io.EOF {
    break
  }
  if err!=nil {
    return err
  }
  processEntry(entry)
}

This is for JSON streams with a separate JSON document in each line (NDJSON):

ndLinesReader:=jsonstream.NewLineReader(reader)
ndLinesWriter:=jsonstream.NewLineWriter(writer)

This is for JSON streams separated with record separator delimiter:

seqReader:=jsonstream.NewSeqReader(reader) // 0x1e record separator
seqWriter:=jsonstream.NewSeqWriter(writer) 
seqReader:=jsonstream.NewSeqReaderWithSep(reader,'\n') // Line-separated JSON
seqWriter:=jsonstream.NewSeqWriterWithSep(writer,'\n')

This is for JSON streams with length prefixed JSON stream, where each JSON document is prefixed by its byte length:

lpReader:=jsonstream.NewLenPrefixedReader(reader)
lpWriter:=jsonstream.NewLenPrefixesWriter(writer)

APIs

All four stream readers/writers use the same APIs.

Readers

data, err:=reader.ReadRaw()

ReadRaw reads the next JSON document. Only the ConcatReader validates that the JSON document is a valid document, the remaining readers simply read until the next delimiter. The returned byte array is a newly allocated copy of the underlying read buffer. Some of the readers use buffered input, so the state of the underlying reader is unknown.

var data myStruct
err:=reader.Unmarshal(&data)

Unmarshals the next entry from the input. For concatenated JSON, errors invalidate the rest of the stream. For others stream processing can continue.

Writers

err:=writer.WriteRaw(data)

WriteRaw simply writes the []byte data to the output, with the correct delimiter. For NDJSON, WriteRaw removes the newline characters from data.

err:=writer.Marshal(data)

Marshal first encodes data to JSON, and then writes it to the output.

# Functions

NewConcatReader returns a new stream reader.
NewConcatWriter returns a new writer.
NewLenPrefixedReader returns a new reader.
NewLenPrefixedWriter returns a new writer.
NewLineReader returns a new lines reader.
NewLineWriter returns a new writer.
NewSeqReader returns a new JSON sequence reader.
NewSeqReaderWithSep returns a new JSON sequence reader with the given separator.
NewSeqWriter returns a writer with 0x1e as separator.
NewSeqWriterSep returns a writer with a custom separator.
ReadAll reads all lines of the stream.
UnmarshalAll unmarshals all JSON documents from the input to the pointer to slice 'out'.

# Structs

ConcatReader reads concatenated documents.
ConcatWriter streams JSON documents by concatenating one after the other.
LenPrefixedReader reads documents delimited with the length of the next record.
LenPrefixedWriter streams JSON documents by adding the byte length of the JSON document before it.
LineReader reads one complete JSON document from each line.
LineWriter streams one JSON document every line.
SeqReader reads JSON documents delimited with a record separator, by default 0x1e.
SeqWriter streams JSON documents delimited with a separator byte.

# Interfaces

Reader reads raw JSON data or unmarshals data from a JSON stream.
Writer writes raw JSON data or marshals data to a JSON stream.