package
0.0.80
Repository: https://github.com/alexamies/cnreader.git
Documentation: pkg.go.dev

# Functions

Builds a flat index of document titles from the hierarchical corpus.
Reads word frequencies data from files into memory and builds the keyword index.
* Filters a slice of sorted words by domain label if any one of the word * senses matches the label.
Retrieves documents with title for a single keyword.
Retrieves raw results for a single keyword.
Gets the dictionary definition of a slice of strings Parameters terms: The Chinese (simplified or traditional) text of the words Return hws: an array of word senses.
Initializes a DocumentFrequency struct.
ReadDocumentFrequency a document frequency object from a CSV file.
Orders the keyword with given frequency in a document by tf-idf weight Param: vocab - word frequencies for a particular document.
* Sorts Word struct's based on frequency */.
UpdateDictIndex writes a list of dicitonary words with subtring array.
UpdateDocTitleIndex writes a list of document titles from the hierarchical corpus with subtring arrays.
Append document analysis to a plain text file in the index directory.
Write corpus analysis to plain text files in the index directory.

# Constants

Bigram frequencies for each file.
No description provided by the author
File name for document index.
Word frequencies for each document.
File name for keyword index.
ngram frequencies for corpus.
Unknown characters file.
Word frequencies for corpus.
Word frequencies for each document.

# Structs

A word with corpus entry label.
A word frequency with corpus entry label.
Records the document length for each document in the corpus.
Map from term to number of documents referencing the term.
IndexConfig encapsulates parameters for index configuration.
A word frequency entry record.
Storage for the keyword index.
A keyword in a document.
A document-specific word frequency entry record.
Sorted list of word frequencies.
An entry in a sorted word array.
Remembers the word frequency for each term for each document in the corpus.
A document-specific word frequency entry record.
A word frequency entry record.
Storage for word frequency data.

# Interfaces

FsClient defines Firestore interfaces needed.

# Type aliases

No description provided by the author
No description provided by the author
Remembers the word frequency for each term for each document in the corpus.