# Functions
getWordFrequencies compute word doc frequencies for corpus.
getWordFrequencies compute word frequencies, collocations, and usage for corpus.
Makes a glossary by filtering by the domain label and sorting by Chinese pinyin.
Constructor for empty CollectionAResults.
ParseText tokenizes a Chinese text corpus document into terms Parameters:
text: the string to parse ColTitle: Optional parameter used for tracing collocation usage document: Optional parameter used for tracing collocation usage
Returns:
tokens: the tokens for the parsed text results: vocabulary analysis results.
Subtract the items in the second list from the first.
WriteCorpus write all the collections in the given corpus collections: The set of collections to write to HTML baseDir: The base directory to use to write the files.
WriteCorpusAll write all the collections in the default corpus (collections.csv file).
WriteCorpusCol writes a corpus document collection to HTML, including all the entries contained in the collection collectionFile: the name of the collection file.
Writes dictionary headword entries func WriteHwFiles(loader library.LibraryLoader,
dictTokenizer tokenizer.Tokenizer, outputConfig generator.HTMLOutPutConfig, indexState index.IndexState, wdict map[string]dicttypes.Word, vocabAnalysis VocabAnalysis, hww HeadwordWriter) error {.
WriteLibraryFile writes a HTML files describing the corpora in the library.
# Structs
A struct to hold the analysis results for the collection.
DictEntry holds content used for writing a dictionary entry to HTML.
The content for a corpus entry.
No description provided by the author
VocabAnalysis bundles up vocabulary analysis.
# Interfaces
hwWriter manages files for writing headwords to HTML.