Categorygithub.com/liuzl/ling
modulepackage
0.0.0-20200509031100-522aef269e3c
Repository: https://github.com/liuzl/ling.git
Documentation: pkg.go.dev

# README

ling is a golang toolkit for natural language processing

GoDocGo Report Card

Implementation references

Similar NLP tools

Multilingual text toknization

Text normalization

Lemmatization

词干提取(stemming)和词形还原(lemmatization)

Tagging

  • Regex tagger
    • commonregex, a collection of common regular expressions for Go.
    • xurls, a Go package of regex for urls.

Natural language Detection

getlang is much slower than franco

# Packages

No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author

# Functions

DefaultNLP returns ling handler with norm, lemma, unidecode and regex.
MustNLP is like NLP but panics if the annotators are not correct.
NewAPITagger returns a new tagger.
No description provided by the author
No description provided by the author
NLP returns ling handler with the annotators.
No description provided by the author
No description provided by the author

# Constants

https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
Lemma processor name.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
TimePattern = `(?is)((?:0?|[12])\d\s*:+\s*[0-5]\d(?:\s*:+\s*[0-5]\d)?(?:\s*[,:.]*\s*(?:am|pm))?|(?:0?|[12])\d\s*[.\s]+\s*[0-5]\d(?:\s*[,:.]*\s*(?:am|pm))+)`.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.

# Variables

No description provided by the author
Regexes is the compiled regular expressions.

# Structs

APITagger via http interface.
No description provided by the author
No description provided by the author
Entity stores the NER entity.
Lemmatizer is the processor for lemmatization.
Normalizer is the processor for token normalization.
A Pipeline contains configured annotators and taggers for nl processing.
RegexTagger is the processor that uses regex expression.
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author

# Interfaces

No description provided by the author

# Type aliases

No description provided by the author