modulepackage
0.0.0-20200509031100-522aef269e3c
Repository: https://github.com/liuzl/ling.git
Documentation: pkg.go.dev
# README
ling is a golang toolkit for natural language processing
Implementation references
Similar NLP tools
- Stanford CoreNLP Java
- spaCy Python
- lingo Golang
Multilingual text toknization
Text normalization
Lemmatization
词干提取(stemming)和词形还原(lemmatization)
- Stemming and lemmatization
- Lemmatization ListsDatasets by MBM
- The UniMorph Project
- 中文繁简转换
Tagging
- Regex tagger
- commonregex, a collection of common regular expressions for Go.
- xurls, a Go package of regex for urls.
Natural language Detection
getlang
is much slower than franco
- getlang
- franco
- test scripts
- franco: Duration: 5.12s, 26.93%
- getlang: Duration: 11.58s, 59.54%
# Functions
DefaultNLP returns ling handler with norm, lemma, unidecode and regex.
MustNLP is like NLP but panics if the annotators are not correct.
NewAPITagger returns a new tagger.
No description provided by the author
No description provided by the author
NLP returns ling handler with the annotators.
No description provided by the author
No description provided by the author
# Constants
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
Lemma processor name.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
TimePattern = `(?is)((?:0?|[12])\d\s*:+\s*[0-5]\d(?:\s*:+\s*[0-5]\d)?(?:\s*[,:.]*\s*(?:am|pm))?|(?:0?|[12])\d\s*[.\s]+\s*[0-5]\d(?:\s*[,:.]*\s*(?:am|pm))+)`.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
https://github.com/mingrammer/commonregex Regular expression patterns.
No description provided by the author
https://github.com/mingrammer/commonregex Regular expression patterns.
go:generate jsonenums -type=TokenTypego:generate stringer -type=TokenType.
# Structs
APITagger via http interface.
No description provided by the author
No description provided by the author
Entity stores the NER entity.
Lemmatizer is the processor for lemmatization.
Normalizer is the processor for token normalization.
A Pipeline contains configured annotators and taggers for nl processing.
RegexTagger is the processor that uses regex expression.
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
# Interfaces
No description provided by the author
# Type aliases
No description provided by the author