# Functions
Compare calculates the Hamming distance between two 64-bit integers
Currently, this is calculated using the Kernighan method [1].
Fingerprint returns a 64-bit fingerprint of the given vector.
Returns a new feature representing the given byte slice, using a weight of 1.
Returns a new feature representing the given byte slice with the given weight.
NewSliceScanner creates a scanner that returns the byte slices in tokens.
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
Shingle returns the w-shingling of the given set of bytes.
Returns a 64-bit simhash of the given feature set.
Returns a 64-bit simhash of the given bytes.
No description provided by the author
Hash returns a simhash value for the document returned by the scanner.
Vectorize generates 64 dimension vectors given a set of features.
VectorizeBytes generates 64 dimension vectors given a set of [][]byte, where each []byte is a feature with even weight.
# Structs
No description provided by the author
UnicodeWordFeatureSet is a feature set in which each word is a feature, all equal weight.
WordFeatureSet is a feature set in which each word is a feature, all equal weight.
# Interfaces
Feature consists of a 64-bit hash and a weight.
Return features one-at-a-time to be considered by SimHash.
FeatureSet represents a set of features in a given document.
# Type aliases
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author