package
0.0.0-20211212194252-0d40728414ff
Repository: https://github.com/uncharted-distil/distil.git
Documentation: pkg.go.dev
# Packages
No description provided by the author
# Functions
Classify will classify the dataset using a primitive.
Clean will clean bad data for further processing.
CloneDataset clones a dataset in metadata storage, data storage and on disk.
Cluster will cluster the dataset fields using a primitive.
ClusterExplainOutput clusters the explained output from a model.
CopyDiskDataset copies an existing dataset on disk to a new location, updating the ID and the storage name.
CreateComposedVariable creates a new variable to use as group id.
CreateDataset structures a raw csv file into a valid D3M dataset.
CreateDatasetFromResult creates a new dataset based on a result set & the input to the model.
CreateFeaturizedDatasetID creates a dataset id for a learning dataset.
DeleteDataset deletes a dataset from metadata and, if not a soft delete, from the database.
DeleteQueryCache deletes the query cache folder if it exists.
ExportDataset extracts a dataset from the database and metadata storage, writing it to disk in D3M dataset format.
Featurize provides a separate step for featurzing data so that it can be called independently of the ingest step.
FeaturizeDataset creates a featurized output of the data that can be used in simplified pipelines.
Format will format a dataset to have the required structures for D3M.
GeocodeForward will geocode a column into lat & lon values.
GeocodeForwardDataset geocodes fields that are types of locations.
GetUniqueOutputFolder produces a unique name for a dataset in a folder.
ImportPredictionDataset imports a dataset to be used for predictions.
Ingest the metadata to ES and the data to Postgres.
IngestDataset executes the complete ingest process for the specified dataset.
IngestMetadata ingests the data to ES.
IngestPostgres ingests a dataset to PG storage.
IngestPredictionDataset ingests a dataset to be used for predictions.
JoinDatamart will make all your dreams come true.
JoinDistil will bring misery.
LoadFittedSolution loads a fitted solution via TA2TA3 API.
LoadSolution loads an unfitted solution via TA2TA3 API.
Merge will merge data resources into a single data resource.
NewConfig creates an ingest config based on a distil config.
NewDefaultClient creates a new client to use when submitting pipelines.
NewPredictionTimeseriesDataset creates prediction timeseries dataset.
OutlierDetection finds outliers in either tabular or remote sensing data.
Predict processes input data to generate predictions.
PrepExistingPredictionDataset sets up an existing dataset to be usable for predictions.
Query uses a query pipeline to rank data by nearness to a target.
Rank will rank the dataset using a primitive.
Sample takes a sample of the dataset since larger datasets can lead to broken user experience through long lasting TA2 processing.
SaveFittedSolution saves a fitted solution to disk via TA2TA3 API.
SaveSolution saves a solution to disk via TA2TA3 API.
SetClient sets the compute client to use when invoking primitives.
SetGroups updates the dataset metadata (as stored) to capture group information.
Summarize will summarize the dataset using a primitive.
TargetRank will rank the dataset relative to a target variable using a primitive.
UpdateExtremas will update every field's extremas in the specified dataset.
VerifySuggestedTypes checks expands the suggested types to include all valid types the database storage can support.
VerticalConcat will bring mastery.
# Constants
DefaultSeparator is the default separator to use when dealing with groupings.
OutlierAnomaly is the category name used for anomalies discovered by outlier detection.
OutlierRegular is the category name used for regular values as discovered by outlier detection.
# Structs
ClusterPoint contains data that has been clustered.
FeatureRequest captures the properties of a request to a primitive.
GeocodedPoint contains data that has been geocoded.
ImportanceResult is the result from a ranking operation.
IngestParams contains the parameters needed to ingest a dataset.
IngestResult captures the result of a dataset ingest process.
IngestSteps is a collection of parameters that specify ingest behaviour.
IngestTaskConfig captures the necessary configuration for an data ingest.
JoinPair captures the information required for a single join relationship.
JoinSpec stores information for one side of a join operation.
OutlierPoint contains whether or not a datapoint is an outlier or not.
PredictionTimeseriesDataset has the paramaters necessary to create a timeseries dataset from minimal information.
PredictParams contains all parameters passed to the predict function.
QueryParams helper struct to simplify query task calling.
SummaryResult represents a summary result.
# Interfaces
DatasetConstructor is used to build a dataset.