pkg.gl

BuildLayerWiseInferenceModel returns a function that builds the OGBN-MAG GNN inference model, that expects to run inference on the whole dataset in one go.

Download

Download and prepares the tensors with the data into the `baseDir`.

Eval

No description provided by the author

ExcludeOgbnMagVariablesFromSave

ExcludeOgbnMagVariablesFromSave marks the OGBN-MAG variables as not to be saved by the given `checkpoint`.

ExtractLabelsFromInput

ExtractLabelsFromInput create the labels from the input seed indices.

FeaturePreprocessing

FeaturePreprocessing converts the `spec` and `inputs` given by the dataset into a map of node type name to its initial embeddings.

InitTrainingSchedule

InitTrainingSchedule initializes custom scheduled training.

LayerWiseEvaluation

LayerWiseEvaluation returns the train, validation and test accuracy of the model, using layer-wise inference.

MagModelGraph

MagModelGraph builds a OGBN-MAG GNN model that sends [ParamNumGraphUpdates] along its sampling strategy, and then adding a final layer on top of the seeds.

MakeDatasets

MakeDatasets takes a directory where to store the downloaded data and return 4 datasets: "train", "trainEval", "validEval", "testEval".

NewSampler

NewSampler will create a [sampler.Sampler] and configure it with the OGBN-MAG graph definition.

NewSamplerStrategy

NewSamplerStrategy creates a sampling strategy given the sampler, batch size and seeds candidates to sample from.

PapersSeedDatasets

PapersSeedDatasets returns the train, validation and test datasets (`data.InMemoryDataset`) with only the papers seed nodes, to be used with FNN (Feedforward Neural Networks).

Train

Train GNN model based on configuration in `ctx`.

TrainingSchedule

TrainingSchedule is used to control hyperparameters during training.

UploadOgbnMagVariables

UploadOgbnMagVariables creates frozen variables with the various static tables of the OGBN-MAG dataset, so it can be used by models.

# Variables

BatchSize

BatchSize used for the sampler: the value was taken from the TF-GNN OGBN-MAG demo colab, and it was the best found with some hyperparameter tuning.

CountAuthorsAffiliations

Counts to the various edge types.

CountAuthorsPapers

No description provided by the author

CountFieldsOfStudyPapers

No description provided by the author

CountInstitutionsAffiliations

Counts to the various edge types.

CountPapersAuthors

No description provided by the author

CountPapersCites

No description provided by the author

CountPapersFieldsOfStudy

No description provided by the author

CountPapersIsCited

No description provided by the author

DownloadSubdir

No description provided by the author

EdgesAffiliatedWith

EdgesAffiliatedWith `(Int32)[1043998, 2]`, pairs with (author_id, institution_id).

EdgesCites

EdgesCites `(Int32)[5416271, 2]`, pairs with (paper_id, paper_id).

EdgesHasTopic

EdgesHasTopic `(Int32)[7505078, 2]`, pairs with (paper_id, topic_id).

EdgesWrites

EdgesWrites `(Int32)[7145660, 2]`, pairs with (author_id, paper_id).

IdentitySubSeeds

IdentitySubSeeds controls whether to use an IdentitySubSeed, to allow more sharing of the kernel.

KeepDegrees

KeepDegrees will also make sampler keep the degrees of the edges as separate tensors.

NanLogger

No description provided by the author

NumAuthors

No description provided by the author

NumFieldsOfStudy

No description provided by the author

NumInstitutions

No description provided by the author

NumLabels

NumLabels is the number of labels for the papers.

NumPapers

No description provided by the author

OgbnMagVariablesRef

OgbnMagVariablesRef maps variable names to a reference to their values.

OgbnMagVariablesScope

OgbnMagVariablesScope is the absolute scope where the dataset variables are stored.

PaperEmbeddingsSize

PaperEmbeddingsSize is the size of the node features given.

PapersEmbeddings

PapersEmbeddings contains the embeddings, shaped `(Float32)[NumPapers, PaperEmbeddingsSize]`.

PapersLabels

PapersLabels for each paper, values from 0 to 348 (so 349 in total).

PapersYears

PapersYears for each paper, where year starts in 2000 (so 10 corresponds to 2010).

ParamDType

ParamDType controls the dtype to be used: either "float32" or "float16".

ParamEmbedDropoutRate

ParamEmbedDropoutRate adds an extra dropout to learning embeddings.

ParamIdentitySubSeeds

ParamIdentitySubSeeds controls whether to use an IdentitySubSeed, to allow more sharing of the kernel.

ParamNumCheckpoints

ParamNumCheckpoints is the number of past checkpoints to keep.

ParamReuseKernels

ParamReuseKernels context parameter configs whether the kernels for similar sampling rules will be reused.

ParamSplitEmbedTablesSize

ParamSplitEmbedTablesSize will make embed tables share entries across these many entries.

ReuseShareableKernels

ReuseShareableKernels will share the kernels across similar messages in the strategy tree.

TestSplit

TrainSplit, ValidSplit, TestSplit splits of the data.

TrainSplit

TrainSplit, ValidSplit, TestSplit splits of the data.

ValidSplit

TrainSplit, ValidSplit, TestSplit splits of the data.

WithReplacement

WithReplacement indicates whether the training dataset is created with replacement.

ZipChecksum

No description provided by the author

ZipFile

No description provided by the author

ZipURL

No description provided by the author