Categorygithub.com/oriyarde/spec
module
0.1.0
Repository: https://github.com/oriyarde/spec.git
Documentation: pkg.go.dev

# README

spec

Terminology

TermDefinition
VolumeIDThe identifier of the volume generated by the plugin.
COContainer Orchestration system that communicates with plugins using CSI service RPCs.
SPStorage Provider, the vendor of a CSI plugin implementation.
DRDisaster Recovery.
RPCRemote Procedure Call.

Objective

Define a standard that will enable storage vendors (SP) to develop controllers/plugins for DR or to talk to the different CO systems.

Goals in MVP

The new standard will

  • Provide API at volume level granularity.
  • Enable SP authors to write one replication compliant plugin that “just works” across all COs that implement RPC.
  • Define API (RPCs) that enable:
    • Enable/Disable volume mirroring.
    • Promote/Demote volume.
    • Resync volume to solve the issue before using the volume.

Non-Goals in MVP

  • Replication at different granular levels
  • Replication of volume snapshots.

Solution Overview

This specification defines an interface along with the minimum operational and packaging recommendations for a storage provider (SP) to implement a Replication compatible plugin. The interface declares the RPCs that a plugin MUST expose.

Architecture

arch

RPC Interface

  • Controller Service: The Controller plugin MUST implement these sets of RPCs.
// Controller holds the RPC methods for replication and all the methods it
// exposes should be idempotent.
service Controller {
  // EnableVolumeReplication RPC call to enable the volume replication.
  rpc EnableVolumeReplication (EnableVolumeReplicationRequest)
  returns (EnableVolumeReplicationResponse) {}
  // DisableVolumeReplication RPC call to disable the volume replication.
  rpc DisableVolumeReplication (DisableVolumeReplicationRequest)
  returns (DisableVolumeReplicationResponse) {}
  // PromoteVolume RPC call to promote the volume.
  rpc PromoteVolume (PromoteVolumeRequest)
  returns (PromoteVolumeResponse) {}
  // DemoteVolume RPC call to demote the volume.
  rpc DemoteVolume (DemoteVolumeRequest)
  returns (DemoteVolumeResponse) {}
  // ResyncVolume RPC call to resync the volume.
  rpc ResyncVolume (ResyncVolumeRequest)
  returns (ResyncVolumeResponse) {}
}

EnableVolumeReplication

// EnableVolumeReplicationRequest holds the required information to enable
// replication on a volume.
message EnableVolumeReplicationRequest {
  // The identifier for this volume, generated by the plugin during
  // CreateVolume CSI RPC call.
  // This field is REQUIRED.
  // This field MUST contain enough information to uniquely identify
  // this specific volume vs all other volumes supported by this plugin.
  // This field SHALL be used by the CO in subsequent calls to refer to
  // this volume.
  string volume_id = 1;
  // Plugin specific parameters passed in as opaque key-value pairs.
  map<string, string> parameters = 2;
  // Secrets required by the plugin to complete the request.
  map<string, string> secrets = 3 [(replication_secret) = true];
}

// EnableVolumeReplicationResponse holds the information to send when
// replication is successfully enabled on a volume.
message EnableVolumeReplicationResponse {
}

Error Scheme

ConditiongRPC CodeDescriptionRecovery Behavior
Missing required field3 INVALID_ARGUMENTIndicates that a required field is missing from the request.Caller MUST fix the request by adding the missing required field before retrying.
Volume does not exist5 NOT_FOUNDIndicates that a volume corresponding to the specified volume_id does not exist.Caller MUST verify that the volume_id is correct and that the volume is accessible and has not been deleted before retrying with exponential back off.
Operation pending for volume10 ABORTEDIndicates that there is already an operation pending for the specified volume_id. In general the Cluster Orchestrator (CO) is responsible for ensuring that there is no more than one call "in-flight" per volume_id at a given time. However, in some circumstances, the CO MAY lose state (for example when the CO crashes and restarts), and MAY issue multiple calls simultaneously for the same volume_id. The Plugin, SHOULD handle this as gracefully as possible, and MAY return this error code to reject secondary calls.Caller SHOULD ensure that there are no other calls pending for the specified volume_id, and then retry with exponential back off.
Not authenticated16 UNAUTHENTICATEDThe invoked RPC does not carry secrets that are valid for authentication.Caller SHALL either fix the secrets provided in the RPC, or otherwise regalvanize said secrets such that they will pass authentication by the Plugin for the attempted RPC, after which point the caller MAY retry the attempted RPC.
Error is Unknown2 UNKNOWNIndicates that a unknown error is generatedCaller MUST study the logs before retrying

DisableVolumeReplication

// DisableVolumeReplicationRequest holds the required information to disable
// replication on a volume.
message DisableVolumeReplicationRequest {
  // The identifier for this volume, generated by the plugin during
  // CreateVolume CSI RPC call.
  // This field is REQUIRED.
  // This field MUST contain enough information to uniquely identify
  // this specific volume vs all other volumes supported by this plugin.
  // This field SHALL be used by the CO in subsequent calls to refer to
  // this volume.
  string volume_id = 1;
  // Plugin specific parameters passed in as opaque key-value pairs.
  map<string, string> parameters = 2;
  // Secrets required by the plugin to complete the request.
  map<string, string> secrets = 3 [(replication_secret) = true];
}

// DisableVolumeReplicationResponse holds the information to send when
// replication is successfully disabled on a volume.
message DisableVolumeReplicationResponse {
}

Error Scheme

ConditiongRPC CodeDescriptionRecovery Behavior
Missing required field3 INVALID_ARGUMENTIndicates that a required field is missing from the request.Caller MUST fix the request by adding the missing required field before retrying.
Volume does not exist5 NOT_FOUNDIndicates that a volume corresponding to the specified volume_id does not exist.Caller MUST verify that the volume_id is correct and that the volume is accessible and has not been deleted before retrying with exponential back off.
Operation pending for volume10 ABORTEDIndicates that there is already an operation pending for the specified volume_id. In general the Cluster Orchestrator (CO) is responsible for ensuring that there is no more than one call "in-flight" per volume_id at a given time. However, in some circumstances, the CO MAY lose state (for example when the CO crashes and restarts), and MAY issue multiple calls simultaneously for the same volume_id. The Plugin, SHOULD handle this as gracefully as possible, and MAY return this error code to reject secondary calls.Caller SHOULD ensure that there are no other calls pending for the specified volume_id, and then retry with exponential back off.
Not authenticated16 UNAUTHENTICATEDThe invoked RPC does not carry secrets that are valid for authentication.Caller SHALL either fix the secrets provided in the RPC, or otherwise regalvanize said secrets such that they will pass authentication by the Plugin for the attempted RPC, after which point the caller MAY retry the attempted RPC.
Error is Unknown2 UNKNOWNIndicates that a unknown error is generatedCaller MUST study the logs before retrying

PromoteVolume

// PromoteVolumeRequest holds the required information to promote volume as a
// primary on local cluster.
message PromoteVolumeRequest {
  // The identifier for this volume, generated by the plugin during
  // CreateVolume CSI RPC call.
  // This field is REQUIRED.
  // This field MUST contain enough information to uniquely identify
  // this specific volume vs all other volumes supported by this plugin.
  // This field SHALL be used by the CO in subsequent calls to refer to
  // this volume.
  string volume_id = 1;
  // This field is optional.
  // Default value is false, force option to Promote the volume.
  bool force = 2;
  // Plugin specific parameters passed in as opaque key-value pairs.
  map<string, string> parameters = 3;
  // Secrets required by the plugin to complete the request.
  map<string, string> secrets = 4 [(replication_secret) = true];
}

// PromoteVolumeResponse holds the information to send when
// volume is successfully promoted.
message PromoteVolumeResponse{
}

Error Scheme

ConditiongRPC CodeDescriptionRecovery Behavior
Missing required field3 INVALID_ARGUMENTIndicates that a required field is missing from the request.Caller MUST fix the request by adding the missing required field before retrying.
Volume does not exist5 NOT_FOUNDIndicates that a volume corresponding to the specified volume_id does not exist.Caller MUST verify that the volume_id is correct and that the volume is accessible and has not been deleted before retrying with exponential back off.
Volume is not replicated9 FAILED_PRECONDITIONIndicates that the volume corresponding to the specified volume_id could not be promoted due to failed precondition (for example mirroring is not enabled).Caller SHOULD ensure that mirroring is enabled.
Operation pending for volume10 ABORTEDIndicates that there is already an operation pending for the specified volume_id. In general the Cluster Orchestrator (CO) is responsible for ensuring that there is no more than one call "in-flight" per volume_id at a given time. However, in some circumstances, the CO MAY lose state (for example when the CO crashes and restarts), and MAY issue multiple calls simultaneously for the same volume_id. The Plugin, SHOULD handle this as gracefully as possible, and MAY return this error code to reject secondary calls.Caller SHOULD ensure that there are no other calls pending for the specified volume_id, and then retry with exponential back off.
Call not implemented12 UNIMPLEMENTEDThe invoked RPC is not implemented by the Plugin or disabled in the Plugin's current mode of operation.Caller MUST NOT retry.
Not authenticated16 UNAUTHENTICATEDThe invoked RPC does not carry secrets that are valid for authentication.Caller SHALL either fix the secrets provided in the RPC, or otherwise regalvanize said secrets such that they will pass authentication by the Plugin for the attempted RPC, after which point the caller MAY retry the attempted RPC.
Error is Unknown2 UNKNOWNIndicates that a unknown error is generatedCaller MUST study the logs before retrying

DemoteVolume

// DemoteVolumeRequest holds the required information to demote volume on local
// cluster.
message DemoteVolumeRequest {
  // The identifier for this volume, generated by the plugin during
  // CreateVolume CSI RPC call.
  // This field is REQUIRED.
  // This field MUST contain enough information to uniquely identify
  // this specific volume vs all other volumes supported by this plugin.
  // This field SHALL be used by the CO in subsequent calls to refer to
  // this volume.
  string volume_id = 1;
  // This field is optional.
  // Default value is false, force option to Demote the volume.
  bool force = 2;
  // Plugin specific parameters passed in as opaque key-value pairs.
  map<string, string> parameters = 3;
  // Secrets required by the plugin to complete the request.
  map<string, string> secrets = 4 [(replication_secret) = true];
}

// DemoteVolumeResponse holds the information to send when
// volume is successfully demoted.
message DemoteVolumeResponse{
}

Error Scheme

ConditiongRPC CodeDescriptionRecovery Behavior
Missing required field3 INVALID_ARGUMENTIndicates that a required field is missing from the request.Caller MUST fix the request by adding the missing required field before retrying.
Volume does not exist5 NOT_FOUNDIndicates that a volume corresponding to the specified volume_id does not exist.Caller MUST verify that the volume_id is correct and that the volume is accessible and has not been deleted before retrying with exponential back off.
Volume in not replicated9 FAILED_PRECONDITIONIndicates that the volume corresponding to the specified volume_id could not be demoted due to failed precondition (for example mirroring is not enabled).Caller SHOULD ensure that mirroring is enabled.
Operation pending for volume10 ABORTEDIndicates that there is already an operation pending for the specified volume_id. In general the Cluster Orchestrator (CO) is responsible for ensuring that there is no more than one call "in-flight" per volume_id at a given time. However, in some circumstances, the CO MAY lose state (for example when the CO crashes and restarts), and MAY issue multiple calls simultaneously for the same volume_id. The Plugin, SHOULD handle this as gracefully as possible, and MAY return this error code to reject secondary calls.Caller SHOULD ensure that there are no other calls pending for the specified volume_id, and then retry with exponential back off.
Call not implemented12 UNIMPLEMENTEDThe invoked RPC is not implemented by the Plugin or disabled in the Plugin's current mode of operation.Caller MUST NOT retry.
Not authenticated16 UNAUTHENTICATEDThe invoked RPC does not carry secrets that are valid for authentication.Caller SHALL either fix the secrets provided in the RPC, or otherwise regalvanize said secrets such that they will pass authentication by the Plugin for the attempted RPC, after which point the caller MAY retry the attempted RPC.
Error is Unknown2 UNKNOWNIndicates that a unknown error is generatedCaller MUST study the logs before retrying

ResyncVolume

// ResyncVolumeRequest holds the required information to resync volume.
message ResyncVolumeRequest {
  // The identifier for this volume, generated by the plugin during
  // CreateVolume CSI RPC call.
  // This field is REQUIRED.
  // This field MUST contain enough information to uniquely identify
  // this specific volume vs all other volumes supported by this plugin.
  // This field SHALL be used by the CO in subsequent calls to refer to
  // this volume.
  string volume_id = 1;
  // This field is optional.
  // Default value is false, force option to Resync the volume.
  bool force = 2;
  // Plugin specific parameters passed in as opaque key-value pairs.
  map<string, string> parameters = 3;
  // Secrets required by the plugin to complete the request.
  map<string, string> secrets = 4 [(replication_secret) = true];
}

// ResyncVolumeResponse holds the information to send when
// volume is successfully resynced.
message ResyncVolumeResponse{
  // Indicates that the volume is ready to use.
  // The default value is false.
  // This field is REQUIRED.
  bool ready = 1;
}

Error Scheme

ConditiongRPC CodeDescriptionRecovery Behavior
Missing required field3 INVALID_ARGUMENTIndicates that a required field is missing from the request.Caller MUST fix the request by adding the missing required field before retrying.
Volume does not exist5 NOT_FOUNDIndicates that a volume corresponding to the specified volume_id does not exist.Caller MUST verify that the volume_id is correct and that the volume is accessible and has not been deleted before retrying with exponential back off.
Volume is not replicated or image is not demoted9 FAILED_PRECONDITIONIndicates that the volume corresponding to the specified volume_id could not be resynced due to failed precondition (for example mirroring is not enabled or the image is not in the demoted state).Caller SHOULD ensure that mirroring is enabled and the image is demoted.
Operation pending for volume10 ABORTEDIndicates that there is already an operation pending for the specified volume_id. In general the Cluster Orchestrator (CO) is responsible for ensuring that there is no more than one call "in-flight" per volume_id at a given time. However, in some circumstances, the CO MAY lose state (for example when the CO crashes and restarts), and MAY issue multiple calls simultaneously for the same volume_id. The Plugin, SHOULD handle this as gracefully as possible, and MAY return this error code to reject secondary calls.Caller SHOULD ensure that there are no other calls pending for the specified volume_id, and then retry with exponential back off.
Call not implemented12 UNIMPLEMENTEDThe invoked RPC is not implemented by the Plugin or disabled in the Plugin's current mode of operation.Caller MUST NOT retry.
Not authenticated16 UNAUTHENTICATEDThe invoked RPC does not carry secrets that are valid for authentication.Caller SHALL either fix the secrets provided in the RPC, or otherwise regalvanize said secrets such that they will pass authentication by the Plugin for the attempted RPC, after which point the caller MAY retry the attempted RPC.
Error is Unknown2 UNKNOWNIndicates that a unknown error is generatedCaller MUST study the logs before retrying

# Packages

No description provided by the author