Skip to content
This repository has been archived by the owner on Aug 14, 2022. It is now read-only.

Latest commit

 

History

History
213 lines (160 loc) · 9.19 KB

README.markdown

File metadata and controls

213 lines (160 loc) · 9.19 KB

Mezeo Vendor Storage Module Developer Information

Vendor Storage Modules provide a means to extend the Mezeo system to provide connectivity with unique types of storage. All Vendor Storage Modules are written in Python (targeting Python 2.6) and must extend Mezeo's StorageModule base class. Once implemented, the module needs only to be added to the system's Python Path to be made available.

Vendor Storage Modules may also be used to extend existing Storage Modules with additional capabilities like encryption or deduplication. By implementing the same Storage Module interface, data streams may be intercepted and pre/post-processed.

Support Materials

A setuptools build file (setup.py and associated setup.cfg) is included with this package which provides an example of building a Vendor Storage Module. For deployment, Mezeo uses setuptools' egg files containing only compiled code. Obviously, this requires a recent version of setuptools to be installed. A suitable egg file may be generated executing the following:

$> python setup.py bdist_egg

for more information on setuptools please see: http://pypi.python.org/pypi/setuptools/

Any egg files produced for deployment should contain only the Vendor Storage Module and related resources. Please do not include the included modules module nor the sample simple module. The setup.py file included has been pre-configured to exclude these modules.

Example tests covering the Simple Storage Example (described below) are also included. Typically Mezeo executes these tests using nose. If nose has been installed, you can run the tests with the following command:

$> python setup.py nosetests

or by running nose directly when located in the root directory of the project:

$> nosetests

Note that if coverage has not also been installed, you may see warnings related to this in the test output.

For more information on coverage please see: http://pypi.python.org/pypi/coverage/

Simple Storage Example

Included with this document is a sample Vendor Storage Module called simple. It may be used as a reference when implementing a module.

There are several things to note:

  1. Each Vendor Storage Module should be implemented in its own Python module. Related python modules may be included into a single module.

  2. Each module needs to import the StorageModule base class (described below), this module is imported as:

    from modules import StorageModule

  3. Any other modules required by the Vendor Storage Module should also be imported.

  4. Once a Vendor Storage Module has been loaded in the system, an instance of the module is instantiated once per server process (of which there may be many). These processes tend to have a rather long lifespan. Care should be taken in the Vendor Storage Module implementation to be respectful of system resources as they are unlikely to be freed (ever).

The StorageModule base class

Each Vendor Storage Module is required to extend the modules.StorageModule base class. The StorageModule class defines seven methods that must be implemented for the module to work with the system. Several methods defined by this interface accept a context parameter. This value is used to provide a set of function pointers for utility functions that may be used by the Vendor Storage Module to interact with other aspects of the Mezeo System. A Vendor Storage Module is not required to use this value. The set of functions provided through this parameter is documented in the head documentation of the StorageModule class.

Module Initialization:

__init__(self, **kwargs)

A Vendor Storage Module may take any number of parameters on initialization. These parameters may be used by the module implementation to initialize the instance of the module with instance specific information such as IP address for the storage, storage path, etc. These arguments are specified by an administrator when the Vendor Storage Module is configured in the system.

Determining if Data already exists:

available(self, context, module_data)

The Mezeo System provides a way to optimize handling of data that may be replicated by some other sub-system. To determine if the data associated with the specified module_data exists at a location, this method is called. If the method returns True then the Mezeo System will not attempt to replicate the object data to the location where the available call was made. If the Vendor Storage Module returns False the Mezeo System will use this as a trigger to do data replication internally.

Associating Data with an Object:

put(self, context, stream)

The put method is used to store a data stream using the Vendor Storage Module. The Vendor Storage Module is free to do whatever it likes with the data stream.

The put method MUST return module_data which will be passed to the other methods in the module_data parameter. This allows the Vendor Storage Module to provide context information for an object. Typical use of this value by a Vendor Storage Module is to store an Object ID or path information usually encoded in a JSON structure. The value returned from this method must be converted to a string.

Retrieving Data from an Object:

get_read_stream(self, context, module_data)

The get_read_stream method provides a means by which the Mezeo System may read the contents of a stored object. The stream returned by this method must implement read, close and seek methods.

Destroying Object Data:

delete(self, context, module_data)

The delete method is called by the Mezeo System when an object is to be destroyed and no longer accessed. If successful, the module_data provided to this method will never again be used.

Storage Information:

statistics(self, context)

The statistics method is used by the Mezeo System to provide system administrators with information about the storage associated with the instance of the Vendor Storage Module. The Vendor Storage Module is free to return any JSON formatted object as the response to this method call. For conformance with other Vendor Storage Modules, a capacity value is defined (and documented in the StorageModule class) that is leveraged by various system UIs. Additional values may be added to the return value if they are significant to the Vendor Storage Module.

Storage Capabilities:

capabilities(self, context)

The capabilities method is used by the Mezeo System to provide system administrators and the system itself with information about the capabilities of storage associated with the instance of the Vendor Storage Module. The Vendor Storage Module is free to return any JSON formatted object as the response to this method call.

The values returned from this method form the basis of a CDMI Capability object which will be created for the module.

For example a vendor module that supports AES 256 CBC encryption would return a capability of:

"cdmi_encryption": ["AES_CBC_256"]

Installation and Configuration of the Storage Module:

In this example, a new storage module named "newstore" will be created.

Create a source directory ($SRC) for development of the new module, and copy the setup.py and setup.cfg files supplied by the SDK to $SRC. Also create a README file with any useful information concerning the new module.

Create the python package for the new module by creating a new directory in $SRC called "newstore", and place the python source code for the module in this directory, along with an "init.py" file (which may be empty). In the $SRC directory, run:

python2.6 setup.py bdist_egg

The new egg file containing "newstore" will be found in $SRC/dist - copy this file to each of the nodes in the MezeoCloud cluster. The egg file may be placed in any convenient location on the node. Add the full path name of the egg file (path and file name) to the 'python-path' argument in the WSGI-Daemon line in each of the configuration files in /opt/mezeo/conf/apache.d, and then restart apache on each of the nodes.

The MezeoCloud Provisioning Guide contains examples of adding, updating and deleting storage module ("vendor module") information to the system. A curl example for adding "newstore":

curl -u administrator -v -X PUT --data-binary
'{"module": "newstore.newstore.NewStorage",
"label: "New Storage Module",
"args": {"arg1": "arg1 value", "arg2": "arg2 value", "arg3": "arg3 value"} }'
https://cloud.example.com/cdmi/system_configuration/vendor_modules/newstore

Note: the "args" dictionary contains the keyword and values of the arguments used to initialize the new storage module - see 'Module Initialization' above.

This curl command should return a '204 No Content' response.

After this step is completed, new users may be provisioned with "policy" set to "newstore". Data for these users will then be stored in and accessed from "newstore".