Introduction to OpenVINO™ Model Server

This notebook demonstrates how to deploy a model server and request predictions from a client application.

OpenVINO Model Server (OVMS) is a high-performance system for serving models. Implemented in C++ for scalability and optimized for deployment on Intel architectures, the model server uses the same architecture and API as TensorFlow Serving and KServe while applying OpenVINO for inference execution. Inference service is provided via gRPC or REST API, making deploying new algorithms and AI experiments easy.

Notebook Contents

The notebook covers following steps:

Prepare Docker
Preparing a Model Repository
Start the Model Server Container
Prepare the Example Client Components

Installation Instructions

If you have not installed all required dependencies, follow the Installation Guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Introduction to OpenVINO™ Model Server

Notebook Contents

Installation Instructions

Files

README.md

Latest commit

History

README.md

File metadata and controls

Introduction to OpenVINO™ Model Server

Notebook Contents

Installation Instructions