Xilinx® Vitis™ AI is a development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
It consists of optimized IP, tools, libraries, models, and example designs. It is designed with high efficiency and ease of use in mind, unleashing the full potential of AI acceleration on Xilinx FPGA and ACAP.
Vitis AI is composed of the following key components:
- AI Model Zoo - A comprehensive set of pre-optimized models that are ready to deploy on Xilinx devices.
- AI Optimizer - An optional model optimizer that can prune a model by up to 90%. It is separately available with commercial licenses.
- AI Quantizer - A powerful quantizer that supports model quantization, calibration, and fine tuning.
- AI Compiler - Compiles the quantized model to a high-efficient instruction set and data flow.
- AI Profiler - Perform an in-depth analysis of the efficiency and utilization of AI inference implementation.
- AI Library - Offers high-level yet optimized C++ APIs for AI applications from edge to cloud.
- DPU - Efficient and scalable IP cores can be customized to meet the needs for many different applications.
- For more details on the different DPUs available, refer to DPU Naming.
Learn More: Vitis AI Overview
- Release Notes
- AI Model Zoo added 14 new models, including BERT-based NLP, Vision Transformer (ViT), Optical Character Recognition (OCR), Simultaneous Localization and Mapping (SLAM), and more Once-for-All (OFA) models
- Added 38 base & optimized models for AMD EPYC server processors
- AI Quantizer added model inspector, now supports TensorFlow 2.8 and Pytorch 1.10
- Whole Graph Optimizer (WeGO) supports Pytorch 1.x and TensorFlow 2.x
- Deep Learning Processing Unit (DPU) for Versal® ACAP supports multiple compute units (CU), new Arithmetic Logic Unit (ALU) engine, Depthwise convolution and more operators supported by the DPUs on VCK5000 and Alveo™ data center accelerator cards
- Inference server supports ZenDNN as backend on AMD EPYC™ server processors
- New examples added to Whole Application Acceleration (WAA) for VCK5000 Versal development card and Zynq® UltraScale+™ evaluation kits
Two options are available for installing the containers with the Vitis AI tools and resources.
- Pre-built containers on Docker Hub: xilinx/vitis-ai
- Build containers locally with Docker recipes: Docker Recipes
-
Install Docker - if Docker not installed on your machine yet
-
At least 100GB of disk space for the disk partition running Docker
-
Clone the Vitis-AI repository to obtain the examples, reference code, and scripts.
git clone --recurse-submodules https://github.com/Xilinx/Vitis-AI cd Vitis-AI
Note: The following commands are for the latest version of Vitis AI. For details and history click Run Docker Container
Download the latest Vitis AI Docker with the following command. This container runs on CPU.
docker pull xilinx/vitis-ai-cpu:latest
To run the docker, use command:
./docker_run.sh xilinx/vitis-ai-cpu:latest
There are two types of docker recipes provided - CPU recipe and GPU recipe. If you have a compatible nVidia graphics card with CUDA support, you could use GPU recipe; otherwise you could use CPU recipe.
CPU Docker
Use below commands to build the CPU docker:
cd docker
./docker_build_cpu.sh
To run the CPU docker, use command:
../docker_run.sh xilinx/vitis-ai-cpu:latest
GPU Docker
Use below commands to build the GPU docker:
cd docker
./docker_build_gpu.sh
To run the GPU docker, use command:
../docker_run.sh xilinx/vitis-ai-gpu:latest
Please use the file ./docker_run.sh as a reference for the docker launching scripts, you could make necessary modification to it according to your needs.
You can install Anaconda packages in a conda environment this way:
Vitis-AI /workspace > sudo conda install -n vitis-ai-caffe https://www.xilinx.com/bin/public/openDownload?filename=unilog-1.3.2-h7b12538_35.tar.bz2
For a downloaded file:
sudo conda install -n vitis-ai-caffe ./<conda_package>.tar.bz2
X11 Support for Running Vitis AI Docker with Alveo
If you are running Vitis AI docker with Alveo card and want to use X11 support for graphics (for example, some demo applications in VART and Vitis-AI-Library for Alveo need to display images or video), please add following line into the docker_run_params variable definition in docker_run.sh script:
-e DISPLAY=$DISPLAY -v /tmp/.X11-unix:/tmp/.X11-unix -v $HOME/.Xauthority:/tmp/.Xauthority \
And after the docker starts up, run following command lines:
cp /tmp/.Xauthority ~/
sudo chown vitis-ai-user:vitis-ai-group ~/.Xauthority
Please note before running this script, please make sure either you have local X11 server running if you are using Windows based ssh terminal to connect to remote server, or you have run xhost + command at a command terminal if you are using Linux with Desktop. Also if you are using ssh to connect to the remote server, remember to enable X11 Forwarding option either with Windows ssh tools setting or with -X options in ssh command line.
Previous releases of Vitis AI are marked with a git tag.
Previously released Vitis AI CPU Docker images are available from Docker Hub.
Version | Github Link | Checkout Git Version | Docker Command |
---|---|---|---|
1.4.1 | https://github.com/Xilinx/Vitis-AI/releases/tag/v1.4.1 | git clone https://github.com/Xilinx/Vitis-AI.git git checkout tags/v1.4.1 -b v1.4.1 |
docker pull xilinx/vitis-ai-cpu:1.4.1.978 |
Vitis AI offers a unified set of high-level C++/Python programming APIs to run AI applications across edge-to-cloud platforms, including DPU for Alveo, and DPU for Zynq Ultrascale+ MPSoC and Zynq-7000. It brings the benefits to easily port AI applications from cloud to edge and vice versa. 10 samples in VART Samples are available to help you get familiar with the unfied programming APIs.
ID | Example Name | Models | Framework | Notes |
---|---|---|---|---|
1 | resnet50 | ResNet50 | Caffe | Image classification with VART C++ APIs. |
2 | resnet50_pt | ResNet50 | Pytorch | Image classification with VART extension C++ APIs. |
3 | resnet50_ext | ResNet50 | Caffe | Image classification with VART extension C++ APIs. |
4 | resnet50_mt_py | ResNet50 | TensorFlow | Multi-threading image classification with VART Python APIs. |
5 | inception_v1_mt_py | Inception-v1 | TensorFlow | Multi-threading image classification with VART Python APIs. |
6 | pose_detection | SSD, Pose detection | Caffe | Pose detection with VART C++ APIs. |
7 | video_analysis | SSD | Caffe | Traffic detection with VART C++ APIs. |
8 | adas_detection | YOLO-v3 | Caffe | ADAS detection with VART C++ APIs. |
9 | segmentation | FPN | Caffe | Semantic segmentation with VART C++ APIs. |
10 | squeezenet_pytorch | Squeezenet | Pytorch | Image classification with VART C++ APIs. |
For more information, please refer to Vitis AI User Guide
The AMD/Xilinx Inference Server offers an easy-to-use inference deployment solution for Vitis-AI and other models targeting CPUs and FPGAs. It can be deployed as a server to accept client requests over REST, gRPC or websockets. Alternatively, its native C++ API can be used to create custom compiled applications. The inference server is also compatible with KServe for deployment on a Kubernetes cluster. The server is under active development to add new features like GPU support and improving integration with KServe.