OpenVINO™ Notebooks at GitHub Pages
- Video generation with ZeroScope and OpenVINO
- Convert and Optimize YOLOv9 with OpenVINO™
- Convert and Optimize YOLOv8 real-time object detection with OpenVINO™
- YOLOv8 Oriented Bounding Boxes Object Detection with OpenVINO™
- Convert and Optimize YOLOv8 keypoint detection model with OpenVINO™
- Convert and Optimize YOLOv8 instance segmentation model with OpenVINO™
- Convert and Optimize YOLOv11 real-time object detection with OpenVINO™
- Convert and Optimize YOLOv11 keypoint detection model with OpenVINO™
- Convert and Optimize YOLOv11 instance segmentation model with OpenVINO™
- Convert and Optimize YOLOv10 with OpenVINO
- Video Subtitle Generation using Whisper and OpenVINO™
- Automatic speech recognition using Whisper and OpenVINO with Generate API
- Wav2Lip: Accurately Lip-syncing Videos and OpenVINO
- Image Generation with Tiny-SD and OpenVINO™
- Text to Image pipeline and OpenVINO with Generate API
- Line-level text detection with Surya
- Image to Video Generation with Stable Video Diffusion
- Stable Fast 3D Mesh Reconstruction and OpenVINO
- Image generation with Stable Diffusion XL and OpenVINO
- Image generation with Stable Diffusion v3 and OpenVINO
- Image generation with Torch.FX Stable Diffusion v3 and OpenVINO
- Text-to-Image Generation with Stable Diffusion v2 and OpenVINO™
- Stable Diffusion Text-to-Image Demo
- Stable Diffusion v2.1 using Optimum-Intel OpenVINO and multiple Intel Hardware
- Infinite Zoom Stable Diffusion v2 and OpenVINO™
- Stable Diffusion with KerasCV and OpenVINO
- Image Generation with Stable Diffusion and IP-Adapter
- Image generation with Stable Cascade and OpenVINO
- Sound Generation with Stable Audio Open and OpenVINO™
- Sound Generation with AudioLDM2 and OpenVINO™
- SoftVC VITS Singing Voice Conversion and OpenVINO™
- Object masks from prompts with SAM and OpenVINO
- Single step image generation using SDXL-turbo and OpenVINO
- Object masks from prompts with SAM2 and OpenVINO
- Object masks from prompts with SAM2 and OpenVINO for Images
- Visual-language assistant with Qwen2VL and OpenVINO
- Audio-language assistant with Qwen2Audio and OpenVINO
- Generate creative QR codes with ControlNet QR Code Monster and OpenVINO™
- Visual-language assistant with Pixtral and OpenVINO
- Text-to-image generation using PhotoMaker and OpenVINO
- Visual-language assistant with Phi3-Vision and OpenVINO
- Voice tone cloning with OpenVoice and OpenVINO
- Screen Parsing with OmniParser and OpenVINO
- Structure Extraction with NuExtract and OpenVINO
- Visual-language assistant with nanoLLaVA and OpenVINO
- Controllable Music Generation with MusicGen and OpenVINO
- Multi LoRA Image Generation
- Visual Content Search using MobileCLIP and OpenVINO
- Visual-language assistant with Llama-3.2-11B-Vision and OpenVINO
- Visual-language assistant with MiniCPM-V2 and OpenVINO
- Magika: AI powered fast and efficient file type identification using OpenVINO
- Create a RAG system using OpenVINO and LlamaIndex
- Create a RAG system using OpenVINO and LangChain
- LLM Instruction-following pipeline with OpenVINO
- Create an LLM-powered Chatbot using OpenVINO
- Create an LLM-powered Chatbot using OpenVINO Generate API
- Create a native Agent with OpenVINO
- Create ReAct Agent using OpenVINO and LangChain
- Create an Agentic RAG using OpenVINO and LlamaIndex
- Create Function-calling Agent using OpenVINO and Qwen-Agent
- Visual-language assistant with LLaVA Next and OpenVINO
- Visual-language assistant with LLaVA and Optimum Intel OpenVINO integration
- Visual-language assistant with LLaVA and OpenVINO Generative API
- Text-to-Image Generation with LCM LoRA and ControlNet Conditioning
- Image generation with Latent Consistency Model and OpenVINO
- Kosmos-2: Multimodal Large Language Model and OpenVINO
- Multimodal understanding and generation with Janus and OpenVINO
- Visual-language assistant with InternVL2 and OpenVINO
- Image Editing with InstructPix2Pix and OpenVINO
- InstantID: Zero-shot Identity-Preserving Generation using OpenVINO
- Image generation with HunyuanDIT and OpenVINO
- Object detection and masking from prompts with GroundedSAM (GroundingDINO + SAM) and OpenVINO
- Image generation with Flux.1 and OpenVINO
- Florence-2: Open Source Vision Foundation Model
- Frame interpolation using FILM and OpenVINO
- Object segmentations with FastSAM and OpenVINO
- Object segmentations with EfficientSAM and OpenVINO
- Animating Open-domain Images with DynamiCrafter and OpenVINO
- Automatic speech recognition using Distil-Whisper and OpenVINO
- Depth estimation with DepthAnything and OpenVINO
- Depth estimation with DepthAnythingV2 and OpenVINO
- Text-to-Image Generation with ControlNet Conditioning
- Zero-shot Image Classification with OpenAI CLIP and OpenVINO™
- Virtual Try-On with CatVTON and OpenVINO
- Visual Question Answering and Image Captioning using BLIP and OpenVINO
- Text-to-speech generation using Bark and OpenVINO
- Image-to-Video synthesis with AnimateAnyone and OpenVINO
- Quantize Speech Recognition Models with accuracy control using NNCF PTQ API
- Post-Training Quantization of PyTorch models with NNCF
- Optimize Preprocessing
- OpenVINO Tokenizers: Incorporate Text Processing Into OpenVINO Pipelines
- Convert models from ModelScope to OpenVINO
- Hello Model Server
- Quantize NLP models with Post-Training Quantization in NNCF
- Convert a JAX Model to OpenVINO™ IR
- Quantization of Image Classification Models
- 🤗 Hugging Face Model Hub with OpenVINO™
- Hello NPU
- Working with GPUs in OpenVINO™
- OpenVINO™ Model conversion
- Big Transfer Image Classification Model Quantization pipeline with NNCF
- Automatic Device Selection with OpenVINO™
- Asynchronous Inference with OpenVINO™
- Classification with ConvNeXt and OpenVINO
- Convert a Tensorflow Lite Model to OpenVINO™
- Convert a TensorFlow Object Detection Model to OpenVINO™
- Convert a TensorFlow Instance Segmentation Model to OpenVINO™
- Convert of TensorFlow Hub models to OpenVINO Intermediate Representation (IR)
- Convert a TensorFlow Model to OpenVINO™
- Line-level text detection with Surya
- Convert and Optimize YOLOv11 with OpenVINO™
- Convert a PyTorch Model to OpenVINO™ IR
- Convert a PyTorch Model to ONNX and OpenVINO™ IR
- Convert a PaddlePaddle Model to OpenVINO™ IR
- Voice tone cloning with OpenVoice and OpenVINO
- OpenVINO Tokenizers: Incorporate Text Processing Into OpenVINO Pipelines
- Object detection and masking from prompts with GroundedSAM (GroundingDINO + SAM) and OpenVINO
- Convert Detectron2 Models to OpenVINO™
- Quantize a Segmentation Model and Show Live Inference
- OpenVINO™ Model conversion
- OpenVINO™ Explainable AI Toolkit (3/3): Saliency map interpretation
- OpenVINO™ Explainable AI Toolkit (2/3): Deep Dive
- OpenVINO™ Explainable AI Toolkit (1/3): Basic
- Language-Visual Saliency with CLIP and OpenVINO™
- OpenVINO™ Runtime API Tutorial
- Hello Image Classification
- Hello Image Segmentation
- Hello Object Detection
- OpenVINO™ Explainable AI Toolkit (1/3): Basic
- Style Transfer with OpenVINO™
- Live Human Pose Estimation with OpenVINO™
- Person Tracking with OpenVINO™
- Person Counting System using YOLOV8 and OpenVINO™
- PaddleOCR with OpenVINO™
- Voice tone cloning with OpenVoice and OpenVINO
- Live Object Detection with OpenVINO™
- CLIP model with Jina CLIP and OpenVINO
- Object detection and masking from prompts with GroundedSAM (GroundingDINO + SAM) and OpenVINO
- Quantize a Segmentation Model and Show Live Inference
- Human Action Recognition with OpenVINO™
- Live 3D Human Pose Estimation with OpenVINO
- Video generation with ZeroScope and OpenVINO
- Convert and Optimize YOLOv9 with OpenVINO™
- Convert and Optimize YOLOv8 real-time object detection with OpenVINO™
- YOLOv8 Oriented Bounding Boxes Object Detection with OpenVINO™
- Convert and Optimize YOLOv8 keypoint detection model with OpenVINO™
- Convert and Optimize YOLOv8 instance segmentation model with OpenVINO™
- Convert and Optimize YOLOv11 real-time object detection with OpenVINO™
- Convert and Optimize YOLOv11 keypoint detection model with OpenVINO™
- Convert and Optimize YOLOv11 instance segmentation model with OpenVINO™
- Convert and Optimize YOLOv10 with OpenVINO
- Video Subtitle Generation using Whisper and OpenVINO™
- Automatic speech recognition using Whisper and OpenVINO with Generate API
- Wav2Lip: Accurately Lip-syncing Videos and OpenVINO
- Monodepth Estimation with OpenVINO
- Image Background Removal with U^2-Net and OpenVINO™
- Vehicle Detection And Recognition with OpenVINO™
- Image Generation with Tiny-SD and OpenVINO™
- Selfie Segmentation using TFLite and OpenVINO
- Text to Image pipeline and OpenVINO with Generate API
- Table Question Answering using TAPAS and OpenVINO™
- Line-level text detection with Surya
- Image to Video Generation with Stable Video Diffusion
- Stable Fast 3D Mesh Reconstruction and OpenVINO
- Image generation with Stable Diffusion XL and OpenVINO
- Image generation with Stable Diffusion v3 and OpenVINO
- Image generation with Torch.FX Stable Diffusion v3 and OpenVINO
- Text-to-Image Generation with Stable Diffusion v2 and OpenVINO™
- Stable Diffusion Text-to-Image Demo
- Stable Diffusion v2.1 using Optimum-Intel OpenVINO and multiple Intel Hardware
- Infinite Zoom Stable Diffusion v2 and OpenVINO™
- Stable Diffusion v2.1 using OpenVINO TorchDynamo backend
- Text-to-Image Generation with Stable Diffusion and OpenVINO™
- Stable Diffusion with KerasCV and OpenVINO
- Image Generation with Stable Diffusion and IP-Adapter
- Image generation with Stable Cascade and OpenVINO
- Sound Generation with Stable Audio Open and OpenVINO™
- Text Generation via Speculative Decoding using FastDraft and OpenVINO™
- Sound Generation with AudioLDM2 and OpenVINO™
- SoftVC VITS Singing Voice Conversion and OpenVINO™
- One Step Sketch to Image translation with pix2pix-turbo and OpenVINO
- Zero-shot Image Classification with SigLIP
- Object masks from prompts with SAM and OpenVINO
- Single step image generation using SDXL-turbo and OpenVINO
- Object masks from prompts with SAM2 and OpenVINO
- Object masks from prompts with SAM2 and OpenVINO for Images
- Text-to-Video retrieval with S3D MIL-NCE and OpenVINO
- Background removal with RMBG v1.4 and OpenVINO
- Text-to-Music generation using Riffusion and OpenVINO
- Visual-language assistant with Qwen2VL and OpenVINO
- Audio-language assistant with Qwen2Audio and OpenVINO
- Generate creative QR codes with ControlNet QR Code Monster and OpenVINO™
- Visual-language assistant with Pixtral and OpenVINO
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis with OpenVINO
- Document Visual Question Answering Using Pix2Struct and OpenVINO™
- Text-to-image generation using PhotoMaker and OpenVINO
- Visual-language assistant with Phi3-Vision and OpenVINO
- Text-to-speech (TTS) with Parler-TTS and OpenVINO
- Text-to-Speech synthesis using OuteTTS and OpenVINO
- Optical Character Recognition (OCR) with OpenVINO™
- Voice tone cloning with OpenVoice and OpenVINO
- Universal Segmentation with OneFormer and OpenVINO
- Screen Parsing with OmniParser and OpenVINO
- Structure Extraction with NuExtract and OpenVINO
- Visual-language assistant with nanoLLaVA and OpenVINO
- Named entity recognition with OpenVINO™
- Controllable Music Generation with MusicGen and OpenVINO
- Multi LoRA Image Generation
- Visual Content Search using MobileCLIP and OpenVINO
- MMS: Scaling Speech Technology to 1000+ languages with OpenVINO™
- Visual-language assistant with Llama-3.2-11B-Vision and OpenVINO
- Visual-language assistant with MiniCPM-V2 and OpenVINO
- Industrial Meter Reader
- Magika: AI powered fast and efficient file type identification using OpenVINO
- Create a RAG system using OpenVINO and LlamaIndex
- Create a RAG system using OpenVINO and LangChain
- LLM Instruction-following pipeline with OpenVINO
- Create an LLM-powered Chatbot using OpenVINO
- Create an LLM-powered Chatbot using OpenVINO Generate API
- Create a native Agent with OpenVINO
- Create ReAct Agent using OpenVINO and LangChain
- Create an Agentic RAG using OpenVINO and LlamaIndex
- Create Function-calling Agent using OpenVINO and Qwen-Agent
- Visual-language assistant with LLaVA Next and OpenVINO
- Visual-language assistant with LLaVA and Optimum Intel OpenVINO integration
- Visual-language assistant with LLaVA and OpenVINO Generative API
- Text-to-Image Generation with LCM LoRA and ControlNet Conditioning
- Image generation with Latent Consistency Model and OpenVINO
- Kosmos-2: Multimodal Large Language Model and OpenVINO
- OpenVINO optimizations for Knowledge graphs
- Multimodal understanding and generation with Janus and OpenVINO
- Visual-language assistant with InternVL2 and OpenVINO
- Image Editing with InstructPix2Pix and OpenVINO
- InstantID: Zero-shot Identity-Preserving Generation using OpenVINO
- Image generation with HunyuanDIT and OpenVINO
- Handwritten Chinese and Japanese OCR with OpenVINO™
- Object detection and masking from prompts with GroundedSAM (GroundingDINO + SAM) and OpenVINO
- Grammatical Error Correction with OpenVINO
- High-Quality Text-Free One-Shot Voice Conversion with FreeVC and OpenVINO™
- Image generation with Flux.1 and OpenVINO
- Florence-2: Open Source Vision Foundation Model
- Frame interpolation using FILM and OpenVINO
- Object segmentations with FastSAM and OpenVINO
- Audio compression with EnCodec and OpenVINO
- Object segmentations with EfficientSAM and OpenVINO
- Animating Open-domain Images with DynamiCrafter and OpenVINO
- Automatic speech recognition using Distil-Whisper and OpenVINO
- Depth estimation with DepthAnything and OpenVINO
- Depth estimation with DepthAnythingV2 and OpenVINO
- Colorize grayscale images using 🎨 DDColor and OpenVINO
- Cross-lingual Books Alignment with Transformers and OpenVINO™
- Text-to-Image Generation with ControlNet Conditioning
- Zero-shot Image Classification with OpenAI CLIP and OpenVINO™
- Language-Visual Saliency with CLIP and OpenVINO™
- Virtual Try-On with CatVTON and OpenVINO
- Visual Question Answering and Image Captioning using BLIP and OpenVINO
- Text-to-speech generation using Bark and OpenVINO
- Image-to-Video synthesis with AnimateAnyone and OpenVINO
- Part Segmentation of 3D Point Clouds with OpenVINO™
- Quantization Aware Training with NNCF, using TensorFlow Framework
- Quantization-Sparsity Aware Training with NNCF, using PyTorch framework
- Quantization Aware Training with NNCF, using PyTorch framework
- Quantization Aware Training with NNCF, using TensorFlow Framework
- SpeechBrain Emotion Recognition with OpenVINO
- Quantize Wav2Vec Speech Recognition Model using NNCF PTQ API
- Accelerate Inference of Sparse Transformer Models with OpenVINO™ and 4th Gen Intel® Xeon® Scalable Processors
- Convert and Optimize YOLOv11 with OpenVINO™
- Quantize Speech Recognition Models with accuracy control using NNCF PTQ API
- Quantization-Sparsity Aware Training with NNCF, using PyTorch framework
- Quantization Aware Training with NNCF, using PyTorch framework
- Post-Training Quantization of PyTorch models with NNCF
- Optimize Preprocessing
- Voice tone cloning with OpenVoice and OpenVINO
- OpenVINO Tokenizers: Incorporate Text Processing Into OpenVINO Pipelines
- Quantize NLP models with Post-Training Quantization in NNCF
- OpenVINO optimizations for Knowledge graphs
- Quantization of Image Classification Models
- Object detection and masking from prompts with GroundedSAM (GroundingDINO + SAM) and OpenVINO
- Quantize a Segmentation Model and Show Live Inference
- Big Transfer Image Classification Model Quantization pipeline with NNCF