Add image-to-image task w/ Swin2SR (for super-resolution) #381

xenova · 2023-11-08T04:23:37Z

This PR adds support for image-to-image translation, starting with the Swin2SR family of models for super-resolution. See here for the list of already-converted models, including 2x and 4x upscalers.

Closes #138

Example usage

Pipeline API

Example code adapted from here.

import { pipeline } from '@xenova/transformers';

let url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/butterfly.jpg';
let upscaler = await pipeline('image-to-image', 'Xenova/swin2SR-classical-sr-x2-64');
let output = await upscaler(url);
// RawImage {
//   data: Uint8Array(786432) [ 41, 31, 24,  43, ... ],
//   width: 512,
//   height: 512,
//   channels: 3
// }

AutoClasses

Example code adapted from here.

import { AutoProcessor, Swin2SRForImageSuperResolution, RawImage } from '@xenova/transformers';

// Load processor and model
const model_id = 'Xenova/swin2SR-classical-sr-x2-64';
const processor = await AutoProcessor.from_pretrained(model_id);
const model = await Swin2SRForImageSuperResolution.from_pretrained(model_id);

// Prepare model inputs
const url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/butterfly.jpg';
const image = await RawImage.fromURL(url);
const inputs = await processor(image);

// Run model
const outputs = await model(inputs);

// Convert Tensor to RawImage
const output = outputs.reconstruction.squeeze().clamp_(0, 1).mul_(255).round_().to('uint8');
const outputImage = RawImage.fromTensor(output);
// RawImage {
//   data: Uint8Array(786432) [ 41, 31, 24, ...],
//   width: 512,
//   height: 512,
//   channels: 3
// }

Example output

input (256x256):
output w/ unquantized model (512x512):

note: produces the exact same output as the python implementation (within floating-point precision errors of course).
output w/ quantized model (512x512):
side-by-side (input vs. unquantized output):

xenova · 2023-11-08T04:26:04Z

cc @josephrocca :)

I also intend to replicate/showcase the results from their README.

xenova · 2023-11-08T04:52:00Z

Example using https://huggingface.co/Xenova/swin2SR-compressed-sr-x4-48:

import { pipeline } from '@xenova/transformers';

let url = 'https://huggingface.co/spaces/jjourney1125/swin2sr/resolve/main/testsets/real-inputs/shanghai.jpg';
let upscaler = await pipeline('image-to-image', 'Xenova/swin2SR-compressed-sr-x4-48');
let output = await upscaler(url);

Input:

Output:

josephrocca · 2023-11-08T09:37:50Z

Awesome!! Seems to take quite a while to load the model - about 40 seconds, not including the download. I'm guessing it's a similar problem to this: microsoft/onnxruntime#11217 since Netron also complains that there are lots of nodes, and takes a very long time to load.

The actual inference is about 40 seconds on 8 threads - not bad! WebGPU will get this to a very usable inference time. Exciting!

xenova added 17 commits November 7, 2023 17:52

Add Swin2SRImageProcessor

b140fc0

Add RawImage.fromTensor helper function

ab40fd3

Add clamp tensor function

9ac7313

Add support for .to data type conversion

da512ab

Add round tensor function

3716515

Add support for mul tensor function

e352d45

Fix image padding

935c459

Only perform padding if it will affect size

39f6d28

Create basic processors unit test suite

6d32e81

Add SamProcessor test case

a9adb6a

Move CONTENT_TYPE_MAP outside RawImage class

fe4f51b

Perform reflective padding for swin2sr models

8852eac

Add swin2sr models for image super-resolution

ea87f1f

Add listed support for Swin2SR models

8921e84

Add image-to-image pipeline

a7ba047

Add listed support for image-to-image task

eb25c5f

Add image-to-image unit tests

6d80ba2

xenova mentioned this pull request Nov 8, 2023

Swin2sr onnx huggingface/optimum#1492

Merged

xenova added 4 commits November 9, 2023 03:37

Add add tensor functions

739879c

Generalize pad_image helper function

dbf970f

Add more unit tests for image processors

26b7802

Fix typo

3c93f8e

xenova merged commit 73a99ba into main Nov 9, 2023
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add image-to-image task w/ Swin2SR (for super-resolution) #381

Add image-to-image task w/ Swin2SR (for super-resolution) #381

xenova commented Nov 8, 2023 •

edited

Loading

xenova commented Nov 8, 2023

xenova commented Nov 8, 2023

josephrocca commented Nov 8, 2023

Add image-to-image task w/ Swin2SR (for super-resolution) #381

Add image-to-image task w/ Swin2SR (for super-resolution) #381

Conversation

xenova commented Nov 8, 2023 • edited Loading

Example usage

Pipeline API

AutoClasses

Example output

xenova commented Nov 8, 2023

xenova commented Nov 8, 2023

josephrocca commented Nov 8, 2023

xenova commented Nov 8, 2023 •

edited

Loading