Add depth estimation pipeline #389

xenova · 2023-11-13T08:58:49Z

This PR adds support for the depth-estimation pipeline w/ DPT and GLPN

Closes #350

Example usage:

import { pipeline } from '@xenova/transformers';

let url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/cats.jpg';
let depth_estimator = await pipeline('depth-estimation', 'Xenova/dpt-hybrid-midas');
let out = await depth_estimator(url);
// {
//   predicted_depth: Tensor {
//     dims: [ 384, 384 ],
//     type: 'float32',
//     data: Float32Array(147456) [ 542.859130859375, 545.2833862304688, 546.1649169921875, ... ],
//     size: 147456
//   },
//   depth: RawImage {
//     data: Uint8Array(307200) [ 86, 86, 86, ... ],
//     width: 640,
//     height: 480,
//     channels: 1
//   }
// }

Input

Output:

Python (baseline):

Code used: https://huggingface.co/Intel/dpt-hybrid-midas#how-to-use

JavaScript (quantized):
JavaScript (unquantized):

Uses `size_divisor` to determine resize width and height

josephrocca · 2023-11-20T17:17:05Z

Xenova you need to STOP. enough is Enough. "i'm xenova and along with the ort web and optimium teams i'm going to get literally all the <1gb sota ML models quantized and running in the browser, and make it really easy for web devs to use"-- NO. This is not Natural. Web devs are meant to struggle with op support on a tflite model for 2 weeks before realising that they really can't get around the TF Select op requirements, and then steel themselves for creating a custom tflite build, but then find out that the build tooling required for adding TF Select ops is Google-internal, and then try to port the model to tfjs instead, but end up with weird errors, which they eventually fix by breaking the model apart into multiple models and implementing some operations in JS to glue the model back together, with one component actually being executed via the tflite runtime because it has support for a particular op that tfjs didn't have, but then the results for one part of the tfjs model are different from the Python model, so they post an issue with a Colab that isolates and minimally replicates it, but that isn't replied to until 6 months later and that reply is actually just the stale bot.

josephrocca · 2023-11-20T17:24:16Z

I should probably post this as a separate issue, but is there a way to get a trimmed down version of transformers.js that only has the depth estimation pipeline? I.e. a "treeshaking" type feature to basically create the leanest possible build for cases where you only need to use certain parts of the library. (this isn't critical for me right now, to be clear, just curious, and it would be good if it were possible in the long term, as more models/features are added, since I imagine the library could get quite big)

xenova · 2023-11-20T17:29:48Z

@josephrocca ❤️ and we're just getting started 😏

is there a way to get a trimmed down version of transformers.js that only has the depth estimation pipeline?

This has actually been requested quite a lot by others, like for use-cases where only tokenizers are used (so, we don't need onnxruntime-web to be bundled). So, please feel free to open up a feature request. Perhaps some others might be able to contribute to the discussion.

For the most part, I think this responsibility should be transferred to build tools like webpack or rollup, as they have much better and advanced support for this. In the worst case, you can just fork the repo and remove everything you don't need :)

xenova added 20 commits November 13, 2023 08:55

Add size getter to RawImage

b3d0aeb

Add DPTFeatureExtractor

abea247

Add depth-estimation w/ DPT models

83cef11

Add GLPN models for depth estimation

99da977

Add missing import in example

951f273

Add DPTFeatureExtractor processor test

47c826f

Add unit test for GLPN processor

7f2efc8

Add support for GLPNFeatureExtractor

80f8b7a

Uses `size_divisor` to determine resize width and height

Add GLPNForDepthEstimation example code

5d1f348

Add DPT to list of supported models

168cbec

Add GLPN to list of supported models

497c5f5

Add DepthEstimationPipeline

da72957

Add listed support for depth estimation pipeline

802a2df

Add depth estimation pipeline unit tests

649c274

Fix formatting

70f6b3f

Update pipeline JSDoc

accbc9c

Merge branch 'main' into add-depth-estimation

96d24c9

Fix typo from merge

0c5e279

Merge branch 'main' into add-depth-estimation

8903ede

Merge branch 'main' into add-depth-estimation

6e2505d

xenova merged commit 83dfa47 into main Nov 20, 2023
4 checks passed

This was referenced Nov 20, 2023

[Feature request] Guide/example/support for "tree shaking" - e.g. a build that only includes a particular tokenizer, or a specific pipeline #407

Open

CORs headers for model files huggingface/huggingface_hub#468

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add depth estimation pipeline #389

Add depth estimation pipeline #389

xenova commented Nov 13, 2023

josephrocca commented Nov 20, 2023

josephrocca commented Nov 20, 2023 •

edited

Loading

xenova commented Nov 20, 2023

Add depth estimation pipeline #389

Add depth estimation pipeline #389

Conversation

xenova commented Nov 13, 2023

Example usage:

Input

Output:

josephrocca commented Nov 20, 2023

josephrocca commented Nov 20, 2023 • edited Loading

xenova commented Nov 20, 2023

josephrocca commented Nov 20, 2023 •

edited

Loading