New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add support for conv_transpose2d operation #1540

Open

jserbedzijaTT wants to merge 1 commit into main from jserbedzija/add_conv_transpose2d_operation

+1,163 −1

Contributor

jserbedzijaTT commented Dec 9, 2024

closes (#1084)

jserbedzijaTT requested review from jnie-TT, kmabeeTT, AleksKnezevic, pilkicTT, sdjordjevicTT, svuckovicTT, mtopalovicTT, rpavlovicTT, nobradovictt, nsmithtt, mrakitaTT and tapspatel as code owners

December 9, 2024 16:38

jserbedzijaTT force-pushed the jserbedzija/add_conv_transpose2d_operation branch 2 times, most recently from 2b4f3e6 to b000588 Compare

December 9, 2024 16:39

nsmithtt requested a review from LPanosTT

December 9, 2024 17:18

Contributor

nsmithtt commented Dec 9, 2024

Adding @LPanosTT

github-actions bot reviewed

View reviewed changes

Contributor

github-actions bot left a comment

⚠️ Clang-Tidy found issue(s) with the introduced code (1/1)

lib/Dialect/TTNN/Transforms/TTNNLayout.cpp Outdated Show resolved Hide resolved

Contributor

LPanosTT commented Dec 9, 2024

Hey thanks for adding this. I have something to say about this op though. It seems as though some frontends reverse the order of the data in the kernel window for this op, and some do not. I.e PyTorch does (and thus TTNN does) and JAX does not. You will see that ttir.convolution has a window_reversal boolean attr as well. In order to model the cases in all frontends we need this attribute for conv_transpose2d in ttnn. Or for us to add ttir.reverse so we can consteval the window reversal away.

There is an issue to add window_reversal to ttnn: tenstorrent/tt-metal#15342

Contributor

LPanosTT commented Dec 9, 2024

Also if you could add a pattern to lower ttir.convolution to ttir.conv_transpose2d that would be great. Check out the stablehlo spec for convolution, which ttir.convolution is meant to mimic to see how you can tell if a given convolution is a transposed convolution or not.

jserbedzijaTT force-pushed the jserbedzija/add_conv_transpose2d_operation branch from b000588 to 7b36217 Compare

December 20, 2024 12:58

jserbedzijaTT requested a review from azecevicTT as a code owner

December 20, 2024 12:58

jserbedzijaTT force-pushed the jserbedzija/add_conv_transpose2d_operation branch 2 times, most recently from 683fb3b to 4ddde58 Compare

December 23, 2024 11:06

jserbedzijaTT mentioned this pull request

[TTIR, TTNN] Decompose ttir.convolution to ttir.conv_transpose2d #1662

Open

Contributor Author

jserbedzijaTT commented Dec 24, 2024 •

edited

Loading

Also if you could add a pattern to lower ttir.convolution to ttir.conv_transpose2d that would be great. Check out the stablehlo spec for convolution, which ttir.convolution is meant to mimic to see how you can tell if a given convolution is a transposed convolution or not.

I will merge this pr as is but I have opened an issue to track the things you mentioned: #1662

jserbedzijaTT force-pushed the jserbedzija/add_conv_transpose2d_operation branch 2 times, most recently from 62b9199 to 2837812 Compare

December 24, 2024 10:32

mtopalovicTT requested a review from vladimirjovanovicTT

December 25, 2024 12:19

Contributor

mtopalovicTT commented Dec 25, 2024

@vladimirjovanovicTT to check

sdjordjevicTT reviewed

View reviewed changes

Contributor

sdjordjevicTT left a comment

Great change Joco, thanks, couple of comments inline.

lib/Conversion/TTIRToTTNN/TTIRToTTNN.cpp

    
            @@ -32,6 +33,8 @@
          
              using namespace mlir;

              using namespace mlir::tt;

              #include <iostream>

Contributor

sdjordjevicTT Dec 25, 2024

This is probably not needed?

lib/Conversion/TTIRToTTNN/TTIRToTTNN.cpp

    
                  llvm::ArrayRef<std::int64_t> output_shape = outputTy.getShape();

                  auto getLastDim = [](const RankedTensorType &ty, int offset = 1) {

Contributor

sdjordjevicTT Dec 25, 2024

Can you please substitue for a real type here?

lib/Conversion/TTIRToTTNN/TTIRToTTNN.cpp

Comment on lines +898 to +909

    
                  auto inChannels = rewriter.getI32IntegerAttr(getLastDim(inputTy));

                  auto outChannels = rewriter.getI32IntegerAttr(getLastDim(outputTy));

                  auto batchSize = rewriter.getI32IntegerAttr(getLastDim(inputTy, 4));

                  auto inputHeight = rewriter.getI32IntegerAttr(getLastDim(inputTy, 3));

                  auto inputWidth = rewriter.getI32IntegerAttr(getLastDim(inputTy, 2));

                  auto kernelSize = rewriter.getDenseI32ArrayAttr(

                      {static_cast<int32_t>(getLastDim(kernelTy, 2)),

                       static_cast<int32_t>(getLastDim(kernelTy, 1))});

                  auto stride = rewriter.getDenseI32ArrayAttr(

                      ttmlir::utils::parseAttrToTwoElementVector(adaptor.getStride()));

                  auto padding = rewriter.getDenseI32ArrayAttr(

                      ttmlir::utils::parseAttrToTwoElementVector(adaptor.getPaddingAttr()));

                  auto outputPadding = rewriter.getDenseI32ArrayAttr(

                      ttmlir::utils::parseAttrToTwoElementVector(

                          adaptor.getOutputPaddingAttr()));

                  auto dilation = rewriter.getDenseI32ArrayAttr(

                      ttmlir::utils::parseAttrToTwoElementVector(adaptor.getDilationAttr()));

                  auto groups = rewriter.getI32IntegerAttr(adaptor.getGroups());

Contributor

sdjordjevicTT Dec 25, 2024

Please substitute auto for a real type here as well...

Contributor Author

jserbedzijaTT Dec 25, 2024 •

edited

Loading

So, should we refrain from using auto completely? In this case I thought it made sense because on the RHS we already know which type we will get.

Contributor

sdjordjevicTT Dec 25, 2024

Yeah, you are right, I just saw a lot of autos hence my head got it wrong :D

Instead of switching auto here in this example, can you rename variables, for example, inChannelsAttr, outChannelsAttr, etc... Also kernelSizeArrayAttr, strideArrayAttr, etc...

lib/Conversion/TTIRToTTNN/TTIRToTTNN.cpp

Comment on lines +925 to +931

    
                  // Using a tensor::EmptyOp so that the rewriter for EmptyOp can handle the

                  // attribute determination

                  auto convDPSOutput = rewriter.replaceOpWithNewOp<tensor::EmptyOp>(

                      adaptor.getOutput().getDefiningOp(), flattenedOutputShape,

                      outputTy.getElementType());

                  // Must set the type to the output type to maintain the layout attributes

                  convDPSOutput.getResult().setType(outputTy);

                  ttnn::ConvTranspose2dOp new_conv = rewriter.create<ttnn::ConvTranspose2dOp>(

                      op.getLoc(), outputTy, adaptor.getInput(), adaptor.getWeight(),

                      adaptor.getBias(), convDPSOutput, device, inChannels, outChannels,

                      batchSize, inputHeight, inputWidth, kernelSize, stride, padding,

                      outputPadding, dilation, groups);

Contributor

sdjordjevicTT Dec 25, 2024

Please sync with @azecevicTT, he had in mind an API for creating a DPS op, not sure if applicable here. :)

lib/Conversion/TTIRToTTNN/TTIRToTTNN.cpp

    
            @@ -871,6 +874,77 @@ class Conv2dOpConversionPattern : public OpConversionPattern<ttir::Conv2dOp> {
          
                }

              };

              class ConvTranspose2dOpConversionPattern

Contributor

sdjordjevicTT Dec 25, 2024

Please add a brief descrpipiton of conversion as it isn't 1-1 mapping to ttnn conv2d.

lib/Dialect/TTIR/IR/TTIROps.cpp

    
                  return emitOpError("Batch size of input and output tensors must match");

                }

                auto checkBiggerThan = [&](llvm::SmallVector<int32_t, 2> &values,

Contributor

sdjordjevicTT Dec 25, 2024

Substitute auto here as well.

lib/Dialect/TTNN/IR/TTNNOps.cpp

+                  }
+                }
+                auto checkBiggerThan = [&](llvm::ArrayRef<int32_t> &values, const char *name,

Contributor

sdjordjevicTT Dec 25, 2024

Substitute auto here as well.

lib/Dialect/TTNN/Transforms/TTNNLayout.cpp

    
            @@ -287,7 +287,9 @@ class TTNNLayoutDPSOperandsRewriter
          
                    // TTNN Conv2d moves input, weight, and bias from host to device

                    // itself. Inserting the ToLayoutOp on these operands is thus problematic.

                    if (mlir::isa<ttir::Conv2dOp>(op.getOperation()) && !isResult) {

                    if (!isResult &&

Contributor

sdjordjevicTT Dec 25, 2024

Can you plese rename isResult in isDPSResult? :)

lib/Target/TTNN/TTNNToFlatbuffer.cpp

Comment on lines +494 to +489

+                auto kernelSize = toFlatbuffer(cache, op.getKernelSize());
+                auto stride = toFlatbuffer(cache, op.getStride());
+                auto padding = toFlatbuffer(cache, op.getPadding());
+                auto outputPadding = toFlatbuffer(cache, op.getOutputPadding());
+                auto dilation = toFlatbuffer(cache, op.getDilation());

Contributor

sdjordjevicTT Dec 25, 2024

At least substitute this autos for real types.


          Add support for conv_transpose2d operation

b219b1f

jserbedzijaTT force-pushed the jserbedzija/add_conv_transpose2d_operation branch from 2837812 to b219b1f Compare

December 27, 2024 10:48

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

sdjordjevicTT sdjordjevicTT left review comments

github-actions[bot] github-actions[bot] left review comments

jnie-TT Awaiting requested review from jnie-TT jnie-TT is a code owner

kmabeeTT Awaiting requested review from kmabeeTT kmabeeTT is a code owner

AleksKnezevic Awaiting requested review from AleksKnezevic AleksKnezevic is a code owner

pilkicTT Awaiting requested review from pilkicTT pilkicTT is a code owner

svuckovicTT Awaiting requested review from svuckovicTT svuckovicTT is a code owner

mtopalovicTT Awaiting requested review from mtopalovicTT mtopalovicTT is a code owner

rpavlovicTT Awaiting requested review from rpavlovicTT

nobradovictt Awaiting requested review from nobradovictt nobradovictt is a code owner

nsmithtt Awaiting requested review from nsmithtt nsmithtt is a code owner

mrakitaTT Awaiting requested review from mrakitaTT mrakitaTT is a code owner

tapspatel Awaiting requested review from tapspatel tapspatel is a code owner

LPanosTT Awaiting requested review from LPanosTT

azecevicTT Awaiting requested review from azecevicTT azecevicTT is a code owner

vladimirjovanovicTT Awaiting requested review from vladimirjovanovicTT

At least 1 approving review is required to merge this pull request.

Labels

None yet