Tests for Embedding op (ABANDONED) #902

kmilanovicTT · 2024-12-12T11:43:59Z

No description provided.

Part of the changes for initial version of executing training e2e on device. Changes to follow after this one will focus on the optimizer. With this change, the loss module and the model can be compiled separately and executed on the TT device (both forward and backward passes). Example: ```python tt_model = forge.compile(framework_model, sample_inputs=[inputs], training=True) loss_fn = CrossEntropyLoss(name="cross_entropy_loss") tt_loss = forge.compile(loss_fn, sample_inputs=loss_inputs, attach_to=tt_model, training=True) # Forward pass is executed as before # # The following will execute the whole backward pass from the loss outputs down to the model backward pass. tt_loss.backward() ``` Note: change to the API - main compile function is modified to accept `training` parameter. To indicate wheter to compile the module for training. - also, loss module is removed as an argument to the compile function For some reason, the gradient inputs in the previous stack were represented as `InputNodeType::Loss`. I have added a new type of input `Gradient`. The removal of the `Loss` input type is to be done as a follow up change since there are some uses of it spread around in the code base. Issue #829 To tie the gradients from the `loss.backward()` to the `module.backward()` we need to "attach" the model to the loss module when compiling the loss. This is done by passing the module to be attached into the compile function (`attach_to` parameter). Note: this doesn't work in general case, when there are multiple gradients being passed between modules - because currently we don't have a mechanism to know which gradient output to tie to which gradient input. Closes #177

- remove xfail mark for tests that are now passing - for flatten op tests change verification to pcc only

- pow - clamp - log - log1p

* Add test without gradient accumulation * Remove unused import and comments * Fix formatting * Switch to new forge compile API * Add bf16 instructions * Remove num of batches limit and increase batch size

The tt-forge-fe workflows accept a tt-mlir SHA override input, ensuring the specified SHA is used for building Docker images, compiling, and testing, instead of the committed version of tt-mlir. This will be used as part of integration testing to ensure that integrating a newer version of tt-metal doesn't break downstream projects. Users can manually trigger the workflow to run with a specific version of tt-mlir, and in CI tt-mlir will trigger this workflow to run with the uplift branch. Relates to #214

Move FailingRulesConverter to shared utils Extend FailingRulesConverter with kwargs support Specify list of params for failing rule

- Package was recently deleted gives error 404 - Update build-and-test.yml and model-analysis-weekly.yml, and need an apt-get update too.

kmilanovicTT · 2024-12-12T12:57:54Z

Abandoned - wrong branches selected

pilkicTT and others added 20 commits December 5, 2024 13:38

Fix bugs in opt model in sequence classification task (#831)

2ebb2d1

rearange imports (#838)

57ce5de

uplift mlir @ fa326aa (#845)

229cb46

- remove xfail mark for tests that are now passing - for flatten op tests change verification to pcc only

Add new unary operators (#770)

6082c61

- pow - clamp - log - log1p

Uplift third_party/tt-mlir to origin/main 2024-12-07 (#844)

582801f

added tanhOp to the init_lowering_handler_map (#851)

6bae375

Uplift third_party/tt-mlir to origin/main 2024-12-08 (#854)

65a879d

Uplift third_party/tt-mlir to origin/main 2024-12-09 (#857)

bb07c68

test for tanh op (#856)

1ffcec9

Increase the job execution timeout for model analysis weekly job (#855)

44cde6b

Fix conv2dtranspose asymmetric padding cases (#826)

40c14ad

Restructure unique op test generation (#843)

a09a541

Add test without gradient accumulation (#841)

10b90b4

* Add test without gradient accumulation * Remove unused import and comments * Fix formatting * Switch to new forge compile API * Add bf16 instructions * Remove num of batches limit and increase batch size

Uplift third_party/tt-mlir to origin/main 2024-12-11 (#861)

e382a1b

Add CPU test for Detr (object detection and segmentation) Model (#799)

7f1cbad

Share FailingRulesConverter (#860)

227eff0

Move FailingRulesConverter to shared utils Extend FailingRulesConverter with kwargs support Specify list of params for failing rule

Share binary test models (#864)

495fe84

CI Fix - Replace libgl1-mesa-glx with libgl1 libglx-mesa0 (#880)

648d399

- Package was recently deleted gives error 404 - Update build-and-test.yml and model-analysis-weekly.yml, and need an apt-get update too.

kmilanovicTT self-assigned this Dec 12, 2024

kmilanovicTT requested review from vbrkicTT and vobojevicTT December 12, 2024 11:44

kmilanovicTT added the Ops Support new op in tt-forge and tt-mlir label Dec 12, 2024

kmilanovicTT added this to the [FFE - E2E] Op testing (sweeps) milestone Dec 12, 2024

kmilanovicTT closed this Dec 12, 2024

kmilanovicTT reopened this Dec 12, 2024

kmilanovicTT changed the title ~~Tests for Embedding op~~ Tests for Embedding op (OBSOLETE) Dec 12, 2024

kmilanovicTT removed request for vbrkicTT and vobojevicTT December 12, 2024 12:56

kmilanovicTT changed the title ~~Tests for Embedding op (OBSOLETE)~~ Tests for Embedding op (ABANDONED) Dec 12, 2024

kmilanovicTT closed this Dec 12, 2024

kmilanovicTT added the invalid This doesn't seem right label Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests for Embedding op (ABANDONED) #902

Tests for Embedding op (ABANDONED) #902

kmilanovicTT commented Dec 12, 2024 •

edited

Loading

kmilanovicTT commented Dec 12, 2024

Tests for Embedding op (ABANDONED) #902

Tests for Embedding op (ABANDONED) #902

Conversation

kmilanovicTT commented Dec 12, 2024 • edited Loading

kmilanovicTT commented Dec 12, 2024

kmilanovicTT commented Dec 12, 2024 •

edited

Loading