Migrate GPT-2 to new tracer #875

kwen2501 · 2023-11-29T01:30:02Z

Description

Migrated GPT-2 example to work with new tracer based pippy.

examples/hf/hf_utils.py contains utility to generate inputs for HuggingFace models.

Model architecture:

GPT2ForSequenceClassification(
  (transformer): GPT2Model(
    (wte): Embedding(50257, 768)
    (wpe): Embedding(1024, 768)
    (drop): Dropout(p=0.1, inplace=False)
    (h): ModuleList(
      (0-11): 12 x GPT2Block(
        (ln_1): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
        (attn): GPT2Attention(
          (c_attn): Conv1D()
          (c_proj): Conv1D()
          (attn_dropout): Dropout(p=0.1, inplace=False)
          (resid_dropout): Dropout(p=0.1, inplace=False)
        )
        (ln_2): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
        (mlp): GPT2MLP(
          (c_fc): Conv1D()
          (c_proj): Conv1D()
          (act): NewGELUActivation()
          (dropout): Dropout(p=0.1, inplace=False)
        )
      )
    )
    (ln_f): LayerNorm((768,), eps=1e-05, elementwise_affine=True)
  )
  (score): Linear(in_features=768, out_features=2, bias=False)
)

Run

$ torchrun --nproc-per-node 4 pippy_gpt2.py

Output

https://gist.github.com/kwen2501/b9ed6158d8d0dc90b16824aa6abd8d72

xw285cornell · 2023-11-29T03:10:28Z

examples/hf/pippy_gpt2.py

+    for i in range(1, gpt2.config.n_layer // decoders_per_rank):
+        annotate_split_points(
+            gpt2,
+            {f'transformer.h.{i * decoders_per_rank}': PipeSplitWrapper.SplitPoint.BEGINNING},


Does this mean the split point is right before/after this submodule (specified by FQN)? I guess it's also not super clear to me what's the meaning of SplitPoint.beginning. Also curious - tracing will "flatten" the submodule into a bunch of aten ops, so there is no longer a concept of submodule. So does this API finds the first or last node of the submodule?

BEGINNING means right before; END would mean right after.

Annotation occurs before tracing. So the module structure is still there at that time.

How come there is no corresponding END?

H-Huang · 2023-11-30T21:34:32Z

examples/hf/pippy_gpt2.py

+    # Input configs
+    example_inputs = generate_inputs_for_model(
+        model_class, gpt2, model_name, args.batch_size, args.device)
+    input_ids = example_inputs["input_ids"]


is input_ids in this case just a single microbatch? Or is it the entire minibatch

Entire.
When PipelineStage actually runs, it splits the batch internally before feeding to scheduler.

H-Huang

Looks good, just a few questions.

facebook-github-bot added the cla signed label Nov 29, 2023

kwen2501 requested review from wconstab, H-Huang, fegin, lessw2020 and HamidShojanazeri November 29, 2023 01:30

xw285cornell reviewed Nov 29, 2023

View reviewed changes

Migrate GPT-2 example to new tracer

acafdfd

kwen2501 force-pushed the migrate_gpt2 branch from 3960481 to acafdfd Compare November 29, 2023 22:38

kwen2501 changed the base branch from pippy_1.0 to main November 29, 2023 22:38

Move boilerplate code to hf_utils.py

d5f683a

H-Huang reviewed Nov 30, 2023

View reviewed changes

H-Huang approved these changes Nov 30, 2023

View reviewed changes

lessw2020 mentioned this pull request Nov 30, 2023

Add Albert example #879

Merged

kwen2501 merged commit 92038fb into main Nov 30, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate GPT-2 to new tracer #875

Migrate GPT-2 to new tracer #875

kwen2501 commented Nov 29, 2023 •

edited

Loading

xw285cornell Nov 29, 2023 •

edited

Loading

kwen2501 Nov 29, 2023

kwen2501 Nov 29, 2023

H-Huang Nov 30, 2023

H-Huang Nov 30, 2023

kwen2501 Nov 30, 2023

H-Huang left a comment

Migrate GPT-2 to new tracer #875

Migrate GPT-2 to new tracer #875

Conversation

kwen2501 commented Nov 29, 2023 • edited Loading

Description

Run

Output

xw285cornell Nov 29, 2023 • edited Loading

Choose a reason for hiding this comment

kwen2501 Nov 29, 2023

Choose a reason for hiding this comment

kwen2501 Nov 29, 2023

Choose a reason for hiding this comment

H-Huang Nov 30, 2023

Choose a reason for hiding this comment

H-Huang Nov 30, 2023

Choose a reason for hiding this comment

kwen2501 Nov 30, 2023

Choose a reason for hiding this comment

H-Huang left a comment

Choose a reason for hiding this comment

kwen2501 commented Nov 29, 2023 •

edited

Loading

xw285cornell Nov 29, 2023 •

edited

Loading