Adapted megatronlm server implementation for interacting with lm eval… #8

KlaudiaTH · 2023-08-03T08:56:12Z

… harness client

KlaudiaTH · 2023-08-03T09:29:53Z

megatron/tokenizer/tokenizer.py

@@ -43,7 +43,8 @@ def build_tokenizer(args):
    elif args.tokenizer_type == "OpenGPTX-PretrainedHFTokenizer":
        tokenizer = PretrainedHFTokenizer.instantiate_from_file_or_name(model_file_or_name=args.tokenizer_model)
    elif args.tokenizer_type == "OpenGPTX-SPTokenizer":
-        tokenizer = SPTokenizer.instantiate_from_file_or_name(model_file_or_name=args.tokenizer_model)
+        #tokenizer = SPTokenizer.instantiate_from_file_or_name(model_file_or_name=args.tokenizer_model)
+        tokenizer = _GPTXSentencePieceTokenizer(args.tokenizer_model)


ToDo: The _GPTXSentencePieceTokenizer is used for inference.

KlaudiaTH · 2023-08-04T12:49:25Z

megatron/text_generation_server.py

@@ -164,48 +179,60 @@ def put(self):
            if len(prompts) > 1:
                return "When doing beam_search, batch size must be 1"

-        stop_token=50256
+        stop_token = 3


Setting this to a constant 3 is wrong. I haven't tested generation yet, but I guess this should be the ID of the EOD token, so we need to get it from the tokenizer.

…on_lmeval_server

github-actions · 2024-01-05T18:20:55Z

Marking as stale. No activity in 60 days.

KlaudiaTH and others added 3 commits August 3, 2023 10:39

Adapted megatronlm server implementation for interacting with lm eval…

3527e6e

… harness client

Merge branch 'main' into megatron_lmeval_server

62c0def

Removed some comments from text_generation_server.py

630a38f

KlaudiaTH commented Aug 3, 2023

View reviewed changes

Minor correction

0d7f8fd

KlaudiaTH commented Aug 4, 2023

View reviewed changes

Michael Fromm and others added 24 commits August 4, 2023 15:57

integrated first methods of hf tokenizer

7a536b8

added tokenizer

db0c036

bugfix

685be55

retrieve eod id from tokenizer

12948f7

bugfix 2

44d1bbb

bugfix 3

e6e6b75

bugfix 4

31ebade

bugfix 4

5773be8

_HFTokenizer typo

1cfe037

added functions

2dd938d

integrated pretrained hf tokenizer

abe9d7a

Add metadata query

c14fefe

bugfix PretrainedHFTokenizer

fa6c3fb

bugfix

0530610

Merge remote-tracking branch 'origin/add-gptx-tokenizers' into megatr…

0c8461d

…on_lmeval_server

MegatronLM server API adaption. Example sh files.

f42ded1

Adaptations for greedy until generation; minor fixes

341f53a

API and SP tokenizer adaptions for handling continuations

5bba337

Server: Don't return padding tokens

10d7fe8

Corrected is_max_logprobs slicing

ac09fe4

Added option for padding to seq_len during tokenization and generation

526ec2a

Minor fix

22aa758

Corrected monolingual bpe sp 32k example

d704b30

Server: Add argument for specifying HTTP port

ab63e91

KlaudiaTH added 2 commits November 3, 2023 21:40

Merge branch 'main' into megatron_lmeval_server

c0cb866

training.py: Import vision modules only when needed

5b214f9

github-actions bot added the stale label Jan 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapted megatronlm server implementation for interacting with lm eval… #8

Adapted megatronlm server implementation for interacting with lm eval… #8

KlaudiaTH commented Aug 3, 2023

KlaudiaTH Aug 3, 2023 •

edited

Loading

KlaudiaTH Aug 4, 2023

github-actions bot commented Jan 5, 2024

Adapted megatronlm server implementation for interacting with lm eval… #8

Are you sure you want to change the base?

Adapted megatronlm server implementation for interacting with lm eval… #8

Conversation

KlaudiaTH commented Aug 3, 2023

KlaudiaTH Aug 3, 2023 • edited Loading

Choose a reason for hiding this comment

KlaudiaTH Aug 4, 2023

Choose a reason for hiding this comment

github-actions bot commented Jan 5, 2024

KlaudiaTH Aug 3, 2023 •

edited

Loading