fix inference_step #38

Fireblossom · 2022-11-25T22:32:39Z

inference_step passes inference=True to model_engine.
However, the __forward__ of the Magma model does not accept this parameter, which will cause an error during training.
I fix it by simply copying the inference code from example_inference.py.

CoEich · 2022-11-28T07:59:30Z

magma/magma.py

    ) -> ModelOutput:
+        if inference is True:
+            input_embeddings = self.image_prefix(images)
+            asks = [self.tokenizer.encode('Describe the painting:')] * len(images)


I don't like the hardcoded instruction here, since not all images are paintings. For the purpose of this codepath I would prefer not to use any instruction.

I write this just because the example :)

CoEich · 2022-11-28T08:01:25Z

magma/magma.py

+            )
+            return self.generate(
+                embeddings = input_embeddings,
+                max_steps = 6,


I think 6 sampling steps might be a bit short in general.

I would suggest putting hardcoded asks, max_steps, temperature, and top_k into config.py as a dictionary.
do you think it would be better?

CoEich · 2022-11-28T08:08:09Z

Hi,

thanks for your efforts to contribute to the open source MAGMA code. The codepath for inference during training should indeed be fixed and your help is appreciated :-)

In addition to the comments I added, I would in general prefer not to overload the forward function of the model too much. Maybe you could try to just change the inference_step method to invoke model.generate instead of changing the forward pass, see

magma/magma/train_loop.py

Line 85 in 4d01e51

def inference_step(config, eval_loader, model_engine):

Thanks again and let me know what you think.

Best,

Constantin

Fireblossom · 2022-11-28T10:50:13Z

Hi Constantin,

thank you for your advice. I was going to do the same.

But in practice, I found that deepspeed's model_engine cannot call methods other than forward.
(I'm a beginner in deepspeed, as this is my first time with it, so please point out if I'm wrong)
But if I call the model directly, it may lead to some unexpected errors, like device mismatch.

For the above reasons, I had to add the code into forward.

Best,

Changxu

CoEich · 2022-12-06T07:44:28Z

Ok, I think you can just access the model by model_engine.model, not 100% sure if that always works but maybe give it a try.

Best,

Constantin

fix inference_step

c96334d

CoEich reviewed Nov 28, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix inference_step #38

fix inference_step #38

Fireblossom commented Nov 25, 2022

CoEich Nov 28, 2022 •

edited

Loading

Fireblossom Nov 28, 2022

CoEich Nov 28, 2022

Fireblossom Nov 28, 2022

CoEich commented Nov 28, 2022

Fireblossom commented Nov 28, 2022 •

edited

Loading

CoEich commented Dec 6, 2022

fix inference_step #38

Are you sure you want to change the base?

fix inference_step #38

Conversation

Fireblossom commented Nov 25, 2022

CoEich Nov 28, 2022 • edited Loading

Choose a reason for hiding this comment

Fireblossom Nov 28, 2022

Choose a reason for hiding this comment

CoEich Nov 28, 2022

Choose a reason for hiding this comment

Fireblossom Nov 28, 2022

Choose a reason for hiding this comment

CoEich commented Nov 28, 2022

Fireblossom commented Nov 28, 2022 • edited Loading

CoEich commented Dec 6, 2022

CoEich Nov 28, 2022 •

edited

Loading

Fireblossom commented Nov 28, 2022 •

edited

Loading