Remove small labels: Unclear prompt #111

tischi · 2024-09-04T06:11:14Z

Here the prompt is:

Takes a label_image and removes all objects that are smaller than a given size_threshold.

While this may just be precise enough I am not sure whether we should be a bit more specific.

I just tried with chatGTP the following:

Q:
write a python function that takes an image as an input and computes the sum of all pixels
^D

A:

  Certainly! Below is a Python function that takes an image as input and
  computes the sum of all its pixel values. This function uses the  PIL
  (Pillow) library to handle image operations.

    from PIL import Image
    import numpy as np

    def sum_of_pixels(image_path):
        # Open the image using PIL
        image = Image.open(image_path)

        # Convert the image to a numpy array
        image_array = np.array(image)

        # Compute the sum of all pixel values
        total_sum = np.sum(image_array)

        return total_sum

    # Example usage
    image_path = 'path_to_your_image.jpg'
    total_sum = sum_of_pixels(image_path)
    print("Sum of all pixels:", total_sum)

So, saying that the "input is an image" could be either interpreted as it being a path to an image or a numpy array (and maybe even something else).

I am wondering whether we should make our prompts more specific, like:

This function takes a numpy array image as an input and returns ...
This function takes a path to an image as an input and returns ...

The text was updated successfully, but these errors were encountered:

haesleinhuepf · 2024-09-04T06:19:54Z

Interesting! That's an issue I barely hit in practice; but I can confirm your observation. Chatgpt produced pretty much the same code on my side.

That means we have to update many test cases...

How about doing this on a separate branch, and comparing the performance on a single model ( old versus new docstrings ) just for documentation purposes?

tischi · 2024-09-04T06:50:53Z

How about doing this on a separate branch, and comparing the performance on a single model ( old versus new docstrings ) just for documentation purposes?

Sounds good to me! But, who does it? Do you think one of the other contributors could be motivated?

And the devil's in the detail here:

Q:
Write a python function that takes an intensity_image as an input and returns the sum of all pixels.
^D

A:

  Certainly! Below is a Python function that takes an intensity image
  (assumed to be in the form of a 2D NumPy array) as an input and returns the
  sum of all its pixel values:

    import numpy as np

    def sum_of_pixels(intensity_image):
        """
        Calculate the sum of all pixel values in the intensity image.

        Parameters:
        intensity_image (numpy.ndarray): A 2D array representing the
  intensity image.

        Returns:
        int/float: The sum of all pixel values.
        """
        return np.sum(intensity_image)

Instead of image I wrote intensity_image, which apparently, and I think rightly so, suggests to the LLM that this is a variable that already contains image data.

This could also one way, to recommend using prompts with meaningful variable names. for example:

Takes an image_path as an input and returns image_width, image_height.

Or

Takes an image_numpy_array as an input and returns image_width, image_height.

Basically, whenever needed specify the type in the variable name.

I am not sure what's best practice here...

haesleinhuepf · 2024-09-04T07:46:05Z

Basically, whenever needed specify the type in the variable name.

I would still try to have the text close to how humans speak. "An numpy image" or "A numpy-array image" seems ok. I'm also using label_image in quite some cases where I thought it's ok too. This seems a bit too extreme to me: "Takes an image_numpy_array as an input and returns image_width, image_height."

haesleinhuepf · 2024-09-04T07:46:41Z

Sounds good to me! But, who does it?

I can do it. It takes longer to explain this to someone than doing it.

tischi · 2024-09-04T09:40:15Z

All right, so would you then, for consistency, change label_image to "numpy label image"?

haesleinhuepf · 2024-09-04T12:01:56Z

Yes, or something like "label image as numpy array" to be a bit more human-readable

tischi assigned haesleinhuepf Sep 4, 2024

tischi mentioned this issue Sep 4, 2024

extend explanation for prompts in new test cases #107

Open

9 tasks

This was referenced Sep 7, 2024

Prompt refinement (discussion about if this should be merged or not) #118

Merged

Should we represent images using default numpy.asarray()? #115

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove small labels: Unclear prompt #111

Remove small labels: Unclear prompt #111

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

Remove small labels: Unclear prompt #111

Remove small labels: Unclear prompt #111

Comments

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024

tischi commented Sep 4, 2024

haesleinhuepf commented Sep 4, 2024