Fixes “The model produced invalid content” error when calling functions #3429

davorrunje · 2024-08-27T11:20:37Z

Why are these changes needed?

This fixes the issue related to the more strict checking of JSON parameters in function calling as described here:

https://community.openai.com/t/error-the-model-produced-invalid-content/747511

This PR removes the 'name' parameter from messages in OpenAI client. It also introduces an additional parameter in the tool JSON specification that was previously missing.

Related issue number

Closes #3247

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

…JSON checking: https://community.openai.com/t/error-the-model-produced-invalid-content/747511

codecov-commenter · 2024-08-27T11:23:48Z

Codecov Report

Attention: Patch coverage is 80.00000% with 2 lines in your changes missing coverage. Please review.

Please upload report for BASE (0.2@5ad2677). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
autogen/oai/client.py	77.77%	2 Missing ⚠️

Additional details and impacted files

@@          Coverage Diff           @@
##             0.2    #3429   +/-   ##
======================================
  Coverage       ?   29.61%           
======================================
  Files          ?      117           
  Lines          ?    13022           
  Branches       ?     2469           
======================================
  Hits           ?     3856           
  Misses         ?     8819           
  Partials       ?      347

Flag	Coverage Δ
unittests	`29.59% <80.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

marklysze · 2024-08-27T20:25:01Z

@davorrunje, thanks for creating this - interesting that OpenAI are also removing name. This is likely to affect group chat with speaker selection. May need to incorporate the recent name transforms as a simpler integration for that (but that's for another discussion/time :) ).

I'll give it a test...

davorrunje · 2024-08-28T05:55:14Z

More about this error: https://community.openai.com/t/bizarre-issue-preventing-response-from-gpt-4o-mini-the-model-produces-invalid-content/875432

…om/davorrunje/autogen into fix-model-produced-invalid-content

marklysze · 2024-08-28T06:06:41Z

@davorrunje, thanks again for working on a fix for the exception.

Would you have an example that I could use to replicate the exception?

davorrunje · 2024-08-28T07:24:49Z

test/oai/test_client.py

+
+
+@pytest.mark.skipif(skip, reason="openai>=1 not installed")
+def test_chat_completion_after_tool_call():


@marklysze here is an example of failing completion call

Thanks @davorrunje, is it possible to get the AutoGen code that would have generated this?

As an update: I ran the params through OpenAI's API (response = completions.create(**params)) and it ran through okay and returned a function call.

With my agent's termination check it failed on checking the content for the termination keyword as content is None:
is_termination_msg=lambda x: True if "FINISH" in x["content"] else False

The exception: argument of type 'NoneType' is not iterable

Updating my termination expression corrected that:
is_termination_msg=lambda x: True if x["content"] and "FINISH" in x["content"] else False

@marklysze I marked the PR as a draft as I am not sure yet what kind of workaround is the best one. The issue seems to be on the OpenAPI side and it is more common with GPT-4o-mini than older models. Maybe we should remove names only if we get an exception? Even changing the system message a little bit helps sometimes. The list of possible workarounds is (https://community.openai.com/t/bizarre-issue-preventing-response-from-gpt-4o-mini-the-model-produces-invalid-content/875432):

Remove tools and tool_choice args.

Slightly reduce the length of the prompt, even by a couple of words.

Add at least a second user message or an assistant message.

Change the name or remove the name parameter (but that didn’t always work, depending on the prompt). Some names work, others don’t. ‘Gregory’ worked, for example.

Change the message. Some work and some don’t. ‘Hello.’ didn’t work, but ‘Hello, friend.’ worked.

Change to gpt-4o or gpt-3.5-turbo.

Thanks @davorrunje, appreciate the detail and the note on the draft.

It is definitely tricky because removing the name is a considerable change and could affect people's existing code - at the very least I'd suggest it is made as an option and defaults to existing behaviour of leaving it in.

From the list you provided, I think the third point would be the safest to do to minimise the impact on the validity of the messages. In the following linked LinkedIn post they added a message "DO NOT PRODUCE INVALID CONTENT" and that seemed to fix it! :).

LinkedIn Post on this being intermittent behaviour.

I would love to be able to replicate it, have you got any other examples that you can get to throw an exception?

I wonder if rather than change the messages to start with, we run inference and if it throws that specific exception then it adjusts the messages (perhaps with 3rd option above) and tries again up to x times.

Yes, I think we need to add a simple hack first and see how it goes. I see this error quite often when working with function calls and GPT-4o and GPT-4o-mini. I'll try adding a message as suggested first.

Okay, not sure if it's possible but if you do get the exception and are able to get the params["messages"] it would be interesting to see if it is replicated.

See how you go with the additional message.

lmcmahi · 2024-08-29T18:39:13Z

Hi ,
I have the same error "The model produced invalid content” error when calling functions", how I can apply this fix, any idea ?

marklysze · 2024-08-29T23:56:58Z

Hi , I have the same error "The model produced invalid content” error when calling functions", how I can apply this fix, any idea ?

Hi @lmcmahi, would you be able to provide a code sample that produces this error? It would help in testing out viable fixes.

zhwuwuwu · 2024-09-19T08:26:30Z

Hi, I've encountered exactly the same error, having you find a good solution?

ekzhu · 2024-10-24T00:51:44Z

@davorrunje would you say the proposed fix is still necessary after gpt-4o-2024-08-06?

Fixes the issue caused the following error introduced by more strict …

d552c05

…JSON checking: https://community.openai.com/t/error-the-model-produced-invalid-content/747511

davorrunje mentioned this pull request Aug 27, 2024

An error occurred: Error code: 500 - {'error': {'message': 'The model produced invalid content. Consider modifying your prompt if you are seeing this error persistently.', 'type': 'model_error', 'param': None, 'code': None}}[Issue]: #3247

Open

update tests

7b65662

davorrunje changed the title ~~Fixes “The model produced invalid content” error with function calling~~ Fixes “The model produced invalid content” error when calling functions Aug 27, 2024

davorrunje requested review from yenif and Hk669 August 27, 2024 12:03

davorrunje marked this pull request as draft August 27, 2024 12:19

fix

31e4db9

davorrunje marked this pull request as ready for review August 27, 2024 16:47

davorrunje requested a review from ekzhu August 27, 2024 16:47

gagb requested review from marklysze and jackgerrits August 27, 2024 17:33

gagb added the llm label Aug 27, 2024

Merge branch 'microsoft:main' into fix-model-produced-invalid-content

0e76c51

davorrunje marked this pull request as draft August 28, 2024 05:54

davorrunje added 3 commits August 28, 2024 05:55

removed names from assistant role only

e81a6c6

Merge branch 'fix-model-produced-invalid-content' of https://github.c…

f7caf75

…om/davorrunje/autogen into fix-model-produced-invalid-content

Merge branch 'microsoft:main' into fix-model-produced-invalid-content

8321c81

davorrunje commented Aug 28, 2024

View reviewed changes

davorrunje added 2 commits August 30, 2024 07:59

Merge branch 'main' into fix-model-produced-invalid-content

abc708f

Merge branch 'microsoft:main' into fix-model-produced-invalid-content

96658f9

davorrunje had a problem deploying to openai1 October 2, 2024 13:35 — with GitHub Actions Failure

ekzhu changed the base branch from main to 0.2 October 2, 2024 18:25

jackgerrits added the 0.2 Issues which are related to the pre 0.4 codebase label Oct 4, 2024

rysweet added the awaiting-op-response Issue or pr has been triaged or responded to and is now awaiting a reply from the original poster label Oct 10, 2024

fniedtner removed the llm label Oct 22, 2024

Merge branch '0.2' into fix-model-produced-invalid-content

a698c8b

ekzhu had a problem deploying to openai1 November 23, 2024 00:50 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes “The model produced invalid content” error when calling functions #3429

Fixes “The model produced invalid content” error when calling functions #3429

davorrunje commented Aug 27, 2024 •

edited

Loading

codecov-commenter commented Aug 27, 2024 •

edited

Loading

marklysze commented Aug 27, 2024

davorrunje commented Aug 28, 2024

marklysze commented Aug 28, 2024

davorrunje Aug 28, 2024

marklysze Aug 28, 2024 •

edited

Loading

davorrunje Aug 29, 2024

marklysze Aug 29, 2024

davorrunje Aug 30, 2024

marklysze Aug 30, 2024

lmcmahi commented Aug 29, 2024

marklysze commented Aug 29, 2024

zhwuwuwu commented Sep 19, 2024

ekzhu commented Oct 24, 2024



		@pytest.mark.skipif(skip, reason="openai>=1 not installed")
		def test_chat_completion_after_tool_call():

Fixes “The model produced invalid content” error when calling functions #3429

Are you sure you want to change the base?

Fixes “The model produced invalid content” error when calling functions #3429

Conversation

davorrunje commented Aug 27, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

codecov-commenter commented Aug 27, 2024 • edited Loading

Codecov Report

marklysze commented Aug 27, 2024

davorrunje commented Aug 28, 2024

marklysze commented Aug 28, 2024

davorrunje Aug 28, 2024

Choose a reason for hiding this comment

marklysze Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

davorrunje Aug 29, 2024

Choose a reason for hiding this comment

marklysze Aug 29, 2024

Choose a reason for hiding this comment

davorrunje Aug 30, 2024

Choose a reason for hiding this comment

marklysze Aug 30, 2024

Choose a reason for hiding this comment

lmcmahi commented Aug 29, 2024

marklysze commented Aug 29, 2024

zhwuwuwu commented Sep 19, 2024

ekzhu commented Oct 24, 2024

davorrunje commented Aug 27, 2024 •

edited

Loading

codecov-commenter commented Aug 27, 2024 •

edited

Loading

marklysze Aug 28, 2024 •

edited

Loading