Setup model routing config and plan routing to o1 #6189

ryanhoangt · 2025-01-10T12:56:16Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

This PR is to:

Setup config for model routing-related features.
Implement a prototype for routing to reasoning models if appropriate. The criteria are based on this paper.

Link of any specific issues this addresses

xingyaoww

Awesome! This is a great start for model routing and LGTM!

xingyaoww · 2025-01-10T13:37:58Z

openhands/router/plan/llm_based.py

+    Router that routes the prompt that is judged by a LLM as complex and requires a step-by-step plan.
+    """
+
+    JUDGE_MODEL = 'gpt-4o'


Would be interesting to see if we can experiment with cheaper model for that 🤔

xingyaoww · 2025-01-10T13:39:50Z

openhands/router/plan/prompts.py

+    * Translating high-level requirements into detailed implementation steps and ensuring consistency.
+
+=== BEGIN USER MESSAGE ===
+{message}


We could also experiment sending O1 with the last 5/10 action/observation 🤔 in case there's some deep reasoning required to figure out the error, etc.

enyst · 2025-01-10T20:03:00Z

openhands/llm/llm.py

+                    )
+
+                # Replace the model with the reasoning model
+                kwargs['model'] = self.model_routing_config.reasoning_model


Is model enough, or also: custom provider, base URL?

We could design the reasoning model not as a part of an LLM instance, but as a second LLM instance in the agent?

enyst · 2025-01-10T21:40:14Z

config.template.toml

+[model_routing]
+
+# The reasoning model to use for plan generation
+reasoning_model = "o1-preview-2024-09-12"


Suggested change

reasoning_model = "o1-preview-2024-09-12"

[llm.reasoning_model]

model = "o1-preview-2024-09-12"

...

enyst

I'm so happy to see this, thank you! I do think we are missing some minimal framework to experiment with reasoning models.

About the way to choose another model:
We already have the ability to choose, configure, and use a random model, for example in evals: we can write the model configuration in toml, in a custom named LLM config section, [llm.o1], load it with an utility function, and instantiate an LLM from it.

We can use that here. Names are user-defined, and we can, if we want, set in stone a particular name for the reasoning model, e.g. [llm.reasoning_model], or [llm.oh_reasoning_model], or [llm.blueberry] (or strawberry for that matter), whatever name.

enyst · 2025-01-10T23:41:57Z

openhands/router/plan/prompts.py

@@ -0,0 +1,42 @@
+ANALYZE_PROMPT = """Analyze this prompt to see if it requires a detailed plan generation.


I'm a bit curious, is this prompt handmade, is it llm-generated, or what is the source of this prompt?

ryanhoangt added 13 commits January 6, 2025 15:40

prototype

cec10a1

Merge branch 'main' into o1-routing

fd1b4ec

add routing config

c33ba45

wire up with codeact and llm

7b08724

fix bug

910ba8c

working cli

b73f3ec

update config template

54d4401

use via ui

06db2d6

remove dotenv

b5973cd

update judge prompt

e3c8a9e

update prompt

27a83db

update prompt

6f86ad9

adjust rule-based router

ec2d162

xingyaoww reviewed Jan 10, 2025

View reviewed changes

enyst reviewed Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup model routing config and plan routing to o1 #6189

Setup model routing config and plan routing to o1 #6189

ryanhoangt commented Jan 10, 2025 •

edited

Loading

xingyaoww left a comment

xingyaoww Jan 10, 2025

xingyaoww Jan 10, 2025

enyst Jan 10, 2025

enyst Jan 10, 2025

enyst Jan 10, 2025

enyst left a comment

enyst Jan 10, 2025

		@@ -0,0 +1,42 @@
		ANALYZE_PROMPT = """Analyze this prompt to see if it requires a detailed plan generation.

Setup model routing config and plan routing to o1 #6189

Are you sure you want to change the base?

Setup model routing config and plan routing to o1 #6189

Conversation

ryanhoangt commented Jan 10, 2025 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

xingyaoww Jan 10, 2025

Choose a reason for hiding this comment

xingyaoww Jan 10, 2025

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

ryanhoangt commented Jan 10, 2025 •

edited

Loading