405 abstract ModelType - inline ModelType #477

Intex32 · 2023-10-04T13:22:28Z

The idea of this PR is outlined in #405 .

In short, ModelType is currently too much designed for the OpenAI. By inlining all the properties and functionality of ModelType to the right interface in the LLM hierarchy, it can be made more generic and type safe.

What I did here:

introduces a ModelID value class (kinda as a replacement for ModelType as this is the only feature all LLMs share
inlines the model's name and context length - and the encoding for all OpenAI models into new interface OpenAIModel
introduce MaxIoContextLength - OpenAI's models all have a shared limit for input and output tokens combined, some of Google's models have separate limits (one for input and output)
the OpenAI specific implementation of tokensFromMessages had to be moved out of LLM; additionally countTokens and truncateText have moved to the same subclass of LLM
changes the Tokenizer functionaliy to use Encoding directly rather than through the model's modelType

Besides the internal refactoring, tests and integrations had to be adapted.

TokenTextSplitter now takes EncodingType rather than ModelType as parameter (as is still specific to OAI)
some tests that were using the encoding type or context length had to be adapted

If this is approved, a subsequent PR building on these changes may be considered to abstract the encoding part for estimating the tokens. Part of this job is adapting internal APIs and integrations to the new more abstract contextLength (of type MaxIoContextLength).

# Conflicts: # gpt4all-kotlin/src/jvmMain/kotlin/com/xebia/functional/gpt4all/GPT4All.kt # gpt4all-kotlin/src/jvmMain/kotlin/com/xebia/functional/gpt4all/HuggingFaceLocalEmbeddings.kt # integrations/gcp/src/commonMain/kotlin/com/xebia/functional/xef/gcp/GCP.kt # integrations/gcp/src/commonMain/kotlin/com/xebia/functional/xef/gcp/models/GcpChat.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/conversation/llm/openai/OpenAI.kt # tokenizer/src/commonMain/kotlin/com/xebia/functional/tokenizer/ModelType.kt

# Conflicts: # gpt4all-kotlin/src/jvmMain/kotlin/com/xebia/functional/gpt4all/GPT4All.kt # gpt4all-kotlin/src/jvmMain/kotlin/com/xebia/functional/gpt4all/HuggingFaceLocalEmbeddings.kt # integrations/gcp/src/commonMain/kotlin/com/xebia/functional/xef/gcp/GCP.kt # integrations/sql/src/main/kotlin/com/xebia/functional/xef/sql/SQL.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/conversation/llm/openai/OpenAI.kt

Intex32 · 2023-11-25T21:25:25Z

core/src/commonMain/kotlin/com/xebia/functional/xef/llm/BaseChat.kt

+      (contextLength as? MaxIoContextLength.Combined)?.total
+        ?: error(
+          "accessing maxContextLength requires model's context length to be of type MaxIoContextLength.Combined"
+        )


As a side note, again - this is supposed to be an intermediary solution. Usages of this field that use an OAI model will still work as before. If this field is called on an instance of a (Google) model that doesn't use the combined context length an exception imminent.
I found this to be the best way to handle it, in favor to not change too much code in one PR.

Intex32 · 2023-11-25T21:56:06Z

Hey everyone 👋, I think this PR is ready for a first review. In the very infancy of this PR i discussed my ideas with @raulraja. This is what I came up with. Open for discussion :)
Since I have no GCP token anymore, somebody pls test if the GCP example is still working.

raulraja · 2023-11-27T08:53:13Z

Hi Ron, we are getting rid of the GCP integrations in favor of having a single server compatible with the Open AI YAML spec based on the branch work in use-generated-openai-client. That branch actually gets rid of the entire LLM hierarchy and autogenerates the services based on the YAML spec.

Intex32 · 2023-11-27T20:03:24Z

Hi Ron, we are getting rid of the GCP integrations

Vale.. 🥺 I was fighting so hard to get GCP to the same support level as OAI. 😪
You changed so much again and again in Arrow until you reached 1.0. I guess that's normal.

Intex32 added 6 commits October 4, 2023 12:41

cherrypicked most changes from former branch

ebb0883

one step closer to make code compiling

dc426c3

one step closer to make code compiling

004f980

delete ModelType.kt

eea22e8

rename MaxContextLength.kt

3441197

Intex32 linked an issue Oct 4, 2023 that may be closed by this pull request

abstract ModelType #405

Open

Merge branch 'main' into 405-abstract-modeltype-lento

0913f98

Intex32 self-assigned this Oct 4, 2023

Intex32 added help wanted Extra attention is needed Core labels Oct 4, 2023

Intex32 and others added 13 commits October 4, 2023 15:38

spotless

aea81bb

first compiling version (no clean build)

a4eef4b

spotless

3aa8be6

pls compile compadre

a267c88

small changes

f0075a4

small changes

9f23295

fixed stuff after merge

a251225

Apply spotless formatting

96fe57c

many small changes; introduces BaseChat

7a628e0

Apply spotless formatting

c6b56e2

removes some intermediary functions in LLM and adapts all usages

1f19f61

fix potential bug with integer overflow in PromptCalculator

253adba

Intex32 removed the help wanted Extra attention is needed label Nov 25, 2023

Intex32 and others added 2 commits November 25, 2023 22:05

fixes test (wrong model contxt length)

066135e

Apply spotless formatting

295ed8c

Intex32 commented Nov 25, 2023

View reviewed changes

Intex32 marked this pull request as ready for review November 25, 2023 21:55

javipacheco closed this Jan 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

405 abstract ModelType - inline ModelType #477

405 abstract ModelType - inline ModelType #477

Intex32 commented Oct 4, 2023 •

edited

Loading

Intex32 Nov 25, 2023

Intex32 commented Nov 25, 2023

raulraja commented Nov 27, 2023

Intex32 commented Nov 27, 2023 •

edited

Loading

405 abstract ModelType - inline ModelType #477

405 abstract ModelType - inline ModelType #477

Conversation

Intex32 commented Oct 4, 2023 • edited Loading

Intex32 Nov 25, 2023

Choose a reason for hiding this comment

Intex32 commented Nov 25, 2023

raulraja commented Nov 27, 2023

Intex32 commented Nov 27, 2023 • edited Loading

Intex32 commented Oct 4, 2023 •

edited

Loading

Intex32 commented Nov 27, 2023 •

edited

Loading