Generation never starts: "context is null" #1

sharpy66 · 2023-03-28T15:22:13Z

Load model,
Type hello
Press send
Open log
Log says:
[isolate 08:13:02] llama loaded
[isolate 08:13:02] main found: true
[isolate 08:13:02] trying main
[isolate 08:13:02] trying main DONE Instance of 'llama_context_params'
[isolate 08:13:02] context is null

Generation never starts

Samsung Z Flip4
8gb of ram
Snapdragon 8+ Gen1
The demo shows a oneplus device, probably an issue with Samsung phones, I'll check later on a different device.

andsofine · 2023-03-28T15:48:13Z

same here

GeorvityLabs · 2023-03-28T21:13:57Z

same issue @maxime-guerin-biprep @ThibautLEAUX any solution to this problem.
I'm getting the same problem.
I'm using a device with MediaTek Dimensity 1200 and 8gb Ram.

maxime-guerin-biprep · 2023-03-28T22:11:58Z

did you use the release 1.1 or 2.0 ?
Did you have the permission pop up ?
which version of android do you have ?
Which model did you use ?

sharpy66 · 2023-03-28T22:34:19Z

Newest release, no permission pop up, android 13, One UI 5.1, Llama-7b

GeorvityLabs · 2023-03-29T02:51:28Z

did you use the release 1.1 or 2.0 ?
Did you have the permission pop up ?
which version of android do you have ?
Which model did you use ?

Release 2.0
Permission pop up did come I clicked on open file.
I used llama7b-ggml model 4bit quantized

GeorvityLabs · 2023-03-29T02:52:53Z

did you use the release 1.1 or 2.0 ?
Did you have the permission pop up ?
which version of android do you have ?
Which model did you use ?

Release 2.0
Permission pop up did come I clicked on open file.
I used llama7b-ggml model 4bit quantized

sharpy66 · 2023-03-29T05:54:29Z

did you use the release 1.1 or 2.0 ?
Did you have the permission pop up ?
which version of android do you have ?
Which model did you use ?

Release 2.0
Permission pop up did come I clicked on open file.
I used llama7b-ggml model 4bit quantized

Same problem same steps.

maxime-guerin-biprep · 2023-03-29T07:27:52Z

Okay, we will check it today

maxime-guerin-biprep · 2023-03-29T11:35:27Z

Hello, we changed the targetSdkVersion to 31 it should work now.
We added more log from the loading of the model.

https://github.com/Bip-Rep/sherpa/releases/tag/2.0.1

andsofine · 2023-03-29T12:23:17Z

https://github.com/Bip-Rep/sherpa/releases/tag/2.0.1

now it crashes after some time when I enter something. (maybe because of 6gb ram)

also the opening of the model is very slow. it would be great to have a download progress indicator

Device: Pixel 4 xl
OS: GrapheneOS A13 (TP1A.221005.002.B2)

maxime-guerin-biprep · 2023-03-29T12:26:06Z

We only succeded to run it on a 8gb device, so it migth be why you crash.
Do you have another device with 8gb ?

GeorvityLabs · 2023-03-29T19:32:34Z

We only succeded to run it on a 8gb device, so it migth be why you crash.
Do you have another device with 8gb ?

@maxime-guerin-biprep
I'm getting the following error :

maxime-guerin-biprep · 2023-03-29T19:37:39Z

We did got this error when we loaded an alpaca model instead of a llama.
Are you using an llama or alpaca model ?

GeorvityLabs · 2023-03-29T20:04:16Z

@maxime-guerin-biprep
i tried llama model from here and here , then i tried alpaca and now i tried gpt4all trained model.
all gave similar errors.

is it possible to mention how you went about converting and getting the model. or linking the exact model you have been using.if you provide a temp link i'll download it and try out.
any possible way this could also get alpaca support?

maxime-guerin-biprep · 2023-03-29T20:08:24Z

We used Dalai a node js implementation here is a link

maxime-guerin-biprep · 2023-03-29T20:49:36Z

It seems that there is 2 format of models, you can go to Llama repo under Using GPT4All, their is a python script to convert models

GeorvityLabs · 2023-03-29T20:50:37Z

@maxime-guerin-biprep can you link the ggml-model.bin file here temporarily, if possible.
dalai is having some issues on my local machine.

GeorvityLabs · 2023-03-29T20:55:51Z

It seems that there is 2 format of models, you can go to Llama repo under Using GPT4All, their is a python script to convert models

Using the gpt4all to ggml conversion script to generate the .bin file should work?

maxime-guerin-biprep · 2023-03-29T20:57:16Z

i'am downloading the gpt4all rigth now, i will try to convert it later

maxime-guerin-biprep · 2023-03-29T21:15:06Z

So i just try the script convert-unversioned-ggml-to-ggml.py of the llama repo and succeded to use an old alpaca model using this file to convert it with this command
python3 convert-unversioned-ggml-to-ggml.py /path/to/bin .okenizer.model

maxime-guerin-biprep · 2023-03-29T21:24:22Z

I just tried the script convert-gpt4all-to-ggml.py and it worked

GeorvityLabs · 2023-03-29T21:28:40Z

@maxime-guerin-biprep could you upload the model bin files to huggingface temporarily so that i can test it at my end?

GeorvityLabs · 2023-03-29T21:42:21Z

@maxime-guerin-biprep i wrote this colab for conversion , but it doesn't work as intended , could you have a look if there is any issues : https://colab.research.google.com/drive/1F7ITFw7MAqEsYUN7ce7sG6rN-eAd8mnd?usp=sharing

maxime-guerin-biprep · 2023-03-29T22:04:29Z

you need to put the .bin in a folder and make the first argumen the folder, you will have a .bin and .orig after, use the .bin

sharpy66 · 2023-03-29T22:49:38Z

updating my models to get rid of the bothersome "invalid model files" error, I now seem to get close, but the app just crashes after a moment, the last log is "trying main DONE instance of 'llama-context-params'

I'm going to try using a different script to update my models, I'll let you know if that works.

GeorvityLabs · 2023-03-30T03:34:42Z

you need to put the .bin in a folder and make the first argumen the folder, you will have a .bin and .orig after, use the .bin

@maxime-guerin-biprep alpaca model now works.
But it has hallucinations and the generation keeps on going and never stops.

Can you add a "Stop Generation" button to the UI?

Do you have any idea why the output keeps on going @maxime-guerin-biprep , it keeps on going and then crashes.

ThibautLEAUX · 2023-03-30T07:34:26Z

@GeorvityLabs the default preprompt is not that perfect at all, it was just a simple one, we hope that people will make better ones so that it hallucinates less.
We can try to add a stop generation button

GeorvityLabs · 2023-03-30T07:35:44Z

@ThibautLEAUX yea it would be great to add a stop generation button so that we could manually stop when it starts to hallucinate?
Is there any way to algorithmically detect when the model starts hallucinating? and maybe we can cut off / stop the generation at that point?
@maxime-guerin-biprep

GeorvityLabs · 2023-03-31T11:53:11Z

@ThibautLEAUX @maxime-guerin-biprep also was curious is the gtp4all working for you guys.
I tried converting the .bin model form the gpt4all repo using the llama.cpp conversion script, but it doesn't seem to run , the app sorta crashes.
does it work on your end?
also , would be great to have a reset option as i mentioned before.

maxime-guerin-biprep · 2023-03-31T13:44:31Z

Did you use the convert-gpt4all-to-ggml.py ? Or the same as alpaca ?
Because with the convert-gpt4all-to-ggml.py one it is not the folder you have to give but the file itself

GeorvityLabs · 2023-03-31T18:53:01Z

@maxime-guerin-biprep
it works now. yea, i had used the other script first.
it works fine now.
hope you guys add a reset button so that we can stop model when it starts hallucinating.

maxime-guerin-biprep · 2023-04-01T09:18:31Z

We will also had an option to do the same as the instruct mode on llama.cop main.
With this there will be no use of prepompt and reverse prompt for the alpaca and gpt4all model

GeorvityLabs · 2023-04-01T09:20:03Z

@maxime-guerin-biprep that is good to hear! looking forward to the update.

GeorvityLabs · 2023-04-01T16:38:44Z

@maxime-guerin-biprep
I just noticed that with the gpt4all model.
When I ask the question :
Who is obamas wife?
It gives me the right answer but then it adds another question related to Michelle Obama after.

Do you know why the model doesn't stop soon after giving the answer?

Any ideas on how to fix this issue?
I only asked one question, but it appends another question and keeps generating stuff related to the topic.

GeorvityLabs · 2023-04-01T16:39:44Z

@maxime-guerin-biprep
I just noticed that with the gpt4all model.
When I ask the question :
Who is obamas wife?
It gives me the right answer but then it adds another question related to Michelle Obama after.

Do you know why the model doesn't stop soon after giving the answer?

Any ideas on how to fix this issue?
I only asked one question, but it appends another question and keeps generating stuff related to the topic.

Wingie · 2023-04-02T06:02:28Z

Hmmm upto which version are supported?

GeorvityLabs · 2023-04-10T12:17:45Z

@maxime-guerin-biprep @ThibautLEAUX any updates on the stop generation button?
reset chat button etc?

maxime-guerin-biprep · 2023-04-11T09:38:47Z

we will try to do it this week

GeorvityLabs · 2023-04-11T12:13:23Z

@maxime-guerin-biprep great, looking forward to testing it out

ThibautLEAUX · 2023-04-13T10:04:23Z

Hmmm upto which version are supported?

we just released an update, so it is working with recent models

maxime-guerin-biprep · 2023-04-14T12:44:45Z

@GeorvityLabs we released the stop button

GeorvityLabs · 2023-04-14T12:46:08Z

@GeorvityLabs we released the stop button

Cool , I'll do some testing and check how it is working.

Add GGUF support

windmaple mentioned this issue Jul 5, 2023

Update llama.cpp and move core processing to native code #12

Open

danemadsen referenced this issue in Mobile-Artificial-Intelligence/maid Oct 17, 2023

Merge pull request #1 from danemadsen/gguf

6392fd7

Add GGUF support

Generation never starts: "context is null" #1

Generation never starts: "context is null" #1

Comments

sharpy66 commented Mar 28, 2023 • edited Loading

andsofine commented Mar 28, 2023

GeorvityLabs commented Mar 28, 2023

maxime-guerin-biprep commented Mar 28, 2023 • edited Loading

sharpy66 commented Mar 28, 2023

GeorvityLabs commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

sharpy66 commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

andsofine commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023 • edited Loading

maxime-guerin-biprep commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

GeorvityLabs commented Mar 29, 2023

maxime-guerin-biprep commented Mar 29, 2023

sharpy66 commented Mar 29, 2023 • edited Loading

GeorvityLabs commented Mar 30, 2023

ThibautLEAUX commented Mar 30, 2023

GeorvityLabs commented Mar 30, 2023 • edited Loading

GeorvityLabs commented Mar 31, 2023

maxime-guerin-biprep commented Mar 31, 2023

GeorvityLabs commented Mar 31, 2023 • edited Loading

maxime-guerin-biprep commented Apr 1, 2023

GeorvityLabs commented Apr 1, 2023

GeorvityLabs commented Apr 1, 2023 • edited Loading

GeorvityLabs commented Apr 1, 2023

Wingie commented Apr 2, 2023

GeorvityLabs commented Apr 10, 2023

maxime-guerin-biprep commented Apr 11, 2023

GeorvityLabs commented Apr 11, 2023

ThibautLEAUX commented Apr 13, 2023

maxime-guerin-biprep commented Apr 14, 2023

GeorvityLabs commented Apr 14, 2023

sharpy66 commented Mar 28, 2023 •

edited

Loading

maxime-guerin-biprep commented Mar 28, 2023 •

edited

Loading

GeorvityLabs commented Mar 29, 2023 •

edited

Loading

sharpy66 commented Mar 29, 2023 •

edited

Loading

GeorvityLabs commented Mar 30, 2023 •

edited

Loading

GeorvityLabs commented Mar 31, 2023 •

edited

Loading

GeorvityLabs commented Apr 1, 2023 •

edited

Loading