Welcome to simple-local-rag Discussions! #1
Replies: 3 comments 1 reply
-
Thanks Bourke. that's an excellent video tutorial on the RAG. quite comprehensive and informative. thank you. Just a couple of things - I tried it in my laptop which doesn't have any GPU. the embeddings model downloaded to the HF cache (in the local machine), and it worked fine. however the LLM model (I changed it to GEMMA-2B-IT), though it downloaded to the cache, the code that runs the model did not work for me. It threw gemmaTokenizer not available error. [ValueError: Tokenizer class GemmaTokenizer does not exist or is not currently imported.]. About Me: |
Beta Was this translation helpful? Give feedback.
-
This is great, thanks! For my application, I need to also retrieve metadata with each vector retrieved (it will effectively be a pointer to where that chunk came from, so that perplexity.ai style results can be compiled). Any tips on that direction? Thanks again! |
Beta Was this translation helpful? Give feedback.
-
I get this error when trying to load the google/gemma-2b-it model from Huggingface.co... OSError: You are trying to access a gated repo. Cannot access gated repo for url https://huggingface.co/google/gemma-2b-it/resolve/main/config.json. Does this mean that they have added a level of authorization since you published the video? I am logged into Huggingface and the model card says "Gated model How do I fix it? |
Beta Was this translation helpful? Give feedback.
-
👋 Welcome!
We’re using Discussions as a place to connect with other members of our community. We hope that you:
build together 💪.
To get started, comment below with an introduction of yourself and tell us about what you do with this community.
Beta Was this translation helpful? Give feedback.
All reactions