SemaDB Firebase Firestore Vector Search

SemaDB Firebase extension is a thin wrapper around the public SemaDB API. It attempts to:

Sync documents in Firestore with a valid vector to SemaDB. It only stores the document ID and the vector without sending other document data.
Provides a callable function to perform vector search within a Firebase application.

It doesn't backfill to avoid potentially high costs and starts indexing updated or newly created documents after the extension is installed.

Please refer to PREINSTALL.md for more information on how to install and use the extension.

Contributing

Thank you for considering to contribute! This repo is hopefully structured in an easy-to-understand manner and you can get started quickly.

Please:

Create an issue to track the contribution.
Follow the fork and pull request approach.
Add documentation, ideally tests and npm run lint.
Be respectful of others 🚀

Thanks again!

Getting started

Most of the action happens inside the functions folder. We start by installing the dependencies:

cd functions && npm install

and then running the emulators, please install firebase tools if it is not already installed:

cd integration-tests && firebase emulators:start --project=demo-test

You can now navigate to the emulator UI to see the emulators running.

Repo structure

The structure is mostly dictated by the firebase tool that generates the skeleton. The files you are interested in are:

extension.yaml is the declaration of the extension, parameters and functions.
semadb.js where all the work happens.
integration-test.spec.js contains the test cases.

Manual live testing

If you edit firestore-semadb-search.secret.local with your actual SemaDB API key, running the emulators will make requests to the public live instance.

Create a collection on the public API using the interactive playground named mycollection.
In the Firestore emulator, create a collection called mycollection and a document with a vector field with 2 numbers. Here 2 is the default vector size in the public API, it should match the SemaDB collection vector size.
After you save the document, it should have a point ID which is a successful indexing of the document.
You can verify that the SemaDB collection has points by getting the collection (GetCollection endpoint) and searching for points (SearchPoint endpoint) in the interactive playground.
Finally, you can make a local search request by using the command in the tasks.json file.

Automated testing

From the functions folder you can run:

npm run test

which runs the tests.

Help needed

We are open to all contributions but here are some ideas to get started:

More tests for syncing documents. We are looking for when the extension is making requests to SemaDB using the mock adapter.
Documentation for end users. The documentation is lacking examples for users such as how to use the search function in an application
Live working example of the extension. If anyone has a public repo using the extension, it would be nice to give reference and credit.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
.vscode		.vscode
functions		functions
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
POSTINSTALL.md		POSTINSTALL.md
PREINSTALL.md		PREINSTALL.md
README.md		README.md
extension.yaml		extension.yaml
icon.png		icon.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemaDB Firebase Firestore Vector Search

Contributing

Getting started

Repo structure

Manual live testing

Automated testing

Help needed

About

Releases

Packages

Languages

License

Semafind/firestore-semadb-search

Folders and files

Latest commit

History

Repository files navigation

SemaDB Firebase Firestore Vector Search

Contributing

Getting started

Repo structure

Manual live testing

Automated testing

Help needed

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages