rankwise

The best way to evaluate your embedding models!

IMPORTANT: rankwise is in an early stage of development; the interface may change without further notice.

Description

rankwise is a benchmark CLI tool for evaluating the quality of embedding models with your own dataset.

It also facilitates the generation of datasets using LLM models.

Getting Started

Pre-requisites

Python 3.12
Poetry.

Installation

Install dependencies:

poetry install

Executing program

Activate the Poetry Shell:

poetry shell

Run the generate command to create a dataset containing questions and their related documents in JSONL format from a data.jsonl file with one document (a json encoded string) per line.

API_KEY=xxx rankwise generate --model "azure_openai.AzureOpenAI(model='gpt-4o',deployment_name='gpt-4o',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" --questions-count 3 --input data.jsonl > dataset.jsonl

This command uses the given LLM model to generate the specified number of questions for every document in the input file.

Run the classify command to classify the generated questions as good/bad using a given classification model.

If you want to use a cross-encoder model, you can use the following command:

API_KEY=xxx rankwise classify cross-encoder --input dataset.jsonl --cross-encoder-model "cross_encoder.CrossEncoder('BAAI/bge-reranker-v2-m3', default_activation_function=torch.nn.Sigmoid())" --output-file classified_dataset.jsonl

If you want to use an embedding model, you can use the following command:

API_KEY=xxx rankwise classify cosine-similarity --input dataset.jsonl --embedding-model "azure_openai.AzureOpenAIEmbedding(model='text-embedding-3-large',deployment_name='azure-text-embedding-ada-002',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" --output-file classified_dataset.jsonl

Run the evaluate classification command to evaluate your classification (from the classify command) against a ground truth dataset.

rankwise evaluate classification --ground-truth classify_ground_truth.jsonl --classification classified_dataset.jsonl

Run the evaluate embedding-model command to assess your dataset and obtain quality metrics for the specified embedding model.

API_KEY=xxx rankwise evaluate embedding-model -E "azure_openai.AzureOpenAIEmbedding(model='text-embedding-3-large',deployment_name='azure-text-embedding-ada-002',api_version='2023-07-01-preview',azure_endpoint='https://your-azure-endpoint',api_key=ENVVAR('API_KEY'))" -m hit_rate -m mrr --input dataset.jsonl

This command uses the given embedding model to evaluate the input dataset and calculate quality metrics.

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Please read the CONTRIBUTING file for details on our code of conduct and the process for submitting pull requests.

Authors

Roberto Abdelkader Martínez Pérez
Pedro Ruiz Pareja

Version History

0.1.0
- Initial Release

License

This project is licensed under the Apache v2.0 License - see the LICENSE file for details

Acknowledgments

Thanks to the open-source community for continuous inspiration.
Special mentions to contributors and collaborators.

Contact

If you have any questions or suggestions, feel free to contact the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
rankwise		rankwise
scripts		scripts
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rankwise

Description

Getting Started

Pre-requisites

Installation

Executing program

Contributing

Authors

Version History

License

Acknowledgments

Contact

About

Releases

Packages

Contributors 4

Languages

License

BBVA/rankwise

Folders and files

Latest commit

History

Repository files navigation

rankwise

Description

Getting Started

Pre-requisites

Installation

Executing program

Contributing

Authors

Version History

License

Acknowledgments

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages