Title Similarity Engine

For the containerized version, please switch to branch: dockerized

Deployed Cloud Swagger Demo: https://title-similarity-engine.azurewebsites.net/docs

HuggingFace Space (Built with Gradio): https://huggingface.co/spaces/agolli/title-similarity-model

This project is a backend API built using FastAPI, which finds the most similar title from a list of titles based on a reference title. It uses a pre-trained sentence embedding model from HuggingFace (sentence-transformers) to compute vector representations of titles and calculates the similarity using cosine similarity.

Features

FastAPI: A modern, fast (high-performance) web framework for building APIs with Python 3.7+ based on standard Python type hints.
HuggingFace Transformers: Pre-trained NLP models to compute vector embeddings for text.
Cosine Similarity: A method to find the most similar title by measuring the cosine similarity between embeddings.

Project Structure

.
├── app/                  Core application package containing the FastAPI app and logic.
│   ├── __init__.py       Marks the app/ directory as a package.
│   ├── main.py           The main entry point of the FastAPI app, defining routes and endpoints
│   ├── models.py         Contains Pydantic models for request validation.
│   ├── utils.py          Utility functions, including the function to compute title similarity using a pre-trained HuggingFace model.
├── .gitignore            Prevents specified files from being tracked by Git
├── poetry.lock           Ensures that all developers and environments use the exact same versions of dependencies.
├── pyproject.toml        Defines the project configuration and dependencies managed by Poetry.
└── README.md             Provides project information, setup instructions, and usage guidelines.

Requirements

Python 3.7+
Poetry (for dependency management)

Installation

Clone the repository:

 git clone https://github.com/selaudin/Title-Similarity-Engine.git
 cd title-similarity-engine

Install Poetry and ensure that Poetry is added to your PATH:

curl -sSL https://install.python-poetry.org | python3 -
export PATH="$HOME/.local/bin:$PATH"

Initialize the Project which installs the project dependencies:
```
poetry install
```
Activate the virtual environment:
```
poetry shell
```

Run the FastAPI server:

poetry run uvicorn app.main:app --reload

Usage

The API will be available at
```
http://127.0.0.1:8000
```
Use SwaggerUI with interactive exploration, call and test the API directly from the browser.
```
http://127.0.0.1:8000/docs
```

Or the alternative Redoc API documentation
```
http://127.0.0.1:8000/redoc
```

Endpoints

GET

/: A welcome message to the API.

POST

/find-title-similarit: This endpoint takes a JSON payload with a reference title and a list of other titles, and returns the most similar title.

Request Example:

{
  "reference_title": "Higgs boson in particle physics",
  "other_titles": [
      "Best soup recipes",
      "Basel activities",
      "Particle physics at CERN"
  ]
}

Response Example:

{
  "top_result": "Particle physics at CERN"
}

Run example in terminal using curl

curl -X POST "http://127.0.0.1:8000/find-title-similarity" \
-H "Content-Type: application/json" \
-d '{"reference_title": "Higgs boson in particle physics", "other_titles": ["Best soup recipes", "Basel activities", "Particle physics at CERN"]}'

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
app		app
media		media
.gitignore		.gitignore
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Title Similarity Engine

Features

Project Structure

Requirements

Installation

Usage

Endpoints

GET

POST

About

Releases

Packages

Languages

selaudin/Title-Similarity-Engine

Folders and files

Latest commit

History

Repository files navigation

Title Similarity Engine

Features

Project Structure

Requirements

Installation

Usage

Endpoints

GET

POST

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages