Skip to content
View imvladikon's full-sized avatar

Highlights

  • Pro

Block or report imvladikon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
imvladikon/README.md

Hi

I'm Vladimir Gurevich, ML/NLP Engineer (IR tasks, such as Semantic Search, Information Extraction tasks, such as NER, Relation Extraction, etc.).

I am also interested in Speech Recognition and in LLMs.

Works:

  • jupyter-notebook-viewer - Jupyter Notebook Viewer for local files *.ipynb in browser without Jupyter Notebook installation.
  • wav2vec2-hebrew - package for speech recognition in Hebrew language using wav2vec2 models that were trained on Hebrew datasets (check out the datasets below).
  • distiller - distillation TextClassification and TokenClassification models using transformers library with different distillation methods.
  • spacy-trankit - spacy wrapper for Trankit (NLP pipeline for tokenization+dependency parsing+lemmatization, etc.)

Models:

Speech Recognition:

Datasets:

Contacts

LinkedIn

Pinned Loading

  1. jupyter-notebook-viewer jupyter-notebook-viewer Public

    chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server

    JavaScript 27 5

  2. huawei-nlpcourse-project huawei-nlpcourse-project Public

    Topic modeling and classification news on Hebrew with Neural Text Summarizer model

    Python 1

  3. distiller distiller Public

    knowledge distillations for bert (classification, token classification models)

    Python 1

  4. wav2vec2-hebrew wav2vec2-hebrew Public

    Speech Recognition for Hebrew (using wav2vec2 models)

    Python 4 1

  5. duckdb + huggingface datasets duckdb + huggingface datasets
    1
    #!/usr/bin/env python3
    2
    # -*- coding: utf-8 -*-
    3
    import duckdb
    4
    import pyarrow as pa
    5
    from datasets import Dataset
  6. fuzzy_grouper.py fuzzy_grouper.py
    1
    #!/usr/bin/env python3
    2
    # -*- coding: utf-8 -*-
    3
    """
    4
    Simple fuzzy grouping of the list of the dictionaries using any string field and string similarities functions
    5
    Dependencies: