From 8255fdfd5b9bb4eb8740fc2c9e7fdcd2f74b1378 Mon Sep 17 00:00:00 2001
From: Hadrien Mary
Date: Sat, 28 Oct 2023 08:55:39 -0400
Subject: [PATCH] add hf links to readme
---
README.md | 11 ++++++++---
docs/index.md | 11 ++++++++---
2 files changed, 16 insertions(+), 6 deletions(-)
diff --git a/README.md b/README.md
index 850c56e..4569f22 100644
--- a/README.md
+++ b/README.md
@@ -16,6 +16,9 @@
|
🤗 Model
+ |
+
+ 🤗 Training Dataset
@@ -73,9 +76,11 @@ mamba install -c conda-forge safe-mol
### Datasets and Models
-We provided a pretained GPT2 model (XX M parameters) using the SAFE molecular representation that has been trained on 1.1 billion molecules from Unichem (0.1B) + Zinc (1B):
-
-- _Safe-XXM_ TODO
+| Type | Name | Infos | Size | Comment |
+| ------- | --------------------------------------------------------------------- | ---------- | ----- | -------------------- |
+| Model | [datamol-io/safe-gpt](https://huggingface.co/datamol-io/safe-gpt) | 87M params | 350M | Default model |
+| Dataset | [datamol-io/safe-gpt](https://huggingface.co/datamol-io/safe-gpt) | 1.1B rows | 250GB | Training dataset |
+| Dataset | [datamol-io/safe-drugs](https://huggingface.co/datamol-io/safe-drugs) | 26 rows | 20 kB | Benchmarking dataset |
## Usage
diff --git a/docs/index.md b/docs/index.md
index 54b7478..617c596 100644
--- a/docs/index.md
+++ b/docs/index.md
@@ -16,6 +16,9 @@
|
🤗 Model
+ |
+
+ 🤗 Training Dataset
@@ -73,9 +76,11 @@ mamba install -c conda-forge safe-mol
### Datasets and Models
-We provided a pretained GPT2 model (XX M parameters) using the SAFE molecular representation that has been trained on 1.1 billion molecules from Unichem (0.1B) + Zinc (1B):
-
-- _Safe-XXM_ TODO
+| Type | Name | Infos | Size | Comment |
+| ------- | --------------------------------------------------------------------- | ---------- | ----- | -------------------- |
+| Model | [datamol-io/safe-gpt](https://huggingface.co/datamol-io/safe-gpt) | 87M params | 350M | Default model |
+| Dataset | [datamol-io/safe-gpt](https://huggingface.co/datamol-io/safe-gpt) | 1.1B rows | 250GB | Training dataset |
+| Dataset | [datamol-io/safe-drugs](https://huggingface.co/datamol-io/safe-drugs) | 26 rows | 20 kB | Benchmarking dataset |
## Usage