-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Rebranch open-parser to any-parser to avoid confusion.
- Loading branch information
1 parent
fc1b9fe
commit af4704b
Showing
18 changed files
with
78 additions
and
78 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,10 @@ | ||
# 🌊 OpenParse | ||
# 🌊 AnyParser | ||
|
||
OpenParse provides an API to accurately extract your unstructured data (e.g. PDF, images, charts) into structured format. | ||
AnyParser provides an API to accurately extract your unstructured data (e.g. PDF, images, charts) into structured format. | ||
|
||
## :seedling: Set up your OpenParser API key | ||
## :seedling: Set up your AnyParser API key | ||
|
||
OpenParse is still in private beta. If you are interested in testing our document models, please reach out at [email protected] for a FREE API key. | ||
AnyParser is still in private beta. If you are interested in testing our document models, please reach out at [email protected] for a FREE API key. | ||
|
||
|
||
To set up your API key `CAMBIO_API_KEY`, you will need to : | ||
|
@@ -18,13 +18,13 @@ To set up your API key `CAMBIO_API_KEY`, you will need to : | |
## :computer: Installation | ||
|
||
``` | ||
conda create -n openparse python=3.10 -y | ||
conda activate openparse | ||
pip3 install open_parser | ||
conda create -n any-parse python=3.10 -y | ||
conda activate any-parse | ||
pip3 install any-parser | ||
``` | ||
|
||
## :bashfile usage | ||
To use OpenParse via `curl` requests, you can run the following bash command from the root folder of this repository: | ||
To use AnyParser via `curl` requests, you can run the following bash command from the root folder of this repository: | ||
``` | ||
bash parse.sh <your apiKey> <file path> <prompt for parse (optional, default="")> | ||
``` | ||
|
@@ -36,10 +36,10 @@ bash parse.sh gl************************************** /path/to/your/file.pdf " | |
|
||
## :scroll: Examples | ||
|
||
OpenParse can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run OpenParse within 10 lines of code! | ||
AnyParser can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run AnyParser within 10 lines of code! | ||
|
||
### [Prompt to Extract Key-values into JSON from W2 (PDF)](https://github.com/CambioML/open-parser/blob/main/examples/prompt_to_extract_table_from_pdf_to_json.ipynb) | ||
### [Prompt to Extract Key-values into JSON from W2 (PDF)](https://github.com/CambioML/any-parser/blob/main/examples/prompt_to_extract_table_from_pdf_to_json.ipynb) | ||
Do you want to extract key-values from a W2 PDF into JSON format? Check out this notebook (3-min read)! | ||
|
||
### [Extract a Table from an Image into Markdown Format](https://github.com/CambioML/open-parser/blob/main/examples/extract_table_from_image_to_markdown.ipynb) | ||
### [Extract a Table from an Image into Markdown Format](https://github.com/CambioML/any-parser/blob/main/examples/extract_table_from_image_to_markdown.ipynb) | ||
Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)! |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
from any_parser.base import AnyParser | ||
|
||
__all__ = ["AnyParser"] | ||
|
||
__version__ = "0.0.1" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.