-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Change ocr prompt to instead enhance tesseract hOCR
- Loading branch information
Showing
4 changed files
with
32 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
Transcribe the text in the provided image. | ||
Place the transcribed text into the appropriate ocrx_word span in the hOCR template provided. Ensure the following: | ||
Never use tesseract to aid you. | ||
Include all text in the transcription. | ||
Never return an incomplete document. Always finish the complete job, with every word wrapped in a span, and the words accurately transcribed from the image. | ||
Never edit the hOCR dimensions provided. Only replace the text within <span>'s. | ||
Do not wrap your response in backticks. Your response should only be the hOCR document provided below with your transcribed text placed in the appropriate spots in the hOCR document. | ||
Do not add any additional spans or attributes. You only should be adding plain text within the hocr HTML document. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,3 +3,5 @@ allowedMimeTypes: | |
cmdByMimeType: | ||
default: | ||
cmd: /app/cmd.sh | ||
args: | ||
- "%source-uri" |