Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for translating PDF files #85

Merged
merged 12 commits into from
Oct 5, 2023

Conversation

dietrichson
Copy link
Contributor

First go at [#84]

@MarkEdmondson1234
Copy link
Collaborator

Looking good so far!

@MarkEdmondson1234
Copy link
Collaborator

Is it ready for review?

@dietrichson
Copy link
Contributor Author

dietrichson commented Jun 2, 2023 via email

@MarkEdmondson1234
Copy link
Collaborator

Looking good! May I suggest a test is added using the test PDF you have added?

@dietrichson
Copy link
Contributor Author

@MarkEdmondson1234, I added a test. It runs (and passes) manually on my local system, however, there may be an issue regarding v2 and v3 of the API that I am not accounting for. Please check/test on your side as well to verify.

@dietrichson
Copy link
Contributor Author

Two commits for pdf-based testing

@LukasWallrich
Copy link

Thanks for working on this! I wanted to try it - three comments:

  • I needed to manually load library(googleAuthR), otherwise I get Error in gar_token() : could not find function "gar_token"
  • It would be helpful if the defaults for source and target match gl_translate()
  • Also, I ran into the 20-page limit because by default IS_NATIVE is false, so that the PDF is treated as a scanned PDF. That is probably a sensible default, but it might be worth it to expose that option, so that 'real' PDFs can be translated even if they are longer.

@LukasWallrich
Copy link

Also, some files require shadow text removal for the translation to be of any use - that would be a good option to expose as well (or set as a default, based on my understanding it shouldn't do any harm)

@MarkEdmondson1234 MarkEdmondson1234 merged commit 5152257 into ropensci:master Oct 5, 2023
@MarkEdmondson1234
Copy link
Collaborator

I'm sorry I missed the pull, merged it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants