Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support pdf extraction #67

Open
liana313 opened this issue Dec 26, 2024 · 0 comments
Open

Support pdf extraction #67

liana313 opened this issue Dec 26, 2024 · 0 comments
Labels
feature request New feature or request

Comments

@liana313
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
Currently sem_extract takes a column of text and extracts structured fields.

Describe the solution you'd like
Ideally we support extraction over a broader set of document types, such as pdfs, to make it easy to convert unstructured docs to data frames

@liana313 liana313 added the feature request New feature or request label Dec 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant