Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add zip handling #22

Merged
merged 8 commits into from
Dec 16, 2024
Merged

Add zip handling #22

merged 8 commits into from
Dec 16, 2024

Conversation

Josh-XT
Copy link
Contributor

@Josh-XT Josh-XT commented Dec 14, 2024

This pull request adds support for converting ZIP files to markdown by extracting and converting all contained files. The most important changes include updating the documentation, importing necessary modules, implementing the ZipConverter class, registering the new converter, and adding tests for ZIP file processing. This uses the same logic that we use for handling zip files in AGiXT and is very useful.

ZIP file conversion support:

Implementation of ZipConverter:

Testing:

@Josh-XT
Copy link
Contributor Author

Josh-XT commented Dec 14, 2024

@microsoft-github-policy-service agree

src/markitdown/_markitdown.py Outdated Show resolved Hide resolved
@gagb gagb self-requested a review December 14, 2024 23:01
@gagb gagb merged commit 6dad1cc into microsoft:main Dec 16, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants