-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Writing CASes to a zip archive #135
Comments
Reported by |
Reported by |
Reported by
|
Reported by |
Reported by |
Reported by |
Reported by
|
@daxenberger this one can be closed as |
This is independent of the latest changes to CV mode. The idea here was to write all CASes into a zip archive rather than individual files. Or why did you think it is obsolete? |
Oh ok, I misunderstood it then. Sry. |
@reckart Is this feature available now? What exactly is the benefit of writing a single .zip instead of N bin-cas? Both is not human-readable but the naming of the bin-cas by document name allows some visual confirmation that the reader read what it was supposed to read? It helps to |
Should be available. |
I don't remember the rationale. Might be to avoid using subfolders in an execution context... or to reduce the number of files which can at times become very large... maybe @daxenberger remembers more. |
This was certainly to reduce the number of files produce by TC - which can become quite big for larger datasets. The "visual confirmation" issue could be avoided by writing some sort of log(?) file, which records the names of files written to the archive. |
@reckart Do you have a code-example that writes to |
Actually, it's even in the documentation: https://dkpro.github.io/dkpro-core/releases/1.9.0/docs/user-guide.html#_working_with_zip_archives |
Hm, when adapting this for the BinaryCasWriter and BinaryCasReader I get a
|
Looks like during reading, you are missing the |
... and mind that these are "zip" files, not "gz" files. |
Originally reported on Google Code with ID 135
Reported by
daxenberger.j
on 2014-05-28 12:41:02The text was updated successfully, but these errors were encountered: