You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To be able to deal with different Typesystems, BinCasWritable needs to store
the Typesystem alongside the data.
This can be either as header to each document (highly renundant) or as a
seperate file alongside each sequencefile on hdfs.
Original issue reported on code.google.com by [email protected] on 13 Sep 2013 at 4:50
The text was updated successfully, but these errors were encountered:
Fixed by introducing BinCasWithTypesystemWritable, which serializes a
compressed Typesystem with each CAS.
Those files are considerable larger than XMI compressed with Snappy.
The user can always choose to use the old format by setting
job.setOutPutValueClass(CASWritable.class)
TODO: store the typesystem seperately, this will be the optimal solution.
Original issue reported on code.google.com by
[email protected]
on 13 Sep 2013 at 4:50The text was updated successfully, but these errors were encountered: