Preserving record order with spark-excel #704
james-miles-ccy
started this conversation in
General
Replies: 1 comment
-
There were some discussions about adding a row number to each row, but afair it was never implemented. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am wondering if there is a way of preserving the order of the records when reading an excel file with spark-excel. There is (as expected) row shuffling happening during the read process and I would normally get round this by firstly creating an rdd with zipwithindex(). Is there any other way of getting round this behaviour for excel files without editing source?
Beta Was this translation helpful? Give feedback.
All reactions