Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

read_excel's parameter - mangle_dupe_cols is used to handle duplicate columns but fails if the duplicate columns are case sensitive. #2213

Open
saikrishnapujari102087 opened this issue Jan 17, 2022 · 2 comments
Labels
docs Document-related

Comments

@saikrishnapujari102087
Copy link

saikrishnapujari102087 commented Jan 17, 2022

mangle_dupe_cols - default is True
So ideally it should have handled duplicate columns, but in case if the columns are case sensitive it fails as below.

AnalysisException: Reference 'Sheet.col' is ambiguous, could be: Sheet.col, Sheet.col.

Where two columns are Col and cOL

In the best practices, there is a mention of not to use case sensitive columns - https://koalas.readthedocs.io/en/latest/user_guide/best_practices.html#do-not-use-duplicated-column-names

Either the docs for read_excel/mangle_dupe_cols has to be updated about this or it has to be handled.

@saikrishnapujari102087 saikrishnapujari102087 changed the title read_excel has one parameter - mangle_dupe_cols to handle duplicate columns but fails if the duplicate columns are case sensitive. read_excel's parameter - mangle_dupe_cols is used to handle duplicate columns but fails if the duplicate columns are case sensitive. Jan 17, 2022
@itholic
Copy link
Contributor

itholic commented Jan 25, 2022

Yeah, we should address this.

Would you mind file the issue to the Apache Spark JIRA ??

This repository is in maintenance mode, as Koalas has been moved in to the PySpark (pandas API on Spark).

@itholic itholic added the docs Document-related label Jan 25, 2022
@saikrishnapujari102087
Copy link
Author

@itholic Yes, created - https://issues.apache.org/jira/browse/SPARK-38004

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs Document-related
Projects
None yet
Development

No branches or pull requests

2 participants