Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Cannot read files into dataframe in Databricks 13.3 LTS Runtime 3.3.0 Spark #853

Closed
1 task done
dinesh1512 opened this issue Apr 23, 2024 · 3 comments
Closed
1 task done

Comments

@dinesh1512
Copy link

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

Current Behavior
When running v2 excel pySpark code below in Databricks 13.3 LTS Runtime:

df = spark.read.format("excel")
.option("header", True)
.option("inferSchema", True)
.load(fr"{folderpath}//.xlsx")
display(df)

I receive the following error upon attempting to display or use the resulting dataframe:

AbstractMethodError: org.apache.spark.sql.execution.datasources.v2.FilePartitionReaderFactory.options()Lorg/apache/spark/sql/catalyst/FileSourceOptions;

org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 0.0 failed 4 times, most recent failure: Lost task 0.3 in stage 0.0 (TID 3) (10.178.42.202 executor 0): java.lang.AbstractMethodError: org.apache.spark.sql.execution.datasources.v2.FilePartitionReaderFactory.options()Lorg/apache/spark/sql/catalyst/FileSourceOptions;

This issue is same as #682 that was addressed for older versions.

Expected Behavior

The resulting dataframe should display the data.

Steps To Reproduce

set the folderpath variable to a location containing excel files, and run the below python code in latest runtime of Databricks:

df = spark.read.format("excel")
.option("header", True)
.option("inferSchema", True)
.load(fr"{folderpath}//.xlsx")
display(df)

Environment

- Spark version: 3.4.1
- Spark-Excel version: 0.18.7
- OS: N/A
- Cluster environment

Anything else?

No response

Copy link

Please check these potential duplicates:

@dinesh1512 dinesh1512 changed the title [BUG] <Cannot read files into dataframe in Databricks 13.3 LTS Runtime 3.3.0 Spark> [BUG] Cannot read files into dataframe in Databricks 13.3 LTS Runtime 3.3.0 Spark Apr 23, 2024
Copy link

Please check these potential duplicates:

@nightscape
Copy link
Owner

Please always try the newest version before creating issues.
Closing this until the issue is reproduced with the newest version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants