You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The documentation reads like this is only supported for a few specific file formats: https://docs.databricks.com/ingestion/auto-loader/options.html#file-format-options
Not sure if they are hard-coded somewhere, or one would need to implement a special API.
I don't have time to look into this, but if you're willing to give it a try yourself I can give you some guidance.
We have gotten this to work for other custom file formats with fixed schema. I wonder if we can apply a similar approach here while supporting provided schemas or inferred schemas.
Is there an existing issue for this?
Current Behavior
I am trying to read via V2 in streaming way, with no success. I was wondering if there is anything I can do to get this working?
the code is below:
df = spark.readStream.format("cloudFiles")
.option("cloudFiles.format", "excel")
.option("maxRowsInMemory", 20)
.schema(schema)
.load(file_path)
display(df)
the exception error is given below:
java.lang.UnsupportedOperationException: ExcelFileFormat as fallback format for V2 supports writing only
Expected Behavior
I was hoping it would generate a dataframe.
Steps To Reproduce
df = spark.readStream.format("cloudFiles")
.option("cloudFiles.format", "excel")
.option("maxRowsInMemory", 20)
.schema(schema)
.load(file_path)
display(df)
Environment
Anything else?
No response
The text was updated successfully, but these errors were encountered: