[QUERY] Azure Schema registry + Spark Structured Streaming + Kafka/EventHub compatibility #29134
Labels
Client
This issue points to a problem in the data-plane of the library.
customer-reported
Issues that are reported by GitHub users external to the Azure organization.
question
The issue doesn't require a change to the product in order to be resolved. Most issues start as that
Schema Registry
Is there a way to use AZURE Schema Registry with Spark Structured Streaming? (not the opensource schema registry).
Ideally I would use some high level library like Abris (https://github.com/AbsaOSS/ABRiS), to read and write avro from/to Kafka with automatically registering/ reading the df schema in the schema registry. However, I have not managed to find anything at all that could enable spark to do to_avro or from_avro against an Azure Schema Registry.
There is an open PR that could add the functionality for some column types Azure/azure-event-hubs-spark#615
There seems to be an unmainained repo that at least tried to do the from_avro part (https://github.com/arerlend/azure-schemaregistry-spark)
Anybody knows about a good way how to interact with Azure Schema Registry with Spark Structured Streaming?
The text was updated successfully, but these errors were encountered: