Replies: 1 comment 9 replies
-
If you have a repro outside databricks, I am happy to check this out. cc @suhsteve |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When my .net application is running in an "all-purpose" databricks cluster under load, I get lots of rejected futures:
There is not any more detail in the log4j on the scala side.
Does anyone know what this part of the message means ?
Terminated, pool size = 0, active threads = 0, queued tasks = 0, completed tasks = 0
On the .Net side it seems that the program is trying to do something simple ( Microsoft.Spark.Sql.DataFrame.ParseConnectionInfo ):
I don't get these issues when I'm running ten concurrent jobs in my cluster but as I slowly increase the number of concurrency jobs (to 30, 40, 50), then I start seeing lots of these errors. There is plenty of RAM and plenty of CPU, and the driver process isn't spending too much time on GC.
Any help would be appreciated. I'm several days behind schedule now. I thought that my spark cluster size was sufficient to avoid problems as I scaled up, but these issues don't seem to be resource related. They seem to be subtle timing bugs.
I may be able to create a repro. Is there a common place to upload that sort of thing?
I also have a support case open with databricks but they have very little familiarity with spark.net and will probably not want to support this as soon as they see the "Microsoft.Spark" referenced in the stack frames.
Beta Was this translation helpful? Give feedback.
All reactions