-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jobs failing #12
Comments
Pinging @rlwastro :D |
In case either of you can spot anything in my batch queue script it's at http://www-wfau.roe.ac.uk/www-data/mar/q.txt Currently it's not crashing but just going really slowly as it's taking some 40 minutes to download each 300MB output file. My network elsewhere seems OK and anyhow the slow network is a separate issue I suspect. Looking at the web UI the download file is generated quickly but then I tried a direct wget of the file and that confirmed that it's the transfer itself that's slow. |
It stayed working for a few hours then then both scripts (long and short) started throwing the same sort of errors. I can submit queries via the web UI. Do the python packages call different endpoints then the web UI? SubmitJob failed with status: 500 |
MAST is generally having a lot of issues with multiple services (not just Casjobs), which may be contributing to the instability you are seeing. Something is hammering our databases and is slowing down all services. I'll let Bernie know about the issue (if he does not already know about it). But until the general problem is fixed, you may continue to see these issues. Sorry for the trouble! |
Hi Rick |
Hi Dan & Rick
I'm still trying to download large chunks of MAST's PanStarrs_DR2.
I'm running two scripts one using casjobs.py and the batch queue and the other using mastcasjobs as the quick queue.
I seem to be getting more and more errors of the type
casjobs:
SubmitJob failed with status: 500
System.Data.SqlClient.SqlException: Could not find prepared statement with handle -1.
at System.Data.SqlClient.SqlConnection.OnError(SqlException exception, Boolean breakConnection, Action`1 wrapCloseInAction)
mastcasjobs
ExecuteQuickJob failed with status: 500
System.Data.SqlClient.SqlException: Could not find prepared statement with handle -1.
Bernie Shiao keeps giving things a kick and then things work for a bit (at the start stuff would work for several days but recently it's failing pretty much every day.
Within my script I'm basically running a query and downloading the results and looping through large tables. My "understanding" is that my python calls to *casjobs is just resulting in http requests and there's nothing I need to do in terms of closing out / finalizing before issuing the next interation. The error maybe suggest of running out of resources.
Again this isn't really any issue with the *casjobs.py but just hoping you might have some insight into how I can get things running more stable.
thanks
Mike
The text was updated successfully, but these errors were encountered: