Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

With a large number ec2 instances ec2 discovery times out #275

Open
dat-vikash opened this issue Jun 25, 2016 · 4 comments
Open

With a large number ec2 instances ec2 discovery times out #275

dat-vikash opened this issue Jun 25, 2016 · 4 comments

Comments

@dat-vikash
Copy link

It seems with a large number of ec2 instances within a region, the ec2 discovery plugin will timeout

[2016-06-25 03:48:09,987][INFO ][discovery.ec2            ] [Zartra] Exception while retrieving instance list from AWS API: Unable to unmarshall response (ParseError at [row,col]:[174,26]
Message: Read timed out). Response Code: 200, Response Text: OK
[2016-06-25 03:49:21,390][INFO ][discovery.ec2            ] [Zartra] Exception while retrieving instance list from AWS API: Unable to unmarshall response (java.net.SocketTimeoutException: Read timed out). Response Code: 200, Response Text: OK

It would be great if we can configure this timeout either through environmental variables of system properties.

@dadoonet
Copy link
Member

Thanks for reporting.
Could you open this in Elasticsearch repo?

What is your version?
How many Elasticsearch nodes you have?
Do you use filtering to only get back Elasticsearch instances related to your project?

@dat-vikash
Copy link
Author

I can open it in the elastic search repo, but I do think the issue lies with the aws ec2 discovery plugin. When I run the command:

aws ec2 describe-instances --region=us-east-1

on one of the ec2 instances within the cluster I do see a very large response time, on the order of 15-20 seconds. I'm guessing that with large node counts, the response time from this command goes up, so it would be great if we could increase the timeout via a setting.

ES Version : 2.3.3
I have 2 clusters : dev and qa each with 20 nodes. I launched dev on thursday evening with no problems and only seem to be having the problems with QA. Both are configured the same except for security groups.

I have not used filtering yet. Would filtering apply the the actual describe-instances command or would it only be used after the command has run? If it is applied to the describe-instances command, it could definitely be a workable solution.

Thanks!

@dadoonet
Copy link
Member

This plugin don't live here anymore from 2.0.

@dadoonet
Copy link
Member

IIRC filtering applies on AWS side so my guess is that you'll get back less instances.
20 nodes is a small number. Wondering why you have so bad response time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants