Group pods under aerogear-metrics application and use deploy instead of deploymentConfig #34

StevenTobin · 2018-03-02T10:22:22Z

This PR groups all the metrics pods under the application label aerogear-metrics and switches to the use of k8s deploy resources instead of Openshift deploymentConfigs.

Related issues: #30 & #31

david-martin · 2018-03-05T12:02:47Z

@matzew would appreciated your review on this given you're also working on https://issues.jboss.org/browse/AEROGEAR-2012

matzew · 2018-03-05T13:47:12Z

👀 at it now

matzew · 2018-03-05T14:10:47Z

While I see that the deployment part works, I am unable to get the entire APB up and running.

Here is the change from dc to deployment, I could successfully verify that:

➜  metrics-apb git:(3d6ef5e) oc get dc
No resources found.
➜  metrics-apb git:(3d6ef5e) oc get deployment
NAME                   DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
aerogear-app-metrics   1         1         1            1           5m
grafana                1         1         1            1           6m
postgres               1         1         1            1           5m
prometheus             1         1         1            1           7m

However, the apb fails here

See the log:


FAILED - RETRYING: Wait for app-metrics health endpoint to report healthy (4 retries left).
--
  | FAILED - RETRYING: Wait for app-metrics health endpoint to report healthy (3 retries left).
  | FAILED - RETRYING: Wait for app-metrics health endpoint to report healthy (2 retries left).
  | FAILED - RETRYING: Wait for app-metrics health endpoint to report healthy (1 retries left).
  | fatal: [localhost]: FAILED! => {"attempts": 30, "changed": false, "content": "", "msg": "Status code was not [200]: Request failed: <urlopen error [Errno 113] No route to host>", "redirected": false, "status": -1, "url": "http://aerogear-app-metrics-foo.192.168.37.1.nip.io/healthz "}
  | to retry, use: --limit @/opt/apb/actions/provision.retry
  | PLAY RECAP *********************************************************************
  | localhost                  : ok=38   changed=35   unreachable=0    failed=1
  | + EXIT_CODE=2
  | + set +ex
  | + '[' -f /var/tmp/test-result ']'
  | + exit 2

This brings me to the question, why we are still using openshift_v1_route entries? We can use k8s_v1_endpoints instead, right ?

StevenTobin · 2018-03-05T15:17:51Z

That check in the APB for the pods to be ready is heavily dependent on your system and how quick the pods come up. If it takes a while for your system to pull the image the check could time out or if it's under heavy load the check could timeout, but when you check the project in Openshift the pod could be fine seconds after the APB provision has 'failed'.

I've been investigating the CI failures on this APB and it usually seems to be these container checks timing out, I also can't find checks like these in any other APBs and my instinct is to remove them. Without them the APB won't wait for each container to become ready so the whole APB should provision faster, so this is what i'd like to do:

Remove the container ready checks for grafana and metrics-api server as the functionality can be replaced with readinessProbes.
The metrics-api server already has a readinessProbe that hits the healthz endpoint.
Grafana has an /api/health endpoint so add a readinessProbe that hits that endpoint.

I can't find a similar health endpoint for prometheus however.

@philbrookes @david-martin @matzew does this seem reasonable.

I'll create an issue as it's not really in the scope of this PR.

david-martin · 2018-03-05T15:31:41Z

@StevenTobin Readiness probes for health endpoints sounds reasonable.
Checks at the end of the APB provision task to ensure every service is running OK (has at least 1 pod in a ready state) would ensure the APB only finishes if everything is running OK.

matzew · 2018-03-06T07:46:35Z

@StevenTobin I also see that the deprovision still speaks about deploymentconfig.

Perhaps we wanna use something like this:
https://github.com/ansibleplaybookbundle/mediawiki-apb/blob/master/roles/mediawiki/tasks/deprovision.yml#L22-L28

matzew · 2018-03-06T07:47:48Z

I'll create an issue as it's not really in the scope of this PR.

+1 on moving this larger discussion to a new issue/thread

david-martin

All groups into single application.

All Deployments as expected (no DeploymentConfigs)

oc get deployment
NAME                   DESIRED   CURRENT   UP-TO-DATE   AVAILABLE   AGE
aerogear-app-metrics   1         1         1            1           2m
grafana                1         1         1            1           3m
postgres               1         1         1            1           2m
prometheus             1         1         1            1           4m
$ oc get dc
No resources found.

All running.

Approved

secondsun · 2018-03-06T22:57:02Z

I don't know if this is related, but after deploying metrics with this PR I don't see it in my list of services after running mobile get clientconfig myapp-android --namespace=myproject -o json

david-martin · 2018-03-08T17:17:27Z

@secondsun I think this will address it disappearing from the cli
https://issues.jboss.org/browse/AEROGEAR-2267 (cc @pb82)

pb82 · 2018-03-08T17:33:34Z

@secondsun did this work for you before? Because i think it shouldn't work work.

grdryn · 2018-12-04T20:38:06Z

Bump!

I just noticed that DeploymentConfigs were being used, and was considering changing to Deployments. Thankfully I noticed this PR first!

StevenTobin added 2 commits March 1, 2018 15:20

group pods under aerogear-metrics application

ed35005

use kubernetes deploy instead of openshift deploymentconfig

3d6ef5e

StevenTobin changed the title ~~group pods under aerogear-metrics application~~ group pods under aerogear-metrics application and use deploy instead of deploymentCong Mar 5, 2018

StevenTobin changed the title ~~group pods under aerogear-metrics application and use deploy instead of deploymentCong~~ group pods under aerogear-metrics application and use deploy instead of deploymentConfig Mar 5, 2018

StevenTobin changed the title ~~group pods under aerogear-metrics application and use deploy instead of deploymentConfig~~ Group pods under aerogear-metrics application and use deploy instead of deploymentConfig Mar 5, 2018

This was referenced Mar 5, 2018

Switch to Kubernetes resources (Deployments instead of DeploymentConfigs) #31

Open

Use the same 'app' label for all Deployments in the APB #30

Open

david-martin requested review from david-martin and matzew March 5, 2018 12:02

david-martin approved these changes Mar 6, 2018

View reviewed changes

secondsun mentioned this pull request Mar 6, 2018

failed to provision keycloak after fetching the latest version of keycloak-apb aerogearcatalog/keycloak-apb#48

Open

remove k8s deploys on deprovision

060148d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Group pods under aerogear-metrics application and use deploy instead of deploymentConfig #34

Group pods under aerogear-metrics application and use deploy instead of deploymentConfig #34

StevenTobin commented Mar 2, 2018 •

edited

Loading

david-martin commented Mar 5, 2018

matzew commented Mar 5, 2018

matzew commented Mar 5, 2018

StevenTobin commented Mar 5, 2018 •

edited

Loading

david-martin commented Mar 5, 2018

matzew commented Mar 6, 2018

matzew commented Mar 6, 2018

david-martin left a comment

secondsun commented Mar 6, 2018

david-martin commented Mar 8, 2018

pb82 commented Mar 8, 2018

grdryn commented Dec 4, 2018

Group pods under aerogear-metrics application and use deploy instead of deploymentConfig #34

Are you sure you want to change the base?

Group pods under aerogear-metrics application and use deploy instead of deploymentConfig #34

Conversation

StevenTobin commented Mar 2, 2018 • edited Loading

david-martin commented Mar 5, 2018

matzew commented Mar 5, 2018

matzew commented Mar 5, 2018

StevenTobin commented Mar 5, 2018 • edited Loading

david-martin commented Mar 5, 2018

matzew commented Mar 6, 2018

matzew commented Mar 6, 2018

david-martin left a comment

Choose a reason for hiding this comment

secondsun commented Mar 6, 2018

david-martin commented Mar 8, 2018

pb82 commented Mar 8, 2018

grdryn commented Dec 4, 2018

StevenTobin commented Mar 2, 2018 •

edited

Loading

StevenTobin commented Mar 5, 2018 •

edited

Loading