Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docs] Kubernetes quickstart flow troubleshooting #186382

Closed
mykolaharmash opened this issue Jun 18, 2024 · 15 comments · Fixed by elastic/ingest-docs#1409
Closed

[Docs] Kubernetes quickstart flow troubleshooting #186382

mykolaharmash opened this issue Jun 18, 2024 · 15 comments · Fixed by elastic/ingest-docs#1409
Assignees
Labels
Feature: Observability Onboarding Team:obs-ux-logs Observability Logs User Experience Team

Comments

@mykolaharmash
Copy link
Contributor

mykolaharmash commented Jun 18, 2024

We need a place in our docs to link users for troubleshooting issues with the kubectl kustomize command.

This could be a new section inside Scaling Elastic Agent on Kubernetes or a new dedicated page.

There is a Figma mockup with some potential troubleshooting steps that might be used as a starting point for the docs page.

Image

@botelastic botelastic bot added the needs-team Issues missing a team label label Jun 18, 2024
@akhileshpok
Copy link

@bturquet, @gizas - Is there any existing doc for troubleshooting of the agent in K8s that we can refer to from within the new onboarding flow? Thanks

@gizas
Copy link
Contributor

gizas commented Jun 18, 2024

We dont have something specific!

For start, we can refer to below general ones:

We have this one as part of the current PR
https://github.com/elastic/elastic-agent/blob/d1a28379019332e27395259912e3675871720a0a/deploy/kubernetes/elastic-agent-kustomize/default/README.md

I would also suggest to include:

  • Command to check the output of kustomize rendering:
    kubectl kustomize elastic-agent-kustomize/default/elastic-agent-standalone/ >> kustomize.yaml
  • Command to get cluster status:
    kubectl get pods -n kube-system | grep elastic
  • Command to describe agent:
    kubectl describe -n kube-system <name_of_elastic_agent_pod>
  • Check logs of elastic-agent
    kubectl logs -n kube-system <name_of_elastic_agent_pod>
    kubectl logs -n kube-system <name_of_elastic_agent_pod> | grep -i error

This could be a new section inside Scaling Elastic Agent on Kubernetes or a new dedicated page.

I would advise not to change the above page as it has specific goal.

My advise is to link the new page/ guide possibly to:

This is the repo where rest of elastic-agent pages for k8s leave

@akhileshpok
Copy link

Thanks @gizas. @mykolaharmash - Probably best to link the current 'Troubleshooting Common Problems' doc to the new onboarding flow. cc-@bmorelli25 - We should try to improve the current 'Troubleshooting Common Problems' doc based on the suggestions provided by @gizas.

@mbondyra mbondyra added the Team:obs-ux-infra_services Observability Infrastructure & Services User Experience Team label Jun 21, 2024
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ux-infra_services-team (Team:obs-ux-infra_services)

@botelastic botelastic bot removed the needs-team Issues missing a team label label Jun 21, 2024
@flash1293
Copy link
Contributor

@mykolaharmash @akhileshpok can we close this?

@roshan-elastic roshan-elastic removed the Team:obs-ux-infra_services Observability Infrastructure & Services User Experience Team label Aug 15, 2024
@botelastic botelastic bot added the needs-team Issues missing a team label label Aug 15, 2024
@roshan-elastic roshan-elastic added the Team:obs-ux-infra_services Observability Infrastructure & Services User Experience Team label Aug 15, 2024
@botelastic botelastic bot removed the needs-team Issues missing a team label label Aug 15, 2024
@mykolaharmash
Copy link
Contributor Author

@flash1293 probably not, we still don't have anything about kustomize flow troubleshooting in the docs (like commands that @gizas provided)

@gbamparop gbamparop added Team:obs-ux-logs Observability Logs User Experience Team Feature: Observability Onboarding and removed Team:obs-ux-infra_services Observability Infrastructure & Services User Experience Team labels Sep 12, 2024
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ux-logs-team (Team:obs-ux-logs)

@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ux-onboarding-team (Feature: Observability Onboarding)

@bmorelli25
Copy link
Member

@eedugon is working on a proposal for improving our Kubernetes docs across Elastic (https://github.com/elastic/ingest-dev/issues/3633#issuecomment-2342936676).

Edu, can you take a look at this issue and make sure the gaps in our documentation explained here are included in your plan?

@eedugon eedugon self-assigned this Sep 24, 2024
@eedugon
Copy link

eedugon commented Sep 25, 2024

@gizas , @mykolaharmash , @akhileshpok : I have a quick question before moving forward with this:

Why the new onboarding workflow in the UX is suggesting a manifest creation method (kustomize) that doesn't appear (or at least i can't find it) anywhere in the docs?

We are telling the users in the UI to use kustomize while all our existing docs still consist of downloading a manifest and updating it before being applied. I'm talking about Running Elastic Agent on Kubernetes and similar docs.

Shouldn't the UX be aligned with the rest of the docs? Are we planning to update also other docs in the near future and start using kustomize related manifests to prepare the users manifests?

Anyway, coming back to this issue:

  • I will start by adding the troubleshooting commands and suggestions in a new section of the Troubleshooting Common Problems doc, called Kustomize based installations of Elastic Agent, and in case of needing to move them somewhere else in the future that will be easy.

  • Then we can update the link of the onboarding UX to point to that new section of the troubleshooting doc (@mykolaharmash , would you change that or should I make that change too? ).

Let us know if that looks good and aligned with the expectations.

@gizas
Copy link
Contributor

gizas commented Sep 25, 2024

Heyyyy @eedugon (nice seeing you on the dark-side)

This is the reference: https://github.com/elastic/elastic-agent/tree/main/deploy/kubernetes#kustomize-templates
In more details see the updated readme

There was a new initiative/ project where we tried to ease the onboarding of users (and mainly install along with the manifest additional resources/assets needed like dashboards and kube-state-metrics with one command). I guess @akhileshpok and @flash1293 might have more documentation for the project goal.

@eedugon
Copy link

eedugon commented Sep 26, 2024

Thanks @gizas for sharing the initiative, and I think it's really great to use kustomize for this (with some caveats like maybe kube-state-metrics could be already installed, and other possible issues).

I'm going to proceed and create a troubleshooting section for possible issues related with this approach (as requested in the issue).

However let me add some comments about the lack of alignment between this new approach (which is great) and our official docs. @bmorelli25 , I'd like you and the team to analyze the following to see if we should create other follow-up issues:

Anyway I will start working on the troubleshooting details and propose something.

@eedugon
Copy link

eedugon commented Oct 24, 2024

@mykolaharmash : where is the Open documentation link of the screenshot you shared pointing?

I'm preparing a PR for ingest-docs to make troubleshooting content available at: /fleet-troubleshooting.html#agent-kubernetes-kustomize, so, something like:

https://www.elastic.co/guide/en/fleet/current/fleet-troubleshooting.html#agent-kubernetes-kustomize.

But if you just point to https://www.elastic.co/guide/en/fleet/current/fleet-troubleshooting.html that might be also ok.

@eedugon
Copy link

eedugon commented Oct 25, 2024

@gizas : I've created a PR elastic/ingest-docs#1409 that should add the content you mentioned at #186382 (comment), and with extra details. I'd like to get your review on it when possible.

@akhileshpok , @bmorelli25 , the PR is not very long, but considering Kubernetes is a complete platform and that we might want to add content to the troubleshooting agent on Kubernetes section in the future, it might be worthy to separate it to a new document.

For the moment I've just added the content to 'Troubleshooting Common Problems' as agreed at #186382 (comment)

@mykolaharmash
Copy link
Contributor Author

@eedugon currently the link point to https://www.elastic.co/guide/en/fleet/current/fleet-troubleshooting.html but I think we should update it to point specifically to the troubleshooting section once the doc is live.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature: Observability Onboarding Team:obs-ux-logs Observability Logs User Experience Team
Projects
None yet
Development

Successfully merging a pull request may close this issue.

10 participants