Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOC]: how do you pre-parse apache logs? With regex or NLP? #1249

Closed
2 tasks done
nyck33 opened this issue Oct 6, 2023 · 2 comments
Closed
2 tasks done

[DOC]: how do you pre-parse apache logs? With regex or NLP? #1249

nyck33 opened this issue Oct 6, 2023 · 2 comments
Labels
doc Improvements or additions to documentation external This issue was filed by someone outside of the Morpheus team

Comments

@nyck33
Copy link

nyck33 commented Oct 6, 2023

How would you describe the priority of this documentation request

Medium

Describe the future/missing documentation

...'BERT language model using a toy dataset of just 1000 previously parsed apache server logs as a labeled data' it says here: https://github.com/nv-morpheus/Morpheus/blob/branch-23.11/models/training-tuning-scripts/log-parsing-models/log-parsing-training.ipynb so I googled parsing apache logs and see things like: https://www.loggly.com/ultimate-guide/parsing-apache-logs/ but that seems low-tech and maybe not in line with all the NLP being used in Morpheus so can you tell me how you did it, and if you didn't use NLP, suggest some NLP ways to do this?

Where have you looked?

See above

Code of Conduct

  • I agree to follow this project's Code of Conduct
  • I have searched the open documentation issues and have found no duplicates for this bug report
@nyck33 nyck33 added the doc Improvements or additions to documentation label Oct 6, 2023
@jarmak-nv jarmak-nv added Needs Triage Need team to review and classify external This issue was filed by someone outside of the Morpheus team labels Oct 6, 2023
@jarmak-nv
Copy link
Contributor

Hi @nyck33!

Thanks for submitting this issue - our team has been notified and we'll get back to you as soon as we can!
In the mean time, feel free to add any relevant information to this issue.

@jarmak-nv jarmak-nv removed the Needs Triage Need team to review and classify label Dec 11, 2023
@jarmak-nv
Copy link
Contributor

Hey @nyck33

We used an open dataset of apache logs and a traditional regex parser, https://github.com/amandasaurus/apache-log-parser, to create an example training dataset.

Any specific bugs/features/documentation issues with Morpheus, please open an issue to resolve, otherwise for more free-form responses please open a discussion on GitHub. Thanks!

@github-project-automation github-project-automation bot moved this from Todo to Done in Morpheus Boards Dec 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc Improvements or additions to documentation external This issue was filed by someone outside of the Morpheus team
Projects
Status: Done
Development

No branches or pull requests

2 participants