Initial ML Model for TANNER #437

hinnazeejah · 2024-08-19T22:34:06Z

No description provided.

The test_data.csv file contains the dataset used to evaluate the performance of the machine learning model. It includes a variety of URLs and payloads, pre-processed using TF-IDF (Term Frequency-Inverse Document Frequency) to extract relevant features. The model uses this data to predict the type of web attacks, such as SQL Injection, Cross-Site Scripting (XSS), and Local/Remote File Inclusion (LFI/RFI), and compare its predictions against the actual labels to assess accuracy and other performance metrics.

The train_data.csv file contains the dataset used to train the machine learning model. It includes a variety of URLs and payloads, pre-processed using TF-IDF (Term Frequency-Inverse Document Frequency) to extract relevant features. This data teaches the model to recognize and categorize different types of web attacks, such as SQL Injection, Cross-Site Scripting (XSS), and Local/Remote File Inclusion (LFI/RFI). By learning from this data, the model becomes capable of making accurate predictions on new, unseen data.

This Jupyter Notebook outlines the complete machine learning pipeline for detecting web attacks using a Random Forest Classifier. It includes data loading, cleaning (removal of duplicates, handling missing values, and outlier removal), feature engineering (Label Encoding and TF-IDF transformation), model training, and evaluation. The pipeline also provides insights into class distribution, summary statistics, and feature correlations, all aimed at improving the accuracy and effectiveness of web attack detection in TANNER.

Update README.md

Modified notebook to import libraries, unzip dataset

cyberholics · 2024-08-27T10:08:04Z

Hello, It seems this project is about creating a ML based classifier for TANNER, I am interested in contributing to this project.

hinnazeejah and others added 30 commits August 16, 2024 11:16

Create README.md

df4cf05

Update README.md

59b5892

Add files via upload

ee277ff

Add Random Forest Model

72734a4

Update README.md

dd56f7b

Update README.md

15867bb

Update README.md

0ad02e8

Update README.md

92bb482

Update README.md

285d297

Update README.md

8173dd0

Update README.md

977ec60

Update README.md

7e9e7a4

Update README.md

6497434

Update README.md

a5bc058

Update README.md

1cad858

Update README.md

7df7f56

Update README.md

ef69d33

Update README.md

8cdeb86

Update README.md

c179276

Update README.md

6506324

Merge pull request #1 from richardp23/patch-1

95ba941

Update README.md

Update README.md

078a8ee

Update README.md

b45146e

Update README.md

e67b0ca

Update README.md

67898ee

Update README.md

863d0c1

Update README.md

077c505

hinnazeejah and others added 9 commits August 17, 2024 21:16

Update README.md

02aa39a

Update README.md

027ba7f

Update README.md

18a4ae7

Update README.md

d106cc2

Modified notebook to import libraries, unzip dataset

5976dd4

Merge pull request #2 from richardp23/main

ba026bc

Modified notebook to import libraries, unzip dataset

Update README.md

21a4aa8

Update README.md

6556585

Update README.md

c970d5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial ML Model for TANNER #437

Initial ML Model for TANNER #437

hinnazeejah commented Aug 19, 2024

cyberholics commented Aug 27, 2024

Initial ML Model for TANNER #437

Are you sure you want to change the base?

Initial ML Model for TANNER #437

Conversation

hinnazeejah commented Aug 19, 2024

cyberholics commented Aug 27, 2024