Following are the CSV Data sets are as follows:
- UsersDumpForCodeTest.csv
- Fields: UserID, country, certifications
- Description: This data set contains information on countries and certifications associated with the user. UserId uniquely identifies the user.
- AssessmentScoreForCodeTest.csv
- Fields: TestId, UserId, assessment_name, score
- Description: This data set contains scores for assessments(tests) that the user has taken. TestID uniquely identifies the test that the user has taken.
- UserSurveyResultsForCodeTest.csv
- Fields: TestID, question, answer
- Description: Surveys are presented to users after they finish taking the test. A survey consists of a collection of questions and answers. And a survey is tied to a TestID.
- AssessmentRealTimeEventsForCodeTest.csv
- Fields: TestID, assessmentname, eventcode, timestamp
- Description: While the user is taking an assessment(test) , real time events are collected which represent the various stages of a test. Following are the event codes:
- STRASMNT: When the user chooses to start the assessment
- RDYASMNT: When the assessment environment is loaded and ready for the student to work on it
- GRDDOM: Grades a domain (sub-section) of an assessment
- GRDASMNT: Grades the entire assessment
- VIEWASMTDIAG: When the student views a diagram-hint on the assessment
- VIEWASMTGRADINGHINT: When the student views a grading-hint
- CANCASMNT: When the user selected to cancel the assessment
- ABNDNASMNT: When the user never completes the assessment or walks out/abandons the assessment.
- CMPLASMNT: When the assessment is complete
- Analyze these data sets and present your analysis. This is an open ended task to exercise your creativity.
- Find the average time taken to complete an assessment, group by assessment name.
- Generate a list of users that fit the following criteria:
- Got 80% or above on an assessment.
- Had a 100% chance of referring a friend.
- Is based in the USA.
- Has received AWS certifications.
- Sort by test scores and most taken assessments first.