Refer to the data set: (a) How would you treat the missing values and why? (b) F
ID: 3301849 • Letter: R
Question
Refer to the data set:
(a) How would you treat the missing values and why?
(b) Find the average (mean) for each quiz.
(c) How could you compare the grades of the midterm with the quizzes?
(d) How can we predict the scores of the final?
(e) Create a histogram for quiz 3 and quiz 4 using bins of width 1 starting from 0. Describe their
shape.
Student Quiz1 Quiz2 Quiz3 Quiz5 Quiz6 Midterm EC 4.55 5.00 3.75 6.80 6.70 13.00 21.50 21.00 24.50 24.00 25.00 21.50 18.50 23.00 14.00 22.00 20.00 12.50 17.00 21.00 18.50 18.00 17.00 22.00 8.00 19.50 21.50 20.50 24.50 19.00 18.00 18.00 15.50 16.50 1.00 1.00 2.00 2 5.50 6.00 6.50 7.00 5.70 5.00 7.00 6.50 6.90 4 6.80 7.00 6.60 1.00 1.00 5.35 6 7.00 6.50 5.90 6.75 8 5.55 7.00 1.00 1.00 6.30 6.90 5.65 6.75 6.05 5.80 5.80 4.05 6.50 5.60 4.70 2.00 1.00 5.75 4.70 6.20 6.20 1.00 1.00 4.00 4.20 2.90 6.70 3.75 5.50 6.00 6.55 6.50 6.60 1.00 1.00 4.80 6.10 6.70 6.60 6.60 1.00 1.00 3.90 5.80 6.20 6.50 6.60 6.55 7.00 6.50 2.00 1.00 5.25 3.50 5.00 19.14 0.714286 0.862562 0.918132 0.6392860.932759 0.804433 0.765517 0.58621Explanation / Answer
Answer to part a)
The missing values are not counted in the statistical calculations
We do not consider them for statistical inference purpose
.
Answer to part b)
Using the command =average(range of data) we get the average score of each quiz
The mean values seem to be already calculated in the image
with quiz 1 , mean = 5, quiz 2 mean = 6.04 , quiz 3 Mean = 6.43 , quiz4 mean = 4.48
Quiz 5 mean = 6.53
Quiz 6 mean = 5.63
MT = 19.14
EC = 1.17
.
Answer to part c)
we can run a regression analysis, where in quiz 1 to quiz 6 serves as the independent variable and the Midterm scores serve as the dependent variables
By this we would get to know if this model has good predictive power or not.
.
Answer to part d)
The scores of the final would be the sum of all the scores
for that create a new column of Total , and the values in it would be the sum of all the rest of the scores