Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Refer to the data set: (a) How would you treat the missing values and why? (b) F

ID: 3301849 • Letter: R

Question

Refer to the data set:

(a) How would you treat the missing values and why?

(b) Find the average (mean) for each quiz.

(c) How could you compare the grades of the midterm with the quizzes?

(d) How can we predict the scores of the final?

(e) Create a histogram for quiz 3 and quiz 4 using bins of width 1 starting from 0. Describe their

shape.

Student Quiz1 Quiz2 Quiz3 Quiz5 Quiz6 Midterm EC 4.55 5.00 3.75 6.80 6.70 13.00 21.50 21.00 24.50 24.00 25.00 21.50 18.50 23.00 14.00 22.00 20.00 12.50 17.00 21.00 18.50 18.00 17.00 22.00 8.00 19.50 21.50 20.50 24.50 19.00 18.00 18.00 15.50 16.50 1.00 1.00 2.00 2 5.50 6.00 6.50 7.00 5.70 5.00 7.00 6.50 6.90 4 6.80 7.00 6.60 1.00 1.00 5.35 6 7.00 6.50 5.90 6.75 8 5.55 7.00 1.00 1.00 6.30 6.90 5.65 6.75 6.05 5.80 5.80 4.05 6.50 5.60 4.70 2.00 1.00 5.75 4.70 6.20 6.20 1.00 1.00 4.00 4.20 2.90 6.70 3.75 5.50 6.00 6.55 6.50 6.60 1.00 1.00 4.80 6.10 6.70 6.60 6.60 1.00 1.00 3.90 5.80 6.20 6.50 6.60 6.55 7.00 6.50 2.00 1.00 5.25 3.50 5.00 19.14 0.714286 0.862562 0.918132 0.6392860.932759 0.804433 0.765517 0.58621

Explanation / Answer

Answer to part a)

The missing values are not counted in the statistical calculations

We do not consider them for statistical inference purpose

.

Answer to part b)

Using the command =average(range of data) we get the average score of each quiz

The mean values seem to be already calculated in the image

with quiz 1 , mean = 5, quiz 2 mean = 6.04 , quiz 3 Mean = 6.43 , quiz4 mean = 4.48

Quiz 5 mean = 6.53

Quiz 6 mean = 5.63

MT = 19.14

EC = 1.17

.

Answer to part c)

we can run a regression analysis, where in quiz 1 to quiz 6 serves as the independent variable and the Midterm scores serve as the dependent variables

By this we would get to know if this model has good predictive power or not.

.

Answer to part d)

The scores of the final would be the sum of all the scores

for that create a new column of Total , and the values in it would be the sum of all the rest of the scores