Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

The director of the state\' s Public Health Department wants you and your team t

ID: 3299763 • Letter: T

Question

The director of the state' s Public Health Department wants you and your team to analyze the data that the department has been collecting through the biosurveillance system. The director wants to know if you and your team are able to predict the occurrence of a new sexually transmitted disease X; The researchers who discovered X state that ongoing investigation points to several factors that appear to magnify the spread of this disease.



1)What type of data do you need to extract from the biosurveillance system to formulate the

prediction model? ( you can make any assumptions you need about the data collected in the

biosurveillance system.)

2) What prediction model will you be utilizing and why?

(Hint HINT: HINT: HINT:: When is it appropriate to

use linear regression, multiple regression, and survival analysis?)

3)Are there any advantages or disadvantages of utilizing your prediction model?

Explanation / Answer

(1)

Let's say that the disease X is magnified by four factors A, B, C and D.

So you need the data for these 5 things, namely the disease and the four factors. The data can be in any relevat units.

The assumptions are that these data are independent of each other.

(2)

In this case you will be using a multiple regression model, because there the dependent variable is X, and the independent variables are A, B, C and D.

(3)

The main disadvantage is that the error term in the regression model is assumed be normally distributed with zero mean, which may not always be true.

Since regression analysis is based on certain assumptions, it is possible that these conditions are sometimes not met, in which case the predicted output of X will not be accurate