MAT 300 Homework Solution

All calculations and relevant Minitab output must be included to receive full credit. Be sure to word-process your solutions and copy and paste the appropriate outputs from Minitab. Show all steps used in arriving at the final answers. Incomplete solutions will receive partial credit. This exam covers content from Modules One through Three.


(1) Consider the GASTURBINE data set and corresponding output from Minitab. Note that all tests should be performed at the α = 0.05 level. Use the complete data set in your analysis. The first 10 observations are given for illustrative purposes. Complete parts a) through f) below.



ENGINE


SHAFTS


RPM


CPRATIO


INLET-TEMP


EXH-TEMP


AIRFLOW


POWER


HEATRATE


Traditional


1


27245


9.2


1134


602


7


1630


14622


Traditional


1


14000


12.2


950


446


15


2726


13196


Traditional


1


17384


14.8


1149


537


20


5247


11948


Traditional


1


11085


11.8


1024


478


27


6726


11289


Traditional


1


14045


13.2


1149


553


29


7726


11964


Traditional


1


6211


15.7


1172


517


176


52600


10526


Traditional


1


6210


17.4


1177


510


193


57500


10387


Traditional


1


3600


13.5


1146


503


315


89600


10592


Traditional


1


3000


15.1


1146


524


375


113700


10460


Traditional


1


3000


15


1171


525


514


164300


10086



Regression Analysis: HEATRATE versus RPM, CPRATIO, ...



The regression equation is


HEATRATE = 14314 + 0.0806 RPM - 6.8 CPRATIO - 9.51 INLET-TEMP + 14.2 EXH-TEMP


- 2.55 AIRFLOW + 0.00426 POWER



Predictor Coef SE Coef T P


Constant 14314 1112 12.87 0.000


RPM 0.08058 0.01611 5.00 0.000


CPRATIO -6.78 30.38 -0.22 0.824


INLET-TEMP -9.507 1.529 -6.22 0.000


EXH-TEMP 14.155 3.469 4.08 0.000


AIRFLOW -2.553 1.746 -1.46 0.149


POWER 0.004257 0.004217 1.01 0.317



S = 458.757 R-Sq = 92.5% R-Sq(adj) = 91.7%


Analysis of Variance



Source DF SS MS F P


Regression 6 155269735 25878289 122.96 0.000


Residual Error 60 12627473 210458


Total 66 167897208



Write a first-order model in general form for the model that includes RPM, CPRATIO, INLET-TEMP, EXH-TEMP, AIRFLOW, and POWER to predict HEATRATE.

Write out the least squares prediction equation for the model that was fit in Minitab.

Calculate and give an interpretation of the coefficients based on a one-unit change in each xi. Calculate and give an interpretation of the effect on HEATRATE based on a 1-unit change in AIRFLOW together with a 200-unit change in POWER.

Interpret the overall model F-test. State the appropriate hypothesis test and associated numerator and denominator degrees of freedom used for this test as well as the critical value that the test statistic is compared to. State the conclusion you would make regarding the null hypothesis. Specifically, would you reject or fail to reject the null hypothesis, and what does this conclusion means about the model parameters? Does this tell us anything about the significance of the individual predictors? Why or why not?

Report and interpret the model R2.

Which predictors are significant in the model? Report the appropriate hypothesis test and formal conclusion you would make regarding RPM and CPRATIO. In your conclusion, state their p-values and test statistics. Would you suggest removing all non-significant predictors at once and refitting the model? Why or why not?




(2) Using the FLAG data set (first 10 observations given), fit a model that predicts COST based on DOTEST. Use the complete data set in your analysis. Show the relevant output from Minitab in your answer. The first 10 observations are given for informational purposes.



CONTRACT


COST


DOTEST


STATUS


1


1379.43


1386.29


1


2


134.03


85.71


1


3


202.33


248.89


0


4


397.12


467.49


0


5


158.54


117.72


1


6


1128.11


1008.91


1


7


400.33


472.98


1


8


581.64


785.39


0


9


353.96


370.02


0


10


138.71


174.25


0



Calculate a confidence and prediction interval for DOTEST = 100. Interpret the confidence and prediction intervals given in the output. Do you see any problems with the interpretation of the prediction interval in terms of what we are trying to predict? Why are confidence intervals always more narrow than prediction intervals?


(3) Consider the EXPRESS data set (first 10 observations given). Use the complete data set in your analysis. Show the relevant output from Minitab in your answers. The first 10 observations are given for illustrative purposes.



Weight


Distance


Cost


5.9


47


2.6


3.2


145


3.9


4.4


202


8


6.6


160


9.2


0.75


280


4.4


0.7


80


1.5


6.5


240


14.5


4.5


53


1.9


0.6


100


1


7.5


190


14



Draw a scatterplot of Cost vs. each of the predictors. Do you see any evidence of a quadratic relationship?



Write a general second-order model (not including interaction terms) for Cost(y).



Give the null and alternative hypothesis for determining whether both of the second-order terms are statistically significant (nested model hypothesis).



Identify which of the two general nested models is the complete model and which is the reduced model.



Using Minitab, produce an output and write the least squares regression equation for the second-order model AND the reduced model that was fit in Minitab.



Compute the test statistic and perform the appropriate F-test. Be sure to state the degrees of freedom and the correct F critical value that you are comparing your test statistic to. Formally state your conclusion. Hint: To compute the test statistic, you need to separately fit both the complete and reduced models. To fit the complete model, you need to add the appropriate variables to your data set.



(4) Consider the EXPRESS data set (first 10 observations given). Use the complete data set in your analysis. Show the relevant output from Minitab in your answers. The first 10 observations are given for illustrative purposes.



Weight


Distance


Cost


5.9


47


2.6


3.2


145


3.9


4.4


202


8


6.6


160


9.2


0.75


280


4.4


0.7


80


1.5


6.5


240


14.5


4.5


53


1.9


0.6


100


1


7.5


190


14



Write out a complete general first-order model including an interaction term for Cost as the outcome.



Using Minitab, produce an output and write the least squares regression equation with the interaction term that was fit in Minitab.



State the null and alternative hypothesis to test if there is a significant interaction effect between weight and distance. Test the hypothesis at the α= 0.01 level. Write your formal conclusion in terms of the variable names



If there is a significant interaction effect, but the individual predictors (main effects) that make up that interaction are not significant in the model, would you suggest removing the main effects? Why or why not