R Questions STA 106 SS2 2020 R Quesitons – Due Monday, August 24th by 5pm. Details * This is an individual asignment and should be completed on y

R Questions

STA 106 SS2 2020
R Quesitons – Due Monday, August 24th by 5pm.

Don't use plagiarized sources. Get Your Custom Assignment on
R Questions STA 106 SS2 2020 R Quesitons – Due Monday, August 24th by 5pm. Details * This is an individual asignment and should be completed on y
From as Little as $13/Page

Details

* This is an individual asignment and should be completed on your own.

* You may use R Markdown, Word, LaTex, Google Docs, etc. to submit your work.

* All answers with necessary output should be in the body of the work, and all code should be placed in an appendix.

* You may use resources on Canvas to complete the work.

* Submit your document to Canvas.

Problem I

I. Online you will find the file GSK.csv. The csv file has the following columns:

Column 1. sysbp: The systolic blood pressure of the subject (mmHg).

Column 2. gender: The gender, with levels F and M.

Column 3. married: Y if the subject was married, N if not.

Column 4. exercise: With levels L = low, M = medium, H = high.

Column 5. age: The age of the subject in years.

Column 6. stress: With levels LS = low, MS = medium, HS = high.

Column 7. educatn: With levels LE = low, ME = medium, HE = high.

Source: Data are part of a larger case study for the 2003 Annual Meeting of the Statistical Society of Canada.

(a) Find the average and standardard deviation of systolic blood pressure by stress level. Which group had the highest
average? Does it appear that the standard deviations are approximately equal?

(b) Find the average and standard deviation of age by exercise level. Which group has the lowest average age? Which
group seems to differ the most from its group mean?

(c) Create a boxplot of systolic blood pressure by education level. Does there appear to be a trend? Explain your
answer.

(d) Create a histogram of systolic blood pressure by marriage category. Does one group tend to vary more than the
other? Explain your answer.

II. For each of the following, use a plot or a function to justify your answers

(a) Which exercise group had the most subjects?

(b) Which stress group had the most highly educated subjects?

(c) Which stress group had the highest average age?

(d) Which gender group had the lowest average systolic blood pressure?

III. Using R, and assuming equal variance by group, test if the average systolic blood pressure for married vs. non-married
subjects is equal.

(a) Find the test-statistic.

(b) Find the exact p-value.

(c) Find and interpret the 95% confidence interval for the true difference.

(d) What is your conclusion about how systolic blood pressure may differ by marriage category? Explain and be specific.

1

Problem 2

I. Use the data Cancer.csv. The csv file has the following columns:

Column 1. Survival: The survival time of the patient in days

Column 2. Organ: The organ where cancer was present – Stomach, Bronchus, Colon, Ovary, Breast

Data Source : From the article Supplemental Ascorbate in the Supportive Treatment of Cancer: Reevaluation of
Prolongation of Survival Times in Terminal Human Cancer by Ewan Cameron and Linus Pauling, Proceedings
of the National Academy of Sciences of the United States of America, Vol. 75, No. 9 (Sep., 1978), pp. 4538-4542.

(a) Create a group box plot of Survival by Organ type. Does there appear to be significant differences in the
groups? Explain your answer.

(b) Find the sample averages of Survival by Organ type. Do you believe you would reject the null hypothesis of
Single Factor ANOVA based on these values? Explain.

(c) Do you believe the standard deviations of each population are equal? Explain.

(d) What level of would you suggest if concluding that the true average survival time was equal when in reality
it was not, would be considered the most severe error?

II. Continue with the Cancer dataset.

(a) Find the value of SSTO, SSA, SSE.

(b) Find the value of MSTO, MSA, MSE.

(c) Find the value of the test-statistic, and the corresponding p-value.

(d) State your conclusion in terms of the problem if = 0.05.

III. Diagnostic Tests

(a) Plot the QQplot, and residuals vs. fitted values. Does there appear to be a violation of the assumptions of
ANOVA? Explain your answer.

(b) Find and interpret the p-value of the Shaprio-Wilks test.

(c) Find and interpret the p-value of the Brown-Forsythe test.

(d) Based on your analyses above, would you want to transform the data? Explain.

2 sbp gender married exercise age stress educatn

133 F N H 60 MS ME

115 M N L 55 MS ME

140 M N L 18 HS HE

132 M Y M 19 HS ME

133 M N M 58 MS HE

138 F N H 55 MS HE

133 F Y L 22 HS LE

67 F Y H 52 MS ME

138 M Y L 46 MS LE

130 M Y H 38 MS LE

103 F N M 28 HS ME

137 M N M 54 MS LE

140 M N L 38 MS HE

131 F Y L 23 MS HE

134 M N H 23 LS HE

107 F Y H 18 LS ME

131 F Y M 24 HS HE

120 M Y H 40 MS LE

113 M Y L 18 HS LE

127 M Y H 56 MS HE

117 F N H 20 MS ME

139 F N M 56 HS HE

132 M N H 57 HS ME

124 F N H 45 LS LE

116 M N H 24 LS HE

115 M Y H 43 LS LE

131 F N L 61 MS ME

130 F N M 22 MS LE

124 F Y M 28 LS LE

139 M Y L 25 MS ME

130 M N M 61 LS HE

103 F Y L 44 MS LE

114 F N L 55 HS LE

135 M N M 53 HS LE

126 M N M 43 LS LE

133 M N L 43 HS HE

125 M Y H 50 MS LE

138 M N L 23 MS LE

138 M N M 33 HS LE

132 M N L 64 HS HE

114 M N M 29 LS ME

130 F N H 31 LS LE

127 F Y L 24 LS HE

131 F Y H 26 LS HE

101 M Y L 54 LS ME

130 F N L 42 MS HE

130 M Y M 24 HS HE

115 F Y M 40 HS ME

135 F N M 54 HS ME

134 F Y L 19 HS LE

131 M Y H 29 LS ME

112 F N H 36 MS LE

86 F N M 58 MS LE

132 M N L 25 MS HE

134 M N L 49 LS ME

122 M N L 43 HS LE

122 M N L 61 MS HE

137 M Y H 60 MS ME

137 M N L 32 MS HE

105 F N H 43 HS LE

136 M N H 53 LS HE

119 F N H 35 MS ME

139 F N H 41 MS ME

131 F N M 26 MS ME

125 M Y L 30 LS ME

121 M N H 54 HS HE

114 M N H 34 HS LE

100 M Y M 46 LS LE

135 M Y H 45 MS ME

129 F Y M 26 LS ME

120 M Y H 22 MS LE

137 M Y L 47 LS HE

132 M Y H 59 MS HE

113 F N H 23 HS ME

135 F Y M 38 HS LE

128 F Y H 56 LS ME

135 F Y L 42 MS LE

127 M Y H 33 LS ME

133 M Y L 45 HS LE

131 F Y L 57 HS LE

135 F Y L 25 MS ME

132 F Y H 32 LS HE

137 F N L 53 HS HE

138 M N L 22 MS HE

116 F Y H 40 LS HE

139 F N H 61 HS LE

137 M Y H 39 LS ME

128 M Y H 37 HS HE

134 F N H 36 HS HE

138 M N L 19 HS ME

134 F Y L 59 LS ME

111 F Y M 48 MS ME

139 M N L 45 HS HE

100 F N H 33 LS ME

135 F N H 40 MS HE

139 F N L 56 LS HE

125 M N H 38 MS HE

111 F Y H 42 HS HE

113 F Y H 56 LS HE

131 M Y L 62 MS HE

104 M N L 60 HS LE

134 M N L 26 LS LE

109 M Y L 51 LS ME

102 M Y L 27 MS HE

130 F N L 18 MS LE

139 F N H 41 LS ME

77 F Y H 20 LS LE

100 F N M 21 MS ME

135 M Y L 59 MS LE

139 M N M 62 HS LE

127 M N H 55 MS ME

110 F Y H 18 LS HE

132 F N M 38 HS ME

136 F N M 39 LS ME

135 M Y L 27 MS HE

139 M Y H 53 HS LE

123 M Y M 46 MS LE

138 F Y H 58 MS ME

123 M Y M 24 MS HE

134 M N H 52 LS ME

121 F Y M 41 MS LE

139 M Y M 60 LS HE

133 F N M 46 HS HE

137 F N L 26 LS HE

77 M Y H 40 LS HE

105 F Y L 57 LS HE

106 F N H 36 LS ME

102 M Y L 51 LS ME

139 M Y H 36 LS ME

130 F Y L 64 LS HE

135 F Y L 38 LS HE

135 F N L 21 MS LE

125 M N H 23 LS LE

134 F N L 45 HS ME

109 F Y H 26 HS HE

132 F Y H 44 MS ME

134 M N L 62 LS HE

125 M Y L 45 LS ME

124 M N M 32 HS HE

125 M N L 58 LS LE

138 F N M 53 HS LE

138 F Y L 30 MS ME

113 M N H 56 MS LE

111 F Y H 56 MS ME

112 M N H 22 LS HE

130 F N H 48 HS LE

114 F Y L 56 LS LE

108 F N L 36 MS LE

101 F N M 39 MS ME

134 F N L 25 MS ME

135 F Y H 63 MS LE

109 F Y M 19 MS ME

128 M N H 58 MS LE

137 F N H 40 HS ME

130 F N L 30 HS ME

135 F N L 37 LS LE

102 F N L 38 HS LE

137 M N H 44 LS LE

128 M N M 18 MS LE

112 F N M 32 LS ME

140 F N L 45 MS HE

138 F N H 53 LS ME

132 M N L 42 MS LE

130 M Y M 32 HS LE

88 F N M 38 LS ME

95 F N L 55 LS HE

130 F Y M 50 MS ME

138 F N H 46 LS LE

138 F N L 63 LS HE

133 F N H 35 LS ME

122 F N M 46 HS HE

120 F N M 23 MS HE

135 F Y M 61 HS LE

114 F Y H 45 LS LE

133 M Y M 40 HS ME

132 M Y M 35 MS HE

137 F N L 20 LS LE

120 F N H 27 LS HE

137 F Y L 23 MS HE

130 F Y H 61 MS LE

93 F Y H 30 HS ME

139 M N H 57 MS ME

122 M N M 25 HS HE

115 F N H 27 HS HE

135 M N H 47 LS ME

112 M N L 24 HS LE

72 M Y H 32 HS HE

104 M N M 57 LS HE

126 M N L 36 HS LE

100 F Y M 23 LS HE

139 M N M 48 LS HE

112 F Y L 18 LS HE

139 M Y H 40 LS HE

128 F N M 39 LS LE

130 M N L 50 HS ME

109 F Y H 33 LS LE

132 F N H 33 LS HE

136 M N H 23 MS LE

138 M Y M 38 LS LE

113 M N H 28 MS ME

131 M N H 38 MS LE

133 F N M 61 HS LE

80 M N H 52 MS ME

131 M Y H 63 HS LE

112 F Y M 32 MS HE

120 F N M 33 MS LE

107 M Y M 50 LS LE

133 M N M 26 LS ME

138 F N H 62 HS ME

134 M N L 20 HS LE

121 M Y L 24 LS HE

121 M N M 50 HS LE

118 M N H 46 HS LE

84 F N H 39 HS LE

117 F Y H 31 MS LE

84 M N H 43 HS LE

135 M N M 32 HS ME

132 M Y H 20 HS LE

132 M N H 37 MS ME

111 M N L 60 HS HE

114 M N L 62 LS HE

134 M Y L 27 LS HE

132 M Y L 46 MS ME

138 M N M 51 MS ME

109 F Y L 22 HS ME

114 F N L 39 HS HE

104 F N M 26 HS ME

130 F Y L 63 LS ME

136 M N L 49 LS ME

130 M Y L 46 MS ME

111 M N M 58 MS ME

102 F N H 35 HS HE

120 M Y H 52 LS LE

137 F N L 18 HS LE

131 M N L 39 MS LE

113 F N L 18 LS HE

131 F N L 37 MS LE

137 M N M 62 HS HE

119 M N M 30 MS HE

137 M N H 61 MS ME

138 F N M 21 LS ME

105 M N M 40 HS ME

73 M Y M 55 MS LE

126 F Y M 30 HS HE

107 F N M 29 LS LE

124 F N H 30 HS LE

80 M N H 36 MS ME

135 M N H 36 HS LE

136 F Y H 30 LS ME

134 M N H 45 HS HE

154 M N M 48 MS LE

151 M Y M 50 MS ME

174 F Y M 64 MS HE

146 F Y M 57 LS ME

148 F Y L 18 HS HE

172 F Y M 48 HS HE

153 M N L 25 HS ME

172 M N M 30 HS HE

171 F N M 33 HS HE

146 M Y H 28 LS ME

174 F Y L 54 MS ME

180 M N L 27 HS HE

165 F N L 59 HS HE

154 F Y M 44 HS LE

142 F Y L 39 LS ME

177 M Y M 56 MS HE

152 M Y M 28 MS HE

213 F Y H 36 HS HE

191 M Y M 55 MS ME

150 F Y L 26 HS HE

159 M Y L 43 HS ME

144 M N L 19 LS LE

146 M N H 25 HS LE

145 F N M 48 MS HE

196 F N L 51 LS LE

172 M Y H 35 HS HE

178 F N H 35 MS ME

150 F N M 33 HS ME

222 F Y L 42 HS ME

161 M Y L 61 LS HE

144 F Y L 58 MS ME

148 F N M 41 LS HE

166 M Y H 18 LS LE

154 F N L 58 HS ME

141 F N H 41 HS ME

148 F N L 61 MS ME

205 M Y L 50 HS ME

144 M Y H 50 HS LE

143 M N H 27 LS LE

203 M N L 48 MS LE

156 F N L 35 HS LE

176 M Y L 33 MS ME

174 F Y H 59 HS ME

161 M Y L 22 HS ME

152 M Y L 25 LS LE

169 M N M 19 LS ME

209 M N H 36 HS ME

201 F N H 60 LS LE

148 M Y M 21 HS LE

194 F Y H 20 MS HE

155 M N L 51 LS LE

176 F Y L 23 LS ME

145 F Y M 25 HS ME

142 F Y L 30 MS HE

182 M N H 57 MS LE

168 F N L 23 LS LE

165 M N H 28 MS LE

141 F Y L 26 MS HE

197 M Y L 54 LS LE

191 M N M 45 LS ME

196 F Y L 56 HS ME

149 F Y L 18 LS ME

180 F Y M 56 HS ME

174 F N H 21 HS HE

160 M Y H 43 LS ME

169 M N L 43 LS LE

147 F Y H 55 MS LE

149 M Y M 29 LS ME

178 F N H 28 MS HE

155 M Y M 60 HS LE

143 F N H 52 MS HE

203 F N H 49 LS LE

187 F Y H 28 MS ME

168 M Y M 26 LS LE

179 F N H 59 HS HE

169 M Y H 33 LS HE

153 M Y M 22 HS HE

173 M N M 45 HS LE

188 F Y L 53 HS ME

153 M Y M 21 LS LE

163 F Y H 48 MS ME

142 F Y L 52 MS LE

170 F Y L 21 LS LE

179 F N L 46 HS HE

160 F Y L 43 MS LE

176 M N L 40 MS HE

143 M N H 32 MS HE

162 F N M 46 LS LE

221 M N L 58 HS LE

142 F Y L 26 MS HE

169 F Y H 55 HS ME

212 F N L 53 MS HE

201 M Y L 50 LS ME

175 F Y H 25 HS LE

149 M N M 32 MS HE

141 F Y L 30 MS HE

149 F Y M 49 MS HE

154 M N M 43 HS HE

147 M N L 48 LS LE

141 F N H 55 HS HE

175 M N H 36 MS LE

164 M N L 32 MS LE

168 F N L 34 MS HE

148 F N H 32 HS HE

224 M N L 23 LS ME

210 F N L 25 MS LE

148 M N L 47 MS LE

198 F N M 27 HS LE

144 M Y M 21 HS ME

159 F Y L 35 HS HE

206 M N H 54 MS HE

154 M Y L 50 HS LE

144 F N L 43 LS ME

149 M N L 39 HS ME

142 F N M 31 HS ME

207 F Y H 33 MS ME

192 M Y L 38 MS ME

182 M Y M 61 LS ME

183 M N H 31 MS HE

148 M N M 31 MS HE

147 M Y L 43 MS HE

146 M Y L 34 HS HE

151 F Y H 47 HS ME

144 F Y H 24 MS LE

211 M N L 28 MS LE

141 F N H 38 HS HE

147 F Y M 59 HS LE

164 M Y H 51 HS ME

193 F Y L 51 MS HE

169 M N H 53 MS ME

172 M Y L 19 LS LE

187 M N L 63 HS ME

142 F Y L 55 HS ME

194 F N L 37 LS HE

149 M N M 40 HS LE

143 F Y M 45 LS LE

184 F Y L 36 LS LE

198 M N L 53 HS ME

143 F Y L 33 MS LE

161 F Y M 38 MS ME

188 F N M 62 LS HE

148 F N L 38 MS HE

167 M Y H 42 HS LE

181 F Y L 53 HS HE

144 F N L 53 MS LE

150 M Y M 49 MS ME

157 F N M 36 MS HE

165 M Y M 34 MS LE

175 F Y L 57 LS HE

162 F N L 44 LS HE

199 F Y L 59 MS HE

192 F N H 27 MS LE

216 F Y M 18 HS HE

199 M Y M 52 HS LE

174 F Y H 21 MS HE

141 M N M 22 LS ME

146 F Y H 36 MS HE

192 M Y L 30 HS LE

164 F Y H 28 LS ME

162 M N L 29 MS LE

178 M N L 63 HS HE

168 M N M 28 HS ME

141 F N M 28 LS LE

186 F Y M 36 LS HE

145 M Y H 44 HS LE

142 F N M 56 MS LE

175 F N L 45 LS HE

145 F N H 51 MS HE

147 F Y H 50 MS HE

179 M N H 55 MS LE

191 M N H 38 HS ME

184 F Y H 43 MS HE

144 F Y L 39 MS HE

178 M N L 23 HS ME

182 M N L 36 HS ME

168 M Y L 18 MS HE

214 F Y L 20 LS LE

148 F N H 50 MS HE

181 M Y M 29 MS LE

142 F Y L 64 HS HE

163 M Y H 50 LS LE

173 F N H 54 MS HE

181 F N H 43 MS LE

184 M N L 35 HS LE

147 M Y M 34 HS ME

176 F N M 19 MS HE

151 M Y H 29 HS ME

142 F Y L 20 LS HE

147 M Y L 40 LS ME

185 F N M 54 MS ME

203 F Y M 59 MS LE

142 F Y H 59 HS HE

168 M Y M 35 LS LE

147 F N H 34 HS HE

148 F Y H 41 HS ME

198 F Y L 58 HS LE

158 F N L 42 MS LE

165 F N M 27 LS HE

145 F N M 62 MS ME

148 F Y H 53 HS ME

172 F Y L 36 HS ME

162 F N M 42 LS LE

182 M N L 19 HS LE

148 F Y L 23 MS HE

148 F N H 39 HS ME

150 F N L 55 HS LE

143 F Y L 19 MS ME

209 M Y H 39 MS LE

151 F Y H 55 LS HE

152 M N L 42 MS LE

163 M N L 45 HS HE

212 F Y L 32 LS HE

159 M N L 27 HS LE

188 F Y L 51 LS LE

169 F N H 18 LS HE

145 M Y M 20 HS ME

188 F Y H 47 LS HE

142 F N L 41 LS HE

197 M N M 44 HS ME

142 M Y M 53 HS ME

175 F Y L 61 MS LE

141 M N L 59 HS ME

148 F Y M 43 MS LE

215 M Y H 52 HS LE

151 F Y L 41 MS HE

159 M N M 45 HS LE

160 F Y L 42 HS HE

167 M Y L 56 LS ME

142 M Y L 60 HS HE

144 F Y H 59 LS LE

143 M Y L 39 HS HE

173 M N M 23 MS ME

148 F Y M 46 HS HE

142 M N L 26 HS HE

144 F N H 62 LS ME

188 M Y L 30 LS HE

147 F N H 55 MS LE

158 M Y L 19 MS ME

179 F N M 32 HS LE

167 M Y L 28 LS ME

148 F N H 48 LS LE

162 F Y M 34 MS HE

165 F Y L 31 HS LE

181 F Y L 40 HS ME

142 F Y L 54 MS HE

146 F N H 49 HS HE

181 F N M 22 MS ME

145 M Y H 42 LS HE

180 M Y H 60 LS LE

174 F N L 24 LS LE Survival Organ

124 Stomach

42 Stomach

25 Stomach

45 Stomach

412 Stomach

51 Stomach

1112 Stomach

46 Stomach

103 Stomach

876 Stomach

146 Stomach

340 Stomach

396 Stomach

81 Bronchus

461 Bronchus

20 Bronchus

450 Bronchus

246 Bronchus

166 Bronchus

63 Bronchus

64 Bronchus

155 Bronchus

859 Bronchus

151 Bronchus

166 Bronchus

37 Bronchus

223 Bronchus

138 Bronchus

72 Bronchus

245 Bronchus

248 Colon

377 Colon

189 Colon

1843 Colon

180 Colon

537 Colon

519 Colon

455 Colon

406 Colon

365 Colon

942 Colon

776 Colon

372 Colon

163 Colon

101 Colon

20 Colon

283 Colon

1234 Ovary

89 Ovary

201 Ovary

356 Ovary

2970 Ovary

456 Ovary

1235 Breast

24 Breast

1581 Breast

1166 Breast

40 Breast

727 Breast

3808 Breast

791 Breast

1804 Breast

3460 Breast

719 Breast