UNIVERSITY OF THE WITWATERSRAND, JOHANNESBURG

Stats

Statistical Research Design and Analysis (Stat4013/7009)

Copyright information, course outline and references

University of the Witwatersrand

School of Statistics and Actuarial Science

Course notes for STAT4013/STAT7009

Statistical Research Design and Analysis

or

?Using statistical methods for those who hate stats?

JS Galpin

2007

Note: This material may be freely used provided it is referenced as being the intellectual property of the University of the Witwatersrand, and has been developed by JS Galpin.

Statistical Research Design and Analysis (STAT 4013/7009)

Course aims and format for 2007

Note: This course is offered at both honours and masters level. It consists of weekly lectures and assignments, an exam, and a major project / long essay.

The same set of notes serve as reference to both groups, as these notes are intended to assist students in their future studies and work. The exams for the two levels are different, with Masters level students being asked more in-depth questions, and expected to show more insight into the methods. They are also expected to produce a project with higher level statistical components than those of honours students. The project should ideally be the stats component of the student?s honours project or masters research report / long essay.

AIMS

The aim of this course is to introduce participants to the statistical way of thinking, and to provide sufficient background to statistical terminology and procedures that many research projects may be tackled without recourse to expert statisticians. On completion of the course, participants should be able to:

  • phrase the aims of a study in such a way that one can collect data and analyze it in order to fulfill those aims;

  • identify what issues are important in designing a study;

  • design a study;

  • identify the most appropriate statistical methods to apply to the data to answer the questions posed, and to check the relevant assumptions of those methods;

  • consolidate the results obtained from different statistical analyses in terms of the aims of the study, and to identify any problems with the study;

  • recognize situations beyond their expertise, for which expert help will be necessary; and

  • identify and understand in broad terms the important statistical issues and problems addressed in the literature of their research area.

In view of the last objective, several sections are included in the notes that sketch the background for a term often used in the literature, without giving details of the method, or examples of its application. Thus several methods currently more prevalent in research articles than in textbooks are included.

Although some of the participants attending this course may have completed previous statistical courses, many have not. For this reason, no statistical knowledge is assumed, and all concepts will be introduced. There will be no attempt at derivation of procedures or formulae. These may be found in the list of recommended books.

The course is essentially a practical course, aimed at issues such as

  • ?how should I go about getting data to answer the question(s) that I am researching?

  • ?what technique should I use and why?

  • ?what does this computer output mean?

  • ?can I trust the results?,

rather than the nuts and bolts of how the techniques function, or the statistical reasoning of why they should work. In line with this, the weekly group assignments are aimed at implementation and interpretation of material covered in lectures.

COURSE FORMAT

The course comprises a double lecture and an optional tutorial period each week, together with a weekly hand-in group assignment.

Students will also be expected to individually complete a major assignment describing the design and analysis of a study of interest to them. This is usually their major project / research report etc for the course for which they are enrolled.

The marks for the course will consist of 50% from the May examination, 10% from the class assignments, and 40% from the major assignment.

(a) Lectures

The course will be run with two parallel groups, a Monday class aimed at SEBS and other CLM students, and a Wednesday class aimed students from APES and other faculties and occasional students (main group). Although both groups will cover certain core sections, other sections differ, and the examples and order in which the material is covered may also differ. All students are welcome to attend any lectures.

Lectures for the Business Economics group will be held on Mondays from 14h15-16h00, and those for the main group will be held on Wednesdays from 15h15-17h00, in CB248. Lectures will start on 12/14 February, and continue till 21 / 23 May, with recap lectures on Thursday 31 May (9h30-13h00, main group) and Monday 4 June (9h30 ?13h00, CLM students).

(b) Weekly assignments and tutorial period

After each double lecture, students will be expected to complete an assignment relating to the material covered in class that week. These may be completed by groups of up to 5 students working together.

A tutorial session will be offered on Thursdays between 08h00-13h15, in the Math Sciences Laboratories on the 2nd floor of Senate house. Assignments generally take a maximum of 2 hours to complete, and you may come at any time during this period. The length of the tutorial session is to allow students to fit this in between other lectures and labs, and also to allow students and tutors more time for discussion of statistical problems arising in their research, as well as discussion of problems with course work and assignments.

The assignments may be handed in during the lab session or at at CB141. Deadlines are 14h00 (CLM students) on the Friday, and for other students on the following Monday (12h00). There is no penalty for early hand-in!

Completion of the class assignments will require the use of a computer package. There are a number of statistical packages available - you will need to be able to handle multiple regression and analysis of variance as a minimum requirement. Some spreadsheet programs, such as Excel, can handle some of the statistics required, and may be used for those purposes, but these packages can in general not handle all required methods, such as residual plots and the diagnostics required for multiple regression. Packages used in some departments are SAS, Stata, Statistica, SPSS and Systat.

The package used in the lab will be R (a freeware package), and instructions on the use of R will be given. The package is downloadable from http://www.r-project.org or from the Stats and Actuarial Science website. In order to access the package, see option R notes. The hints for each week?s assignment will be put on the website, on the Wednesday, and handed out at the tut on the Thursday.

(c) Exam

The examination for the course will be a 3 hour OPEN BOOK examination, held on 6 June 2007 at 14h00. The examination will be aimed at examining understanding of principles, and of what procedure to use when, and why. Interpretation of computer output (similar to that in these notes and in the weekly assignments), will be expected. Very few calculations will be required in the examination - these will be stated in the recap sessions on Thursday 31 May (9h30-13h00, main group) and Monday 4 June (9h30 ?13h00, CLM students). These sessions will provide an overview of the work, handle questions on the material, and talk through a previous exam paper. Not all material is applicable to the different groups of students. Examinable sections will be noted during lectures, and in the recap sessions.

The exam will consist of 3-4 sections. The first will cover the material common to all students. The other sections will cover material given to subgroups only, and students may attempt only ONE of these sections.

(d) Project/ Major Assignment

Students will also be expected to individually complete a major assignment describing the design and analysis of a study of interest to them, and is usually their major project / research report etc for the course for which they are enrolled. The hand-in date varies by school, with SEBS and APES setting their own deadlines. For other students, the 1st hand-in date will be 20 September 2007. Students handing in by this date will be allowed to make corrections to their projects. The second hand-in date will be 5 November 2007. Although an extension may be granted, this will be under the condition that students accept that the projects may not be marked before April 2008, or even later.

This should describe a research study in which the student is involved, covering all stages from problem definition, data collection, analysis of the data, and synthesis of the results. It is hoped that this will be one of the projects to be done for your degree, otherwise a project concerning an issue of interest to you. For honours and masters students, the statistical section of your major project would be ideal. For students whose honours projects do not involve statistics, other data will have to be used - please discuss this with your lecturers.

The write-up should:

  • clearly state the objectives of the study, providing a short resume of the background to the area (maximum 4 pages), in terms that a non expert in your area can understand;

  • formulate these objectives in terms that one can use to design a study;

  • discuss the design on the study, giving a motivation for the sample design used, and discussing any possible problems with this design, and how you have tried to overcome these;

  • include a discussion as to the appropriate statistical techniques for answering the questions posed, (and for checking the data);

  • include a discussion of the application of the techniques, checks for the appropriateness of the assumptions of those techniques, and of the results obtained;

  • include a synthesis of the results of different techniques used; and

  • have a conclusion chapter giving a discussion of the study as a whole, including aims, conclusions and any problems encountered, further work recommended, etc.

It should be written in such a way that a non-expert in your research area can understand what the problem is, what you have done, and why. Remember that marks can only be given for what is written and motivated, not for what you assume the examiner will deduce or what you thought.

The report should not be a historical review of what has been done, but should be aimed at outlining the problem and the questions being asked, motivating and discussing the methods of data collection and analysis used, and drawing conclusions from the data.

You will need to include enough printouts so that I can see if you have interpreted them correctly. You do not need to include your data. Try not to copy and paste descriptions of multiple analyses. For example, if you are going to do identical tests for 20 variables, describe the 1st one in detail, then do a summary table for all 20. Please try to set the font size of tables so that I can read the numbers, and that they fit on 1 line. For example: use

0.9874327

rather than

0.9874327

Do NOT forget to put your name, student number, contact details and A TITLE on the front page.

(e) Notes

The notes cover the material included in the course in more detail than is possible in the lectures. Not all sections are of interest to every student. Some sections are not of interest to you now, but may be of interest in later years. Details of the calculation of the formulae for some specific data sets are included in the notes, partly to illustrate the procedures, and partly for those students who are unhappy with blindly trusting the computer. Although almost all computations needed for this course are to be done by computer, the worked examples in the text are also there to serve as a reference for later use when a package may not always be available. FORMULAE ARE NOT EXAMINABLE. YOU DO NOT HAVE TO UNDERSTAND THE FORMULAE TO PASS THE COURSE. They will seldom be used in lectures. Further details on the material covered may be found in the list of recommended books.

These notes are downloadable from this page under STAT4013/7009. Notes are updated every year, so that you can access revised sections of the notes should you wish.

(f) Lecturer and contact details for consultation

Course lecturer: Prof J Galpin, contact via Mrs Maud Manuel on 717-6277, CB152, or jacky@galpin.co.za. You are welcome to make an appointment to see me, with Mrs Manuel ? as I am Head of School I am not able to give fixed consultation hours.

LIST OF RECOMMENDED BOOKS

General, introductory:

Anderson, TW and Finn, JD (1996). The new statistical analysis of data. Springer.

Brown, BW and Hollander, M. (1977) Statistics: A Biomedical introduction. Wiley.

Campbell, RC. (1989) Statistics for biologists, 3rd edition. Cambridge University Press.

D?Agostino, RB, Sullivan, LM and Beiser, AS. (2006). Introductory Applied Biostatistics. Duxberry.

Dunn, OJ. (1977) Basic Statistics. A primer for the Biomedical Sciences. Wiley. Maths

Green. Sampling design and statistical methods for environmental biologists.

Iversen, GR and Gergen, M. (1997) Statistics: the conceptual approach. Springer.

Ludwig, JA and Reynolds, RF. (1988) Statistical ecology: a primer on methods and computing. Wiley.

Moore, DS. (1991) Statistics: Concepts and controversies. WH Freeman.

Moore, DS, and McCabe. (1993) Introduction to the practice of statistics. WH Freeman.

Mulhotra, NK. (1993) Marketing Research: an applied orientation. Prentice Hall.

Pagano, M and Gauvreau, K. (1993) Principles of Biostatistics. Duxberry.

Rosner, B. (2006) Fundamentals of Biostatistics. 6th Edition. Duxberry.

Sokal and Rohlf. (1992) Biometry. WH Freeman.

Woolson, I. (1987) Statistical methods for the analysis of biomedical data. Wiley.

Zar. Biostatistical analysis.

General, more advanced:

Bethea, et al. Statistical methods for engineers and scientists. Marcel Dekker.

Fleiss, JL, Levin, B and Paik, MC. (2003) Statistical methods for rates and proportions. 3rd edition. Wiley.

Fleiss, JL. (1986) The design and analysis of clinical experiments. Wiley.

Keeping, ES. (1962) Introduction to statistical inference. van Nostrand.

Sample size calculations:

Cohen, J. (1977) Statistical power analysis for the behavioural sciences. Academic Press.

Odeh and Fox. Sample size choice. Marcel Dekker.

Sampling:

Cochran, WG. (1977) Sampling techniques. Wiley.

Kish, L. (1965) Survey sampling. Wiley.

Reliability and validity:

Anastasi, A, Urbina, S and Anasasti, A. (1997). Psychological Testing. Prentice Hall.

Nonparametric statistics:

Conover, WJ. (1971) Practical Nonparametric Statistics, Second edition, Wiley.

Edginton, ED. (1995). Randomization tests, 3rd Edition. Marcel Dekker.

Experimental design:

Box, GEP, Hunter, WG and Hunter, JS. (1978) Statistics for Engineers. Wiley.

Cornell, JA. (1990) Experiments with mixtures. Second edition. Wiley.

Jones, B and Kenward, MG. (1989) Design and analysis of cross-over trials. Chapman and Hall.

Mason, RL, Gunst, RF, and Hess JL. (1989) Statistical design and analysis of experiments. Wiley.

Montgomery, DC. (1986) Design and analysis of experiments. Second edition. Wiley.

Neter, J, Kutner, MH, Nachtsheim, CJ and Wasserman, W. (1996) Applied linear statistical models. Irwin.

Regression, introductory:

Chatterjee, S and Price, B. (1977) Regression Analysis by example. Wiley.

Montgomery, D C and Peck, E A. (1992) Introduction to linear regression analysis. Second edition, Wiley.

Regression, more advanced:

Bates, DM and Watts, DG (1988) Nonlinear regression and its applications. Wiley.

Belsley, DA, Kuh, E and Welsch, RE. (1980) Regression diagnostics: identifying influential data and sources of collinearity. Wiley.

Draper, N R and Smith, H. (1966) Applied regression analysis. Wiley.

Hamilton, LC. (1992) Regression with graphics: a second course in applied statistics. Duxberry.

Neter, J, Kutner, MH, Nachtsheim, CJ and Wasserman, W. (1996) Applied linear statistical models. Irwin.

Seber, GAF and Wild, CJ. (1989) Nonlinear regression. Wiley

Categorical data:

Greenacre, MJ. (1984). Theory and Applications of correspondence analysis. Academic Press.

Hosmer, DJ. and Lemeshow, S. (1989). Applied logistic regression. Wiley.

Upton, GJG. (1978) The analysis of cross-tabulated data. Wiley.

Categorical data, advanced:

Christensen, R. (1990) Log-linear models. Springer-Verlag.

Feinberg, S. (1980). The analysis of cross-classified categorical data. The MIT Press.

Freeman, DH. (1987) Applied categorical data analysis. Wiley.

Survival analysis, introductory:

Lee, ET. (1980) Statistical methods for survival data analysis. Lifetime Learning Publications. Armitage, P. (1977)

Survival analysis, more advanced:

Eland-Johnson and Johnson. (1980). Survival models and data analysis. Wiley. Miller, RG. (1981). Survival analysis.

Gross, AJ and Clark, VA. (1975) Survival distributions, reliability and applications in the medical sciences. Wiley.

Klein, JP and Moeschberger. (1997) Survival analysis: techniques for censored and truncated data. Springer.

Multivariate analysis, introductory:

Chatfield, C and Collins, AJ. (1980) Introduction to Multivariate Analysis. Chapman and Hall.

Manly, BFJ. (1986) Multivariate statistical methods: a primer. Chapman and Hall.

Multivariate analysis, more advanced:

Johnson, RA and Wichern, DW. (1988) Applied Multivariate Statistical Analysis, second edition. Prentice Hall.

Kryzanowski, WJ. (1988) Principles of multivariate analysis. Oxford.

Seber, GAF. (1984) Multivariate observations. Wiley.

Repeated measures, advanced:

Crowder, MJ and Hand, DJ. (1990) Analysis of repeated measures. Chapman and Hall.

Jones, RH. (1993) Longitudinal data with serial correlation: A State-space approach. Chapman and Hall.

Time series:

Brockwell, PJ and Davis, RA. (1996). Introduction to time series and forecasting. Springer.

Box, GEP and Jenkins, GM. (1976) Time Series analysis: forecasting and control. Holden Day.

Chatfield, C. (1984) The analysis of time series: an introduction. Third edition. Chapman and Hall.

Chatfield, C. (1996) The analysis of time series: an introduction. Fifth edition. Chapman and Hall.

Diggle, PJ. (1990) Time series: a biostatistical introduction. Oxford Science Publications.

Kendall, M and Ord, JC. (1990) Time Series. Third Edition. Oxford.

Makridakis, S, Wheelwright, SC and McGee, VE. (1983) Forecasting methods and applications. Second Edition. Wiley.

Geostatistics

Haining, R. (1990) Spatial data analysis in the social and environmental sciences. Cambridge.

Is cs, EH and Srivastava, RH. (1989) An introduction to applied Geostatistics. Oxford.

Tables: Most introductory statistics books contain tables.

Hald, A. (1952) Statistical Tables and Formulas. Wiley.

Stoker, DJ. (1977) Statistical Tables. Academica.

TABLE OF CONTENTS

Chapters

Description

Pages

1

Introduction and descriptive techniques

1.1 Random Variability

1.2 Mean, median, variance, standard deviation and range

1.3 Getting to know the data - plots

1.4 Types of data

1.5 Independent versus dependent observations

1.6 Independent versus dependent variables

1.7 The scientific method

1

3

6

5

10

11

12

12

2

Confidence intervals and hypothesis tests

2.1 Population versus sample

2.2 Normal (or Gaussian) distribution

2.3 Distribution of the mean of a set of data

2.4 Confidence intervals

2.5 Hypothesis tests

2.6 One sided versus two sided tests

2.7 One sample t test

2.8 Two sample t test

2.9 Paired sample t test

2.10 Summary of hypothesis testing

14

15

16

18

20

22

26

27

30

34

37

3

Sample size determination

3.1 Sample size for a single sample

3.2 Power of a test

3.3 Sample size for the two sample case

39

39

44

46

4

Sampling techniques

4.1 Drawing a random sample

4.1.1 Drawing numbers from random number tables

4.1.2 Obtaining random numbers from a computer

4.2 Sampling with and without replacement

4.3 Finite versus infinite populations

4.4 Types of sampling

4.4.1 Non-random sampling

4.4.2 Random sampling

4.4.3 Weighting

4.5 Bias in sampling

4.6 Digression: Accuracy and precision

4.7 Sampling protocol

4.8 Deciding on the method of interviewing

4.9 Choice and training of interviewers

4.10 Questionnaire design

4.11 Pilot surveys

4.12 Reliability and validity

48

49

49

50

50

51

51

51

52

55

56

57

58

59

59

59

61

62

5

Tests for categorical data

5.1 Testing a single proportion

5.2 Testing two proportions

5.3 Comparing proportions in various categories for a single sample

5.4 Goodness of fit tests

5.5 Contingency tables - one sample, two classifications

5.6 Contingency tables: 2 or more samples, 1 variable

64

65

67

69

70

70

74

6

Nonparametric tests

6.1 Sign test

6.2 The Wilcoxon signed rank test

6.3 Tests for two independent samples

6.4 McNemar test

6.5 Kolmogorov-Smirnov tests

6.6 Permutation tests

6.7 Tests for several independent samples

6.8 Testing several related samples

76

77

78

80

82

83

85

86

89

7

Hypothesis tests for more that two groups (Analysis of variance)

7.1 Comparison of several groups on one factor

7.2 Multiple comparisons

7.3 Assumptions of the ANOVA test

7.3.1 Independence of observations

7.3.2 Normality of the error

7.3.3 Error variance the same for all groups (homoscedasticity)

7.3.4 Transformations

7.3.5 No change over time

91

91

94

96

97

97

97

100

102

8

Hypothesis tests for two or more factors (Multifactor ANOVA)

8.1 ANOVA for 2 factors

8.2 Interactions

8.3 Multifactor experiments

8.4 Analysis of repeated measurement data

8.5 Random effects models

104

104

106

110

111

112

9

Principles of experimental design

9.1 Defining the scope of the problem

9.2 Balancing

9.3 Blocking

9.4 Randomization

9.5 Replicates

9.6 Guidelines for experimental design

115

115

116

117

119

120

121

10

Factorial and fractional factorial designs

10.1 2n factorial designs

10.2 Fractional factorial designs

10.3 Blocking and stratification

10.4 Model checking

10.5 Other factorial experiments

122

122

123

127

127

128

11

Other types of designs

11.1 Nested or hierarchical designs

11.2 Latin and Graeco-Latin square designs

11.3 Split-plot designs

11.4 Mixture experiment designs

11.5 Cross-over designs

11.6 Randomized block designs and balanced incomplete block designs

129

129

130

130

130

131

136

12

Correlation

12.1 Pearson correlation coefficient

12.2 Rank correlation coefficients

12.3 Measuring agreement between continuous observations-the Bland-Altman plot

12.4 Measuring agreement in questionnaires - reliability

12.5 Measuring correlation in a contingency table

12.5.1 2x2 tables

12.5.2 rxc tables

12.5.3 Three and higher way tables

139

139

140

142

143

145

145

145

147

13

Simple linear regression

13.1 The simple linear regression model

13.2 Regression through the origin: no intercept model

13.3 Confidence and prediction intervals

13.4 Analysis of residuals

13.4.1 Model assumptions

13.4.2 Plot of residuals versus predicted values

13.4.3 Plotting residuals versus the predictor variable

13.4.4 Plotting residuals versus variables not included in the model

13.4.5 Plotting versus time or order of data collection

13.4.6 Outliers

13.4.7 Normal probability plots

13.5 Transformations

13.5.1 Transformations of the response (dependent) variable

13.5.2 Transformations of the predictor variables

13.5.3 Digression - Lack of fit test

13.5.4 Transformations to linearity

13.5.5 Back transformation

13.6 Uses of the regression line

13.7 Nonlinear regression

13.8 Summary of the procedure of simple linear regression

149

150

155

156

157

157

158

160

161

161

163

165

166

166

171

172

173

175

175

177

179

14

Multiple regression

14.1 Examining the fit of the regression equation

14.2 Diagnostics

14.3 Variable selection

14.3.1 Best subsets regression

14.3.2 Stepwise methods

14.3.3 Choice of model

14.4 Weighted regression

14.5 Multicollinearity

14.6 Ridge regression

14.7 Indicator variables

14.8 Standardized regression coefficients

182

182

184

185

186

186

187

188

188

195

196

198

15Analysis of covariance200
16Response surface designs208
17

Models for categorical data

17.1 Odds ratio and relative risk

17.2 Linear logistic regression

17.3 Loglinear models

17.3.1 Model definition

17.3.2 Model selection

17.3.3 Checking model fit

17.3.4 Interpreting model parameters

17.3.5 Sampling versus structural zeros

17.3.6 Response versus predictor variables

17.4 Correspondence analysis

17.5 Automatic interaction detection

17.6 Automatic interaction detection for a continuous response variable

213

213

217

220

220

224

228

228

230

230

231

234

236

18

Survival analysis

18.1 Survival, destiny and hazard functions

18.2 Non-parametric analyses

18.2.1 The kaplan-meier estimator

18.2.2 The actuarial estimator

18.3 Methods of comparing two survival curves

18.3.1 Point by point comparison

18.3.2 Total curve comparison

18.3.3 Mantel-Haenszel test

18.4 Estimation of survival times, and corrected and standardized survival rates

18.5 Covariate adjusted analyses

18.6 Parametric analyses of survival functions

238

239

241

241

243

244

245

245

248

249

252

252

19

Multivariate analysis

19.1 Testing hypotheses

19.2 Principal component analysis

19.3 Factor analysis

19.4 Cluster analysis

19.5 Discriminant analysis

19.6 Canonical correlation analysis

19.7 Multidimensional scaling

253

254

256

259

266

268

272

272

20

Analysis of time series data

20.1 Model structure for a single time series

20.1.1 Transformations

20.1.2 Outliers

20.1.3 US Bureau of the Census X11 program

20.1.4 Trading or calendar day effects

20.2 Smoothing methods

20.3 Box Jenkins models

20.3.1 Autoregressive models

20.3.2 Moving average models

20.3.3 Autoregressive moving average models

20.3.4 Duality between AR and MA models

20.3.5 Identifying potential models

20.3.6 Model fitting

20.3.7 Model checks

20.3.8 Data that is nonstationary in the mean

20.3.9 SARIMA models

20.3.10 Examples

20.4 Transfer functions

20.5 Intervention analysis

20.6 ARCH and GARCH and other nonlinear models

20.6.1 Bilinear models

20.6.2 Threshold autoregression models

20.6.3 Random coefficients models

20.6.4 ARCH and GARCH models

20.7 Other models and methods

20.7.1 ARUMA models

20.7.2 Fractional ARIMA models

20.7.3 Other methods for identification of ARIMA models

20.8 Trend vs autocorrelation structure

20.9 Missing values and unequal spacing

273

275

277

277

277

279

280

281

282

283

283

284

284

285

285

286

287

288

295

296

297

297

297

298

298

299

299

299

299

299

300

Stats511/414 Assignments and tutorials using R

STAT 511 Assignment Questions

Download R from http://cran.za.r-project.org/

Introduction to R

Running regression and ANOVA in R

Using R for STAT assignment questions requiring analysis Assign1-Q2 to 3

Using R for STAT assignment questions requiring analysis Assign2-Q5 to 8

Using R for STAT assignment questions requiring analysis Assign3-Q6 to 11

Using R for STAT assignment questions requiring analysis Assign4-Q8, Q17, Q18

Using R for STAT assignment questions requiring analysis Assign5-Q19 to Q22

Using R for STAT assignment questions requiring analysis Assign6-Q22 to Q25

Using R for STAT assignment questions requiring analysis Assign7-Q27 to Q33

Using R for STAT assignment questions requiring analysis Assign8-Q33 to Q34

Using R for STAT assignment questions requiring analysis Assign9-Q33 to Q34

Using R for STAT assignment questions requiring analysis Assign10-Q38 to Q40

Using R for STAT assignment questions requiring analysis Assign11-Q42 to Q44 and Q46

Using R for STAT assignment questions requiring analysis Assign12-Q48, Q49 and Q54

Using R for STAT assignment questions requiring analysis Assign13-Q54 and Time Series

Loglinear example using R

301