20150 - STATISTICS FOR ECONOMICS AND BUSINESS
EMIT
Department of Decision Sciences
Course taught in English
RAFFAELLA PICCARRETA
Class 22: RAFFAELLA PICCARRETA, Class 22: TO BE DEFINED, Class 22: TO BE DEFINED
Course Objectives
The key aim of this course is providing the students with basic skills in multivariate data analysis. In particular, students learn techniques and methods useful to analyze and synthesize rich data sets (e.g. cluster and factor analyses), with respect to both the number of variables and the number of observations. All methods are taught through hands-on classes, during which the students analyze a number of databases relevant to their studies (e.g. R&D data, patent data, investment data).
Course Content Summary
Introduction
-
Matrix algebra.
-
Multivariate random variables. Moments of multivariate distributions. Multivariate samples, summary statistics for multivariate samples. Geometric interpretation of data matrices. Total and generalized variance and their geometric interpretation.
Factorial Techniques
-
Principal component (PC) analysis. PC transformation. Property of PCs and their interpretation. Evaluation of results. Sample PC.
-
Factor analysis. The Factor model: definition and assumptions. Parameter estimates: the principal component and the principal factor methods. Interpretation of factors: factors rotation. Factor Scores.
-
Association for qualitative variables. Simple and multiple correspondence analysis. Profiles and Chi-square metric. Indicator matrices and Burt matrix. Factors and their interpretation. Graphical representation and analysis of results.
Dissimilarity matrices and clustering
-
Cluster analysis. Distance and dissimilarity matrices. Hierarchical classification methods. Choice of the number of cluster. Partitioning methods: the k-means method. Evaluation of results.
-
Multidimensional scaling (MDS). Representing one or more dissimilarity matrices in a factorial plane. Relationship with factor analysis and cluster analysis.
Detailed Description of Assessment Methods
Attending students
The course grade is based upon
-
2 assignments (handed in and discussed during the lessons)
-
Practical analysis - Analysis of a real data set (Pc-lab session 4 hours)
-
Theoretical exam (written exam concerning the methodological issues discussed during the course).
A student is considered as attending if he/she attended at least one half of the lab sessions, handed in and discussed both the assignments.
Not attending students
For not attending students the final grade is based on an
-
extended practical analysis
-
extended theoretical exam.
The practical and theoretical exam must be given in the same session. It is not possible to combine results from different sessions.
Textbooks
-
R.A. JOHNSON, D.W. WICHERN, Applied Multivariate Statistical Analysis, Prentice Hall, 2002, 5th ed.
or:
-
J. LATTIN, J.D. CARROLL, P.E. GREEN, Analyzing Multivariate Data, Thomson, 2003
Prerequisites
-
Basic notions of statistics. Descriptive statistics univariate and bivariate. Most relevant inferential concepts (samples, statistics, estimators, hypothesis testing, p-values)
-
Students are expected to be able to work with Excel and Word (basic skills).