Computer-Intensive Statistics and Data Mining

10 credits

Syllabus, Master's level, 1MS009

A revised version of the syllabus is available.
Code
1MS009
Education cycle
Second cycle
Main field(s) of study and in-depth level
Mathematics A1N, Mathematics A1N
Grading system
Fail (U), Pass (3), Pass with credit (4), Pass with distinction (5)
Finalised by
The Faculty Board of Science and Technology, 24 April 2013
Responsible department
Department of Mathematics

Entry requirements

120 credit points including Analysis of Regression and Variance or corresponding course

Learning outcomes

In order to pass the course (grade 3) the student should be able to

  • give an account for the theoretical foundation of Markov Chain Monte Carlo-methods and to use such techniques to solve given statistical problems;
  • give an account for the principles behind random number generators;
  • use simulation methods such as Bootstrap and SIMEX;
  • use EM methods;
  • use non-parametric statistical models;
  • use statistical software, preferably R.

Content

The purpose of the course is to give the student a good overview about statistical techniques that have been developed during the last years due to increasing computer capacity. Resampling techniques, Jack-knife, bootstrap. . EM algorithms. SIMEX methodology. Markov Chain Monte Carlo (MCMC) methods. Random number generators. Smoothing techniques. Kernel estimators, nearest neighbour estimators, orthogonal and local polynomial estimators, wavelet estimators. Splines. Choice of bandwidth and other smoothing parameters. Applications. Use of statistical software.

Instruction

Lectures, problem solving sessions and computer-assisted laboratory work.

Assessment

Written examination at the end of the course combined with compulsory assignments and oral presentation according to directions given at course start.

FOLLOW UPPSALA UNIVERSITY ON

facebook
instagram
twitter
youtube
linkedin