Cookies

We use cookies to ensure that we give you the best experience on our website. You can change your cookie settings at any time. Otherwise, we'll assume you're OK to continue.

Department of Mathematical Sciences

Seminar Archives

On this page you can find information about seminars in this and previous academic years, where available on the database.

Statistics Seminars: Mixture Model Component Cluster Trees

Presented by Nema Dean, University of Glasgow

20 February 2009 14:00 in CM221

One of the most commonly used parametric clustering methods - model-based clustering - assumes that continuous data (possibly after a transformation) comes from a mixture of Gaussian components. The common implicit assumption is that once the best such mixture has been chosen to fit the data, each mixture component is a cluster estimating an underlying (sub-population) group. Clearly there will be issues with such an assumption if the underlying groups do not have Gaussian distributions. While the mixture will still fit the data well, it is likely that if the true underlying groups are non-symmetric, skewed, heavy-tailed, curvilinear or if there are outliers then the number of components in the model will overestimate the number of groups. We look at using hierarchical clustering methods based on a distance defined by the estimated mixture to create a dendrogram with components as leaves - a component cluster tree. This can be used to identify sub-mixtures of combinations of components that will better estimate the underlying groups.

Contact sunil.chhita@durham.ac.uk for more information