Robust Analysis of Large Data Sets

June 5 - June 19, 2004

Organizers: Ruben Zamar (UBC), Stefan Van Aelst (University of Ghent, Belgium).

Objectives

Traditional statistical theory mainly deals with the uncertainty of estimates and predictions in the presence of sampling variability. On the other hand, the increasing computing and storage capacity of computers creates the need for thorough statistical analysis of large databases including very large amount of data of uneven quality. Therefore, the issue of "data quality" as opposed to "data quantity" becomes more and more important. This is particularly true in the case of large scientific databases (e.g. statistical genetics-Microarray data) and in the process of economical decision making (e.g. company policy based on knowledge obtained from customer database).

The purpose of the workshop is to bring together a group of scientists that have expressed their interest in the analysis of large complex databases including data of uneven quality. Due to the increasing size of databases the demand for such methods is pertinent and urgent. By excchanging ideas, viewpoints and experience, all aspects of the problem will be addressed in a constructive manner such that significant progress will become possible.

Confirmed Participants

Final Report (pdf file)