2CS24 - TOPICS IN INFORMATION PROCESSING

DATA MINING


1. Overview

Data Mining deals with the discovery of hidden knowledge, unexpected patterns and new rules from large databases. It is currently regarded as the key element of a much more elaborate process called Knowledge Discovery in Databases (KDD), which is closely linked to another important development - data warehousing. A data warehouse is a central store of data that has been extracted from operational data. The information in a data warehouse is subject-oriented, non-volatile, and of an historic nature; so data warehouses tend to contain extremely large data sets. The combination of data warehousing, decision support, and data mining indicates an innovative and totally new approach to information management. Until now, information systems have been built and operated mainly to support the operational processes of an organisation. KDD and data warehousing view the information in an organisation in an entirely new way - as a strategic source of opportunity. (Adriaans 1996)


2. Reading

Library books on the subject are very limited,

is generally regarded as a good reference work, also

A better source of information is the WWW, try the following:

Both sites contain links too many other sites which you can follow. There is also a journal, called "Data Mining and Knowledge Discovery", which maintains a web page.


3. Lecture notes

Click here to obtain a printed copy of the lecture slides.


4. Seminar title

Suggested seminar titles:

  1. The KDD knowledge discovery process - what are the steps involved?
  2. Data miming and the data warehouse
  3. Data mining and knowledge discovery - what techniques are available?
  4. Setting up a KDD environment



Created and maintained by Frans Coenen. Last updated 11 October 1999