Supervised by John Ebden

Statement of the Problem

Data mining also known as knowledge discovery, involves finding unexpected but interesting patterns within enormous amounts of data that are normally stored in databases and data warehouses. Data Mining has three major components Clustering or classification, Association Rules and Sequence Analysis which come in form of many algorithms proposed. Some of the algorithms have had better success than the others.

However, the commercial world is fast reacting to the growth and potential in this area as a wide range of tools are marketed under the label of data mining. The main objective of this project is to investigate two types of algorithms available in Oracle for data mining. Apply the two algorithms to actual data. Then, analyse the results and compare outcome in terms of accuracy, efficiency and effectiveness.

The diagram below gives a brief idea of the data mining steps. however, there will be quite a number of techiques that will be adopted as the algorithms being investigated are kind of sofisticated in terms of determining both their efficiency and effectiveness.



 

 

 

 

 

Last updated on the 11 October ©2005 Nhamo Mdzingwa