Wisconsin Breast Cancer Dataset available


Frequently I use the Wisconsin Breast Cancer Dataset for demonstrating the Data Mining Addins for Office – enough people asked, so I made it available as an Excel 2007 file (free login required).  For purists, the original data is available at the Machine Learning repository, which is a great location for many sample datasets.


Here are some screenshots of the data mining add-ins applied to this dataset


Figure 1:  Key Factor Analysis showing differences between benign and malignant tumors


Key factors discriminating malignant and benign tumors


Figure 2: Detect categories showing malignancy across detected groups.  Note two purely malignant categories suggesting differing classes of malignant tumors.


Malignancy across categories detected by Table Analysis Tools


Figure 3: Decision tree to predict diagnosis, with nodes shaded based on likelihood of malignancy.


Diagnosis Decision Tree

Comments (0)