Insider threats continue to present a major challenge for the information
security community. Despite constant research taking place in this area; a
substantial gap still exists between the requirements of this community and the
solutions that are currently available. This paper uses the CERT dataset r4.2
along with a series of machine learning classifiers to predict the occurrence
of a particular malicious insider threat scenario - the uploading sensitive
information to wiki leaks before leaving the organization. These algorithms are
aggregated into a meta-classifier which has a stronger predictive performance
than its constituent models. It also defines a methodology for performing
pre-processing on organizational log data into daily user summaries for
classification, and is used to train multiple classifiers. Boosting is also
applied to optimise classifier accuracy. Overall the models are evaluated
through analysis of their associated confusion matrix and Receiver Operating
Characteristic (ROC) curve, and the best performing classifiers are aggregated
into an ensemble classifier. This meta-classifier has an accuracy of
\textbf{96.2\%} with an area under the ROC curve of \textbf{0.988}.