VanBuren, VictoriaVillarreal, DavidMcMillen, Thomas A.Minnicks, Andrew L.2009-10-142012-02-242009-09-25VanBuren, V., Villarreal, D., McMillen, T. A., & Minnick, A. L. (2009). Enron dataset research: E-mail relevance classification (Report No. TXSTATE-CS-TR-2009-12). Texas State University-San Marcos, Department of Computer Science.https://hdl.handle.net/10877/2583This paper discusses a probabilistic approach to address the problem of searching through large amount of data to find case-relevant documents. Using a valuable collection of data, e-mail communications from Enron, an actual corporation, we train a Bayes-based text classifier algorithm to identify e-mails known to be case-relevant and those known to be case-irrelevant.Text16 pages1 file (.pdf)enenron datasete-mail Relevancee-mail classificationBayes classifierelectronic discoveryforensicsComputer ScienceEnron Dataset Research: E-mail Relevance ClassificationTechnical Report