blob: e2b0b7a97ed3bec4fa4ce34880b47e27390dc149 (
plain) (
blame)
1
2
3
4
5
6
7
8
|
This module implements the classic "Naive Bayes" machine learning algorithm.
It is a well-studied probabilistic algorithm often used in automatic text
categorization. Compared to other algorithms (kNN, SVM, Decision Trees),
it's pretty fast and reasonably competitive in the quality of its results.
A paper by Fabrizio Sebastiani provides a really good introduction to
text categorization:
http://faure.iei.pi.cnr.it/~fabrizio/Publications/ACMCS02.pdf
|