Combining individual probabilities in Naive Bayesian spam filtering
问题 I'm currently trying to generate a spam filter by analyzing a corpus I've amassed. I'm using the wikipedia entry http://en.wikipedia.org/wiki/Bayesian_spam_filtering to develop my classification code. I've implemented code to calculate probability that a message is spam given that it contains a specific word by implementing the following formula from the wiki: My PHP code: public function pSpaminess($word) { $ps = $this->pContentIsSpam(); $ph = $this->pContentIsHam(); $pws = $this-