发表新帖

发表新帖

Neural networks for email spam detection

前端未结

关注

 4  832

半阙折子戏 2020-12-23 18:07

Let\'s say you have access to an email account with the history of received emails from the last years (~10k emails) classified into 2 groups

genuine email

4条回答

轮回少年 (楼主)

2020-12-23 18:30
1. You'll basically have an entire problem, of similar scope to designing and training the neural net, of feature extraction. Where I would start, if I were you, is in slicing and dicing the input text in a large number of ways, each one being a potential feature input along the lines of "this neuron signals 1.0 if 'price' and 'viagra' occur within 3 words of each other", and culling those according to best absolute correlation with spam identification.
2. I'd start by taking my best 50 to 200 input feature neurons and hooking them up to a single output neuron (values trained for 1.0 = spam, -1.0 = not spam), i.e. a single-layer perceptron. I might try a multi-layer backpropagation net if that worked poorly, but wouldn't be holding my breath for great results.
Generally, my experience has led me to believe that neural networks will show mediocre performance at best in this task, and I'd definitely recommend something Bayesian as Chad Birch suggests, if this is something other than a toy problem for exploring neural nets.
0 讨论(0)

查看其它4个回答
发布评论:

提交评论
- 加载中...

热议问题