博弈论在邮件特征选择中的应用

Application of Game Theory for Email Feature Selection

  • 摘要: 特征选择在垃圾邮件过滤中起着十分重要的作用,本文分析讨论了现有邮件特征选择方法所存在的不足,并在此基础上,提出一种基于博弈论的邮件特征选择模型。该模型将博弈论应用于邮件特征选择中,以达到约减信息规模,提高垃圾邮件过滤效率的目的。在设计特征选择模型时,考虑到邮件样本自身的模糊隶属性对特征选择所产生的影响,在特征点对邮件类别的区分度定义中,通过引入由相融性度量定义的样本模糊隶属度函数,提高博弈邮件特征选择模型对实际问题的处理能力。在CDSCE语料库上的实验表明,该邮件特征选择模型的性能优于同类其他特征选择方法,验证了该邮件特征选择模型的有效性。

     

    Abstract: The shortages existed in the e-mail feature selection method is first analyzed. A new spam filtering feature selection model based on game theory is then proposed. The game theory is applied to feature selection of mail in order to reduce the scale of information and improve the efficiency of spam filtering. When designing the feature selection model, the impact acted by fuzzy membership of mail samples on feature selection is considered. The feature selection model’s handling capacity for practical problems is enhanced by using a blending sample measure of fuzzy membership function in the definition of feature points to mail category discrimination. The experiments performed on CDSCE Corpus show that the mail feature selection is better than other feature selection methods.

     

/

返回文章
返回