A systematic analysis of random forest based social media spam classification
Abstract
Recently random forest classification became a popular choice machine learning applications aimed to detect spam content in online social networks. In this paper, we report a systematic analysis of random forest classification for this purpose. We assessed the impact of key parameters, such as number of trees, depth of trees and minimum size of leaf nodes on classification performance. Our results show that controlling the complexity of random forest classifiers applied to social media spam is important in order to avoid overfitting and optimize performance We also conclude that in order to support reproducibility of experimental results it is important to report key parameters of random forest classifiers.
Citation
(2017). A systematic analysis of random forest based social media spam classification. In Network and System Security. NSS 2017 (427-438)
Acceptance Date | Jun 12, 2017 |
---|---|
Publication Date | Aug 1, 2017 |
Pages | 427-438 |
Series Title | International Workshop on Security Measurements of Cyber Networks |
Book Title | Network and System Security. NSS 2017. |
ISBN | 9783319647005 |
Publisher URL | http://doi.org/10.1007/978-3-319-64701-2_31 |
Downloadable Citations
About Keele Repository
Administrator e-mail: research.openaccess@keele.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search