These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
The proliferation of spam on the Web has necessitated the development of
machine learning models to automate their detection. However, the dynamic
nature of spam and the sophisticated evasion techniques employed by spammers
often lead to low accuracy in these models. Traditional machine-learning
approaches struggle to keep pace with spammers' constantly evolving tactics,
resulting in a persistent challenge to maintain high detection rates. To
address this, we propose blockchain-enabled incentivized crowdsourcing as a
novel solution to enhance spam detection systems. We create an incentive
mechanism for data collection and labeling by leveraging blockchain's
decentralized and transparent framework. Contributors are rewarded for accurate
labels and penalized for inaccuracies, ensuring high-quality data. A smart
contract governs the submission and evaluation process, with participants
staking cryptocurrency as collateral to guarantee integrity. Simulations show
that incentivized crowdsourcing improves data quality, leading to more
effective machine-learning models for spam detection. This approach offers a
scalable and adaptable solution to the challenges of traditional methods.
External Datasets
phishing websites dataset
ISCX-URL2017
References
white paper
A next-generation smart contract and decentralized application platform
Buterin, V., et al.
Published: 2014
Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
A taxonomy of javascript redirection spam
Chellapilla, K., Maykov, A.
Published: 2007
IEEE
Spam e-mail identification using blockchain technology
Choudhari, S., Das, S.
Published: 2021
Journal of Big Data
Survey of review spam detection using machine learning techniques
Crawford, M., Khoshgoftaar, T.M., Prusa, J.D., Richter, A.N., Al Najada, H.
Published: 2015
Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Measuring and fingerprinting click-spam in ad networks