CONDENSER: A Graph-Based Approachfor Detecting Botnets

TOP Literature Database CONDENSER: A Graph-Based Approachfor Detecting Botnets

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1410.8747

PDF

https://arxiv.org/pdf/1410.8747

Paper Information

Author: Pedro Camelo,Joao Moura,Ludwig Krippahl
Published: 10-31-2014
Affiliation: AnubisNetworks R&D
Country: Portugal
Conference

Labels Estimated by AI

Clustering methods Network Packet Management Communication Efficiency

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Botnets represent a global problem and are responsible for causing large financial and operational damage to their victims. They are implemented with evasion in mind, and aim at hiding their architecture and authors, making them difficult to detect in general. These kinds of networks are mainly used for identity theft, virtual extortion, spam campaigns and malware dissemination. Botnets have a great potential in warfare and terrorist activities, making it of utmost importance to take action against. We present CONDENSER, a method for identifying data generated by botnet activity. We start by selecting appropriate the features from several data feeds, namely DNS non-existent domain responses and live communication packages directed to command and control servers that we previously sinkholed. By using machine learning algorithms and a graph based representation of data, then allows one to identify botnet activity, helps identifying anomalous traffic, quickly detect new botnets and improve activities of tracking known botnets. Our main contributions are threefold: first, the use of a machine learning classifier for classifying domain names as being generated by domain generation algorithms (DGA); second, a clustering algorithm using the set of selected features that groups network communication with similar patterns; third, a graph based knowledge representation framework where we store processed data, allowing us to perform queries.

External Datasets

Top 10,000 domains from Alexa

approximately 10,000 DGA domains provided by AnubisNetworks