Using convolutional neural networks for classification of malware represented as images
MetadataShow full item record
The number of malicious files detected every year are counted by millions. One of the main reasons for these high volumes of different files is the fact that, in order to evade detection, malware authors add mutation. This means that malicious files belonging to the same family, with the same malicious behavior, are constantly modified or obfuscated using several techniques, in such a way that they look like different files. In order to be effective in analyzing and classifying such large amounts of files, we need to be able to categorize them into groups and identify their respective families on the basis of their behavior. In this paper, malicious software is visualized as gray scale images since its ability to capture minor changes while retaining the global structure helps to detect variations. Motivated by the visual similarity between malware samples of the same family, we propose a file agnostic deep learning approach for malware categorization to efficiently group malicious software into families based on a set of discriminant patterns extracted from their visualization as images. The suitability of our approach is evaluated against two benchmarks: the MalImg dataset and the Microsoft Malware Classification Challenge dataset. Experimental comparison demonstrates its superior performance with respect to state-of-the-art techniques.
Is part ofJournal of Computer Virology and Hacking Techniques, 2019, vol. 15, núm. 1, p. 15–28
European research projects
Showing items related by title, author, creator and subject.
The rise of machine learning for detection and classification of malware: Research developments, trends and challenge Gibert Llauradó, Daniel; Mateu Piñol, Carles; Planes Cid, Jordi (Elsevier, 2020)The struggle between security analysts and malware developers is a never-ending battle with the complexity of malware changing as quickly as innovation grows. Current state-of-the-art research focus on the development and ...
An argumentative approach for discovering relevant opinions in Twitter with probabilistic valued relationships Alsinet, Teresa; Argelich Romà, Josep; Béjar Torres, Ramón; Fernàndez Camon, César; Mateu Piñol, Carles; Planes Cid, Jordi (Elsevier, 2017-07-08)Twitter is one of the most widely used social networks when it comes to sharing and criticizing relevant news and events. In order to understand the major opinions accepted and rejected in different domains by Twitter ...
Gómez, Sergio Alejandro; Chesñevar, Carlos Iván (Iberoamerican Science & Technology Education Consortium, 2004)Many classification systems rely on clustering techniques in which a collection of training examples is provided as an input, and a number of clusters c1,...cm modelling some concept C results as an output, such that every ...