A framework to create malware detectors based on machine learning.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

DetEXE

A tool to build and analyze static malware detectors, based on machine learning.

DetEXE allows the selection of different features to train malware detectors through the LightGBM framework. The project is developed in a way that users can contribute by adding new features and combining them. It also offers the the options to compare the created models and evaluate the detectors' robustness by perturbing malware files.

Installation

To install the latest version:

$ pip install detexe

Setup

Set up DetEXE environment variable.

$ export DETEXE_ROOT=$PWD

Create a project layout containing the needed directories to store the data of the project.

$ detexe setup

Add executable samples to the benign and malware directories. You can obtain them from different sources. SOREL, ViruSshare... (As you are working with malware samples, please, take the safety measures).
Configure the features_selection.txt file with the features you wish to extract from the files.
In case you would like to select the feature OpCodeVectors, you will need to use previously the following command, to create the W2V model.

$ detexe opcodesw2v

How to use

CLI

Train your model.

$ detexe train --model="foo"

Execute adversarial attacks on your trained model.

It is possible to select one specific attack, or all ddiferent attacks with one command:

$ detexe attack padding --model="foo" --malware="/malware/path.exe"

$ detexe attack all --model="foo" --malware="/malware/path.exe"

Compare the trained models.

$ detexe compare

Search for optimal parameters to obtain better result in training. These parameters will be saved in the model directory.

$ detexe tune --model="foo"

Scan a PE file with a trained model.

$ detexe scan --model="foo" --exe="/malware/path.exe"

Python

Import functions and classes.

import os
from detexe import configure_layout, train_opcode_vectors, Detector, Attacker, compare

Setup project directories.

os.environ["DETEXE_ROOT"] = os.path.dirname(os.path.abspath(__file__))
configure_layout()

Configure the features_selection.txt file with the features you wish to extract from the files.
In case you would like to select the feature OpCodeVectors, you will need to train previously the W2V model.

train_opcode_vectors()

Instanciate a detector object

detector = Detector(model="model_foo", config_features="/path/to/features_selection.txt")

With the instance of detector you will be able to train, tune and scan.

detector.train()  # Train the model
detector.tune()  # Tune the hyperparameters
detector.scan("/path/to/exe")  # Scan a file

The efficiency of the created models can be compared, and visualized in a created graph.

compare("model_comparation.png")

Evaluate the robustness of a certain model.

attacker = Attacker(model="model_foo")
attacker.malware("/path/to/malware.exe")  # Choose the malware to ve modified for attacking the model.
attacker.all_attacks() # Choose one specific attack or all.

Add your own features

Add new feature class in separated file under ./detexe/ped/features/your_feature.
Update ./features_selection.txt file.

Built With

LIEF - A cross-platform library which can parse, modify and abstract ELF, PE and MachO formats.
EMBER - Elastic Malware Benchmark for Empowering Researchers.
SecML Malware - Python library for creating adversarial attacks against Windows Malware detectors.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.2.4

Mar 29, 2022

0.0.2.3

Jan 16, 2022

0.0.2.2

Dec 27, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

detexe-0.0.2.4.2.tar.gz (50.5 kB view hashes)

Uploaded Mar 29, 2022 Source

Hashes for detexe-0.0.2.4.2.tar.gz

Hashes for detexe-0.0.2.4.2.tar.gz
Algorithm	Hash digest
SHA256	`c7f8884c1d312d875202047fd21090b0df03b0afcdecc5475762685261c2bc4b`
MD5	`c8498a61ff3fcffde568f56509e797b7`
BLAKE2b-256	`9fd3e5eb84e14ad41c0f762ddc9b72549498a1cf43a6d63b7e8a363032c618be`