Malicious Macro Bot Project

The main goals of this project are to:

Provide a powerful malicious file triage tool for cyber responders.
Help fill existing detection gaps for malicious office documents, which are still a very prevalent attack vector today.
Deliver a new avenue for threat intelligence, a way to group similar malicious office documents together to identify phishing campaigns and track use of specific malicious document templates.

These goals are achieved through clever feature engineering and applied machine learning techniques like Random Forest and TF-IDF.

Installation

sudo pip install mmbot

That's it! Otherwise checkout the source on this git repo.

Usage Examples

Triage office files with five lines of code

Import, instantiate, predict:

from mmbot import MaliciousMacroBot
mmb = MaliciousMacroBot()
    mmb.mmb_init_model()
result = mmb.mmb_predict('./your_path/your_file.xlsm', datatype='filepath')
print result.iloc[0]

Note: mmb_predict() returns a Pandas DataFrame. If you are unfamiliar with Pandas DataFrames, there is a helper function that can be used to convert a useful summary of the prediction result to json.

Convert result from Pandas DataFrame to json

print mmb.mmb_prediction_to_json(prediction)

This package was designed for flexibility. The mmb_predict() function will take in single office documents as a path to the specific file, as a path to a directory and recursively analyze all files in the path and subdirectories, as a raw byte stream of a file passed to it, or as a string of already extracted vba text that a different tool already processed. Finally, all of these options can be done in bulk mode, where the input is a Pandas DataFrame. The method will decide how to handle it based on the "datatype" argument and the actual python object type passed in.

More Information

Python 3.6 + is fully supported. We have tested with Python 3.6.2

Update v1.0.10

Added Python 3.6 support
Reimplemented the model so it loads with joblib
The prediction now includes a "confidence" score on a scale from 0 - 1.0
Larger model, factoring in approx. 40,000 macro-enabled samples, with 10,000 benign
Tuned RandomForest model to use 100 prediction trees and up to 20% of the features in the prediction

License

Free software: MIT License
Documentation: https://maliciousmacrobot.readthedocs.io.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
cli		cli
docs		docs
mmbot		mmbot
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Kibana_MMBot_Triage_Example.ipynb		Kibana_MMBot_Triage_Example.ipynb
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

License

egaus/MaliciousMacroBot

Folders and files

Latest commit

History

Repository files navigation

Malicious Macro Bot Project

Installation

Usage Examples

More Information

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages