Collection of scripts to pinpoint obfuscated code

Last update: Nov 26, 2022

Related tags

Text Data & NLP obfuscation_detection

Overview

Obfuscation Detection (v1.0)

Author: Tim Blazytko

Automatically detect control-flow flattening and other state machines

Description:

Scripts and binaries to automatically detect control-flow flattening and other state machines in binaries.

Implementation is based on Binary Ninja. Check out the following blog post for more information:

Automated Detection of Control-flow Flattening

Usage

$ ./detect_flattening.py samples/finspy 
Function 0x401602 has a flattening score of 0.9473684210526315.
Function 0x4017c0 has a flattening score of 0.9981378026070763.
Function 0x405150 has a flattening score of 0.9166666666666666.
Function 0x405270 has a flattening score of 0.9166666666666666.
Function 0x405370 has a flattening score of 0.9984544049459042.
Function 0x4097a0 has a flattening score of 0.9992378048780488.
Function 0x412c70 has a flattening score of 0.9629629629629629.
Function 0x412df0 has a flattening score of 0.9629629629629629.
Function 0x412f70 has a flattening score of 0.9927007299270073.
Function 0x4138e0 has a flattening score of 0.9629629629629629.

Note

The password for the zipped malware samples is "infected". To unpack, use the following command line:

$ unzip -P infected samples.zip

Contact

For more information, contact @mr_phrazer.

A collection of models for image - text generation in ACM MM 2021.

Bi-directional Image and Text Generation UMT-BITG (image & text generator) Unifying Multimodal Transformer for Bi-directional Image and Text Generatio

63 Oct 30, 2022

An open collection of annotated voices in Japanese language

声庭 (Koniwa): オープンな日本語音声とアノテーションのコレクション Koniwa (声庭): An open collection of annotated voices in Japanese language 概要 Koniwa(声庭)は利用・修正・再配布が自由でオープンな音声とアノテ

32 Dec 14, 2022

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

AliceMind AliceMind: ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab This repository provides pre-trained encode

922 Dec 10, 2021

Code examples for my Write Better Python Code series on YouTube.

Write Better Python Code This repository contains the code examples used in my Write Better Python Code series published on YouTube: https:/

858 Dec 29, 2022

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

This codebase is being actively maintained, please create and issue if you have issues using it Basics All data files are included under losses and ea

32 Nov 9, 2021

Code for the Python code smells video on the ArjanCodes channel.

7 Python code smells This repository contains the code for the Python code smells video on the ArjanCodes channel (watch the video here). The example

55 Dec 29, 2022

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation This is the official PyTorch implementation

564 Jan 8, 2023

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2. It is trained (finetuned) on a curated list of approximately 45K Python (~470MB) files gathered from the Github. Currently, it just works properly on Python but not bad at other languages (thanks to GPT-2's power).

91 Sep 23, 2022

Code-autocomplete, a code completion plugin for Python

Code AutoComplete code-autocomplete, a code completion plugin for Python.

13 Jan 7, 2023

Comments

plugin?

Are you interested in a PR to add a plugin.json so this could be used either in headless mode on the command-line or via the UI inside BN itself which would let it be installable via the plugin manager?

opened by psifertex 2
Replace Counter.total() for users with python < 3.10

I'm running Binary Ninja on windows 10 and it's got Python 3.9.2, which means the Counter.total() function in calc_uncommon_instruction_sequences_score() doesn't work. I've replaced this with sum(counter.values()) which should do the same thing

opened by samrussell 1

Collection of scripts to pinpoint obfuscated code

Related tags

Overview

Obfuscation Detection (v1.0)

Description:

Usage

Note

Contact

You might also like...

A collection of models for image - text generation in ACM MM 2021.

An open collection of annotated voices in Japanese language

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Code examples for my Write Better Python Code series on YouTube.

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Code for the Python code smells video on the ArjanCodes channel.

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Code-autocomplete, a code completion plugin for Python

Comments

plugin?

Replace Counter.total() for users with python < 3.10

Releases(v1.4)

v1.4(Feb 23, 2022)

v1.3(Feb 14, 2022)

v1.2(Aug 14, 2021)

v1.1(Aug 10, 2021)

v1.0(Mar 5, 2021)

Owner

Tim Blazytko

GVT is a generic translation tool for parts of text on the PC screen with Text to Speak functionality.

Problem: Given a nepali news find the category of the news

A Flask Sentiment Analysis API, with visual implementation

Longformer: The Long-Document Transformer

Training RNNs as Fast as CNNs

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

A simple word search made in python

Application to help find best train itinerary, uses speech to text, has a spam filter to segregate invalid inputs, NLP and Pathfinding algos.

Task-based datasets, preprocessing, and evaluation for sequence models.

This repository structures data in title, summary, tags, sentiment given a fragment of a conversation

This repository has a implementations of data augmentation for NLP for Japanese.

BERTAC (BERT-style transformer-based language model with Adversarially pretrained Convolutional neural network)

Conditional Transformer Language Model for Controllable Generation

🌐 Translation microservice powered by AI

2021 2학기 데이터크롤링 기말프로젝트

A high-level Python library for Quantum Natural Language Processing

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Topic Modelling for Humans

A python framework to transform natural language questions to queries in a database query language.