Skip to content

Latest commit

 

History

History
140 lines (109 loc) · 3.92 KB

File metadata and controls

140 lines (109 loc) · 3.92 KB

Malware_Classifier_For_PE_Files

Command line tool for scoring PE format file on PE header's data.

Dataset Features characteristics

Size of the dataset

  • Rows : 50211
  • COLS : 28

BaseOfCode

  • DESCRIPTION : The address that is relative to the image base of the beginning-of-code section when it is loaded into memory.
  • Type : Categorical
  • Distinct values : 665

BaseOfData

  • DESCRIPTION : The address that is relative to the image base of the beginning-of-data section when it is loaded into memory.
  • Type : Categorical
  • Distinct values : 1667

Characteristics

  • DESCRIPTION : The flags that indicate the attributes of the file.
  • Type : Categorical
  • Distinct values : 72

DllCharacteristics

  • Description : (Not Found)
  • Type : Categorical
  • Distinct values : 60

Entropy **

  • Description : (Not Found)
  • Type : Categorical
  • Distinct values : 60

FileAlignment

  • DESCRIPTION : The alignment factor (in bytes) that is used to align the raw data of sections in the image file.
  • Type : Numerical
  • Distinct values : 7

ImageBase

  • DESCRIPTION : The preferred address of the first byte of image when loaded into memory.
  • Type : Identifier
  • Distinct values : 3809

ImportedDlls **

  • DESCRIPTION : The preferred address of the first byte of image when loaded into memory.
  • Type : Identifier
  • Distinct values : 3809

ImportedSymbols **

  • DESCRIPTION : The preferred address of the first byte of image when loaded into memory.
  • Type : Identifier
  • Distinct values : 3809

Machine

  • DESCRIPTION : Type of target machine on which the image file will be able to run.
  • Type : Categorical
  • Distinct values : 3

Magic

  • DESCRIPTION : The unsigned integer that identifies the state of the image file.
  • Type : Categorical
  • Distinct values : 1

NumberOfRvaAndSizes

  • Description : The number of data-directory entries in the remainder of the optional header. Each describes a location and size.
  • Type : Numerical
  • Distinct values : 31

NumberOfSections

  • Description : This indicates the size of the section table, which immediately follows the headers.
  • Type : Numerical
  • Distinct values : 21

NumberOfSymbols

  • Description : The number of entries in the symbol table.
  • Type : Numerical
  • Distinct values : 77

PointerToSymbolTable

  • Description : The file offset of the COFF symbol table, or zero if no COFF symbol table is present.
  • Type : Categorical
  • Distinct values : 92

Size

  • Description : (Not Found)
  • Type : Numerical
  • Distinct values : 18419

SizeOfCode

  • DESCRIPTION : The size of the code (text) section, or the sum of all code sections.
  • Type : Numerical
  • Distinct values : 5431

SizeOfHeaders

  • DESCRIPTION : The combined size of an MS-DOS stub, PE header, and section headers rounded up to a multiple of FileAlignment.
  • Type : Numerical
  • Distinct values : 15

SizeOfImage

  • Description : The size (in bytes) of the image, including all headers, as the image is loaded in memory. It must be a multiple of SectionAlignment.
  • Type : Numerical
  • Distinct values : 3408

SizeOfInitializedData

  • DESCRIPTION : The size of the initialized data section, or the sum of all such sections.
  • Type : Numerical
  • Distinct values : 4728

SizeOfOptionalHeader

  • DESCRIPTION : The size of the optional header, which is required for executable files but not for object files.
  • Type : Numerical
  • Distinct values : 1

SizeOfUninitializedData

  • DESCRIPTION : The size of the uninitialized data section (BSS), or the sum of all such sections.
  • Type : Numerical
  • Distinct values : 741

TimeDateStamp

  • DESCRIPTION : The time and date that the debug data was created.
  • Type : Identifier
  • Distinct values : 24241

Data to be removed from csv

FirstSeenDate

  • Index is : 6

Identify

  • Index is : 7

Label

  • Index is : 11

PE_TYPE

  • Index is : 17

SHA1

  • Index is : 19