Skip to content

Latest commit

 

History

History
254 lines (164 loc) · 30.8 KB

README.md

File metadata and controls

254 lines (164 loc) · 30.8 KB

Malware Behavior Catalog Frequently Asked Questions

What's New?

  • MBC v3.0 - The latest MBC release includes behavior detection information, expanded descriptions, and new properties, such as version and created and modified dates.

  • STIX 2.1 Representation - MBC content is available in an updated STIX format based on the new STIX 2.1 Malware Behavior Extension.

  • Attack Flow Examples - Example attack flows for Shamoon and SearchAwesome reference MBC behaviors.

  • CACAO Playbook Example - An example CACAO playbook for Locky Bart references MBC behaviors.

  • MBC Website - We're developing a website to replace the current markdown content.

  • MBC Mailing List - To join the MBC mailing list, please send a request to [email protected].

Overview

What are the primary use cases of MBC?

  • Standardized reporting: Enables consistent interpretation of result data to improve detection, mitigation, and remediation.
  • Correlation of malware analysis results: Correlates results of automated tools (e.g., sandboxes) or manual analysis, validating results or identifying areas for further investigation.
  • Labeled data sets for malware research: Using MBC to label malware samples provides a collection of malware meeting chosen criteria that can support research (e.g., assessment of tool effectiveness, similarity analysis).
  • Malware analysis support: MBC's set of identified, organized behaviors helps an analyst know what to look for, informing the malware analysis process.

What are objectives?

Objectives correspond to the intentions behind malware behaviors. For example, malware may use Hooking (behavior) to load and execute code within the context of another process either to hide its execution (defense evasion objective), to gain elevated privileges (privilege escalation objective), or to access the process's memory (credential access objective).

How were behaviors identified?

As discussed below, many MBC behaviors stemmed from MAEC and EMA. Mapping capa rules to MBC was another avenue for defining behaviors (especially MBC's micro-behaviors). We also identified MBC behaviors by studying publicly available analysis reports and output of automated analysis engines (some of which map behavior indicators to ATT&CK). Such resources enabled identification of MBC-enhanced ATT&CK techniques, as well as the definition of new behaviors for results that could not be mapped to ATT&CK.

What are methods?

Methods are variations of behaviors and are provided to help explain behaviors. Although it would be hard to enumerate all methods associated with a behavior, our aim is to define those most commonly used by malware.

What are micro-behaviors?

Micro-behaviors are low-level malware behaviors that support other objectives and behaviors.

What are code snippets?

A code snippet illustrates a behavior or method implementation. See Anti-Behavioral Analysis::Sandbox Detection::Product Key/ID Testing for an example snippet.

Are all behaviors for a malware sample captured in its MBC corpus entry?

A behavior is listed for an sample in the MBC corpus only if an associated analysis report contains a narrative for the behavior or if the behavior is identified by an automated tool. For automated tools, the description is taken from the rule name (e.g., "initialize Winsock library" --> "Shamoon initializes a Winsock library").

While under development, a corpus example may contain a limited number of behaviors (e.g., behaviors of particular interest may be added first).

Are all known IoCs captured in a corpus entry?

Corpus entries include some but not all indicators of compromise (IoCs) due to space considerations.

What is the MBC matrix?

The MBC matrix is a visual representation of MBC based on the ATT&CK Navigator. Two views are available:

Does MBC have version control?

There was no version control when MBC v1.0 was released at the end of January 2020. However, further changes have been tracked:

  • MBC v2.0 was released in September 2020 and includes micro-behaviors and changes associated with ATT&CK sub-techniques.
  • MBC v2.1 was released in February 2021 and includes additional micro-behaviors and behavior methods.
  • MBC v2.2 was released in February 2022 and includes additional micro-behaviors and behavior methods. Added code snippets to certain methods.
  • MBC v2.3 was released in September 2022 and aligns with ATT&CK v11 and includes an updated malware corpus.
  • MBC v3.0 was released in September 2023 and includes behavior detection information, expanded descriptions, and new properties.

Using MBC

What does it mean to map a malware analysis report to MBC?

Malware analysis reports contain information about a malware's behaviors, but the information is in text and not standardized. Mapping a report entails interpreting the sentences to identify the associated MBC behaviors and ATT&CK techniques.

Most of the content in the MBC malware corpus came from mapping malware analysis reports. Please see Kovter and Poison Ivy as good examples of report mappings.

Are there any tips for successful report mappings?

While building the malware corpus, the MBC team compiled the following tips for mapping malware analysis reports to MBC and ATT&CK.

  • Review ATT&CK and MBC content to become familiar with options for mapping. The MBC review should include micro-behaviors.
  • If a malware analysis report contains explicit mappings to ATT&CK techniques, start by reviewing and validating them. We found it was easier to validate ATT&CK mappings before identifying unmapped behaviors because ATT&CK mappings usually provide an overview.
  • Check whether an ATT&CK technique has been enhanced in MBC and if so, update the mapping to reflect the MBC identifier.
  • If a report does not explicitly reference ATT&CK techniques, it’s still easier to consider ATT&CK mappings before MBC mappings because ATT&CK techniques are generally higher level than MBC behaviors.
  • The ATT&CK Powered Suit browser extension is useful for quickly finding potential ATT&CK techniques.
  • The MBC matrix is useful for identifying MBC behaviors and methods (although it doesn’t have search capabilities).
  • If the mapping of a behavior is unclear, mark it for further consideration and move on. An uncertain behavior mapping often becomes clearer later in the process.
  • Analysis reports often contain initial compromise information that is out of scope of MBC (i.e., activity independent of the malware code).

How do I map an analysis product's output to MBC when I don't know the details behind the behavior indicators?

Ideally, the product vendor will provide the MBC mapping, but if not and it's not possible to ask the signature developer for details, it may be best to map the indicator to all potentially relevant MBC behaviors (a one-to-many mapping). For example, the indicator "opened listening port" could be mapped to both COMMAND AND CONTROL::C2 Communication and IMPACT::Remote Access.

Do objectives and behaviors have to be specified in pairs? Can MBC behaviors be used without MBC objectives? Can objectives be used without behaviors?

Objectives and behaviors do not need to be specified in pairs. Because it's not always possible to know intent, MBC behaviors can be used without objectives. For example, automated sandbox analysis may indicate hooking behavior without corresponding information on intent, in which case objectives might not be specified; alternatively, all objectives associated with a behavior might be noted.

If lower level behaviors are not known, it may be appropriate to only reference an MBC objective. For example, if a sandbox reports that a malware sample exhibits "self-defense" with no other details, the information is best captured by the Defense Evasion objective, without specifying any specific behaviors.

When should methods be used?

Methods may be referenced in analyses as a refinement of behaviors.

Can methods be used alone?

Because methods are specific to a behavior, they're always associated with a behavior and aren’t used on their own.

When should micro-behaviors be used?

Micro-behaviors should be used when context isn’t clear but there is value in capturing basic mechanics. For example, a behavior of “write and execute a file” can ideally map to Execution::Install Additional Program, but if the context isn't clear, the behaviors can be captured as File System::Create File and Process::Create Process.

Can malware behaviors identified via manual analysis map to multiple MBC behaviors, or should correspondence be one-to-one?

Each malware behavior identified during analysis and associated with a specific code snippet should map to a single MBC behavior. If multiple behaviors seem to apply, the reported behavior should be considered to see whether it can be broken into smaller components that lead to one-to-one mappings.

For example, a GoBotKR report reflecting manual analysis reads, "The malware installs two instances of itself on the system. The second instance (watchdog) monitors whether the first instance is still active and reinstalls it if it has been removed from the system" [2]. Initially, one might think this is a single behavior, but it can be broken apart and mapped into two behaviors:

Although one-to-one mappings are best practice, there may be some instances when an analyst may choose to map a snippet to multiple MBC behaviors.

Can malware behaviors identified by automated sandboxes and tools map to multiple MBC behaviors or should correspondence be one-to-one?

Behaviors identified by automated tools are often intentionally broad to give an overview of the malware sample's behavior. Broad behaviors will often map to multiple MBC behaviors. For example, if a tool reports that Armadillo was used on a malware sample, both Executable Code Obfuscation and Software Packing behaviors apply.

How are MBC behaviors, possibly at different levels of abstraction, associated?

MBC does not define relationships between behaviors, so association of behaviors must be done at the reporting level. The GoBotKR example above illustrates how two behaviors can be associated by the human-readable text: "The malware installs two instances of itself on the system. The second instance (watchdog) monitors whether the first instance is still active and reinstalls it if it has been removed from the system" [2].

Can variant names captured by an anti-virus tool be captured in MBC?

No. MBC captures behaviors and characteristics directly associated with malware code. Variant names are outside MBC's scope. A variant name may lead to published reports, in which case, one could use the report to map the sample to its associated behaviors.

Can MBC capture specific data values associated with a behavior?

No. For example, MBC might capture the behavior ANTI-BEHAVIORAL ANALYSIS::Emulator Detection::Check Emulator-related Registry Keys, but the specific registry key (e.g., SYSTEM\ControlSet001\Services\VBoxMouse (VBOX)) would be captured outside of MBC (e.g., as STIX Windows Registry Key Object).

If malware displays only some attributes defining a behavior, it is correct to say it exhibits the behavior?

Yes. In many cases, malware will display only a subset of a behavior's attributes. For example, the System Information Discovery behavior includes discovery of a hostname, operating system version, patch information, processor architecture, etc. Malware that gathers only a hostname would still be said to exhibit the System Information Discovery behavior.

What if no MBC behavior is defined for something I need to capture?

The MBC will evolve to better support the malware analysis community. If you have a suggestion for a new behavior (or any content change), please open an issue on GitHub.

Some MBC behaviors seem to be characteristics of code, not actual behaviors (e.g., Executable Code Optimization). Why are they captured and why are they called behaviors?

MBC captures traits of malware that are evident through code analysis to support malware analysis use cases. For example, capturing a characteristic such as Executable Code Optimization supports use cases such as attribution and detection by indicating that optimization has been used to make the code harder to analyze. Such "characteristics" are captured as "behaviors" to simplify MBC's structure; however, their descriptions identify them as "characteristics."

Characteristics are evident by looking at the code and may not relate to specific code snippets. For example, a high-level look at a malware instance can reveal it is encrypted (Obfuscated Files or Information ).

Observables that do not reflect on the malware's code are not captured in MBC; for example, strings and data in the code are not captured in MBC.

Do malware behaviors and adversary behaviors overlap?

Malware behaviors and adversary behaviors can overlap because adversaries sometimes use malware to achieve their goals. However, MBC only captures behaviors associated with malware code. In other words, MBC behaviors are identified through analysis of a malware sample's binary code or by observing the malware on a network (or in a disassembler), whereas adversary behaviors may be derived from a variety of indicators on a system or network.

STIX Representation

How are MBC behaviors captured in STIX 2?

MBC content is available in STIX 2.1 format. See the mbc-stix2.1 repository for details. MBC content is also available in an older STIX 2.1 representation based on the representation used for ATT&CK.

Why was MBC's STIX representation updated?

The previous STIX 2.1 representation is valid, but it doesn't take advantage of the STIX 2.1 Extension Definition Object. MBC users said they found the MBC representation (and ATT&CK) kludgy, so we defined new SDOs and extended the STIX Malware Object in a malware behavior extension. We think it works much better!

How are malware corpus examples captured in STIX?

Corpus examples are captured using the Malware SDO. Three new properties are defined in the STIX 2.1 Malware Behavior extension:

  • obj_defn (required) provides a reference to the malware that includes the MBC ID.
  • year (optional) denotes the year the malware was first seen.
  • platforms (optional) denotes the operating system affected by the malware.

Why is the STIX "is_family" property set to true for all the malware in the corpus?

The "is_family" property (defined on the Malware SDO) indicates whether an object represents a malware family (if true) or a malware instance (if false). All the malware in the corpus represent malware families.

How is MBC versioning captured in STIX?

The "Created" and "Last Modified" metadata for versioned MBC objects are captured in the corresponding STIX object's "created" and "modified" common properties. In such cases, the STIX properties do not correspond to the created and modified dates of the STIX object, but they remain effective for versioning because the modified property is updated each time MBC content changes.

Malware Analysis Tools

MBC is used in the following malware analysis tools.

capa

The capa tool is an open-source static analysis tool developed by the FLARE team. Most capa rules are associated with MBC behaviors and/or ATT&CK techniques. The MBC team reviews new capa rules quarterly, sometimes suggesting additions or updates.

CAPE

The CAPE sandbox was derived from Cuckoo with the goal of adding automated malware unpacking and config extraction. CAPE maps its signatures to MBC and ATT&CK. The MBC team reviewed the mappings and suggested additions and updates.

Cuckoo Sandbox

Cuckoo Sandbox 2.0.7 includes mappings between its signatures and ATT&CK. The MBC team updated the mappings using MBC, which increased the accuracy due to MBC’s malware focus. A fork of the Cuckoo community repo is available on MBCProject (see community and cuckoo).

MBC and Other Community Efforts

What is the relationship between MAEC, EMA, and MBC?

In short, the Malware Attribute Enumeration and Characterization (MAEC) led to the Encyclopedia of Malware Attributes (EMA), and EMA led to the Malware Behavior Catalog.

MAEC is a community-developed structured language for encoding and sharing high-fidelity information about malware based upon attributes such as behaviors, artifacts, and relationships between malware samples. The vocabularies associated with MAEC define malware capabilities and behaviors, which are further defined and specified in EMA, along with "behavior instances" that capture specific malware instances/families exhibiting behaviors.

After noting clear overlap between EMA and ATT&CK, we initially decided that EMA would only capture malware behaviors not captured by ATT&CK (primarily anti-analysis behaviors). However, ATT&CK defines many techniques not applicable to malware, so to provide a well-defined, single collection of malware behaviors, MBC was created. MBC differs from EMA in that MBC was created in ATT&CK's image with the hope that by following ATT&CK's philosophy and methodology, MBC would be more useful and readily adopted.

What is an "attack flow" and how is MBC used in a flow?

Developed by MITRE Engenuity, Attack Flow is a data model with supporting tooling and examples for describing sequences of adversary behaviors. Attack flows can also describe sequences of malware behaviors, and MBC (along with ATT&CK) provides a supporting catalog of malware objectives and behaviors.

To illustrate MBC's use, we defined attack flows for Shamoon and SearchAwesome that reference MBC behaviors.

What is CACAO and how is MBC used in a playbook?

The OASIS Collaborative Automated Course of Action Operations (CACAO) project defines the standard for implementing course of action playbooks for cybersecurity operations. The sequence of behaviors exhibited by malware can be captured as an "attack" type playbook.

To illustrate MBC's use, we defined a playbook for Locky Bart, which is available in the CACAO example repository.

Can Pafish behaviors be mapped to MBC?

Yes. Pafish is a tool for detecting virtual machines and malware analysis environments. Malware analysts use Pafish to determine whether their analysis environment is likely to be detected by malware. [1] We studied the Pafish source code to identify its behaviors and mapped those behaviors to MBC. The results are here.

Relationship to ATT&CK

We have ATT&CK, so why do we need the Malware Behavior Catalog?

As a publicly available framework, The Malware Behavior Catalog (MBC) aims to directly and explicitly define malware behaviors and code characteristics to support malware analysis-oriented use cases. MBC defines behaviors outside ATT&CK's scope and enhances some ATT&CK techniques and sub-techniques to be malware-focused.

Operationally, most malware behaviors are identified via analysis, either direct analysis or from outside reporting or threat feeds. Subsequent tagging with MBC behaviors supports a variety of malware-related use cases: detection, mitigation and remediation, attribution, provenance and similarity scoring, and standardized reporting. Clearly defining methods malware uses to compromise systems or defeat analysis can also drive research toward improved detection, analysis, and prevention capabilities.

Why not merge malware behaviors into ATT&CK?

It's not feasible for ATT&CK to expand to cover all problem spaces, including the malware analysis space. As stated in the ATT&CK design and philosophy document [1], "The basis of ATT&CK is the set of individual techniques that represent actions that adversaries can perform to accomplish objectives." This differs from the malware community's need to capture information discovered via analysis of malware code.

While malware sometimes acts as a surrogate for an adversary (i.e., there is overlap between malware and adversary behavior), there are aspects unique to malware, mostly notably anti-analysis characteristics that are only identified during analysis: ANTI-BEHAVIORAL ANALYSIS and ANTI-STATIC ANALYSIS. Some MBC behaviors may be appropriate for inclusion in ATT&CK, but it is up to the ATT&CK team to identify them and integrate the content.

Is there a formal relationship between ATT&CK and MBC?

No. Given that MBC aims to serve as a model and test case for how ATT&CK can be expanded by second parties, we consult with the ATT&CK team to keep them apprised of our work, but there is no formal relationship between ATT&CK and MBC. While content of MBC was not coordinated with the ATT&CK team, we borrowed from ATT&CK's philosophy and methodology [1]. Namely, MBC will maintain a malware analysis-oriented perspective; focus on real-world use of behaviors through empirical malware examples, drawing upon publicly available analysis and reporting; and maintain a level of abstraction appropriate for supporting malware analysis use cases (e.g., develop malware signatures, mitigate infection, drive analytics, attribution).

ATT&CK defines "tactics" and "techniques." Why does MBC instead define "objectives" and "behaviors"?

The cyber adversary and malware analysis realms each have their own vocabulary, and MBC aims to reflect malware analysis common word usage. Just as "tactics," "techniques," and "procedures" (TTPs) are considered when discussing advanced persistent threats (APTs), "objectives" and "behaviors" are commonly considered when analyzing malware. Maintaining a distinct vocabulary also helps to differentiate MBC and ATT&CK content.

Why don't MBC behavior names always match ATT&CK technique names?

MBC maintains a malware, code-oriented perspective, so MBC behavior names will not always match related ATT&CK names. This is especially true when an MBC behavior was defined before an ATT&CK technique. In such cases, the preexisting MBC identifier is preserved and the new ATT&CK technique is referenced.

Are different MBC behaviors linked (related) to the same ATT&CK technique?

Yes. Sometimes an ATT&CK technique does not provide enough granularity for malware analysis-oriented use cases; therefore, multiple, more granular MBC behaviors may link to the same ATT&CK technique.

For example, MBC defines separate behaviors for detecting sandboxes (Sandbox Detection) and virtual machines (Virtual Machine Detection). When ATT&CK was updated to include the Virtualization/Sandbox Evasion technique, which includes detection of virtualization and sandboxes (although not obstruction in the sense of MBC's Dynamic Analysis Evasion behavior), the MBC behaviors were not combined. Rather, both were updated to show a relationship to the new ATT&CK technique.

Will MBC always remain distinct from ATT&CK, or someday might its content be merged into ATT&CK?

We can't predict what will happen in the long run, but currently there are no plans to merge MBC into ATT&CK. Over time, new ATT&CK techniques will be defined and some may directly pertain to malware in expanded ways. For example, the April 2019 ATT&CK update included a new technique - Virtualization/Sandbox Evasion - which may seem to blur the line between ATT&CK and MBC. However, the technique focuses on checks made by a human adversary (although malware is mentioned), and it's defined in the context of Defense Evasion and Discovery. Two related MBC behaviors pertain to malware anti-analysis objectives: [Sandbox Detection] and [Virtual Machine Detection] (ATT&CK doesn't define anti-analysis tactics).

Why aren't ATT&CK Initial Access techniques included in MBC?

Initial Access - the way malware gains an initial foothold within a system - is not typically determined by analyzing the malware's code. For example, in ATT&CK's Spearphishing Attachment sub-technique, the attachment may be a malicious executable file, but the code of the attached file's isn't related to its email delivery. This is not to say an analyst can't surmise initial access by studying a malware instance, but determining initial access requires evidence beyond the malware's code.

Some malware self-replicates or distributes other malware, but in keeping with MBC's malware, code-oriented perspective, such behaviors would be associated with the LATERAL MOVEMENT, EXECUTION, and/or IMPACT objectives in MBC. For example, if malware sends spearphishing email, its behavior would be captured by the Send Email behavior, which is associated with execution and lateral movement.

Why aren't PRE-ATT&CK techniques used by malware authors included in MBC?

For example, why doesn't the Obfuscate or Encrypt Code technique under the Adversary OPSEC tactic apply to malware showing signs of anti-analysis techniques?

PRE-ATT&CK is independent of technology and models an adversary's behavior as they attempt to gain access. By contrast, MBC captures behaviors associated with malware executable code. It does not capture human behaviors, even if the behaviors relate to malware. (Malware with obfuscated code, which makes analysis difficult, would be captured using the MBC Executable Code Obfuscation behavior.)

ATT&CK is organized by the Enterprise and Mobile technology domains. Why isn't MBC?

ATT&CK separates the Enterprise and Mobile domains because of how ATT&CK evolved. While both domains are included in MBC to support malware analysis use cases, we didn't see value in distinguishing between enterprise and mobile malware.

ATT&CK is a mid-level adversary model. Does MBC model malware in a similar way?

ATT&CK models the life cycle of a human adversary, which results in an ordering of tactics (not necessarily strict), starting with Initial Access and ending with Impact. Software (malware code) executes sequentially but from a timing, life cycle perspective, it executes simultaneously. Therefore, MBC objectives are presented alphabetically, and MBC doesn't provide a model in the same sense as ATT&CK: rather MBC captures an unordered collection of objectives and behaviors applicable to a malware sample.

MBC v1.0 included ATT&CK techniques? Why is MBC v2.x a stand-alone supplement to ATT&CK?

When MBC provided a complete set of malware-related behaviors (MBC v1.0), updates to ATT&CK, including addition of new platforms (e.g., ICS and cloud), made it difficult to keep MBC current. Also, most malware vendors using ATT&CK in their reporting consider the complete set of ATT&CK techniques, even though some techniques are never used. Consequently, MBC v2.x defines new, malware-related behaviors, as well as malware-focused enhancements to ATT&CK techniques, but it no longer defines a restricted set of ATT&CK techniques that pertains to malware.

References

[1] MITRE ATT&CK(R): Design and Philosophy, https://www.mitre.org/sites/default/files/publications/pr-18-0944-11-mitre-attack-design-and-philosophy.pdf

[2] https://www.welivesecurity.com/2019/07/08/south-korean-users-backdoor-torrents/