GitHub - inchiosa/terraform-azurerm-avm-ptn-ai-reference-implementation: AI Pattern Module

AI Reference Implementation Baseline Pattern Module

Overview

The AI Reference Implementation Baseline Pattern Module provides a secure, observable by default, scalable, and highly configurable foundation for deploying AI workloads on Azure. This pattern module integrates multiple Azure resources, following best practices and architectural standards, to deliver a comprehensive AI Reference Implementation. The goal is to accelerate the deployment of AI solutions by providing a ready-to-use infrastructure that adheres to Azure's Well-Architected Framework.

This pattern module is opinionated, meaning it comes with pre-configured defaults for security, observability, and essential AI resources. However, it remains flexible, allowing users to customize the environment to meet specific project needs by enabling or disabling various components.

Architecture

The pattern module is designed to be modular and composable. The default deployment includes a minimum set of resources required to establish a secure and observable AI environment, but additional resources can be added based on project requirements. Below is a high-level architecture diagram:

Key Features and Goals

This AI Reference Implementation pattern module is designed to accelerate the deployment of AI solutions on Azure, while ensuring security, compliance, flexibility, and observability. The primary objectives and functionalities of this module include:

Security by Default: Ensuring that all deployed resources adhere to Azure's security best practices, including network isolation, encryption, and identity management. This guarantees that AI environments are secure from the outset. For more details, see Security practices.
Observability: Providing out-of-the-box integration of logging, monitoring, and alerting, making the AI environment fully observable from day one. This ensures that all deployments are transparent and issues can be quickly identified and resolved. More details are available in Observability practices.
Modular and Flexible Design: Composed of several modules that can be individually enabled or disabled, this pattern offers a flexible architecture that can be tailored to specific project needs. This modular approach allows teams to start with a minimal setup and expand as required, ensuring the AI environment is scalable and adaptable.
Compliance with Azure Best Practices: Adhering to the recommendations of the Azure Well-Architected Framework, this module ensures that all resources and configurations are optimized for performance, reliability, security, and cost management.
Rapid Deployment and Consistency: By providing a standardized reference implementation, this module accelerates the deployment process and ensures consistency across different projects and teams. This reduces variability and guarantees that best practices are consistently applied in all AI environments.

Usage

This pattern module is designed for both data scientists and engineers who need to quickly stand up a secure, scalable AI environment on Azure. It is also suitable for organizations that require a compliant and secure environment for their AI workloads with the flexibility to customize the setup based on project-specific needs.

Example Usage

To deploy the AI Reference Implementation Baseline with minimal configuration:

module "ai_reference_implementation" {
  source  = "Azure/avm-ptn-ai-reference-implementation/azurerm"
  version = "x.x.x"

  resource_group_name = "<your_resource_group>"
  location            = "<your_location>"
}

This example sets up the AI Reference Implementation with all default resources.

Use Cases

This module is ideal for:

Data Scientists: Who need a secure, scalable, and integrated environment to experiment, develop, and train machine learning models.
ML Engineers: Looking to deploy machine learning models into production with robust monitoring, scaling, and management capabilities.
Organizations: That require a compliant and secure environment for their machine learning workloads, with the flexibility to integrate with existing Azure services.

Extending the Pattern Module

The AI Reference Implementation Baseline Pattern Module is designed to be extended. You can add additional resources or services by integrating other Azure Verified Modules (AVM). For example, you can include additional machine learning environments, data lakes, or advanced AI services by simply integrating their respective modules and configuring them within the pattern module.

Additional Resources

Azure Well-Architected Framework: Azure WAF
Azure AI Documentation: Azure AI Services
Terraform Registry: Terraform AzureRM Provider

Contributing

This module is part of the Azure Verified Modules (AVM) ecosystem, and contributions are welcome. Please follow the standard contribution guidelines if you wish to submit enhancements or report issues.

Requirements

The following requirements are needed by this module:

terraform (~> 1.7)
azurerm (3.112)
modtm (~> 0.3)
random (~> 3.5)

Resources

The following resources are used by this module:

azurerm_management_lock.this (resource)
azurerm_public_ip.bastion_ip (resource)
azurerm_role_assignment.this (resource)
modtm_telemetry.telemetry (resource)
random_uuid.telemetry (resource)
azurerm_client_config.current (data source)
azurerm_client_config.telemetry (data source)
azurerm_resource_group.base (data source)
modtm_module_source.telemetry (data source)

Required Inputs

The following input variables are required:

location

Description: The location/region where the resources will be deployed.

Type: string

name

Description: The name of the this resource.

Type: string

resource_group_name

Description: The resource group where the resources will be deployed.

Type: string

Optional Inputs

The following input variables are optional (have default values):

azure_bastion_subnet_address_spaces

Description: The address space that is used for the Azure Bastion subnet

Type: list(string)

Default:

[
  "10.1.3.0/24"
]

bastion_name

Description: The name of the Azure Bastion resource. if not provided, a name will be generated.

Type: string

Default: ""

bastion_network_security_group_name

Description: The name of the Network Security Group for the Azure Bastion subnet. If not provided, a name will be generated.

Type: string

Default: ""

container_registry_name

Description: The name of the Azure Container Registry. If not provided, a name will be generated.

Type: string

Default: ""

enable_telemetry

Description: This variable controls whether or not telemetry is enabled for the module.
For more information see https://aka.ms/avm/telemetryinfo.
If it is set to false, then no telemetry will be collected.

Type: bool

Default: true

jumpbox

Description: This creates a jumpbox if configured with jumpbox.create = true and defaults to a Windows machine.

Type:

object({
    create  = bool
    name    = optional(string, "jumpbox")
    os_type = optional(string, "Windows")
    size    = optional(string, "Standard_D2s_v3")
    zone    = optional(string, "1")
    image_ref = optional(object({
      publisher = string
      offer     = string
      sku       = string
      version   = string
      }), {
      publisher = "microsoftwindowsdesktop"
      offer     = "windows-11"
      sku       = "win11-22h2-ent"
      version   = "latest"
    })
  })

Default:

{
  "create": false
}

key_vault_name

Description: The name of the Azure Key Vault. If not provided, a name will be generated.

Type: string

Default: ""

lock

Description: Controls the Resource Lock configuration for this resource. The following properties can be specified:

kind - (Required) The type of lock. Possible values are \"CanNotDelete\" and \"ReadOnly\".
name - (Optional) The name of the lock. If not specified, a name will be generated based on the kind value. Changing this forces the creation of a new resource.

Type:

object({
    kind = string
    name = optional(string, null)
  })

Default: null

log_analytics_workspace_name

Description: The name of the Log Analytics Workspace. If not provided, a name will be generated.

Type: string

Default: ""

machine_learning_workspace_name

Description: The name of the Azure Machine Learning Workspace. If not provided, a name will be generated.

Type: string

Default: ""

pe_network_security_group_name

Description: The name of the Network Security Group for the private endpoints subnet. If not provided, a name will be generated.

Type: string

Default: ""

private_endpoints_subnet_address_spaces

Description: The address space that is used for the private endpoints subnet

Type: list(string)

Default:

[
  "10.1.2.0/24"
]

role_assignments

Description: A map of role assignments to create on the . The map key is deliberately arbitrary to avoid issues where map keys maybe unknown at plan time.

role_definition_id_or_name - The ID or name of the role definition to assign to the principal.
principal_id - The ID of the principal to assign the role to.
description - (Optional) The description of the role assignment.
skip_service_principal_aad_check - (Optional) If set to true, skips the Azure Active Directory check for the service principal in the tenant. Defaults to false.
condition - (Optional) The condition which will be used to scope the role assignment.
condition_version - (Optional) The version of the condition syntax. Leave as null if you are not using a condition, if you are then valid values are '2.0'.
delegated_managed_identity_resource_id - (Optional) The delegated Azure Resource Id which contains a Managed Identity. Changing this forces a new resource to be created. This field is only used in cross-tenant scenario.
principal_type - (Optional) The type of the principal_id. Possible values are User, Group and ServicePrincipal. It is necessary to explicitly set this attribute when creating role assignments if the principal creating the assignment is constrained by ABAC rules that filters on the PrincipalType attribute.

Note: only set skip_service_principal_aad_check to true if you are assigning a role to a service principal.

Type:

map(object({
    role_definition_id_or_name             = string
    principal_id                           = string
    description                            = optional(string, null)
    skip_service_principal_aad_check       = optional(bool, false)
    condition                              = optional(string, null)
    condition_version                      = optional(string, null)
    delegated_managed_identity_resource_id = optional(string, null)
    principal_type                         = optional(string, null)
  }))

Default: {}

storage_account_name

Description: The name of the Azure Storage Account. If not provided, a name will be generated.

Type: string

Default: ""

virtual_machines_subnet_address_spaces

Description: The address space that is used for the virtual machines subnet

Type: list(string)

Default:

[
  "10.1.1.0/24"
]

virtual_network_name

Description: The name of the Virtual Network. If not provided, a name will be generated.

Type: string

Default: ""

vm_network_security_group_name

Description: The name of the Network Security Group for the virtual machines subnet. If not provided, a name will be generated.

Type: string

Default: ""

vnet_address_spaces

Description: The address space that is used the virtual network

Type: list(string)

Default:

[
  "10.1.0.0/16"
]

Outputs

The following outputs are exported:

resource

Description: This is the full output for the resource.

resource_id

Description: The Azure resource id of the resource.

Modules

The following Modules are called:

aml

Source: Azure/avm-res-machinelearningservices-workspace/azurerm

Version: 0.1.1

avm_res_containerregistry_registry

Source: Azure/avm-res-containerregistry-registry/azurerm

Version: ~> 0.2

azure_bastion

Source: Azure/avm-res-network-bastionhost/azurerm

Version: 0.3.0

ba_network_security_group

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

jumpbox

Source: Azure/avm-res-compute-virtualmachine/azurerm

Version: 0.15.1

key_vault

Source: Azure/avm-res-keyvault-vault/azurerm

Version: ~> 0.5

log_analytics_workspace

Source: Azure/avm-res-operationalinsights-workspace/azurerm

Version: ~> 0.1

pe_network_security_group

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

private_dns_container_registry

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

private_dns_keyvault

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

private_dns_storage

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

private_dns_workspace

Source: Azure/avm-res-network-privatednszone/azurerm

Version: ~> 0.1.1

storage_account

Source: Azure/avm-res-storage-storageaccount/azurerm

Version: 0.2.1

virtual_network

Source: Azure/avm-res-network-virtualnetwork/azurerm

Version: ~> 0.2.0

vm_network_security_group

Source: Azure/avm-res-network-networksecuritygroup/azurerm

Version: ~> 0.2.0

Data Collection

The software may collect information about you and your use of the software and send it to Microsoft. Microsoft may use this information to provide services and improve our products and services. You may turn off the telemetry as described in the repository. There are also some features in the software that may enable you and Microsoft to collect data from users of your applications. If you use these features, you must comply with applicable law, including providing appropriate notices to users of your applications together with a copy of Microsoft’s privacy statement. Our privacy statement is located at https://go.microsoft.com/fwlink/?LinkID=824704. You can learn more about data collection and use in the help documentation and our privacy statement. Your use of the software operates as your consent to these practices.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
examples		examples
media		media
modules		modules
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.terraform-docs.yml		.terraform-docs.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
_footer.md		_footer.md
_header.md		_header.md
avm		avm
avm.aml.tf		avm.aml.tf
avm.bastion.tf		avm.bastion.tf
avm.bat		avm.bat
avm.container_registry.tf		avm.container_registry.tf
avm.key_vault.tf		avm.key_vault.tf
avm.log_analytics_workspace.tf		avm.log_analytics_workspace.tf
avm.network_security_groups.tf		avm.network_security_groups.tf
avm.storage.tf		avm.storage.tf
avm.virtual_network.tf		avm.virtual_network.tf
data.tf		data.tf
locals.tf		locals.tf
main.telemetry.tf		main.telemetry.tf
main.tf		main.tf
observability_practices.md		observability_practices.md
outputs.tf		outputs.tf
security_practices.md		security_practices.md
terraform.tf		terraform.tf
variables.tf		variables.tf

License

inchiosa/terraform-azurerm-avm-ptn-ai-reference-implementation

Folders and files

Latest commit

History

Repository files navigation

AI Reference Implementation Baseline Pattern Module

Overview

Architecture

Key Features and Goals

Usage

Example Usage

Use Cases

Extending the Pattern Module

Additional Resources

Contributing

Requirements

Resources

Required Inputs

Optional Inputs

Outputs

Modules

Data Collection

About

Resources

License

Stars

Watchers

Forks

Languages