-
Notifications
You must be signed in to change notification settings - Fork 0
Self hosting snowplow js
HOME > SNOWPLOW SETUP GUIDE > Step 2: setup a Tracker > Javascript tracker > Self-hosting Snowplow.js
We recommend self-hosting the Snowplow tracking JavaScript, snowplow.js
as it does have some definite advantages over using a third-party-hosted JavaScript:
- Hosting your own JavaScript allows you to use your own JavaScript minification and asset pipelining approach (e.g. bundling all JavaScripts into one minified JavaScript)
- As [Douglas Crockford] crockford put it about third-party JavaScripts: "it is extremely unwise to load code from servers you do not control."
The alternative to self-hosting snowplow.js
is to use the version hosted by Snowplow Analytics, which is okay too. (Details of all the assets including sp.js
that we house on behalf of the community can be found here). But if you want to self-host snowplow.js
, please read on...
Note: We recommend running all Snowplow AWS operations through an IAM user with the bare minimum permissions required to run Snowplow. Please see our IAM user setup page for more information on doing this.
## Pre-requisitesFor the purposes of this guide, we are going to assume that you want to serve the standard snowplow.js
from CloudFront. (We discuss other approaches in the Advanced options section below.). To accomplish this, you will need the following:
- An account with [Amazon Web Services] aws
- S3 and CloudFront enabled within your AWS account
- Some technical chops (not too many)
Once you have those ready, please read on...
## Self-hosting instructionsFirst create a new bucket within your Amazon S3 account to store the pixel. Call this bucket something like snowplow-static-js
:
A couple of notes on this:
- Don't enable logging on this bucket
- You won't be able to call this bucket exactly
snowplow-static-js
. This is because Amazon S3 bucket names have to be globally unique, andsnowplow-static-js
is unfortunately taken by us!
You want to upload the minified version of the Snowplow JavaScript, which is called sp.js
. You can obtain the latest version of the Javascript from the hosted assets section or review the Snowplow Github repo.
Now you're ready to upload the JavaScript file into S3. Within the S3 pane, hit Upload and browse to your file:
Then hit Open and you will see the following screen:
Hit Set Details >, then hit Set Permissions > to set permissions on this file allowing Everyone to Open/Download it:
Now hit Start Upload to upload the JavaScript file into your bucket. When done, you should have something like this:
Now that sp.js
has been uploaded, we recommend that you set the Cache-Control max-age
property on the file. This property determines both how long Cloudfront caches sp.js
in its edge locations, and crucially, how long individual browsers cache sp.js
before repinging Cloudfront for a fresh copy. By setting a long expiration date, you can reduce the number of browser requests for sp.js
, which can significantly decrease your Cloudfront costs. (Especially if you are a large website or network of sites.)
The only disadvantage of a long expiration is that you need to find a way to force end users to fetch a fresh copy of sp.js
when you upgrade to a newer version. This is easily managed by saving your new version to a new folder in your S3 bucket, and updating your Snowplow tags to point to the new version.
To set a long expiration date on sp.js
, right click on it in the S3 console, and select Properties:
Click on the Metadata dropdown and then click on the Add more metadata button. New drop downs appear to enable you to enter a new key/value pair:
In the Key dropdown, select Cache-Control. In the value field, enter
max-age=$value_in_seconds
For example, if you want to set your items to expire in 10 years, enter, that is 10x365x24x60x60 = 315,360,000
max-age=315360000
![entered-values] key-value-pair-entered
Now click save button (bottom right of the screen). You area ready to create your CloudFront distribution!
Now you are ready to create the CloudFront distribution which will serve your JavaScript. In the CloudFront tab, hit the Create Distribution button:
Select the Download option and hit Continue:
For the Origin Domain Name, choose your S3 bucket (snowplow-static-js
) from the dropdown, and accept the suggested Origin ID. Now hit Continue:
The defaults are fine on this screen, hit Continue again:
On this screen leave Logging as Off and hit Continue to review a summary of your new distribution:
Hit Create Distribution and then you should see something like this:
Write down your CloudFront distribution's Domain Name - e.g. http://d1fc8wv8zag5ca.cloudfront.net
. You will need this later when you integrate Snowplow into your website.
Before testing, take a 10 minute coffee or brandy break (that's how long CloudFront takes to synchronize).
Done? Now just check that you can access your JavaScript file over both HTTP and HTTPS using a browser, wget
or curl
:
http://{{SUBDOMAIN}}.cloudfront.net/sp.js
https://{{SUBDOMAIN}}.cloudfront.net/sp.js
If you have any problems, then double-check your CloudFront distribution's URL, and check the permissions on your sp.js
file: it must be Openable by Everyone.
That's it - you now have a CloudFront distribution which will serve your Snowplow JavaScript to anybody anywhere in the world, fast. Now all that remains is to update your Snowplow header tag to fetch your own version of sp.js
, rather than the version hosted by the Snowplow team.
The standard Snowplow tracking tag looks something like:
<!-- Snowplow starts plowing -->
<script type="text/javascript">
var _snaq = _snaq || [];
_snaq.push(['setCollectorCf', '{{YOUR COLLECTOR\'S CF SUBDOMAIN}}']);
_snaq.push(['trackPageView']);
_snaq.push(['enableLinkTracking']);
(function() {
var sp = document.createElement('script'); sp.type = 'text/javascript'; sp.async = true; sp.defer = true;
sp.src = ('https:' == document.location.protocol ? 'https' : 'http') + '://d1fc8wv8zag5ca.cloudfront.net/0.12.0/sp.js';
var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(sp, s);
})();
</script>
<!-- Snowplow stops plowing -->
The reference to '://d1fc8wv8zag5ca.cloudfront.net/0.12.0/sp.js'
loads sp.js
, the Snowplow JavaScript tracker. The version loaded is the version hosted by the Snowplow team from our own Cloudfront subdomain (and provided free to the community).
To use the version hosted yourself, update the sp.src
line to point to your own self-hosted sp.js
:
sp.src = ('https:' == document.location.protocol ? 'https' : 'http') + '://{{YOUR SP.JS CF SUBDOMAIN}}.cloudfront.net/sp.js';
The setCollectorCf
method is used to determine the Cloudfront subdomain where your tracking pixel is served from. This should not be confused with the Cloudfront subdomain used to serve sp.js
.
This page of documentation relates to self-hosting sp.js
. You should be using a different Cloudfront distribution for you setCollectorCf
method in the Snowplow tag. (Or if you're not using the Cloudfront collector, setCollectorUrl
.) If you are using the Cloudfront collector, see Cloudfront collector setup for details on setting up a Cloudfront distribution for your tracking pixel, and setting the collector endpoint of your Javascript tracker for details on configuring your Snowplow tags.
The guide above assumed that you were happy to take the already-minified sp.js
and host it on CloudFront. In fact, you may prefer to:
- Update the contents of
snowplow.js
and then minify it yourself, and/or: - Add the un-minified
snowplow.js
into your own asset pipelining process and serve it using something other than CloudFront
The first option above is explored in more detail in the guide to Modifying snowplow.js.
The second option is out of the scope of the Snowplow documentation but you should get some ideas as to how the minification should be handled from that same guide, Modifying snowplow.js.
## Next stepsYou may want to setup [campaign tracking](tracking your marketing campaigns), so that you can tie user behaviour on your website to any paid campaigns that drove those users to your website.
Finished setting up the [Javascript tracker] (javascript-tracker-setup)? Then you are ready to [setup EmrEtlRunner] (Setting-up-Snowplow#wiki-step3).
Return to the [setup guide] (Setting-up-Snowplow).
Home | About | Project | Setup Guide | Technical Docs | Copyright © 2012-2013 Snowplow Analytics Ltd
HOME > SNOWPLOW SETUP GUIDE > Step 2: Setup a Tracker > Javascript tracker setup
- [Step 1: Setup a Collector] (setting-up-a-collector)
- [Step 2: Setup a Tracker] (setting-up-a-tracker)
- Javascript tracker setup
- Integrating Snowplow tags directly on your website
- Integrating Snowplow tags via Google Tag manager
- [Integrating Snowplow tags via QuBit OpenTag](Integrating Javascript tags with QuBit OpenTag)
- [Testing the Javascript tracker is firing](Testing the Javascript tracker is firing)
- Hosting Snowplow.js yourself
- Setting up campaign tracking
- [Step 3: Setup EmrEtlRunner] (setting-up-EmrEtlRunner)
- [Step 4: Setup the StorageLoader] (setting-up-storageloader)
- [Step 5: Analyse your data!] (Getting started analysing Snowplow data)
Useful resources