Sample data #1

mccalluc · 2016-10-23T15:07:48Z

but can you point me at a dataset that would be good to used for a first demo? Or suggest how many rows and columns should be accommodated?
For the volcano plot and the pca, is it worthwhile to do it in JS? It would make it easier to plug in new sample data, and maybe it could be useful for smaller datasets in the long run, too.

ngehlenborg · 2016-11-07T22:43:09Z

Still looking for a good sample data set.

Typical data set sizes we need to be prepared for: ~25,000 genes (= rows) and say 100 conditions/samples (= columns).

In heatmaps, the genes will be filtered to something more managable (dozens to hundreds) and obivously we can't render 25000 rows even if we make them 1 pixel high. There will either be overplotting (initial solution) or we find smart ways to aggregate (later solution?).

The scatterplots (PCA and volcano) should support the same number of items as the worst case (~25,000), which is best handled using opacity (which will make rendering much slower). Note that PCA can be done on both the genes and the samples (which is much easier to handle in the visualization). I suggest you build on the canvas-based scatter plot that @sgratzl started to work on.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample data #1

Sample data #1

mccalluc commented Oct 23, 2016

ngehlenborg commented Nov 7, 2016

Sample data #1

Sample data #1

Comments

mccalluc commented Oct 23, 2016

ngehlenborg commented Nov 7, 2016