You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When there are super nodes in the graph with hundreds of thousands or millions of out-edges, the PageRank join operation performs very poorly (stragglers) with skewed partitions.
The text was updated successfully, but these errors were encountered:
Super nodes in this case might actually not be very big -- like 100k out edges even. Consider a broadcast join (although this can be huge) or a "duplicate" join, where the vertices dataset is artificially duplicated to add randomness and allow for uniformly distributed partitions.
When there are super nodes in the graph with hundreds of thousands or millions of out-edges, the PageRank join operation performs very poorly (stragglers) with skewed partitions.
The text was updated successfully, but these errors were encountered: