Home

stared edited this page Nov 2, 2014 · 16 revisions
Clone this wiki locally

Intro

Below there are some graph maps made from co-occurrence of tags at StackExchange sites.

Started as an entry at StackExchange visualization competition at Kaggle.

Code is here. The recipe to make similar graphs is in the README file. If you cannot redo anything, just write an e-mail to me.

See also: TagOverflow - dynamic, interactive graphs of Stack Exchange.

Tag Maps

256 Tags of StackOverflow

An example graph for StackOverflow (I really recommend to see the actual hi res jpg file):

Map of 256 Tags of StackOverflow

Why only 256 Tags?

It isn't little.

Even tough they make only 0.6% of the total tag count, 90% of questions have at least one of them.

Moreover, for more tags the graph looks cluttered, and the resulting Top Tens may not show typical features of the Stack Overflow.

But if you still don't like the idea of using 256 tags you are not a programmer, are you?

What is actually on the map?

It is a graph, where:

  • nodes are tags (their size reflects their popularity),
  • edges are frequent co-occurrences of them. More precisely, an edge between two tags (tag1 and tag2) is drawn if their observed to expected ratio is greater or equal to 4.

Colors mark different communities in tags, as detected by a community detection algorithm found in Gephi, in which it was created.

StackExchange network

Stack Exchange sites (except the 3 biggest)

See also the related thread at meta.stackoverflow.com - especially take a look at an interpretation of clusters.

Other StackExchange sites

Or for Physics.SE:

Tag Map of Physics.SE

Or Math.SE (and related thread is here):

Tag Map of Math.SE

Or for Stats.SE a.k.a. CrossValidated:

Tag Map of Stats.SE

OR for Photo.SE:

Photo.SE Tag Map