Community validation #
See also Cluster validation for general clustering validation.

Given ground truth, Mutual information is often used. An important, overlooked question is how good the ground truth is. Some work about tagged networks^1[^3].

Statistical significance of communities: ;

Overlapping community structure makes the problem harder. For instance, Mutual information should be extended to covers. There are two proposals[^4][^5]. There are other suggestions: using conductance of boundary nodes[^6].

An approach to use data[^7].

How can we define statistical significance? What will be the proper null models for community structure?

There are approaches to use online social network services such as Facebook (

