See also Cluster validation for general clustering validation.
Given ground truth, Mutual information is often used. An important, overlooked question is how good the ground truth is. Some work about tagged networks^1[^2][^3].
Statistical significance of communities: http://www.pnas.org/content/108/18/7321.short ; http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0018961
Overlapping community structure makes the problem harder. For instance, Mutual information should be extended to covers. There are two proposals[^4][^5]. There are other suggestions: using conductance of boundary nodes[^6].
An approach to use data[^7].
References #
- http://www.cse.chalmers.se/~moradi/Lic/SEA.pdf (http://sea2012.labri.fr/index.php?n=Main.Papers)
- http://arxiv.org/abs/1006.0375 - Information theoretic model validation for clustering
How can we define statistical significance? What will be the proper null models for community structure?
- http://pre.aps.org/abstract/PRE/v81/i4/e046110 - Statistical significance of communities in networks
- http://arxiv.org/abs/1012.2363 - Finding statistically significant communities in networks
- http://prl.aps.org/abstract/PRL/v105/i22/e220601 - Significance Analysis and Statistical Mechanics: An Application to Clustering
- http://arxiv.org/abs/1110.0305 - Significant communities in large sparse networks
There are approaches to use online social network services such as Facebook (http://fellows-exp.com/).
- Comparing network covers using mutual information
- Defining and Evaluating Network Communities based on Ground-truth
-
Community detection: effective evaluation on large social networks
-
A revisit to evaluating accuracy of community detection using the normalized mutual information
^1: Palla, Gergely; Farkas, Illés J; Pollner, Péter; Derényi, Imre; Vicsek, Tamás (2008). "Fundamental statistical features and self-similar properties of tagged networks". New Journal of Physics 10 (12): 123026. doi:10.1088/1367-2630/10/12/123026. ISSN 1367-2630
[^2]: Pollner, Péter; Palla, Gergely; Vicsek, Tamás (2010). "Clustering of tag-induced subgraphs in complex networks". Physica A: Statistical Mechanics and its Applications 389 (24): 5887–5894. doi:10.1016/j.physa.2010.09.012. ISSN 03784371.
[^3]: Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely (2012). "Ontologies and tag-statistics". New Journal of Physics 14 (5): 053009. doi:10.1088/1367-2630/14/5/053009. ISSN 1367-2630.
[^4]: Lancichinetti, Andrea; Fortunato, Santo; Kertész, János (2009). "Detecting the overlapping and hierarchical community structure in complex networks". New Journal of Physics 11 (3): 033015. doi:10.1088/1367-2630/11/3/033015. ISSN 1367-2630.
[^5]: "Template:Citation error". http://arxiv.org/abs/1202.0425.
[^6]: "Evaluating Overlapping Communities with the Conductance of their Boundary Nodes". http://arxiv.org/abs/1206.3992.
[^7]: "Benchmarking community detection methods on social media data". http://arxiv.org/abs/1302.0739.
Incoming Links #
Related Articles (Article 0) #
Suggested Pages #
- 0.369 Clustering comparison
- 0.132 Network similarity
- 0.076 Dynamic community structure
- 0.062 Community
- 0.048 Information diffusion and communities
- 0.028 Community evolution
- 0.025 Pajek
- 0.025 Clustering
- 0.021 Stochastic block model
- 0.019 Temporal community structure
- More suggestions...