Community detection algorithms: a comparative analysis

Community detection algorithms: a comparative analysis

16 Sep 2010 | Andrea Lancichinetti1,2 and Santo Fortunato1
A comparative analysis of community detection algorithms is presented, evaluating their performance on benchmark graphs and random graphs. The study highlights the importance of testing algorithms on realistic networks with heterogeneous degree and community size distributions, as opposed to simplified models. The LFR benchmark, which generalizes the GN benchmark by introducing power law distributions for degree and community size, is used to assess the algorithms. The GN benchmark, which assumes equal community sizes and degrees, is also tested. The analysis reveals that three recent algorithms—Rosvall and Bergstrom's Infomod, Blondel et al.'s method, and Ronhovde and Nussinov's RN—perform exceptionally well, with low computational complexity, making them suitable for large networks. The study evaluates various community detection methods, including Girvan and Newman's hierarchical divisive algorithm, fast greedy modularity optimization, simulated annealing for modularity optimization, and others. The performance of these algorithms is assessed using normalized mutual information, a measure that quantifies the similarity between partitions. The results show that modularity-based methods generally perform well on the GN benchmark but struggle with the LFR benchmark, which has more realistic properties. The Infomap algorithm, based on information theory, performs well on both benchmarks, especially on the LFR benchmark, which is more challenging. The RN method also performs well, while the EM method is effective but requires the number of communities as input. The study also considers directed and weighted graphs, as well as graphs with overlapping communities. The LFR benchmark is extended to these cases, allowing for a more comprehensive evaluation. The Cfinder algorithm is the only one capable of detecting overlapping communities. The analysis shows that the Infomap algorithm performs well on large graphs, while the RN and Blondel et al. methods are also effective. However, none of the algorithms can account for overlapping communities, requiring further refinement. The study concludes that the Infomap algorithm is the best performing on the benchmarks tested, with excellent performance on both the GN and LFR benchmarks. It is also effective on directed and weighted graphs and has low computational complexity, making it suitable for large networks. The RN and Blondel et al. methods are also promising, but they require further refinement to handle overlapping communities. The study emphasizes the importance of using realistic benchmarks to evaluate community detection algorithms, as they provide a more accurate representation of real-world networks.A comparative analysis of community detection algorithms is presented, evaluating their performance on benchmark graphs and random graphs. The study highlights the importance of testing algorithms on realistic networks with heterogeneous degree and community size distributions, as opposed to simplified models. The LFR benchmark, which generalizes the GN benchmark by introducing power law distributions for degree and community size, is used to assess the algorithms. The GN benchmark, which assumes equal community sizes and degrees, is also tested. The analysis reveals that three recent algorithms—Rosvall and Bergstrom's Infomod, Blondel et al.'s method, and Ronhovde and Nussinov's RN—perform exceptionally well, with low computational complexity, making them suitable for large networks. The study evaluates various community detection methods, including Girvan and Newman's hierarchical divisive algorithm, fast greedy modularity optimization, simulated annealing for modularity optimization, and others. The performance of these algorithms is assessed using normalized mutual information, a measure that quantifies the similarity between partitions. The results show that modularity-based methods generally perform well on the GN benchmark but struggle with the LFR benchmark, which has more realistic properties. The Infomap algorithm, based on information theory, performs well on both benchmarks, especially on the LFR benchmark, which is more challenging. The RN method also performs well, while the EM method is effective but requires the number of communities as input. The study also considers directed and weighted graphs, as well as graphs with overlapping communities. The LFR benchmark is extended to these cases, allowing for a more comprehensive evaluation. The Cfinder algorithm is the only one capable of detecting overlapping communities. The analysis shows that the Infomap algorithm performs well on large graphs, while the RN and Blondel et al. methods are also effective. However, none of the algorithms can account for overlapping communities, requiring further refinement. The study concludes that the Infomap algorithm is the best performing on the benchmarks tested, with excellent performance on both the GN and LFR benchmarks. It is also effective on directed and weighted graphs and has low computational complexity, making it suitable for large networks. The RN and Blondel et al. methods are also promising, but they require further refinement to handle overlapping communities. The study emphasizes the importance of using realistic benchmarks to evaluate community detection algorithms, as they provide a more accurate representation of real-world networks.
Reach us at info@study.space