[slides and audio] Fast Random Walk with Restart and Its Applications

The paper "Fast Random Walk with Restart and Its Applications" by Hanghang Tong addresses the challenge of efficiently computing the relevance score between nodes in a large, disk-resident graph using Random Walk with Restart (RWR). Traditional RWR methods either require quadratic space and cubic pre-computation time or are slow on query responses. To overcome these limitations, the authors propose a novel approach called B.LIN, which leverages the linear correlations and block-wise community-like structure of real graphs. The key contributions include: 1. **B.LIN Algorithm**: This algorithm uses low-rank matrix approximation and graph partitioning to pre-compute and store only a few small matrices, significantly reducing pre-computation and storage costs. 2. **Theoretical Justification**: The paper provides theoretical analysis and error bounds for the simplified version of B.LIN, NB.LIN, demonstrating its accuracy. 3. **Experimental Results**: Extensive experiments on real datasets (CoR, CoMMG, AP, AC) show that B.LIN and NB.LIN achieve significant speedups (up to 150x) while preserving 90%+ accuracy in various applications such as automatic image captioning, neighborhood formulation, and cross-modal correlation discovery. 4. **Special Case for Bipartite Graphs**: A fast algorithm (BB_LIN) is proposed for a specific class of bipartite graphs, achieving up to 1,800x speedup with minimal pre-computation and storage costs. The paper also discusses related work in random walk methods, graph partitioning, and low-rank approximation, and concludes with future directions for improving error bounds and comparing with other solutions.The paper "Fast Random Walk with Restart and Its Applications" by Hanghang Tong addresses the challenge of efficiently computing the relevance score between nodes in a large, disk-resident graph using Random Walk with Restart (RWR). Traditional RWR methods either require quadratic space and cubic pre-computation time or are slow on query responses. To overcome these limitations, the authors propose a novel approach called B.LIN, which leverages the linear correlations and block-wise community-like structure of real graphs. The key contributions include: 1. **B.LIN Algorithm**: This algorithm uses low-rank matrix approximation and graph partitioning to pre-compute and store only a few small matrices, significantly reducing pre-computation and storage costs. 2. **Theoretical Justification**: The paper provides theoretical analysis and error bounds for the simplified version of B.LIN, NB.LIN, demonstrating its accuracy. 3. **Experimental Results**: Extensive experiments on real datasets (CoR, CoMMG, AP, AC) show that B.LIN and NB.LIN achieve significant speedups (up to 150x) while preserving 90%+ accuracy in various applications such as automatic image captioning, neighborhood formulation, and cross-modal correlation discovery. 4. **Special Case for Bipartite Graphs**: A fast algorithm (BB_LIN) is proposed for a specific class of bipartite graphs, achieving up to 1,800x speedup with minimal pre-computation and storage costs. The paper also discusses related work in random walk methods, graph partitioning, and low-rank approximation, and concludes with future directions for improving error bounds and comparing with other solutions.

Fast Random Walk with Restart and Its Applications

| Hanghang Tong, Christos Faloutsos, Jia-Yu Pan