運用Python3實現(xiàn)Two-Pass算法檢測區(qū)域連通性

更新時間：2021年08月18日 17:16:38 作者：DECHIN

如何高效的檢測出連通區(qū)域的流動性是大家一直關注的話題，這篇文章主要介紹了運用Python3實現(xiàn)Two-Pass算法檢測區(qū)域連通性，感興趣的朋友可以一起來看看

技術背景

連通性檢測是圖論中常常遇到的一個問題，我們可以用五子棋的思路來理解這個問題五子棋中，橫、豎、斜相鄰的兩個棋子，被認為是相連接的，而一樣的道理，在一個二維的圖中，只要在橫、豎、斜三個方向中的一個存在相鄰的情況，就可以認為圖上相連通的。比如以下案例中的python數(shù)組，3號元素和5號元素就是相連接的，5號元素和6號元素也是相連接的，因此這三個元素實際上是屬于同一個區(qū)域的：

array([[0, 3, 0],
       [0, 5, 0],
       [6, 0, 0]])

而再如下面這個例子，其中的1、2、3三個元素是相連的，4、5、6三個元素也是相連的，但是這兩個區(qū)域不存在連接性，因此這個網(wǎng)格被分成了兩個區(qū)域：

array([[1, 0, 4],
       [2, 0, 5],
       [3, 0, 6]])

那么如何高效的檢測一張圖片或者一個矩陣中的所有連通區(qū)域并打上標簽，就是我們所關注的一個問題。

Two-Pass算法

一個典型的連通性檢測的方案是Two-Pass算法，該算法可以用如下的一張動態(tài)圖來演示：

該算法的核心在于用兩次的遍歷，為所有的節(jié)點打上分區(qū)的標簽，如果是不同的分區(qū)，就會打上不同的標簽。其基本的算法步驟可以用如下語言進行概述：

遍歷網(wǎng)格節(jié)點，如果網(wǎng)格的上、左、左上三個格點不存在元素，則為當前網(wǎng)格打上新的標簽，同時標簽編號加一；
當上、左、左上的網(wǎng)格中存在一個元素時，將該元素值賦值給當前的網(wǎng)格作為標簽；
當上、左、左上的網(wǎng)格中有多個元素時，取最低值作為當前網(wǎng)格的標簽；
在標簽賦值時，留意標簽上邊和左邊已經(jīng)被遍歷過的4個元素，將4個元素中的最低值與這四個元素分別添加到Union的數(shù)據(jù)結構中（參考鏈接1）；
再次遍歷網(wǎng)格節(jié)點，根據(jù)Union數(shù)據(jù)結構中的值刷新網(wǎng)格中的標簽值，最終得到劃分好區(qū)域和標簽的元素矩陣。

測試數(shù)據(jù)的生成

這里我們以Python3為例，可以用Numpy來產(chǎn)生一系列隨機的0-1矩陣，這里我們產(chǎn)生一個20*20大小的矩陣：

# two_pass.py

import numpy as np
import matplotlib.pyplot as plt

if __name__ == "__main__":
    np.random.seed(1)
    graph = np.random.choice([0,1],size=(20,20))
    print (graph)

    plt.figure()
    plt.imshow(graph)
    plt.savefig('random_bin_graph.png')

執(zhí)行的輸出結果如下：

$ python3 two_pass.py 
[[1 1 0 0 1 1 1 1 1 0 0 1 0 1 1 0 0 1 0 0]
 [0 1 0 0 1 0 0 0 1 0 0 0 1 1 1 1 1 0 0 0]
 [1 1 1 1 1 1 0 1 1 0 0 1 0 0 1 1 1 0 1 0]
 [0 1 1 0 1 1 1 1 0 0 1 1 0 0 0 0 1 1 1 0]
 [1 0 0 1 1 0 1 1 0 1 0 0 1 1 1 0 1 1 0 1]
 [1 1 1 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0]
 [0 1 1 1 1 1 1 0 0 1 1 0 0 1 0 0 0 1 1 1]
 [1 1 0 1 0 1 0 0 0 1 1 1 0 1 0 0 0 0 1 0]
 [1 0 1 1 1 0 0 0 0 0 0 1 0 0 1 0 0 1 1 0]
 [0 0 1 0 0 0 0 1 0 0 0 0 1 1 0 0 1 1 1 0]
 [0 0 0 0 1 1 1 0 1 1 0 0 0 1 1 0 1 1 1 0]
 [1 1 1 1 0 1 0 0 1 0 1 0 1 1 0 1 1 0 1 1]
 [1 0 1 0 1 0 1 1 1 1 1 1 0 0 1 1 0 0 0 1]
 [1 0 0 0 0 0 1 1 1 1 1 1 1 0 0 1 0 0 0 1]
 [0 1 0 1 0 0 0 0 1 1 0 0 0 1 0 1 1 0 0 1]
 [0 1 0 0 0 1 0 1 0 1 1 1 0 1 0 1 1 1 1 0]
 [0 1 0 0 0 0 1 1 0 1 1 0 0 1 1 1 1 1 1 1]
 [0 0 0 0 0 0 0 1 0 0 0 0 0 1 1 1 1 0 0 0]
 [1 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 1 0]
 [0 1 1 0 1 0 1 0 1 1 0 0 1 0 0 0 0 0 1 1]]

同時會生成一張網(wǎng)格的圖片：

其實從這個圖片中我們可以看出，圖片的上面部分幾乎都是連接在一起的，只有最下面存在幾個獨立的區(qū)域。

Two-Pass算法的實現(xiàn)

這里需要說明的是，因為我們并沒有使用Union的數(shù)據(jù)結構，而是只使用了Python的字典數(shù)據(jù)結構，因此代碼寫起來會比較冗余而且不是那么美觀，但是這里我們主要的目的是先用代解決這一實際問題，因此代碼亂就亂一點吧。

# two_pass.py

import numpy as np
import matplotlib.pyplot as plt
from copy import deepcopy

def first_pass(g) -> list:
    graph = deepcopy(g)
    height = len(graph)
    width = len(graph[0])
    label = 1
    index_dict = {}
    for h in range(height):
        for w in range(width):
            if graph[h][w] == 0:
                continue
            if h == 0 and w == 0:
                graph[h][w] = label
                label += 1
                continue
            if h == 0 and graph[h][w-1] > 0:
                graph[h][w] = graph[h][w-1]
                continue
            if w == 0 and graph[h-1][w] > 0:
                if graph[h-1][w] <= graph[h-1][min(w+1, width-1)]:
                    graph[h][w] = graph[h-1][w]
                    index_dict[graph[h-1][min(w+1, width-1)]] = graph[h-1][w]
                elif graph[h-1][min(w+1, width-1)] > 0:
                    graph[h][w] = graph[h-1][min(w+1, width-1)]
                    index_dict[graph[h-1][w]] = graph[h-1][min(w+1, width-1)]
                continue
            if h == 0 or w == 0:
                graph[h][w] = label
                label += 1
                continue
            neighbors = [graph[h-1][w], graph[h][w-1], graph[h-1][w-1], graph[h-1][min(w+1, width-1)]]
            neighbors = list(filter(lambda x:x>0, neighbors))
            if len(neighbors) > 0:
                graph[h][w] = min(neighbors)
                for n in neighbors:
                    if n in index_dict:
                        index_dict[n] = min(index_dict[n], min(neighbors))
                    else:
                        index_dict[n] = min(neighbors)
                continue
            graph[h][w] = label
            label += 1
    return graph, index_dict

def remap(idx_dict) -> dict:
    index_dict = deepcopy(idx_dict)
    for id in idx_dict:
        idv = idx_dict[id]
        while idv in idx_dict:
            if idv == idx_dict[idv]:
                break
            idv = idx_dict[idv]
        index_dict[id] = idv
    return index_dict

def second_pass(g, index_dict) -> list:
    graph = deepcopy(g)
    height = len(graph)
    width = len(graph[0])
    for h in range(height):
        for w in range(width):
            if graph[h][w] == 0:
                continue
            if graph[h][w] in index_dict:
                graph[h][w] = index_dict[graph[h][w]]
    return graph

def flatten(g) -> list:
    graph = deepcopy(g)
    fgraph = sorted(set(list(graph.flatten())))
    flatten_dict = {}
    for i in range(len(fgraph)):
        flatten_dict[fgraph[i]] = i
    graph = second_pass(graph, flatten_dict)
    return graph

if __name__ == "__main__":
    np.random.seed(1)
    graph = np.random.choice([0,1],size=(20,20))
    graph_1, idx_dict = first_pass(graph)
    idx_dict = remap(idx_dict)
    graph_2 = second_pass(graph_1, idx_dict)
    graph_3 = flatten(graph_2)
    print (graph_3)

    plt.subplot(131)
    plt.imshow(graph)
    plt.subplot(132)
    plt.imshow(graph_3)
    plt.subplot(133)
    plt.imshow(graph_3>0)
    plt.savefig('random_bin_graph.png')

完整代碼的輸出如下所示：

$ python3 two_pass.py 
[[1 1 0 0 1 1 1 1 1 0 0 1 0 1 1 0 0 1 0 0]
 [0 1 0 0 1 0 0 0 1 0 0 0 1 1 1 1 1 0 0 0]
 [1 1 1 1 1 1 0 1 1 0 0 1 0 0 1 1 1 0 1 0]
 [0 1 1 0 1 1 1 1 0 0 1 1 0 0 0 0 1 1 1 0]
 [1 0 0 1 1 0 1 1 0 1 0 0 1 1 1 0 1 1 0 1]
 [1 1 1 0 0 0 0 0 1 1 1 1 1 1 1 0 0 0 0 0]
 [0 1 1 1 1 1 1 0 0 1 1 0 0 1 0 0 0 1 1 1]
 [1 1 0 1 0 1 0 0 0 1 1 1 0 1 0 0 0 0 1 0]
 [1 0 1 1 1 0 0 0 0 0 0 1 0 0 1 0 0 1 1 0]
 [0 0 1 0 0 0 0 1 0 0 0 0 1 1 0 0 1 1 1 0]
 [0 0 0 0 1 1 1 0 1 1 0 0 0 1 1 0 1 1 1 0]
 [1 1 1 1 0 1 0 0 1 0 1 0 1 1 0 1 1 0 1 1]
 [1 0 1 0 1 0 1 1 1 1 1 1 0 0 1 1 0 0 0 1]
 [1 0 0 0 0 0 1 1 1 1 1 1 1 0 0 1 0 0 0 1]
 [0 1 0 2 0 0 0 0 1 1 0 0 0 1 0 1 1 0 0 1]
 [0 1 0 0 0 1 0 1 0 1 1 1 0 1 0 1 1 1 1 0]
 [0 1 0 0 0 0 1 1 0 1 1 0 0 1 1 1 1 1 1 1]
 [0 0 0 0 0 0 0 1 0 0 0 0 0 1 1 1 1 0 0 0]
 [3 0 3 0 4 0 0 0 0 0 0 5 0 0 0 1 0 1 1 0]
 [0 3 3 0 4 0 6 0 7 7 0 0 5 0 0 0 0 0 1 1]]

同樣的我們可以看看此時得到的新的圖像：

這里我們并列的畫了三張圖，第一張圖是原圖，第二張圖是劃分好區(qū)域和標簽的圖，第三張是對第二張圖進行二元化的結果，以確保在運算過程中沒有丟失原本的信息。經(jīng)過確認這個標簽的結果劃分是正確的，但是因為涉及到一些算法實現(xiàn)的細節(jié)，這里我們還是需要展開來介紹一下。

算法的執(zhí)行流程

if __name__ == "__main__":
    np.random.seed(1)
    graph = np.random.choice([0,1],size=(20,20))
    graph_1, idx_dict = first_pass(graph)
    idx_dict = remap(idx_dict)
    graph_2 = second_pass(graph_1, idx_dict)
    graph_3 = flatten(graph_2)

這個部分是算法的核心框架，在本文中的算法實現(xiàn)流程為：先用first_pass遍歷一遍網(wǎng)格節(jié)點，按照上一個章節(jié)中介紹的Two-Pass算法打上標簽，并獲得一個映射關系；然后用remap將上面得到的映射關系做一個重映射，確保每一個級別的映射都對應到了最根部（可以聯(lián)系參考鏈接1的內(nèi)容進行理解，雖然這里沒有使用Union的數(shù)據(jù)結構，但是本質(zhì)上還是一個樹形的結構，需要做一個重映射）；然后用second_pass執(zhí)行Two-Pass算法的第二次遍歷，得到一組打上了新的獨立標簽的網(wǎng)格節(jié)點；最后需要用flatten將標簽進行壓平，因為前面映射的關系，有可能導致標簽不連續(xù)，所以我們這里又做了一次映射，確保標簽是連續(xù)變化的，實際應用中可以不使用這一步。

標簽的重映射

關于節(jié)點的遍歷，大家可以直接看算法代碼，這里需要額外講解的是標簽的重映射模塊的代碼：

def remap(idx_dict) -> dict:
    index_dict = deepcopy(idx_dict)
    for id in idx_dict:
        idv = idx_dict[id]
        while idv in idx_dict:
            if idv == idx_dict[idv]:
                break
            idv = idx_dict[idv]
        index_dict[id] = idv
    return index_dict

這里的算法是先對得到的標簽進行遍歷，在字典中獲取當前標索引所對應的值，作為新的索引，直到鍵跟值一致為止，相當于在一個樹形的數(shù)據(jù)結構中重復尋找父節(jié)點直到找到根節(jié)點。

其他的測試用例

這里我們可以再額外測試一些案例，比如增加幾個0元素使得網(wǎng)格節(jié)點更加稀疏：

graph = np.random.choice([0,0,0,1],size=(20,20))

得到的結果圖片如下所示：

還可以再稀疏一些：

graph = np.random.choice([0,0,0,0,0,1],size=(20,20))

得到的結果如下圖所示：

越是稀疏的圖，得到的分組結果就越分散。

總結概要

在本文中我們主要介紹了利用Two-Pass的算法來檢測區(qū)域連通性，并給出了Python3的代碼實現(xiàn)，當然在實現(xiàn)的過程中因為沒有使用到Union這樣的數(shù)據(jù)結構，僅僅用了字典來存儲標簽之間的關系，因此效率和代碼可讀性都會低一些，單純作為用例的演示和小規(guī)模區(qū)域劃分的計算是足夠用了。在該代碼實現(xiàn)方案中，還有一點與原始算法不一致的是，本實現(xiàn)方案中打新的標簽是讀取上、上左和左三個方向的格點，但是存儲標簽的映射關系時，是讀取了上、上左、上右和左這四個方向的格點。

參考鏈接

https://blog.csdn.net/lichengyu/article/details/13986521
https://www.cnblogs.com/riddick/p/8280883.html

到此這篇關于運用Python3實現(xiàn)Two-Pass算法檢測區(qū)域連通性的文章就介紹到這了,更多相關Python3實現(xiàn)Two-Pass算法檢測區(qū)域流通內(nèi)容請搜索腳本之家以前的文章或繼續(xù)瀏覽下面的相關文章希望大家以后多多支持腳本之家！

您可能感興趣的文章:

python 添加用戶設置密碼并發(fā)郵件給root用戶
這篇文章主要介紹了python 添加用戶設置密碼并發(fā)郵件給root用戶的相關資料,非常不錯，具有參考借鑒價值，需要的朋友可以參考下
2016-07-07
python遞歸調(diào)用中的坑:打印有值, 返回卻None
這篇文章主要介紹了python遞歸調(diào)用中的坑:打印有值, 返回卻None,本文通過問題分析給出解決方法，對大家的學習或工作具有一定的參考借鑒價值，需要的朋友可以參考下
2020-03-03
學好python基本數(shù)據(jù)類型
這篇文章主要介紹了學好python基本數(shù)據(jù)類型,學習python基本數(shù)據(jù)類型我們需要了解基本數(shù)據(jù)類型有數(shù)字int、布爾值bool、字符串str、列表list、元組tuple、字典dict等，其中包括他們的基本用法和其常用的方法，下面來看看文章的具體介紹吧
2021-12-12
python爬蟲框架Scrapy基本應用學習教程
這篇文章主要為大家介紹了python爬蟲框架Scrapy的基本應用學習教程，有需要的朋友可以借鑒參考下，希望能夠有所幫助，祝大家多多進步
2021-11-11
python sorted函數(shù)原理解析及練習
這篇文章主要介紹了python sorted函數(shù)原理解析及練習,文中通過示例代碼介紹的非常詳細，對大家的學習或者工作具有一定的參考學習價值,需要的朋友可以參考下
2020-02-02
python cv2圖像質(zhì)量壓縮的算法示例
使用opencv對圖像進行編碼，一方面是圖像二進制傳輸?shù)男枰?，另一方面對圖像壓縮。本文主要介紹了python cv2圖像質(zhì)量壓縮的算法示例，感興趣的可以了解一下
2021-06-06
Python中的迭代和列表生成式
這篇文章主要介紹了Python中的迭代和列表生成式,具有很好的參考價值,希望對大家有所幫助,如有錯誤或未考慮完全的地方,望不吝賜教<BR>
2024-02-02
500行代碼使用python寫個微信小游戲飛機大戰(zhàn)游戲
這篇文章主要介紹了500行代碼使用python寫個微信小游戲飛機大戰(zhàn)游戲，本文通過實例代碼給大家介紹的非常詳細，具有一定的參考借鑒價值,需要的朋友可以參考下
2019-10-10
django中模板繼承與ModelForm實例詳解
ModelForm類是form是組件中Form的一個子類,所以也是處理表單的,下面這篇文章主要給大家介紹了關于django中模板繼承與ModelForm的相關資料,文中通過實例代碼介紹的非常詳細,需要的朋友可以參考下
2022-04-04
關于初始種子自動選取的區(qū)域生長實例（python+opencv)
今天小編就為大家分享一篇關于初始種子自動選取的區(qū)域生長實例（python+opencv)，具有很好的參考價值，希望對大家有所幫助。一起跟隨小編過來看看吧
2020-01-01