String Distance Matrix in Python -


how calculate levenshtein distance matrix of strings in python

              str1    str2    str3    str4    ...     strn       str1    0.8     0.4     0.6     0.1     ...     0.2       str2    0.4     0.7     0.5     0.1     ...     0.1       str3    0.6     0.5     0.6     0.1     ...     0.1       str4    0.1     0.1     0.1     0.5     ...     0.6       .       .       .       .       .       ...     .       .       .       .       .       .       ...     .       .       .       .       .       .       ...     .       strn    0.2     0.1     0.1     0.6     ...     0.7 

using ditance function can calculate distance betwwen 2 words. here have 1 list containing n number of strings. wanted calculate distance matrix after want clustering of words.

just use pdist version accepts custom metric.

y = pdist(x, levensthein) 

and levensthein can use implementation of rosettacode suggested tanu

if want full squared matrix use squareform on result:

y = np.squareform(y) 

Comments

Popular posts from this blog

scala - 'wrong top statement declaration' when using slick in IntelliJ -

c# - DevExpress.Wpf.Grid.InfiniteGridSizeException was unhandled -

PySide and Qt Properties: Connecting signals from Python to QML -