Fuzzyjoin columns with string distance

PHOTO EMBED

Wed Jun 01 2022 12:46:14 GMT+0000 (Coordinated Universal Time)

Saved by @Treenose #r

library(fuzzyjoin) 

df1 %>% # Left join based on stringdist using city and city_actual cols
    stringdist_left_join(df2, by = c("df1_col" = "df2_col")) 
content_copyCOPY

Performs Damerau-Levenshtein distance and merges names with smallest distance