Question

74
Views
Sivaraman Pillai (Sivaraman)
Northbridge Financial
Senior Software Engineer PEGA
Northbridge Financial
CA
Sivaraman Member since 2009 2 posts
Northbridge Financial
Posted: November 20, 2020
Last activity: November 20, 2020
Posted: 20 Nov 2020 9:48 EST
Last activity: 20 Nov 2020 12:05 EST

Comparing between two strings where the order of the words in the query string is not relevant

Our problem statement is to come up with higher matching scoring when comparing between two strings where the order of the words in the query string is not relevant and should score a higher value in case the same words are found in the term string.

We have implemented the various similairity functions like (LevenshteinDistance, FuzzyScore, JaroWinklerDistance, JaccardDistance, CosineDistance, HammingDistance ) from the package present at https://commons.apache.org/proper/commons-text/apidocs/org/apache/commons/text/similarity/package-summary.html

However still we are doubting ourselves to come up with the best scoring matching algorithm to be followed here for our use case.

Could you please suggest that in your view what would be the best simlarity matching algorithm to be used here.

***Edited by Moderator Marissa to update Content Type from Discussion to Questio; to update Support Case Details***
Pega Platform 8.4.2 Case Management Insurance Lead System Architect Support Case Exists