I have a monthly safety slogan competition which requires back checking to a list of already submitted slogans. This takes forerver to do. I have 2 lists: this month's slogans and a master list of all slogans.
Here is an example:
S1Master: An unsafe behavior can bring you down
S1Submitted: Unsafe behaviors can bring you down
I am thinking that after removing the plural "s", and then counting the number of words that are in both sentences, as a percentage of the total number of words. Percentage greater than 90% say would be listed as a match.
6 words in 2 sentences the same = 12/13 words = 92.31%
therefore this is a matched pair.
Is this the best way, and then how do I go about comparing. Please guide me.
Thanks
Steve
Here is an example:
S1Master: An unsafe behavior can bring you down
S1Submitted: Unsafe behaviors can bring you down
I am thinking that after removing the plural "s", and then counting the number of words that are in both sentences, as a percentage of the total number of words. Percentage greater than 90% say would be listed as a match.
6 words in 2 sentences the same = 12/13 words = 92.31%
therefore this is a matched pair.
Is this the best way, and then how do I go about comparing. Please guide me.
Thanks
Steve