Word Manager
The Word Manager can be used to correct, replace, extract, count or delete any values/words within columns.
You can create your own custom dictionaries and spelling checkers and use any language you wish, and then save the settings to be used for other WinPure projects. This is one of the most powerful cleaning modules within Clean & Match and will help to ensure your data is more accurate.
Some examples/ideas on how you can use the Word Manager:
- For Company names: Replacing all "Ltd" to "Limited", replacing misspelled words like "Corpration" with "Corporation"
- For Countries and States: Replacing "NY" with "New York", correcting "Engleand" with "England"
- Addresses: Replacing all "Rd" to "Road", replacing misspelled words like "TERACE" with "TERRACE"
- Persons Names: Replacing all "VICTORIA" to "VICKY", replacing misspelled names like "CLAIRE" with "CLAIR"
You can create separate Word Manager files if you wish, there is no limit to how many you can create. Not only will this module help to correct and harmonize your data but it will also help to find more true matches when performing a de-duplication. Included are a set of Word Manager files to get you started within C:\ProgramData\WinPure\Clean & Match\Word Manager Files
From the Matrix, click Word Manager, then against the column you wish to use, simply click "Edit" to open up Word Manager, as shown below:
In the left section (Display) are all the unique Values or Words from the chosen column and the right section (Word Manager List) is your Word Manager file.
Display - Switch between Values & Words
- Values - This will provide a list of all the values (together with count) from the chosen column
- Words - This will provide a list of all the separate words (together with count) from the chosen column
Using the arrow buttons, you can transfer the values from the Display section onto your Word Manger List (right section).
Alternatively, you can manually enter Words & Replacements on the Word Manager List
.
- Words - This will provide a list of all the values (together with count) from the chosen column.
- Replacement - This will provide a list of all the separate words (together with count) from the chosen column.
- To delete - This is selected if you wish to just remove the word from the column, rather than replace it.
- Option
- WholeWord - This will replace any specific word and replace it with the ‘Replacement’ value.
- (eg. Ltd > Limited - this will replace all the Ltd words within each value and replace it with the word Limited)
- ("The Write Way Ltd" becomes "The Write Way Limited")
- WholeValue - This is to replace the exact word and replace it with the "Replacement" value. This is mainly used to correct mistakes for individual values rather than a group.
- (eg. Alter Image > The Alter Images Plc - this will replace all the values that contain Alter Image and replace it with the words The Alter Images Plc)
- ("Alter Image" becomes "The Alter Images Plc")
- AnyPart - This will replace any part of the word and replace it with the ‘Replacement’ value. Use this with caution as it could replace parts of words that you might not wish to change.
- (eg, king > kings - This will replace all values containing king with the work kings
- ("12 Kington pl" becomes "12 Kingston pl", "132 Kingly" becomes "132 Kingsly" etc")
- AnyPartEntire - This will replace any part of the word and replace the entire cell value with the ‘Replacement’ value.
- (eg. Maidstone > Hotel Grande - this will completely replace any value containing Maidstone and replace the entire cell value with Hotel Grande.
- ("Maidstone hotels" become "Hotel Grande", "New Maidstone Hostal" becomes "Hotel Grande")
Load - To load a previously saved Word Manager file
Save - To save a Word Manager File
Clear - Clear all values from the Word Manager List