client login download contact us help/faq’s
 
4. Word Smith ™

Data Match's trademarked Word Smith ™ is essential for cleaning up common text issues and harmonizing your data.
  • Groups and counts the all words in your data set.
  • Replace or remove unwanted text. Ideal for fixing spelling errors and harmonizing your data without having to touch every occurrence of a misspelling. Example: Change "Streeet" or "STrt" to "Street".
  • Comes with several standard Word Smith ™ files saving you the time of coming up with them yourself.
    • Common company suffixes "LLC, Corporation, Corp, etc."
    • Common misspellings "thhe", "nineteeen", etc.
    • Save your Word Smith ™ files for future use across projects.
  • Place keywords into new fields. Example identify the keywords "Blue" and "Purple" and place into a new field called color. Ideal for creating new fields to match and harmonize your data.
  • Extract numbers with common prefixes or suffixes. Example: extract the text "1 Gallon" or "3 Gallons" to a new field called "Volume". Ideal for creating new fields to match and harmonize your data.
  • Preview your Word Smith actions in the dynamic data window. Apply actions to all tables, or selected tables/fields.

Pattern Extraction Example


Free Trial



 

 

Data Match 2008 Screenshots
1. Data In
Learn More
 

 

 

 

 
2. Data Quality
Learn More

 

 

 

 
3. Clean
Learn More

 

 

 

 
4. Word Smith™
Learn More

 

 

 

 
5. Match Definition
Learn More

 

 

 

 
6. Results
Learn More

 

 

 

 
7. Deduplicate Preview
Learn More