Introducing the WinPure™
Data Profiling / Statistics Module
Data Profiling Made Easy
The Data Profiling / Statistics module within WinPure™ Clean & Match is a user-friendly and powerful data profiling tool that can help your business to discover patterns and meaning in your data and to check the quality of your data by analyzing formats, types, completeness and value counts.
It presents you with a complete set of statistics which you can use to help clean and correct your data, and to prepare it better for data matching.
- One-Click Operation
- Fully customize
- View Real-time changes
- Save and re-use Data Cleaning Matrix designs on other datasets
- Understand data quality issues clearly and quickly
The Data Profiling / Statistics Module provides over 30 different statistics
- Column Name – Name of column from the selected data table
- Type – The declared data type for the column
- Filled – The count of records that contain any data
- Empty – The % of records that are blankble
- Distinct – The count of all unique values
- Trailing Spaces – Number of records that have a trailing spaces (e.g. “John Smith “)
- Commas – Number of records that contain a comma (e.g. “10, Main Street”)
- Dots – Number of records that contain dots (e.g. “New.York”)
- Hyphens – Number of records that contain hyphens (e.g. “0986-5652”)
- Apostrophes – Number of records that contain apostrophes (e.g. “John’s Business”)
- Leading Spaces – Number of records that have a leading spaces (e.g. ” John Smith”)
- Letters – Number of records that only contain letters
- Numbers – Number of records that only contain numbers
- Non Printables – Number of records that contain non-printable characters. Non-printable characters are parts of a character set that do not represent a written symbol or part of the text within a document or code, but rather are there in the context of signal and control in character encoding. Non-printable characters are used to indicate certain formatting actions, such as: White spaces (considered an invisible graphic), Carriage Returns, Tabs, Line Breaks, Page Breaks and Null characters
- With Spaces – Number of records that have any space
- Multiple Spaces – Number of records that have more than one spaces (e.g. ” John Smith “)
- New Line Char – Number of records that contain a new line character
- Tab Char – Number of records that contain a tab character
- Punctuation – Number of records that contain punctuation marks. Punctuation marks are: period, comma, question mark, hyphen, dash, parentheses, apostrophe, ellipsis, quotation mark, colon, semicolon, exclamation point
- Upper Only – Number of records that contain Upper case only characters (e.g. “JOHN SMITH”)
- Lower Only – Number of records that contain Lower case only characters (e.g. “john smith”)
- Proper Case -Number of records that contain both Upper and Lower case in a standardized format (e.g. “John Smith”)
- Mixed Case – Number of records that contain both Upper and Lower case which are mixed together (e.g. “JoHN SmiTH)
- Most Common – The most common value within the column
- Most Common Count – The most common count within the column
- Min Number – The lowest number within that column
- Max Number – The highest number within that column
- Max Words – The maximum number of words
- Average Words – The average count of words
- Max Length – The maximum length of words
- Average Length – The average length of words
The Data Profiling / Statistics Module is available on all Clean & Match Editions
“Once you start using it, you can no longer imagine life without it! The easier and more familiar you become the more you will want to use it over and over! It will definitely become a part of your every workday life when dealing with data!”
“This software was faster and easier to use than the competitor products we tested“
– G2 Crowd Review
Always Recommending WinPureA very powerful but easy to use tool to for cleansing and removing duplicates from databases. I have used Clean & Match for many of my clients, and I am regular recommending this product to other companies.
Excellent Software & SupportWinPure is a really great product, we've been using it with excellent results for many years now, for finding and removing duplicate records and to keep our lists and database more accurate.
Excellent Product & Customer Service.We perform multiple matching projects for our clients and WinPure has filled the bill for these. The product is easy to use and we can complete a large matches in a very short time.
Great Data Quality SoftwareWinPure Clean & Match works great to analyze data and find duplicates. It saves us tons of money when mailing catalogs. This is a great product for the money and easy to use.
Best Data Cleansing SoftwareNot only does it execute its job with ease, but also provides ease of use and extreme comfort in doing so. This is the kind of product that once you start using you will not be able to drop down! I would highly recommend any business or user who has any data cleansing or matching needs to use this program!