Data profiling made easy

The Data Profiling / Statistics module within WinPure™ Clean & Match is a user-friendly and powerful data profiling tool that can help your business to discover patterns and meaning in your data and to check the quality of your data by analyzing formats, types, completeness and value counts.

It presents you with a complete set of statistics which you can use to help clean and correct your data, and to prepare it better for data matching.

One-Click Operation
Fully customize
View Real-time changes
Save and re-use Data Cleaning Matrix designs on other datasets
Understand data quality issues clearly and quickly

The data profiling / statistics module provides over 30 different statistics

Column NameName of column from the selected data table
Multiple SpacesNumber of records that have more than one spaces (e.g. ”John Smith“)
TypeThe declared data type for the column
EmptyThe % of records that are blankble
DistinctThe count of all unique values
Trailing SpacesNumber of records that have a trailing spaces (e.g. “John Smith “)
CommasNumber of records that contain a comma (e.g. “10, Main Street”)
DotsNumber of records that contain dots (e.g. “New.York”)
HyphensNumber of records that contain hyphens (e.g. “0986-5652”)
ApostrophesNumber of records that contain apostrophes (e.g. “John’s Business”)
Leading SpacesNumber of records that have a leading spaces (e.g. ” John Smith”)
LettersNumber of records that only contain letters
NumbersNumber of records that only contain numbers
With SpacesNumber of records that have any space
Multiple SpacesNumber of records that have more than one spaces (e.g. ” John Smith “)
New Line CharNumber of records that contain a new line character
Tab CharNumber of records that contain a tab character
Upper OnlyNumber of records that contain Upper case only characters (e.g. “JOHN SMITH”)
Lower Only Number of records that contain Lower case only characters (e.g. “john smith”)
Proper CaseNumber of records that contain both Upper and Lower case in a standardized format (e.g. “John Smith”)
Mixed CaseNumber of records that contain both Upper and Lower case which are mixed together (e.g. “JoHN SmiTH)
Most CommonThe most common value within the column
Most Common CountThe most common count within the column
Min NumberThe lowest number within that column
Max NumberThe highest number within that column
Max WordsThe maximum number of words
Average WordsThe average count of words
Max LengthThe maximum length of words
PunctuationNumber of records that contain punctuation marks. Punctuation marks are: period, comma, question mark, hyphen, dash, parentheses, apostrophe, ellipsis, quotation mark, colon, semicolon, exclamation point
Average LengthThe average length of words
Non PrintablesNumber of records that contain non-printable characters. Non-printable characters are parts of a character set that do not represent a written symbol or part of the text within a document or code, but rather are there in the context of signal and control in character encoding. Non-printable characters are used to indicate certain formatting actions, such as: White spaces (considered an invisible graphic), Carriage Returns, Tabs, Line Breaks, Page Breaks and Null characters

The data profiling/statistics module is available on all Clean & Match Editions

Read more about why Why Data Profiling is Important.

Download WinPure Clean & Match Desktop today for free.

WinPure Clean & Match delivers everything you need to clean, profile, deduplicate and enrich your data. It’s plug and play and we have a trial version available so in just a few minutes you can start importing and profiling data from a variety of file formats, databases, or CRMs.

ed 100 150x150

Edward B - Company Owner

Excellent Product & Customer Service

We perform multiple matching projects for our clients and WinPure has filled the bill for these. The product is easy to use and we can complete large matches in a very short time.

Richard

Richard F - Company Owner

Excellent Software & Support

WinPure is a really great product, we've been using it with excellent results for many years now, for finding and removing duplicate records and to keep our lists and database more accurate.

G2 Crowd Review

Best Data Cleaning Software

Not only does it execute its job with ease, but also provides ease of use and extreme comfort in doing so. This is the kind of product that once you start using you will not be able to drop down! I would highly recommend any business or user who has any data cleansing or matching needs to use this program!

cynthia

Cynthia T - Director of Information Technology

Great Data Quality Software

WinPure Clean & Match works great to analyze data and find duplicates. It saves us tons of money when mailing catalogs. This is a great product for the money and easy to use.

Naveed B - IT Consultant

Always Recommending WinPure

A very powerful but easy to use tool for cleansing and removing duplicates from databases. I have used Clean & Match for many of my clients, and I am regularly recommending this product to other companies.

SUHA ALPARSLAN

Fantastic Software with Exceptional Support

I cannot emphasise enough how valuable this data cleansing and dedupe software has been for us and I would recommend this to any business that requires their database to be cleaned and corrected.

Trustpilot logo

Trustpilot Review

9 Year User - Still Happy!

I've used WinPure for 9 years now (since 2007) and have found it to be the perfect companion to the many data projects I do for marketing and sales campaigns. Having started my own firm since then, I now have every client facing team member get Winpure on their machine to benefit from friendly UI, efficient speed, and dependability.

WinPure, a trusted innovator in Data Quality and Master Data Management Tools.
Join the thousands of customers who rely on WinPure to grow faster with better data.

McAfee Logo Deloitte logo vodafone HP logo