DataCleaner is an on-premise data cleaning software for small, midsize and large enterprises. It allows users to discover and analyze data quality, detect duplications, standardize data and monitor data health.
The data profiling module helps users find missing values, patterns, character sets and data characteristics. The duplicate detection feature finds and deduplicates repetitive data. The data standardization module checks values and confirms that they are realistic. It works on Excel, CSV, relational databases (RDBMs) and NoSQL databases. It allows users to create personalized cleansing rules.
The data health monitoring module allows users to check data health in real time and schedule periodic data quality checks and notifications. It offers integration with Spark, Hadoop, Hive, Microsoft SQL Server, Oracle, IBM DB2, MySQL, etc.
DataCleaner is offered in two editions, Commercial and Community. The commercial edition has support offered via email and phone whereas community has discussion forums.