Friday, December 25, 2009

What is data cleansing?

aound in application control computer packages.What is data cleansing?
(m)





1) Also referred to as data scrubbing, the act of detecting and removing and/or correcting a database鈥檚 dirty data (i.e., data that is incorrect, out-of-date, redundant, incomplete, or formatted incorrectly). The goal of data cleansing is not just to clean up the data in a database but also to bring consistency to different sets of data that have been merged from separate databases. Sophisticated software applications are available to clean a database鈥檚 data using algorithms, rules and look-up tables, a task that was once done manually and therefore still subject to human error.


(2) In a RAID system, the act of correcting parity bit errors so that drives remain synchronized.What is data cleansing?
Data cleansing is the act of detecting and correcting (or removing) corrupt or inaccurate records from a record set.





After cleansing, a data set will be consistent with other similar data sets in the system. The inconsistencies detected or removed may have been originally caused by different data dictionary definitions of similar entities in different stores, may have been caused by user entry errors, or may have been corrupted in transmission or storage.





Preprocessing the data will also guarantee that it is unambiguous, correct, and complete.





The actual process of data cleansing may involve removing typos or validating and correcting values against a known list of entities. The validation may be strict (such as rejecting any address that does not have a valid ZIP code) or fuzzy (such as correcting records that partially match existing, known records).





Data cleansing is synonymous with the less frequently-used term data scrubbing. Data cleansing differs from data validation in that validation almost invariably means data is rejected from the system at entry and is performed at entry time, rather than on batches of data.
Sorry but I don't know, but it's sound good, isn't it?

No comments:

Post a Comment