Study on the Extensible Data Cleaning Software Platform
-
Graphical Abstract
-
Abstract
An extensible data cleaning software platform is proposed, which has open rules library and algorithms library. Rules library is used to store rules and algorithms library is used to store algorithms. Algorithms library has many algorithms and can be extended. Through defining rules in rules library and choosing proper cleaning algorithms from algorithms library, the software platform can be used to various data sources, which makes it universal and adaptive. The synthetic result is improved through data cleaning with many algorithms. Finally, the effect and feasibility of this extensible data cleaning software platform is proved through an example.
-
-