what is data extraction

1 year ago 42
Nature

Data extraction is the process of retrieving data from data sources for further processing or storage. This process involves collecting or retrieving disparate types of data from a variety of sources, many of which may be poorly organized or completely unstructured. Data extraction makes it possible to consolidate, process, and refine data so that it can be stored in a centralized location in order to be transformed. The term data extraction is usually applied when experimental data is first imported into a computer from primary sources, like measuring or recording devices. The process of extracting data includes locating and identifying the relevant data, then preparing it for processing or transformation. Data extraction can be used to migrate data from outside sources into a companys own databases. There are various strategies employed to extract data, including structured and unstructured data extraction. Data extraction software enables extracts to be made from many different data source types, both structured and unstructured.