Difference between revisions of "Data Lake"

From Clinfowiki
Jump to: navigation, search
Line 1: Line 1:
 
A Data Lake is similar to that of a data warehouse, but it allows for the flow and storage of unstructured data sources in addition to the structured data in an enterprise data warehouse or data mart. The idea of a lake is such that water flows from various paths into the reservoir and then flows out.
 
A Data Lake is similar to that of a data warehouse, but it allows for the flow and storage of unstructured data sources in addition to the structured data in an enterprise data warehouse or data mart. The idea of a lake is such that water flows from various paths into the reservoir and then flows out.
  
==Functions of a Data Lake==
+
=Functions of a Data Lake=
* Data Ingestion
+
==Data Ingestion==
* Data Storage and Retention
+
==Data Storage and Retention==
* Data Processing
+
==Data Processing==
* Data Access
+
==Data Access==
  
  
  
==Difference from Data Warehouse==
+
=Difference from Data Warehouse=
  
  
==Data Swamp==
+
=Data Swamp=
  
 
This is when a data lake can become unruly and become a data swamp.
 
This is when a data lake can become unruly and become a data swamp.
  
  
==References==
+
=References=
  
  
 
Submitted by Tom Nahass
 
Submitted by Tom Nahass
 
[[Category:BMI512-FALL-20]]
 
[[Category:BMI512-FALL-20]]

Revision as of 18:24, 26 October 2020

A Data Lake is similar to that of a data warehouse, but it allows for the flow and storage of unstructured data sources in addition to the structured data in an enterprise data warehouse or data mart. The idea of a lake is such that water flows from various paths into the reservoir and then flows out.

Functions of a Data Lake

Data Ingestion

Data Storage and Retention

Data Processing

Data Access

Difference from Data Warehouse

Data Swamp

This is when a data lake can become unruly and become a data swamp.


References

Submitted by Tom Nahass