The National Data Lake of NDB is a national-scale central storage repository that holds vast amounts of raw and standardized data collected from various government agencies. The Data Lake servers as the backbone for the other national-level services and platforms to ensure that the consistent and trusted data is cascaded to the end users in the form of a single source of truth.
The Data Lake is equipped with flexibility of acquiring multi-speed and structure data through in-house developed integration suite that automatically and reliably ingests, curates and provisions data in an industrialized manner. It is also supported with a group of ancillary services to ensure that it is properly managed, governed, and secured, giving full trust and confidence to the consumers.
60+
Entities have been onboarded as data providers
300+
Systems have been integrated to provide a single source of truth
85+ TB
Total volume of data stored in the lake
80,000+
Attributes are managed inside the lake in tables