Archiv der Kategorie: Data Vault

Data Vault: Possibilities to add a new descriptive Attribute

Problem: The Source-System added a new descriptive attribute to a source-table.
We need to include the new data in our Raw-Data-Vault(-Satellite)-World.

We have two possibilities:

  1. Add a new Hub-Satellite which contains besides the metadata only the new attribute from the time on the attribute appeared.
  2. Stop filling the currrent Hub-Satellite and fill a new Satellite which is a structural copy from the old Satellite plus the new attribute from the time on the attribute appeared

Case 1.: I think this is the normal case. The existing dataflow based on the data without the new Attribute is untouched and makes still sense for the business so there is no need to change or anounce anything concerning the dataflow or any report based on the information without the new Attribute.
We can develope an extended dataflow containing the new Attribute and a new report which is very similar to the ‚old‘ report without the new Attribute.
After successfully completion we can set the old report to ‚deprecated‘, if there is no need to use the old report because the new report, containing the new Attribute, covers the old situation, too. After a given period of time the business should use only the new report and the old one can be removed.

Case 2.: These situation is a complete different one: The old satellite will be not filled anymore after appearance of the new attribute. So the old dataflow and reports based on the old satellite shows data only till the arrival of the new attribute-data.
This could be necessary if the new attriute is a game-changer concerning the old characteristic of the data so that the old report does not make sense anymore by knowing the new attribute.
So the old report shows only the data before and a an new report shows only the data from the date on the new attribute-values are available.

It is of course possible to connect the old an the new satellite by a view for example, but here we have the problem of the values for the new attribute in the history…

Any thought on this topic is appreciated!

Data Vault Modeling

Datamodelling the core-dwh

Bill Inmon: Corporate Information Factory,
Ralph Kimball: Dimensional Modeling
Dan Linstedt: Data Vault

Data Vault Modeling:
New Modeling Method for the Core Enterprise Data Warehouse including the complete history.
This technique allows to follow a complete agile approach.

Here a few links for more information about Data Vault:

English:
Data Vault Modeling specification (DanLinstedt)
Data Vault Loading Specification v1.2 (DanLinstedt)

Data Vault Modeling guide (Hans Hultgren 2012)

A very good overview and real-life-example from Source to Raw/Information-Mart by Kent Graziano:
Part 1: Agile Modeling: Not an Option Anymore
Part 2: Data Vault 2.0 Modeling Basics
Part 3: The Business Data Vault
Part 4: Building an Information Mart With Your Data Vault

Data Matters (Unseen)

Modeling One Model for persistence and access with Data Vault

Web Archive for MakingDataMeaningful: Data Vault: Hubs, Links, and Satellites With Associated Loading Patterns

Forum:

Forum: dvforums.danlinstedt.com (not active anymore)

German:
Core DWH Modellierung: Vergleich Data Vault und konsolidierte Ankermodellierung (nicht Anchormodelling!)

Anker – Modellierung im Core Data Warehouse

Additional resources:
www.datavaultmodeling.de

danlinstedt.com

Visual Data Vault – DV-extension for Microsoft Visio

www.dwh42.de

roelantvos.com: DWH-Automatisation

kentgraziano.com: The Data Warrior

Scalefree: BIG DATA CONSULTING