Instantly identify the impacts of a regression

janvier 12, 2023

Eliminate regressions between IT environments

Insight #3 :
Instantly identify the impacts of a regression!

Multiple environments, the norm

To secure developments and by extension information systems, companies replicate their information systems in various environments:

dev, test, UAT, QA, Int, pre-prod, prod, and others!

Time, a regression accelerator

The nuances between environments are generally minimal initially...,

But the life cycle of each piece of information in each environment is different, which mechanically accentuates the disparities.

The impacts of such regressionses

Growing differences end up generating damaging regressions. Indeed, the data manipulation tools reflect these differences massively on entire chains, whether in the feed layers or in the dataviz layers.

A dashboard can have part of its answers missing or wrong if a field is missing in a table,
The results of aggregation functions (and therefore certain analyses) may be invalid if there is an additional field,

If a field is not of the same type (String, Integer, Date, etc.), a database function will not return the same result, or may not work at all,
....

Structural modifications are not neutral, they can have significant and sometimes unexpected impacts on the rest of the System.

Data lineage to identify regression impacts

1- Identify regressions: compare hashes

{openAudit} will daily scan the database structures in the different environments, ie schemas, tables, fields, views.

A "hash" will resume the fields, the types of fields, the order of fields, the standardized script (in the case of views),

The same table will have to point to the same “hash” in 2 distinct environments. Thus, even extremely limited structural changes are immediately identified

2- Identify the impacts of regression through technical data lineage

Once you know that there are fields that differ (between the prod and my pre-prod for example), {openAudit} will allow you to know the impacts:

If {openAudit} detects that this field does not produce data used in a dashboard or a query, it will be a minor regression. This will be an opportunity to carry out decommissioning to lighten the System.

If, on the other hand, {openAudit} detects usage in a dashboard or a query at the end of a flow, then the regression is impacting, and the necessary corrective measures will have to be taken.

Conclusion:

Regressions are (very, too) numerous between environments.

The idea will therefore be to continuously check whether these regressions are really harmful.

To find out, technical data lineage is essential, as long as it is exhaustive and dynamic, i.e. it is an exact reflection of the Information System.

Rechercher dans ce blog

Le data lineage et l’usage des données pour transformer un système : simplifications / migrations

Instantly identify the impacts of a regression

Commentaires

Enregistrer un commentaire

Posts les plus consultés de ce blog

Migration automatisée de SAP BO vers Power BI, au forfait.

La Data Observabilité, Buzzword ou nécessité ?

La 1ère action de modernisation d’un Système d'Information : Ecarter les pipelines inutiles ?