The first action to modernize an Information System: Eliminate unnecessary pipelines?

Obtenir le lien
Facebook
X
Pinterest
E-mail
Autres applications

janvier 21, 2025

The first action to modernize an Information System:

Eliminate unnecessary pipelines?

A data system could be compared to a vast road network, made up of various paths, each created to meet a specific need at a given time.

As this network expands and ages, some paths (data pipelines) become underutilized or unused (replicated, obsolete).

The financial and organizational impacts are numerous:

60% of data in the Cloud is not used according to NTT.

Source: IT Social

According to Civo, for almost half of companies with more than 500 employees, the annual cost of the Cloud exceeds a million dollars, with growth rates that are difficult to sustain.

Source: ChannelNews

The origin of these useless data pipelines,

these “ghost paths”?

Over time, Information Systems aggregate pipelines that have become useless:

Pipelines created for now abandoned projects.
Duplication of pipelines due to lack of coordination between services. The advent of "data mesh" architectures appears to be a major accelerator of this state of affairs.
Obsolete pipelines kept as a precautionary measure ("you never know!"), or to cover some risk.

…
These “ghost routes” consume a lot of resources in the Cloud (storage, processing, bandwidth), which could be used otherwise!

We have advanced on a software response which makes it possible to remedy this natural drift, as old as physics: entropy, i.e. the "degree of disorder reflecting the natural tendency of things to evolve towards a state of chaos".

This drift is not inevitable. On the other hand, it is a race against time, because systems have such an inclination towards entropy that only industrialized mechanisms can cope with it.

This answer is one of the features of {openAudit} . With 2 mechanics:

Technically and continuously identify pipelines to be decommissioned

It is possible to precisely map these complex tangles and identify unused pipelines.

This approach requires 2 coordinated technical actions that we offer with our {openAudit} software :

Analysis of data usage: to identify “informational dead ends”.

{openAudit} will analyze the main technical stack to know all the data consumed in and outside the batch chains.
Data consumed by satellites (non-parsed applications) are also analyzed to identify the completeness of useful information.
This dual analysis can be subtle and will be configured to take into consideration the business target : regulatory information can be consumed very periodically for example, while having significant added value.

Through a "mirror analysis", informational deadlocks are factually defined in continuous time.

Data Lineage: trace flows to isolate unnecessary chains

Data lineage allows you to trace the pipeline from unused data to the first table that will be the source of information consumed in another branch.
From this branch, it is possible to delete the unnecessary chain fraction without impact.

Clean the Information System

The {openAudit} run is operated continuously, which makes it possible to organize the decommissioning of all unnecessary flows over a long period of time with internal teams.

A classification can also be made by profession, tools, others, to prioritize the process.

Modeling a harmonious system

We are currently developing an algorithm, which we have named " Harmony" , which will allow us to automatically model a system so that it is as rational and efficient as possible, even when many proprietary technologies are at work (ETL, dataviz tools). News to come!

If you wish, we can schedule an individual video exchange (15 to 20 minutes) in the coming days to discuss these themes in relation to your own issues.

To book a slot, follow this link:

Reserve a 20' slot in my diary

And if you would like to discuss automated migration topics around the following themes (or others), we will also be happy to:

“Migrate from Talend (or any other ETL) to DBT”

“Migrate from SAP BO to Power BI in an automated way”:

"Moving from DataStage to SQL in ELT mode or ETL mode"

" 4 options for changing dataviz tools"

Learn more

CONCLUSION

By eliminating "ghost routes" and streamlining your pipelines, you can reduce your costs, optimize your human resources and strengthen the efficiency of your Information System.

The approach of combining usage analysis and data lineage makes it possible to transform a complex network into a fluid and efficient ecosystem.

The implementation of “Harmony” could help finalize the construction of a harmonious and sustainable data-driven future!

ITdebt simplification

Obtenir le lien
Facebook
X
Pinterest
E-mail
Autres applications

Commentaires

Enregistrer un commentaire

Posts les plus consultés de ce blog

Migration automatisée de SAP BO vers Power BI, au forfait.

juin 02, 2024

Migrer de SAP BO vers Power BI , Automatiquement, Au forfait Ellipsys a accompagné le Groupe ADEO / Leroy Merlin dans la simplification massive de sa plateforme SAP BO puis dans la migration du patrimoine utile vers Power BI (et Looker) en automatisant le processus, au forfait. Nous vous en partageons les principaux aspects. SAP BO : une feuille de route incertaine SAP Business Objet détiendrait plus de 5 % de part de marché sur le marché de la BI avec presque 30,000 clients dans le monde (6Sense). Or la plateforme navigue dans un flou certain : La version 4.2 atteindra la fin du support « priorité 1 » fin 2024. La version 4.3 devrait être prise en charge jusqu'à fin 2027, avec une maintenance standard se terminant fin 2025. SAP prévoit de publier la première version de SAP Business Objects BI 2025 au quatrième trimestre 2024. On n’en connaît pas très bien les contours. Mais ce qui est précisé, c’est que certains composants ...

La Data Observabilité, Buzzword ou nécessité ?

novembre 21, 2023

La Data Observabilité, Buzzword ou nécessité ? La qualité des données utilisées dans les opérations quotidiennes a un rôle prépondérant pour les entreprises. Une étude du Gartner de 2022 suggère que les "mauvaises données" coûtent aux organisations environ 12,9 millions de dollars par an. Le Data Quality Hub du gouvernement britannique estime que les organisations consacrent entre 10 % et 30 % de leurs revenus aux problèmes de qualité des données, ce qui peut représenter des centaines de millions de dollars pour des entreprises de premier plan. Dès 2016, IBM estimait que la mauvaise qualité des données coûtait aux entreprises américaines 3 100 milliards de dollars par an. ... Tout le monde souhaite s’appuyer sur des données intègres pour éviter des erreurs simples liées à des problèmes d’alimentation dans des flux, ou à des données mal organisées, répliquées, etc. Mais comment ? Les techniques d' "obse...

La 1ère action de modernisation d’un Système d'Information : Ecarter les pipelines inutiles ?

janvier 21, 2025

La 1ère action de modernisation d’un Système d'Information : Ecarter les pipelines inutiles ? Un système de données pourrait être comparé à un vaste réseau routier, composé de voies diverses, chacune créée pour répondre à un besoin spécifique à un moment donné. Quand ce réseau se déploie et vieillit, certaines voies (data pipelines) deviennent sous-utilisées ou inutilisées (répliquées, obsolètes). Les impacts financiers et organisationnels sont nombreux : 60 % des données dans le Cloud ne sont pas utilisées selon NTT. Source : IT Social Selon Civo, pour pratiquement la moitié des entreprises de plus de 500 salariés, le coût annuel du Cloud dépasse le million de dollars, avec des taux de croissance difficilement soutenables. Source : ChannelNews L’origine de ces data pipelines inutiles, ces "voies fantômes" ? Avec le temps, les Systèmes d’Information agrègent des pipelines devenus inutiles : Pipelines créés pour des projets désorm...

Rechercher dans ce blog

Le data lineage et l’usage des données pour transformer un système : simplifications / migrations