The Web Analyst (or Digital Analyst) is in charge of the analysis of the objectives and results obtained on the web or through applications. He/she generally works around websites, mobile applications, social networks or other online platforms, in an agency or directly at the advertiser’s.

He/she must understand the global…

Customer Relationship Management (CRM) is a process in which a business or other organization administers its interactions with customers, typically using data analysis to study large amounts of information.

The role of Analytical CRM systems is to analyse customer data collected through multiple sources and present it so that business…

Data mining is a process of extracting and discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.

Data mining is the analysis step of the Knowledge Discovery in Databases process or KDD.

Why mine data ?

Computerization and automated data gathering…

This trial describes a way to automate a cloud infrastructure using Cloud Composer. The example shows how to schedule automated backups of Compute Engine virtual machine (VM) instances.

Cloud Composer is a fully managed workflow orchestration service on Google Cloud. Cloud Composer allows you to create workflows using a Python…

Kubernetes is an open source tool for orchestrating containers. It packages isolated microservices into loosely-coupled containers that can be deployed and scaled anywhere. While traditional, monolithic architectures can be difficult to adapt, containers make it possible for applications to become more scalable, portable, and resilient (i.e. cloud native).

Google created…

You’ve heard of Docker, but you don’t really know what it’s for? Do you want to set up an efficient and scalable deployment of your application on any server? Or do you want to prepare easy-to-deploy development environments using containers?

Are you ready to take on the performance, lightness, and…

Hadoop is a framework for distributed processing of large datasets across clusters of commodity computers.

Core Hadoop :

  • HDFS : Reliable Shared storage
  • MapReduce : Distributed Computation

Distribution type including Hadoop :

Cloudera, HortonWorks, MapR

HDFS (Hadoop Distributed File System) :

  • NameNodes : metadata management and bandwidth optimization by determining…

What is E-reputation

E-reputation is simply the reputation of a company, a brand, an individual or a product on the Internet. In other words, e-reputation is the image of an entity on the Internet. This image can be positive or negative.

74% of consumers check on Google before buying a…

Serigne DIAW

Data Engineer / Data Architect / Data Scientist

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store