Profil professionnel
Vue d'ensemble
Expérience
Formation
Compétences
Informations complémentaires
Langues
Personnalisé
Certificats
Chronologie
Generic
Yassine Jouini

Yassine Jouini

Senior Big Data Engineer
Noisy-le-Grand

Profil professionnel

I am a Senior Data Engineer and Big Data Architect with nearly 10 years of experience, specializing in the design and industrialization of complex data solutions. My core achievement is leading the migration of major Cloudera-based platforms to modern Cloud architectures (AWS/GCP), ensuring scalability and performance while acting as a Tech Lead to uplift team capabilities. I bring a unique blend of deep expertise in Spark, Kafka, Nifi, and DevOps practices, making me the ideal candidate to drive your modernization projects from architecture to reliable production operation (Build & Run).

Vue d'ensemble

11
11
years of professional experience
5
5
years of post-secondary education
1
1
Certification

Expérience

Senior Big Data

SFR
Paris
12.2017 - Current

As Architect:

  • Defined integration and data processing patterns in batch and streaming modes.
  • Conducted a feasibility study on switching to a dataflow solution (Nifi) for existing processing.
  • Implemented Spark best practices.
  • Conducted impact studies for the deployment of new solutions.
  • Served as the primary point of contact for Cloudera support.
  • Analyzed the application and system impact of migrating from Cloudera 5 to CDP 7.
  • Set up a Pre-Prod cluster from scratch.
  • Studied the migration of Spark Streaming jobs to Nifi or Flink.
  • Conducted a study on migrating projects from Cloudera to GCP.

As Data Tech Lead:

  • Upskilled two data engineers on Big Data technologies.
  • Optimized existing codebases.
  • Performed code audits.

As Data Engineer:

  • Migrated a Cloudera-based project (HDFS ingestion and Impala webservice) to an AWS Cloud architecture (Ingestion to S3, processing with Glue and Lambda, and a webservice using Athena).
  • Developed a webservice using Spring Boot.
  • Developed Spark batch jobs orchestrated by Oozie.
  • Developed Spark streaming jobs.
  • Orchestrated jobs using Oozie.
  • Designed a dynamic dashboard.
  • Migrated a project using Flume to a Nifi solution (developed a specific processor in Nifi).

DevOps and Administration:

  • Provided administration and support for CDH5 and CDP clusters.
  • Automated installations via Ansible.
  • Migrated the cluster from CDH 5.12 to CDH 5.16, and subsequently to CDP 7.1.7.
  • Technical Environment: AWS (S3, Athena, Lambda), Spring, Scala, Cloudera 5.12/5.16/7.1.7, Ansible, Python, Linux, Kerberos, HDFS, Hive, Spark, Oozie, Kafka, Tableau, Zeppelin.

Consultant Big Data

AXA
Suresnes
12.2016 - 11.2017

As Architect:

  • Defined new architecture for the AXA Germany entity; designed a specific Kafka Confluent solution tailored to their needs.
  • Managed security aspects (Kerberos and LDAP with specific Cloudera development).
  • Implemented Spark and Impala best practices.
  • Conducted architecture studies for new applications.
  • Acted as the point of contact for Cloudera support.

DevOps and Administration:

  • Administered and supported Cloudera and Elasticsearch clusters.
  • Aligned different solutions across various environments.
  • Automated R package deployment and administration tasks using Ansible and Python.
  • Provided support for Hadoop, HDFS, Hive, Spark, Oozie, Flume.
  • Security: Provided support on LDAP and Kerberos.
  • Performed code audits.
  • Technical Environment: Elasticsearch, Kafka, Cloudera CDH 5.8, Ansible, Python, Cloudera Manager, Linux, Kerberos, HDFS, Hive, Spark, Oozie.

Consultant Big Data

Orange Afrique
Tunis
11.2015 - 11.2016

POC Phase:

  • Installed Cloudera and Elasticsearch clusters.
  • Developed a Beats/Logstash job to ingest data from various sources into Kafka.
  • Developed Spark Streaming jobs (Java, Python) to import data from Kafka to Elasticsearch.
  • Implemented a consolidation job (OrientDB).
  • Implemented the Search page and 360 View (ReactJS and NodeJS).

Project Phase:

  • Implemented the project using the Exalead solution (selected for data processing).
  • Visualized data using Exalead.

Cross-Channel POC (Tunisia Entity):

  • Imported customer actions via Google Tag Manager from digital channels (web portals, e-shops) and social networks (Twitter, Facebook).
  • Integrated data into Kafka.
  • Processed data from Kafka using Spark Streaming for customer reconciliation into Hive.
  • Reporting using Zeppelin.
  • Technical Environment: Elasticsearch, Kafka, CDH 5.7.0, Hive, HDFS, Spark, ReactJS, Zeppelin, Scala, Python, Java, Javascript, MongoDB, NodeJS, ELK Suite, OrientDB.

Consultant Big Data

Nouvel Air
Tunis
11.2015 - 04.2016
  • Analyzed server logs from websites to detect customer behaviors and adapt commercial offers (POC context).
  • Imported and parsed server data via Logstash to Kafka.
  • Developed a Spark Streaming job for data cleaning, followed by integration into Elasticsearch for analysis and Hadoop for storage.
  • Created KPI presentations and reporting using Zeppelin via Hive.
  • Technical Environment: Spark, Hive, Zeppelin, Logstash, Kafka, Java.

Big Data Engineer

TCB consulting
Tunis
11.2014 - 10.2015
  • Imported social media data (Twitter, Facebook, YouTube) into Kafka.
  • Developed using Spark Java (Batch).
  • Used Spark Streaming for real-time data extraction and enrichment from Kafka to Elasticsearch.
  • utilized Machine Learning with Spark MLlib (Scala) for real-time text sentiment analysis.
  • Used Elasticsearch and Kibana for visualization.
  • Coordinated batch and streaming workflows with Oozie.
  • Technical Environment: Elasticsearch, Spark, Hive, Storm, Oozie, Logstash, Kafka, Scala, Python, Java.

Formation

Engineering Degree - Statistics and Information Analysis

Engineer's Degree in Statistics and Information Analysis
Tunis
01.2009 - 01.2014

Compétences

  • Python

  • Java

  • Scala

  • Cloudera 71X/5X

  • Impala

  • Spark

  • Hive

  • Oozie

  • Base NoSql: Redis,Hbase,MongoDB,ElasticSearch

  • Zeppelin

  • Tableau/QlikView

  • Ansible

Informations complémentaires

  • Passion for cinema
  • Music
  • Community activities

Langues

English
Fluent
French
Bilingual
Arabic
Mother Tongue

Personnalisé

  • Âge: 34
  • Genre: Masculin

Certificats

  • Cloud Digital Leader (GCP)
  • Academy Accreditation - Databricks Fundamentals
  • AWS Certified Cloud Practitioner
  • Talend BigData for Developers 6.0
  • Oracle Certified Professional, Java SE 6 Programmer

Chronologie

Senior Big Data

SFR
12.2017 - Current

Consultant Big Data

AXA
12.2016 - 11.2017

Consultant Big Data

Orange Afrique
11.2015 - 11.2016

Consultant Big Data

Nouvel Air
11.2015 - 04.2016

Big Data Engineer

TCB consulting
11.2014 - 10.2015

Engineering Degree - Statistics and Information Analysis

Engineer's Degree in Statistics and Information Analysis
01.2009 - 01.2014
Yassine JouiniSenior Big Data Engineer