Data Engineer

Description

The aim is to develop an enterprise-grade fraud detection product in Hadoop ecosystem (Cloudera Distribution) out of an existing, ML-based prototype. All the Data sources are SQL DB, the algorithm is programmed booth in SQL.

Tasks

  • Participate in development and deployment of ETL processes and SQL / No SQL repository systems such as data warehouses, data lakes, functional data marts, data cubes, analytical data sets in both traditional BI and Big data systems (Hadoop ecosystem) in order to build a central storage for complex structured and unstructured data

  • Participate in professional execution of solution design, implementation, and unit testing activities, ensure support of key users and handover of solution

Skills

  • 3+ years knowledge of hadoop concepts, experience in working with Hadoop environment and/or Cloudera Distribution (oozie, sqoop, spark, hive, impala) is an advantage

  • Experience with Scala and/or Spark would be appreciated

  • Strong Database/SQL skills and experience in writing ETL processes

  • Experience in running applications handling large volumes of data in production environment (scheduling, monitoring, maintenance, troubleshooting, debugging, etc.).

Send your CV to us!

No file selected

255/0
I subscribe to newsletter
I have read and accept the Data Protection Policy.

Subscribe to our monthly coding goodies