We develop Big Data, AI, Enterprise-, Web- and Mobile applications
The team: professional workshop-atmosphere, highly-qualified colleagues
We offer the stability of a multinational company and the flexibility of a start-up
Not just unique but a wide range of projects, as we work for the american and european market
Unique projects, that you won’t be bored of
Why do our colleagues love to work here?
Flexible work schedule, home office, remote possibilities, no dress code
Regular meetups, workshops and conferences, further training opportunities, continuous internal knowledge sharing, corporate library and account for online courses
Medical insurance, corporate telephone fleet even for your family members, Discount bank account package, +1 day off for your birthday
Game room in the office, unlimited tea/coffee, fruit days, massage
Monthly table soccer and sandwich parties, teambuildings, fortnightly monster forums: where we discuss our ongoing happenings
The aim is to develop an enterprise-grade fraud detection product in Hadoop ecosystem (Cloudera Distribution) out of an existing, ML-based prototype. All the Data sources are SQL DB, the algorithm is programmed booth in SQL.
Participate in development and deployment of ETL processes and SQL / No SQL repository systems such as data warehouses, data lakes, functional data marts, data cubes, analytical data sets in both traditional BI and Big data systems (Hadoop ecosystem) in order to build a central storage for complex structured and unstructured data
Participate in professional execution of solution design, implementation, and unit testing activities, ensure support of key users and handover of solution
3+ years knowledge of hadoop concepts, experience in working with Hadoop environment and/or Cloudera Distribution (oozie, sqoop, spark, hive, impala) is an advantage
Experience with Scala and/or Spark would be appreciated
Strong Database/SQL skills and experience in writing ETL processes
Experience in running applications handling large volumes of data in production environment (scheduling, monitoring, maintenance, troubleshooting, debugging, etc.).