Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SemanticBits: Senior Data Engineer #51

Closed
remote-job-board opened this issue Jul 4, 2020 · 0 comments
Closed

SemanticBits: Senior Data Engineer #51

remote-job-board opened this issue Jul 4, 2020 · 0 comments
Labels
level-senior remoteok.io Imported from RemoteOK

Comments

@remote-job-board
Copy link
Contributor

remote-job-board commented Jul 4, 2020

Tags: #senior #engineer

Published on: July 02, 2020

Original Job Post: https://remoteok.io/remote-jobs/96779

SemanticBits is looking for a talented Senior Data Engineer who is eager to apply computer science, software engineering, databases, and distributed/parallel processing frameworks to prepare big data for the use of data analysts and data scientists. You will mentor junior engineers and deliver data acquisition, transformations, cleansing, conversion, compression, and loading of data into data and analytics models. You will work in partnership with data scientists and analysts to understand use cases, data needs, and outcome objectives. You are a practitioner of advanced data modeling and optimization of data and analytics solutions at scale. Expert in data management, data access (big data, data marts, etc.), programming, and data modeling; and familiar with analytic algorithms and applications (like machine learning).

Requirements

  • Bachelor’s degree in computer science (or related) and eight years of professional experience

  • Strong knowledge of computer science fundamentals: object-oriented design and programming, data structures, algorithms, databases (SQL and relational design), networking

  • Demonstrable experience engineering scalable data processing pipelines.

  • Demonstrable expertise with Python, Spark, and wrangling of various data formats - Parquet, CSV, XML, JSON.

  • Experience with the following technologies is highly desirable: Redshift (w/Spectrum), Hadoop, Apache NiFi, Airflow, Apache Kafka, Apache Superset, Flask, Node.js, Express, AWS EMR, Scala, Tableau, Looker, Dremio

  • Experience with Agile methodology, using test-driven development.

  • Excellent command of written and spoken EnglishSelf-driven problem solver

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
level-senior remoteok.io Imported from RemoteOK
Development

No branches or pull requests

1 participant