diff --git a/talent/job-descriptions/offers/data-processing.md b/talent/job-descriptions/offers/data-processing.md new file mode 100644 index 00000000..994b0ac0 --- /dev/null +++ b/talent/job-descriptions/offers/data-processing.md @@ -0,0 +1,30 @@ +Software Engineer in the Data Processing team +------------------------------------------ + +### [Location & Seniority](../location-seniority-section.md) + +**Compensation:** 49-53,000 Euros + 0.095% [stock options](https://github.com/src-d/guide/blob/master/talent/esop.md) for senior engineers + +### [Company](../company-section.md) + +### [Engineering at source{d}](../engineering-section.md) + +### Role + +The Data Processing team is developing source{d}'s high-level code analysis applications for running scalable processing pipelines for source code analysis. Written mostly in Go, with some Scala components, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data. + +The team projects are: + +* [go-mysql-server](https://github.com/src-d/go-mysql-server/): a SQL engine with a MySQL interface, written in Go. +* [gitbase](https://github.com/src-d/gitbase): a SQL interface to Git repositories, written in Go. +* [gitbase-spark-connector](https://github.com/src-d/gitbase-spark-connector) gitbase connector for Apache Spark, written in Scala. + +You will be expected to have strong backend coding skills in at least two languages, and very good algorithmic problem-solving skills. Skill at programming in Scala and Go, and knowledge about Apache Spark, are not required but will be highly appreciated. For Go in particular we strongly believe it can be learned by any skilled developer, and we care much more about our team's mindset and prior experience than specific language skills. + +Good knowledge of distributed systems and parallel processing is important. Previous experience with SQL engines or similar systems is a plus. + +### [Culture](../culture-section.md) + +### [Perks](../perks-section.md) + +### [Other](../other-section.md) \ No newline at end of file diff --git a/talent/job-descriptions/offers/data-retrieval.md b/talent/job-descriptions/offers/data-retrieval.md index 2b9364c2..7cbedce7 100644 --- a/talent/job-descriptions/offers/data-retrieval.md +++ b/talent/job-descriptions/offers/data-retrieval.md @@ -11,13 +11,13 @@ Software Engineer in the Data Retrieval team ### Role -The Data Retrieval team is developing source{d}'s high-level code analysis applications for running scalable data retrieval pipelines that process and manipulate any number of code repositories for source code analysis. Written mostly in Go, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data. +The Data Retrieval team is developing source{d}'s data retrieval pipelines that synchronize and provide access to any number of code repositories. Written mostly in Go, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data. -- We at source{d} seek to be at the heart of any project related to source code. Thus, this core tool will be used both in-house for building source{d}'s unique global scale open dataset of +60M code repositories for cutting-edge Machine Learning research, as well as used externally by empowering a wide community of developers, researchers and companies worldwide when doing vanguard research or building the next generation of developer tools and experiences. +We at source{d} seek to be at the heart of any project related to source code. Thus, this core tool is used both in-house for building source{d}'s unique global scale open dataset of +60M code repositories for cutting-edge Machine Learning research, as well as used by our customers to ingest git repositories to their source code processing clusters. -- Good knowledge of distributed computing and parallel processing is important. +Good knowledge of Linux, storage, networking and distributed computing is important. -- You will be expected to have strong backend coding skills in at least two languages and very good algorithmic skills. Scala coding skills and knowledge about Apache Spark aren't required but will be highly appreciated, on the other hand Go is not a strict requirement; we strongly believe that it can be learned easily by any skilled developer and care a lot more about our team's mindset and prior experience than any specific skills. +You will be expected to have strong backend coding skills in at least two languages, and very good algorithmic problem-solving skills. Skill at programming in Go is not required but will be highly appreciated. We strongly believe it can be learned by any skilled developer, and we care much more about our team's mindset and prior experience than specific language skills. ### [Culture](../culture-section.md)