Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 30 additions & 0 deletions talent/job-descriptions/offers/data-processing.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
Software Engineer in the Data Processing team
------------------------------------------

### [Location & Seniority](../location-seniority-section.md)

**Compensation:** 49-53,000 Euros + 0.095% [stock options](https://github.com/src-d/guide/blob/master/talent/esop.md) for senior engineers

### [Company](../company-section.md)

### [Engineering at source{d}](../engineering-section.md)

### Role

The Data Processing team is developing source{d}'s high-level code analysis applications for running scalable processing pipelines for source code analysis. Written mostly in Go, with some Scala components, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data.

The team projects are:

* [go-mysql-server](https://github.com/src-d/go-mysql-server/): a SQL engine with a MySQL interface, written in Go.
* [gitbase](https://github.com/src-d/gitbase): a SQL interface to Git repositories, written in Go.
* [gitbase-spark-connector](https://github.com/src-d/gitbase-spark-connector) gitbase connector for Apache Spark, written in Scala.

You will be expected to have strong backend coding skills in at least two languages, and very good algorithmic problem-solving skills. Skill at programming in Scala and Go, and knowledge about Apache Spark, are not required but will be highly appreciated. For Go in particular we strongly believe it can be learned by any skilled developer, and we care much more about our team's mindset and prior experience than specific language skills.

Good knowledge of distributed systems and parallel processing is important. Previous experience with SQL engines or similar systems is a plus.

### [Culture](../culture-section.md)

### [Perks](../perks-section.md)

### [Other](../other-section.md)
8 changes: 4 additions & 4 deletions talent/job-descriptions/offers/data-retrieval.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,13 +11,13 @@ Software Engineer in the Data Retrieval team

### Role

The Data Retrieval team is developing source{d}'s high-level code analysis applications for running scalable data retrieval pipelines that process and manipulate any number of code repositories for source code analysis. Written mostly in Go, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data.
The Data Retrieval team is developing source{d}'s data retrieval pipelines that synchronize and provide access to any number of code repositories. Written mostly in Go, it aims to be robust, friendly, flexible and capable of running on large-scale distributed clusters over petabytes of data.

- We at source{d} seek to be at the heart of any project related to source code. Thus, this core tool will be used both in-house for building source{d}'s unique global scale open dataset of +60M code repositories for cutting-edge Machine Learning research, as well as used externally by empowering a wide community of developers, researchers and companies worldwide when doing vanguard research or building the next generation of developer tools and experiences.
We at source{d} seek to be at the heart of any project related to source code. Thus, this core tool is used both in-house for building source{d}'s unique global scale open dataset of +60M code repositories for cutting-edge Machine Learning research, as well as used by our customers to ingest git repositories to their source code processing clusters.

- Good knowledge of distributed computing and parallel processing is important.
Good knowledge of Linux, storage, networking and distributed computing is important.

- You will be expected to have strong backend coding skills in at least two languages and very good algorithmic skills. Scala coding skills and knowledge about Apache Spark aren't required but will be highly appreciated, on the other hand Go is not a strict requirement; we strongly believe that it can be learned easily by any skilled developer and care a lot more about our team's mindset and prior experience than any specific skills.
You will be expected to have strong backend coding skills in at least two languages, and very good algorithmic problem-solving skills. Skill at programming in Go is not required but will be highly appreciated. We strongly believe it can be learned by any skilled developer, and we care much more about our team's mindset and prior experience than specific language skills.

### [Culture](../culture-section.md)

Expand Down