/
python-mapreduce-programming-with-pydoop.json
37 lines (37 loc) · 1.77 KB
/
python-mapreduce-programming-with-pydoop.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
{
"alias": "video/1109/python-mapreduce-programming-with-pydoop",
"category": "EuroPython 2011",
"copyright_text": "Standard YouTube License",
"description": "Hadoop is the leading open source implementation of MapReduce, Google's\nlarge scale distributed computing paradigm. Hadoop's native API is in\nJava, and its built-in options for Python programming - Streaming and\nJython - have several drawbacks: the former allows to access only a\nsmall subset of Hadoop's features, while the latter carries with it all\nof the limitations of Jython with respect to CPython.\n\n`Pydoop <http://pydoop.sourceforge.net>`__ is an API for Hadoop that\nmakes most of its features available to Python programmers while\nallowing CPython development. Its core consists of Boost.Python wrappers\nfor Hadoop's C/C++ interface.\n\nThe talk consists of a MapReduce/Hadoop tutorial and a presentation of\nthe Pydoop API, with the main goal of bridging the gap between the\nHadoop and Python communities. A basic knowledge of distributed\nprogramming is helpful but not strictly required.\n",
"duration": null,
"id": 1109,
"language": "eng",
"quality_notes": "",
"recorded": "2011-07-13",
"related_urls": [
"http://pydoop.sourceforge.net"
],
"slug": "python-mapreduce-programming-with-pydoop",
"speakers": [
"Simone Leo"
],
"summary": "[EuroPython 2011] Simone Leo - 24 June 2011 in \"Track Lasagne\"\n",
"tags": [
"api",
"cpython",
"distributed",
"hadoop",
"jython",
"mapreduce",
"tutorial"
],
"thumbnail_url": "https://i.ytimg.com/vi/IyXOP7SJqKQ/hqdefault.jpg",
"title": "Python MapReduce Programming with Pydoop",
"videos": [
{
"length": 0,
"type": "youtube",
"url": "https://www.youtube.com/watch?v=IyXOP7SJqKQ"
}
]
}