Permalink
Browse files

Added badges and LICENSE; Upgraded to lintools-datatypes 1.1.1 (#17)

  • Loading branch information...
lintool committed Jun 12, 2018
1 parent fbc3a96 commit 30c8dd037a5fd4d62613881eff284744e604af8e
Showing with 15 additions and 2 deletions.
  1. +11 −0 LICENSE
  2. +3 −1 README.md
  3. +1 −1 pom.xml
11 LICENSE
@@ -0,0 +1,11 @@
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
@@ -1,6 +1,8 @@
# Bespin
[![Build Status](https://travis-ci.org/lintool/bespin.svg?branch=master)](https://travis-ci.org/lintool/bespin)
[![Maven Central](https://maven-badges.herokuapp.com/maven-central/io.bespin/bespin/badge.svg)](https://maven-badges.herokuapp.com/maven-central/io.bespin/bespin)
[![LICENSE](https://img.shields.io/badge/license-Apache-blue.svg?style=flat-square)](../LICENSE)
Bespin is a library that contains reference implementations of "big data" algorithms in MapReduce and Spark.
@@ -24,7 +26,7 @@ The datasets are stored in the [Bespin data repo](https://github.com/lintool/bes
+ The file `Shakespeare.txt` contains the [The Complete Works of William Shakespeare](http://www.gutenberg.org/ebooks/100) from [Project Gutenberg](http://www.gutenberg.org/).
+ The file `p2p-Gnutella08-adj.txt` contains a [snapshot of the Gnutella peer-to-peer file sharing network from August 2002](http://snap.stanford.edu/data/p2p-Gnutella08.html), where nodes represent hosts in the Gnutella network topology and edges represent connections between the Gnutella hosts. This dataset is available from the [Stanford Network Analysis Project](http://snap.stanford.edu/).
+ The tarball `taxi-data.tar.gz` contains a [one-day slice NY taxi data](http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml), chopped into one file per minute. See analyses in Todd Schneider's blog post [Analyzing 1.1 Billion NYC Taxi and Uber Trips, with a Vengeance](http://toddwschneider.com/posts/analyzing-1-1-billion-nyc-taxi-and-uber-trips-with-a-vengeance/).
## Word Count in MapReduce and Spark
@@ -207,7 +207,7 @@
<dependency>
<groupId>tl.lin</groupId>
<artifactId>lintools-datatypes</artifactId>
<version>1.0.0</version>
<version>1.1.1</version>
</dependency>
<dependency>
<groupId>net.sf.jung</groupId>

0 comments on commit 30c8dd0

Please sign in to comment.