Skip to content

caiomsouza/microsoft-cosmos-db-playground

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 

Repository files navigation

Microsoft Cosmos DB Playground

Microsoft Labs

https://cosmosdb.github.io/labs/
https://github.com/CosmosDB/labs
https://www.linkedin.com/in/aliuy/

Cosmos DB - Query Playground
https://www.documentdb.com/sql/demo

Azure Cosmos DB Python examples
https://docs.microsoft.com/en-gb/azure/cosmos-db/sql-api-python-samples
https://github.com/Azure/azure-documentdb-python
https://github.com/CosmosDB

Price
Request Units (RU) = % Memory, % CPU, % IOPS = RU is a rate-based currency.
1 RU = 1 read of 1 KB record

Pricing Example

  • Storage cost
  • Throughput

Big = $ 100.000 / month
Medium = Range from $ 1000 to $ 3000 / month

SQL is the native API (Microsoft).

Types:

  • Key-value = Table API
  • Cassandra = Column-family
  • MongoDB = Document
  • Gremlin = Graph

Cosmos DB Customers

  • Domino's Pizza = Global Distributed Apps -Single online website to order pizza for all restaurants

  • IoT (Internet of Things) - Telemetry & Sensor Data = Toyota, Honeywell, LG CNS, Johnson Controls ○ Business Needs: § High scalability to ingest large # of events coming from many devices § Low latency queries and changes feeds for responding quickly to anomalies § Schema-agnostic storage and automatic indexing to support dynamic data coming from many different generations of devices § High availability across multiple data centers ○ Real-time Recommendations § Use Azure Databricks + Cosmos DB + Azure Container Services (Recommendations API) □ Online Recommendations Service □ Order Transactions ○ Customer 360 - Operational Analytics = Real Madrid § Azure Databricks on top of Cosmos DB □ Scale-out Computation = Azure Databricks □ Scale-out Database = Cosmos DB § Spark Connector using SQL API Apache Spark on Databricks

Walmart Cosmos DB - Use Case From startups to big-business: Using functional programming techniques to transform line of
https://www.youtube.com/watch?v=dSCzCaiWgLM

Where to Store Your Data in Azure? Understand Azure Data Storage Options
https://stackify.com/store-data-azure-understand-azure-data-storage-options/

Partition - Consistent hashing

https://en.wikipedia.org/wiki/Consistent_hashing

The Simple Magic of Consistent Hashing
https://dzone.com/articles/simple-magic-consistent

Distributed Web Caching System with Consistent Hashing
https://dspace.mit.edu/bitstream/handle/1721.1/80121/43563161-MIT.pdf?sequence=2

Consistent hashing and random trees : algorithms for caching in distributed networks
https://dspace.mit.edu/handle/1721.1/9947

Consistent hashing and random trees : algorithms for caching in distributed networks
https://www.cs.princeton.edu/courses/archive/fall09/cos518/papers/chash.pdf

David Karger
http://people.csail.mit.edu/karger/
http://people.csail.mit.edu/karger/Papers/thesis.pdf
https://scholar.google.com/citations?user=2vQRGrYAAAAJ

Consistent Hashing in Cassandra
https://blog.imaginea.com/consistent-hashing-in-cassandra/
https://docs.datastax.com/en/cassandra/3.0/cassandra/architecture/archDataDistributeHashing.html

Indexing

How does Azure Cosmos DB index data?
https://docs.microsoft.com/en-us/azure/cosmos-db/indexing-policies

System Properties Comparison Amazon DynamoDB vs. Microsoft Azure Cosmos DB vs. Titan
https://db-engines.com/en/system/Amazon+DynamoDB%3BMicrosoft+Azure+Cosmos+DB%3BTitan

Cost Comparison: Azure Cosmos DB vs. DynamoDB vs. Neptune
https://dzone.com/articles/azure-cosmos-db-costs-vs-dynamo-db-and-neptune

Azure Databricks + Cosmos DB

https://github.com/Azure/azure-cosmosdb-spark https://docs.azuredatabricks.net/spark/latest/data-sources/azure/cosmosdb-connector.html https://docs.databricks.com/spark/latest/data-sources/azure/cosmosdb-connector.html#use-the-azure-cosmos-db-spark-connector
https://github.com/MicrosoftDocs/azure-docs/blob/master/articles/cosmos-db/spark-connector.md
https://github.com/Azure/azure-cosmosdb-spark/tree/2.3/samples/notebooks

Accelerate big data analytics by using the Apache Spark to Azure Cosmos DB connector
https://docs.microsoft.com/en-us/azure/cosmos-db/spark-connector

Structured streaming with Azure Databricks into Power BI & Cosmos DB
http://www.mnazureusergroup.com/2018/06/25/structured-streaming-with-azure-databricks-into-power-bi-cosmos-db/

Streaming Live Tweets from Twitter to CosmosDB
http://sajeetharan.blogspot.com/2018/05/streaming-live-tweets-from-twitter-to.html
https://github.com/sajeetharan/CosmosdbTweetsStream

Structured streaming with Azure Databricks into Power BI & Cosmos DB
https://azure.microsoft.com/en-us/blog/structured-streaming-with-databricks-into-power-bi-cosmos-db/

Choosing a big data storage technology in Azure
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage

Configuring Power BI Direct Query to Azure Cosmos DB via Apache Spark (HDI)
https://github.com/Azure/azure-cosmosdb-spark/wiki/Configuring-Power-BI-Direct-Query-to-Azure-Cosmos-DB-via-Apache-Spark-(HDI)

Lambda Architecture with Azure Cosmos DB and HDInsight (Apache Spark)
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/lambda/readme.md

On-Time Flight Performance with Spark and Cosmos DB (Seattle)
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/notebooks/On-Time%20Flight%20Performance%20with%20Spark%20and%20Cosmos%20DB%20-%20Seattle.ipynb

Twitter Source with Apache Spark and Azure Cosmos DB Change Feed
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/notebooks/Twitter%20with%20Spark%20and%20Azure%20Cosmos%20DB%20Change%20Feed.ipynb

Cosmos DB + R
https://github.com/aaron2012r2/cosmosR
https://www.erikhoward.net/azure-cosmos-db-tutorial-using-r/

Replicate data globally

Easy to add new regions and enable automatic failover

More links:

https://www.jamesserra.com/
https://www.linkedin.com/in/jamesserra/
https://www.slideshare.net/jamserra/introduction-to-azure-databricks-83448539
https://www.slideshare.net/jamserra/power-bi-for-big-data-and-the-new-look-of-big-data-solutions
https://www.jamesserra.com/archive/2017/10/use-cases-of-various-products/
https://www.jamesserra.com/archive/2018/06/understanding-cosmos-db/

About

Microsoft Cosmos DB Playground

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published