https://cosmosdb.github.io/labs/
https://github.com/CosmosDB/labs
https://www.linkedin.com/in/aliuy/
Cosmos DB - Query Playground
https://www.documentdb.com/sql/demo
Azure Cosmos DB Python examples
https://docs.microsoft.com/en-gb/azure/cosmos-db/sql-api-python-samples
https://github.com/Azure/azure-documentdb-python
https://github.com/CosmosDB
Price
Request Units (RU) = % Memory, % CPU, % IOPS = RU is a rate-based currency.
1 RU = 1 read of 1 KB record
Pricing Example
- Storage cost
- Throughput
Big = $ 100.000 / month
Medium = Range from $ 1000 to $ 3000 / month
SQL is the native API (Microsoft).
Types:
- Key-value = Table API
- Cassandra = Column-family
- MongoDB = Document
- Gremlin = Graph
Cosmos DB Customers
-
Domino's Pizza = Global Distributed Apps -Single online website to order pizza for all restaurants
-
IoT (Internet of Things) - Telemetry & Sensor Data = Toyota, Honeywell, LG CNS, Johnson Controls ○ Business Needs: § High scalability to ingest large # of events coming from many devices § Low latency queries and changes feeds for responding quickly to anomalies § Schema-agnostic storage and automatic indexing to support dynamic data coming from many different generations of devices § High availability across multiple data centers ○ Real-time Recommendations § Use Azure Databricks + Cosmos DB + Azure Container Services (Recommendations API) □ Online Recommendations Service □ Order Transactions ○ Customer 360 - Operational Analytics = Real Madrid § Azure Databricks on top of Cosmos DB □ Scale-out Computation = Azure Databricks □ Scale-out Database = Cosmos DB § Spark Connector using SQL API Apache Spark on Databricks
Walmart Cosmos DB - Use Case From startups to big-business: Using functional programming techniques to transform line of
https://www.youtube.com/watch?v=dSCzCaiWgLM
Where to Store Your Data in Azure? Understand Azure Data Storage Options
https://stackify.com/store-data-azure-understand-azure-data-storage-options/
https://en.wikipedia.org/wiki/Consistent_hashing
The Simple Magic of Consistent Hashing
https://dzone.com/articles/simple-magic-consistent
Distributed Web Caching System with Consistent Hashing
https://dspace.mit.edu/bitstream/handle/1721.1/80121/43563161-MIT.pdf?sequence=2
Consistent hashing and random trees : algorithms for caching in distributed networks
https://dspace.mit.edu/handle/1721.1/9947
Consistent hashing and random trees : algorithms for caching in distributed networks
https://www.cs.princeton.edu/courses/archive/fall09/cos518/papers/chash.pdf
David Karger
http://people.csail.mit.edu/karger/
http://people.csail.mit.edu/karger/Papers/thesis.pdf
https://scholar.google.com/citations?user=2vQRGrYAAAAJ
Consistent Hashing in Cassandra
https://blog.imaginea.com/consistent-hashing-in-cassandra/
https://docs.datastax.com/en/cassandra/3.0/cassandra/architecture/archDataDistributeHashing.html
How does Azure Cosmos DB index data?
https://docs.microsoft.com/en-us/azure/cosmos-db/indexing-policies
System Properties Comparison Amazon DynamoDB vs. Microsoft Azure Cosmos DB vs. Titan
https://db-engines.com/en/system/Amazon+DynamoDB%3BMicrosoft+Azure+Cosmos+DB%3BTitan
Cost Comparison: Azure Cosmos DB vs. DynamoDB vs. Neptune
https://dzone.com/articles/azure-cosmos-db-costs-vs-dynamo-db-and-neptune
https://github.com/Azure/azure-cosmosdb-spark
https://docs.azuredatabricks.net/spark/latest/data-sources/azure/cosmosdb-connector.html
https://docs.databricks.com/spark/latest/data-sources/azure/cosmosdb-connector.html#use-the-azure-cosmos-db-spark-connector
https://github.com/MicrosoftDocs/azure-docs/blob/master/articles/cosmos-db/spark-connector.md
https://github.com/Azure/azure-cosmosdb-spark/tree/2.3/samples/notebooks
Accelerate big data analytics by using the Apache Spark to Azure Cosmos DB connector
https://docs.microsoft.com/en-us/azure/cosmos-db/spark-connector
Structured streaming with Azure Databricks into Power BI & Cosmos DB
http://www.mnazureusergroup.com/2018/06/25/structured-streaming-with-azure-databricks-into-power-bi-cosmos-db/
Streaming Live Tweets from Twitter to CosmosDB
http://sajeetharan.blogspot.com/2018/05/streaming-live-tweets-from-twitter-to.html
https://github.com/sajeetharan/CosmosdbTweetsStream
Structured streaming with Azure Databricks into Power BI & Cosmos DB
https://azure.microsoft.com/en-us/blog/structured-streaming-with-databricks-into-power-bi-cosmos-db/
Choosing a big data storage technology in Azure
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage
Configuring Power BI Direct Query to Azure Cosmos DB via Apache Spark (HDI)
https://github.com/Azure/azure-cosmosdb-spark/wiki/Configuring-Power-BI-Direct-Query-to-Azure-Cosmos-DB-via-Apache-Spark-(HDI)
Lambda Architecture with Azure Cosmos DB and HDInsight (Apache Spark)
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/lambda/readme.md
On-Time Flight Performance with Spark and Cosmos DB (Seattle)
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/notebooks/On-Time%20Flight%20Performance%20with%20Spark%20and%20Cosmos%20DB%20-%20Seattle.ipynb
Twitter Source with Apache Spark and Azure Cosmos DB Change Feed
https://github.com/Azure/azure-cosmosdb-spark/blob/master/samples/notebooks/Twitter%20with%20Spark%20and%20Azure%20Cosmos%20DB%20Change%20Feed.ipynb
Cosmos DB + R
https://github.com/aaron2012r2/cosmosR
https://www.erikhoward.net/azure-cosmos-db-tutorial-using-r/
Easy to add new regions and enable automatic failover
https://www.jamesserra.com/
https://www.linkedin.com/in/jamesserra/
https://www.slideshare.net/jamserra/introduction-to-azure-databricks-83448539
https://www.slideshare.net/jamserra/power-bi-for-big-data-and-the-new-look-of-big-data-solutions
https://www.jamesserra.com/archive/2017/10/use-cases-of-various-products/
https://www.jamesserra.com/archive/2018/06/understanding-cosmos-db/