In this article we will learn what is Cosmos DB, review general concepts, features and key points for capacity calculation when creating new Database.

If you are already familiar with concepts and architecture feel free to explore other articles from the series:

Three Pillars: Account, Database, Container

Cosmos DB (CDB) — a document store which is an evolution of a previously relatively popular Azure Document DB. Microsoft product team managed to differentiate Cosmos DB from competitors by offering “API endpoint of your choice”, “infinite scale”, “unlimited DB size” with “minimum configuration”. …

In a previous article, we reviewed how to set up logs and troubleshoot Cosmos DB issues using Azure Log Analytics.


You can read the full story by navigating to the link above. In a nutshell, I pushed 31 million Invoices into Cosmos DB with a total size reaching 47GBs. The database had containers with different indexing and logical partition key configuration. The test's purpose wasn’t related to costs analysis but an unexpected spike in charges for my subscription resulted in this article.

So here we are now.

For reference, a code-snippet that I used for data ingestion (utilizing parallelism of…

In this, another part of the series, we already know how to set up an account, what’s the difference between the Serverless and Provisioned throughput.

We will learn types of monitoring available in Cosmos DB and then set up the Log Analytics workspace and troubleshoot the physical partition split event.

This article is a part of the series. Check the below articles before proceeding to log analysis:


Cosmos DB as a first-class citizen of Microsoft Azure provides excellent tooling for monitoring and troubleshooting. Search for account-level “monitoring” and you will see few options.

Among others, monitoring offers features familiar to…


This article is a part of the series. Check others for more details around Cosmos DB:

What is a partition key and why is it important?

Cosmos DB is designed to scale horizontally based on the distribution of data between physical partitions (PP). Think of it as separately deployable self-sufficient nodes which are synchronized and coordinated by a central gateway.

The second essential part of the architecture is the logical partition (LP) — it’s bucket of documents which share the same characteristic (partition key) and are supposed…

Anatolii Gabuza

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store