Skip to content

ananfito/case-study_amazon-bestsellers

Repository files navigation

Case Study: Trends from Amazon Bestsellers

Introduction

The following is a case study investigating bestelling books on Amazon between the years 2009 and 2019. We will look at how authors/publishers can use the trends in this data to set long-term goals.

Contents

Summary

It's no secret that Amazon has become a behemoth of an organization, but it started out by starting selling books. And by the end of the time period for this data set (2019), Amazon enjoys a 50% share of all book distribution and an eBook market share around 67% (source). For writers and publishers around the world, it's hard to ignore Amazon's success and it seems unwise to consider not publishing to the platform.

This case study will investgiate the top 50 bestselling authors and books over the ten year period from 2009-2019. The goal is to provide insights so writers and publishers can make data-informed decisions about long-term goals.

Guiding Questions

For this case study, our main guiding question is:

How can trends from the Top 50 Amazon Bestsellers over a 10-year period inform long-term goals for authors and publishers?

To address this question, we will be exploring the answering the following sub questions:

  • What author(s) published the most between 2009-2019?
  • What percentage of books were fiction vs. non-fiction?
  • What book appears the most between 2009-2019?

Tools and Techniques

This analysis made use of the following tools and techniques:

  • Python programming language and libraries; pandas, seaborn, numpy, and matplotlib
  • Data transformations: extraction, visualizations, summary statistics
  • Data inspection: removal of duplicate/unnessary data, change format/datatype, verify unique values

Recommendations

The analysis yielded the following key observations:

  • Most authors (75%) only published twice during this 10-year period
  • A majority (73%) of the expensive books (books priced greater than 16 USD) were non-fiction
  • most books (75%) published during this 10-year period were under 16 USD and 50% of them were priced between 7 USD and 16 USD

These observations led to the following recommendations outlined below.

Focus on high quality content

Most authors (75%) only published twice during this 10-year period. It takes a lot to publish a book -- drafting, writing, editing, and revising. This would seem to indicate that for those authors who made it into the Top 50 bestsellers list their book was most likely well-received because it was well-written.

For writers/publishers, seeking to make it on these lists in the next ten years this means focusing on writing less and writing better. By slowing down to focus on higher quality content it increases your chance of your book being received well and popular.

For higher priced books, focus on non-fiction

A majority (73%) of the expensive books (books priced greater than 16 USD) were non-fiction. Being that non-fiction books generally require more research and effort to write, it make sense that they would be priced higher to reflect that. For authors/publishers seeking earn more, they might consider shifting their output to non-fiction books. This could be more lucrative if combined with other sources of incomes such speaking engagements, workshops, courses, etc.

Set price between 7 and 16 USD

This is likely to change due to inflation and increase production costs, but it should be noted that most books (75%) published during this 10-year period were under 16 USD and 50% of them were priced in this range. In other words, when choosing the book price ensure that it is within the current range of book prices -- not too high, but not too low.

Full Case Study

The full case study can be viewed at the following places:

Contact