Goodreads Data Analysis Project

Goodreads Data Analysis Project

Posted by Swarup Malli

Updated: May 18, 2020

 

The data set is from Goodreads. It contains around 11k  records on books. The target audience of this project are book-lovers .The books come in many different languages. The fields used for data analysis are  Book Title, Authors, Average Ratings ,  Language code, Number of reviews given to each book and publication date.

Goal of the project

  • Find a trend for number of books published by year
  • List of top 30 Authors by they average user ratings
  • Number of Books by language code
  • Top 50 books by the number of user reviews

Number of books by year

   

Top 30 Authors by Average User Ratings

Number of Books by language code

 

Top 50 books by the number of user reviews

Swarup Malli

Swarup has a Bachelors's degree in Information Technology. He started his career as an ETL developer and eventually transitioned into the Business Intelligence space. He has been consulting as a Business Intelligence professional with 10 + years plus of experience in the Banking, Pharma, and Manufacturing Industry. He believes Data Science is a logical extension to the field of BI and believes the future of data science is quite promising.

View all articles

Topics from this blog: Student Works R Shiny Shiny Dashboard

Interested in becoming a Data Scientist?

Get Customized Course Recommendations In Under a Minute