Skip to content

This is visualisation of the character from the tv show "The Office" - "Michael Scott"

Notifications You must be signed in to change notification settings

Sveta151/MichaelScottvisualization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MichaelScottvisualization

This is visualisation of the character from the tv show "The Office" - "Michael Scott"

I've decided to analyze main character of the great show - ‘Michael Scott”
For the next visualizations I have used few datasets, that I get from kaggle.
First dataset providing spoken text lines during the show for all characters. There are 5 attributes:

  • index
  • character
  • line
  • season
  • episode_number
While working with this dataset I have found that some eposides have been missing, but amount of the lost episodes is small enough for continue working with this dataset.
Second dataset gibe us more general information, such as title of the episode, director, writers, original air date.The attributes for this dataset are:
  • season
  • episode number of season
  • episode number in series
  • title of the episode
  • director
  • writers
  • original air date
  • production coded
  • US viewers on orignal air date
As for the third dataset it's pretty much the same as the second one this dataset providing general information. The only attributes, that we are looking for is Average IMDd rating. For connection of all of those dataset I joined using two attributes -'season' and 'episode number'
Because data, that have been provided can’t be directly used for the questions,that I want to answer,I had to preprocessed it. Because I wanted to see the appearance of the famous Michael Scott line ‘That’s what she said’ during the show, I had to store in which episodes are this line appeared and who said that with use of first dataset. As a result we have dataset that provide to us every appearance of this line during the show with next attributes:
  • season
  • episode
  • character
. Also using first dataset, I have counted amount of lines from Michael Scott during the show. As a result I had dataset with next attributes:
  • season
  • episode
  • Michael count (count of Michaels line in this episode)
  • As for my first visualisation “‘that’s what she said’ line” I used ‘season’ as column and ‘episode’ and ‘character’ as rows. To separate Michael from other characters, I’ve made a set with this character and used it as a marks. The visualisation is a Gantt one. Screenshot 2022-08-04 at 17 30 42

    For the second visualisation - “Michael appearance during the show” I used ‘Season’ as a column, and sum of Michael lines for the seasons with sum of US viewers as a rows. Also I’m marking median IMBd rating for the each season. The plot type is dual lines.

    Screenshot 2022-08-04 at 17 32 04

    For the last visualisation I’ve made text visualisations using “character” attribute from the ‘the-office-lines’ dataset. Screenshot 2022-08-04 at 17 32 30

About

This is visualisation of the character from the tv show "The Office" - "Michael Scott"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published