Skip to content

Latest commit

 

History

History
84 lines (65 loc) · 3.16 KB

day4.md

File metadata and controls

84 lines (65 loc) · 3.16 KB

Introduction to working with data

Objectives

  • Greetings/Announcements - Please make sure you have submitted all of your assignments
  • Review Assignment 2 and upload to github
  • Pair programming
  • Introduction to using packages with dice function
  • Work on Dice Charting Function and Visualization (plot)
  • 15 minute break
  • Introduction to built in Datasets and type of charts
    data_bar <- data$y                      # Extract values
    names(data_bar) <- data$x               # Assign names to values
    data_bar   
    OR!
    data <- data.frame(x = LETTERS[1:5],    # Create example data
                      y = 5:1)
    data                                    # Print example data
    

Create a, b, c, d variables

a <- c(10,20,30,40) b <- c('book', 'pen', 'textbook', 'pencil_case') c <- c(TRUE,FALSE,TRUE,FALSE) d <- c(2.5, 8, 10, 7)

Join the variables to create a data frame

df <- data.frame(a,b,c,d) df #code for dataset and charts data() # shows all datasets View() # View a dataset in a table format str() # Outputs dataset to console

#Create a barplot on this data
max.temp <- c(22, 27, 26, 24, 23, 26, 28)

barplot(max.temp, main = "Maximum Temperatures in a Week", xlab = "Degree Celsius", ylab = "Day", names.arg = c("Sun", "Mon", "Tue", "Wed", "Thu", "Fri", "Sat"), col = "darkred", # you can also create a list of colors # col=c("red","green","blue","beige") horiz = TRUE) View(airquality) barplot(airquality$Temp) barplot(airquality$Temp) barplot(airquality$Temp[order(airquality$Temp)]) #orders bars in ascending order barplot(airquality$Temp[order(-airquality$Temp)]) #orders bars in descending order barplot(airquality$Temp[order(airquality$Temp)],horiz = TRUE) #displaairquality$Temps barchart in horizontal view barplot(airquality$Temp[order(airquality$Temp)],horiz = TRUE, col="red") #changes all bar colors to red barplot(airquality$Temp[order(airquality$Temp)],horiz = TRUE, col=c("red","green","blue","beige")) #makes each bar a different color colors() #shows list of available colors in console #atomic vector - a one-dimensional data object created by c() or vector() #if you get the '$ operator is invalid for atomic vectors' error, do this x<- as.data.frame(Titanic) barplot(x)

#Count occurances of data in column hist(table(x$Age))

hist(Customer$Age) #gives us a histogram of frequency of age hist(Customer$Age,breaks = 5) #suggests the number of bars but it creates 6 bars

hist(Customer$Age,breaks = c(0,40,60,100)) #setting strict categories, 3 bars (0-40) (40-60) (60-100)

hist(Customer$Age,breaks = c(0,40,60,100), freq=TRUE) #sorts histogram by frequency/order

hist(Customer$Age,breaks = c(0,40,60,100), freq=TRUE, col="blue", main="Histogram of Age") # sets chart title to "Histogram of age"

* Team activity - chose built in dataset and build a chart showing the data
* Present findings



https://www.programmingr.com/count-occurrences-in-column/


Example 2: Fix the Error in barplot.default() : ‘height’ must be a vector or a matrix