Thetazero Pubs
wei zhaonotes response plot plots data friends friend user users think

Lesson 3


What to Do First?

Notes:



Pseudo-Facebook User Data

Notes:

list.files()
## [1] "Analytics Challenge Data 2.xlsx"                                   
## [2] "lesson3_student.html"                                              
## [3] "lesson3_student.rmd"                                               
## [4] "pseudo_facebook.tsv"                                               
## [5] "[Steven_Chapra,_Steven_C._Chapra,_Raymond_Canale,_(BookFi.org).pdf"

Histogram of Users? Birthdays

Notes:

#install.packages('ggplot2')
library(ggplot2)

What are some things that you notice about this histogram?

Response:


Moira?s Investigation

Notes:


Estimating Your Audience Size

Notes:



Think about a time when you posted a specific message or shared a photo on Facebook. What was it?

Response:

How many of your friends do you think saw that post?

Response:

Think about what percent of your friends on Facebook see any posts or comments that you make in a month. What percent do you think that is?

Response:


Perceived Audience Size

Notes:


Faceting

Notes:

Let?s take another look at our plot. What stands out to you here?

Response:


Be Skeptical - Outliers and Anomalies

Notes:


Moira?s Outlier

Notes: #### Which case do you think applies to Moira?s outlier? Response:


Friend Count

Notes:

What code would you enter to create a histogram of friend counts?

How is this plot similar to Moira?s first plot?

Response:


Limiting the Axes

Notes:

Exploring with Bin Width

Notes:


Adjusting the Bin Width

Notes:

Faceting Friend Count

# What code would you add to create a facet the histogram by gender?
# Add it to the code below.
#qplot(x = friend_count, data = pf, binwidth = 10) +
#  scale_x_continuous(limits = c(0, 1000),
#                     breaks = seq(0, 1000, 50))

Omitting NA Values

Notes:


Statistics ?by? Gender

Notes:

Who on average has more friends: men or women?

Response:

What?s the difference between the median friend count for women and men?

Response:

Why would the median be a better measure than the mean?

Response:


Tenure

Notes:


How would you create a histogram of tenure by year?


Labeling Plots

Notes:


User Ages

Notes:

What do you notice?

Response:


The Spread of Memes

Notes:


Lada?s Money Bag Meme

Notes:


Transforming Data

Notes:


Add a Scaling Layer

Notes:


Frequency Polygons


Likes on the Web

Notes:


Box Plots

Notes:

Adjust the code to focus on users who have friend counts between 0 and 1000.


Box Plots, Quartiles, and Friendships

Notes:

On average, who initiated more friendships in our sample: men or women?

Response: #### Write about some ways that you can verify your answer. Response:

Response:


Getting Logical

Notes:

Response:


Analyzing One Variable

Reflection:


Click KnitHTML to see all of your hard work and to have an html page of this lesson, your answers, and your notes!

Copyright © 2016 thetazero.com All Rights Reserved. Privacy Policy