-
Notifications
You must be signed in to change notification settings - Fork 0
/
ac209_finalproj_notes.txt
66 lines (45 loc) · 1.57 KB
/
ac209_finalproj_notes.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
AC209
Final Project Notes
A. Meisner & M. Williams
Meeting Notes
_______________
_______________
10.24.2014
__________
** Possible Questions/Directions
- Predict the positve/negative score of a game based on the text of the associated review
- Develop a regression model to predict the numerical score of a game given text in the review
- Use reddit or other social networking sites plus gaming sites to predict scores for future games
** Possible Data Sets to use
(Gaming)
- Giant Bomb (has an API): http://www.giantbomb.com/api/
- IGN
- Gamespot
(Social Sites)
- Twitter
- Reddit
** Main Tasks for next meeting
- Get Giant Bomb API and try to get any form of data
- If Fail: Make preparations to Data Scrape
11.08.2014
__________
** Results since previous meeting
- Success in accessing Giant Bomb data through API
** Main Question 1: Can we predict the score of a review based on the text in the review blurb?
** Main Question 2: (Need additional questions for analysis)
** Exploratory Analysis Questions (look at "gp_expl_analysis.ipynb" for details)
- Giant Bomb score distribution
- Correlation between game score and company
- Types of games which are the most popular
- Giant Bomb scores over the years
- User score distribution
- User scores and critics (i.e. staff) scores
- Staff score distribution
** Main Tasks for next meeting
- Scrape reviews from Game Spot if possible
- Rough Draft of Proposal?
- Complete Exploratory Analyses on Giant Bomb Data Set
- Create a stable data set:
- Review information
- Company information
- User Reviews