EMSE_DevInt

By Cassandra, Nimmi, Yiming, and Jory.

This research was conducted as part of SENG 480A @ UVic (EMSE).

The included PDF presents the motivation, methodology, results, and conclusions of our work and findings.

Dependencies

Download the following packages needed for the included python modules and Jupyter notebooks:

pip install stackapi sklearn numpy nltk pandas seaborn wordcloud pyLDAvis

Alternatively, try

pip install -r requirements.txt

(Rough) Procedural Overview

Use StackAPI to grab SO data.

a. Grab maximum questions & answers daily. Do over couple days.

b. Collate JSONs into single data file.

c. Remove duplicates

d. Format into input file for LDA.
Use LDA to process data.
- LDA does not label topics. This will need to be done manually.
Additional statistics on questions, answers, and users.

Usage

Ad-Hoc Python Scripts

Grabbing Data

Resources

StackAPI

JGibbLabeledLDA

Refactored JGibbLabeledLDA

Preprocess

LDA

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
notebook		notebook
python		python
.gitignore		.gitignore
A_Replication_of_An_Empirical_Study_on_Developer_Interactions_in_Stack_Overflow.pdf		A_Replication_of_An_Empirical_Study_on_Developer_Interactions_in_Stack_Overflow.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EMSE_DevInt

Dependencies

(Rough) Procedural Overview

Usage

Resources

About

Releases

Packages

Contributors 3

Languages

JoryAnderson/EMSE_DevInt

Folders and files

Latest commit

History

Repository files navigation

EMSE_DevInt

Dependencies

(Rough) Procedural Overview

Usage

Resources

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages