Skip to content

Description of files on Github

JoonL edited this page May 15, 2013 · 2 revisions

Python Files

Tweepy.py - Twitter Streaming written for extracting live tweets related to some predefined words. (Example 'medical', 'hospital' etc.)

medical_tweet_streamer.py - Modifciation of Tweepy to filter twitter feeds based on Symplur medical hashtags stored in medical_hashtags.json.

Results

Hashtag2_results.txt - Deprecated output from now obsolete script Tweepy_Hashtags_2.py (running for approximately 8 hours). Similar output can be generated now by running medical_tweet_streamer.py.

(Lets plan not to dump output text files to git repo, may be google drive or dropbox?)

Data folder

Please upload all files of tweets in data folder with proper name.

naming: hashtags-from(hhmmssmonthday)-to-myname.txt

(e.g. hashtags-033546may15-041200may15-joon.txt)

naming: mword-from(hhmmssmonthday)-to-myname.txt

(e.g. mword-033546may15-041200may15-joon.txt)

Others

config_template.json - Template for config.json, required to run Tweepy*.py

medical_hashtags.json - Collection of medical hashtags from Symplur, ...

Clone this wiki locally