APSA Workshop 2015

Scroll over to find links to underlying files

##############

1. List of R packages to install before workshop. To install them, open R on your machine, then type – install.packages(“name of package”) . You will be asked to choose a mirror site. It doesn’t matter which you use.

2, Files to download before workshop

Labeled reviews for R twitter exercise

Three FOMC files for R sample script one two three

##############

 

OTHER MATERIALS

Original email to participants

Overview slides

Intro to R slides

Intro to Python (no slides)

Regular Expressions slides (they are the bomb!)

 

R SCRIPTS

R Sample Script tm (scraping urls (website pages), data prep, and wordcloud)

R Sample Script quanteda (same script but using the new quanteda package rather than tm)

R NYT API Script (searching NYT articles for keywords, using NLP package to investigate ‘parts of speech” (POS) in selected articles)

R Twitter API Script     labeled reviews data file (includes data cleaning and supervised machine learning (rtexttools) application)

R_LDA_Script (topic modeling)    qb_race.zip data file

 

Python Intro (iPython notebook version)

Python Intro (ordinary)

 

PYTHON SCRIPTS

Python basics

Importing and preparing text

Scraping URLS

Scraping PDFs

Preprocessing and summarizing results

NLTK (NLP) Tokenizing

Text Reuse

The syllabus for Wilkerson’s course (always a work in progress),  can be found here

Many additional Python scripts (accomplishing different tasks) can be found here

Comments are closed.