HOME  |  PAPERS |  BLOG |  DATA |  SOFTWARE               

Software
  1. Infer Race and Ethnicity From Names:

  2. Infer Gender From Names:

  3. Search a long list of names (patterns) in a large text corpus systematically and quickly
    Software

  4. Categorize the Content of Domains:

  5. Know Your IP
    Python Package

  6. AutoSum: Summarize Publications Automatically and Discover Miscitations
    Software

  7. Adjust Naive Estimates of Learning for Guessing
    R package | Related Paper

  8. Get Weather Data:
    Please read this before downloading any of the following scripts.

    • Find nearest zip codes given a list of weather stations (COOP and GHCND) via
      GeoNames: Data & Scripts
    • Find nearest weather stations given a list of zip codes: Data & Scripts
    • Get data from the nearest weather station given a list of zip codes and date range
      Script
    • Get data from the nearest weather station given a list of zip codes and date range
      using the NOAA web-service:  Script

  9. Image to Text:
    Please read this before downloading any of the following scripts.

  10. Edit Distance Based Search and Replace
    Software | Related Note

  11. Text as Data:

    • Normalize text, remove stop words, punctuation, numbers, stem, lemmatize
      Script
    • Subset, Randomly Sample, Summarize: Script
    • Create TDM with various weighting schemes: Script
    • Sentiment Analysis: Script
    • Supervised Learning: Classification, Regression

  12. Clarifai: Understand (Moving) Images
    R package | Analysis of Politicians' Instagrams | Infer Gender Based on First Name

  13. tuber: Access YouTube from R
    R package
    REVIEW: 'Thank you very much for the package ... it has made my life easy ....'

  14. tubern: R Client for the YouTube Analytics and Reporting API
    R package

  15. virustotal: R Client for the Virustotal Public API 2.0
    R package

  16. aws.alexa: Access Amazon Alexa from R
    R package

  17. Collecting Data from the Streets: