Links & Software

Predictive Analytics Links & Software:

A growing list of analytical links & software basics as well as useful reference links to various areas of interest within predictive analytics.

Analytical Software:

Ventana Research has an in-depth software value index used to evaluate software based on the business goal and purpose,  requirements, user community and usage requirements, functional requirements and capabilities, organizational maturity, technology approach and master list and business and technology evaluation criteria. While you can retrieve old reports for free, which are out-of-date, their methodical approach and insight is excellent.  Software that I have used in predictive analytics applications include:

R and RStudio:  https://www.rstudio.com/

Package popularity for R:  https://dgrtwo.shinyapps.io/cranview/

Enthought Scientific Computing Solutions:  https://www.enthought.com/

SAS:  https://www.sas.com/en_ca/home.html

Anaconda Python from Continuum Analytics:  https://www.continuum.io/

Jupyter:   http://jupyter.org/

SQLite:  http://sqlite.org/download.html

Cyberduck for FTP:  https://cyberduck.io/

Of course, all of the databases and servers care of Northwestern University!  Awesome resources!

Frameworks:

Python Crawl and Scrape:  https://scrapy.org/

Flask: http://flask.pocoo.org/

Cloudera:  https://www.cloudera.com/

Fabulous Links:

Notebooks:  https://github.com/ipython/ipython/wiki/A-gallery-of-interesting-IPython-Notebooks

Web Commons:  http://webdatacommons.org/

Data School:  Fabulous resource for learning Pandas including blog and super helpful videos http://www.dataschool.io/

R datasets list:  https://vincentarelbundock.github.io/Rdatasets/datasets.html

Reddit with search capabilities:  https://www.reddit.com/r/datasets/

StatSci:  http://www.statsci.org/datasets.html

Text/Word Reference:  http://www.wordfrequency.info/

Graphics:  https://www.r-graph-gallery.com/

Agent-based Modeling:  http://www.intro-to-abm.com/#models2

Forecasting:  Principles and Practice book http://otexts.org/fpp2/

Regression Trees:  http://blog.revolutionanalytics.com/2013/06/plotting-classification-and-regression-trees-with-plotrpart.html

Text Analytics Tools:  http://tapor.ca/tools

Matrix Multiplication:  http://matrixmultiplication.xyz/

Papers:  www.paperswithcode.com

IP Tools:  https://hackertarget.com/ip-tools/

Predictive Analytics  – General:

Predictive Analytics:  http://www.kdnuggets.com/

Text Analytics:

Personality theory on your Twitter account:   https://personality-insights-livedemo.mybluemix.net/

Natural Language Processing:  http://www.nltk.org/

Word Frequency Data:  http://www.wordfrequency.info/100k_samples.asp

Hermeneutia:  http://hermeneuti.ca/

Datasets:

Tensors:  research.google.com/seedbank – transfer styles on images, text, etc. on Google Colab

TensorFlow:  https://www.tensorflow.org/datasets/catalog/overview#all_datasets

Unusual and darn-right interesting:  http://rs.io/100-interesting-data-sets-for-statistics/

NASA:  https://software.nasa.gov/data_and_image_processing/1

Data World:  https://data.world/

Data Market:  https://datamarket.com

Favorite R Packages:

Data Descriptions

Psych:  https://cran.r-project.org/web/packages/psych/index.html

Hmisc:  https://cran.r-project.org/web/packages/Hmisc/Hmisc.pdf

Missing Data:

Mice:  https://cran.r-project.org/web/packages/mice/mice.pdf

Amelia:  https://gking.harvard.edu/files/gking/files/amelia_jss.pdf

mi:  https://cran.r-project.org/web/packages/mi/mi.pdf

missForest:  https://cran.r-project.org/web/packages/missForest/missForest.pdf

Other

Certified Analytics Professional:  https://www.certifiedanalytics.org/for_professionals.php

Northwestern University Bookstore:  http://abbotthall.bncollege.com/  (’cause I can never find this link)