Predictive Analytics Links & Software:
A growing list of analytical links & software basics as well as useful reference links to various areas of interest within predictive analytics.
Analytical Software:
Ventana Research has an in-depth software value index used to evaluate software based on the business goal and purpose, requirements, user community and usage requirements, functional requirements and capabilities, organizational maturity, technology approach and master list and business and technology evaluation criteria. While you can retrieve old reports for free, which are out-of-date, their methodical approach and insight is excellent. Software that I have used in predictive analytics applications include:
R and RStudio: https://www.rstudio.com/
Package popularity for R: https://dgrtwo.shinyapps.io/cranview/
Enthought Scientific Computing Solutions: https://www.enthought.com/
SAS: https://www.sas.com/en_ca/home.html
Anaconda Python from Continuum Analytics: https://www.continuum.io/
Jupyter: http://jupyter.org/
SQLite: http://sqlite.org/download.html
Cyberduck for FTP: https://cyberduck.io/
Of course, all of the databases and servers care of Northwestern University! Awesome resources!
Frameworks:
Python Crawl and Scrape: https://scrapy.org/
Flask: http://flask.pocoo.org/
Cloudera: https://www.cloudera.com/
Fabulous Links:
Notebooks: https://github.com/ipython/ipython/wiki/A-gallery-of-interesting-IPython-Notebooks
Web Commons: http://webdatacommons.org/
Data School: Fabulous resource for learning Pandas including blog and super helpful videos http://www.dataschool.io/
R datasets list: https://vincentarelbundock.github.io/Rdatasets/datasets.html
Reddit with search capabilities: https://www.reddit.com/r/datasets/
StatSci: http://www.statsci.org/datasets.html
Text/Word Reference: http://www.wordfrequency.info/
Graphics: https://www.r-graph-gallery.com/
Agent-based Modeling: http://www.intro-to-abm.com/#models2
Forecasting: Principles and Practice book http://otexts.org/fpp2/
Regression Trees: http://blog.revolutionanalytics.com/2013/06/plotting-classification-and-regression-trees-with-plotrpart.html
Text Analytics Tools: http://tapor.ca/tools
Matrix Multiplication: http://matrixmultiplication.xyz/
Papers: www.paperswithcode.com
IP Tools: https://hackertarget.com/ip-tools/
Predictive Analytics – General:
Predictive Analytics: http://www.kdnuggets.com/
Text Analytics:
Personality theory on your Twitter account: https://personality-insights-livedemo.mybluemix.net/
Natural Language Processing: http://www.nltk.org/
Word Frequency Data: http://www.wordfrequency.info/100k_samples.asp
Hermeneutia: http://hermeneuti.ca/
Datasets:
Tensors: research.google.com/seedbank – transfer styles on images, text, etc. on Google Colab
TensorFlow: https://www.tensorflow.org/datasets/catalog/overview#all_datasets
Unusual and darn-right interesting: http://rs.io/100-interesting-data-sets-for-statistics/
NASA: https://software.nasa.gov/data_and_image_processing/1
Data World: https://data.world/
Data Market: https://datamarket.com
Favorite R Packages:
Data Descriptions
Psych: https://cran.r-project.org/web/packages/psych/index.html
Hmisc: https://cran.r-project.org/web/packages/Hmisc/Hmisc.pdf
Missing Data:
Mice: https://cran.r-project.org/web/packages/mice/mice.pdf
Amelia: https://gking.harvard.edu/files/gking/files/amelia_jss.pdf
mi: https://cran.r-project.org/web/packages/mi/mi.pdf
missForest: https://cran.r-project.org/web/packages/missForest/missForest.pdf
Other
Certified Analytics Professional: https://www.certifiedanalytics.org/for_professionals.php
Northwestern University Bookstore: http://abbotthall.bncollege.com/ (’cause I can never find this link)