Apache Spark - A unified analytics engine for large-scale data processing
Getting started with machine learning
Today, machine learning—the study of algorithms that make data-based predictions—has found a new audience and a new set of possibilities.
Apache Hadoop
A curated list of awesome computer vision resources
Assorted data from the General Services Administration.
An index of all open-source data
An unofficial repository of National Park Service data.
Data and code behind the articles and graphics at FiveThirtyEight
Cool links & research papers related to Machine Learning applied to source code (MLonCode)
ID3-based implementation of the ML Decision Tree algorithm
A toolkit for developing and comparing reinforcement learning algorithms.
Reinforcement learning resources curated
Principal Component Analysis on music loops
Ruby gem to calculate the similarity between texts using tf*idf
Large-scale linear classification, regression and ranking in Python
scikit-learn: machine learning in Python
An Open Source Machine Learning Framework for Everyone
Dataset format for AI. Build, manage, query & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai