Here are
7 public repositories
matching this topic...
A command-line tool for launching Apache Spark clusters.
-
Updated
Aug 3, 2020
-
Python
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
-
Updated
Nov 3, 2017
-
Jupyter Notebook
This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.
-
Updated
Jun 21, 2019
-
Scala
This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc
Implementations of Markov Clustrer Algorithm (MCL) and Regularized Markov Cluster Algorithm (R-MCL) in Apache Spark
-
Updated
Jul 18, 2017
-
Scala
Apache Spark cluster lab.
Analysis performed on data from the Steam platform using Apache Spark and Cloud services such as Amazon Web Services.
-
Updated
Dec 11, 2019
-
Python
Improve this page
Add a description, image, and links to the
apache-spark-cluster
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
apache-spark-cluster
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.