This is a MATLAB implementation of the algorithm LGD presented in the following paper. Local gap density for clustering high-dimensional data. Ruijia Li, Xiaofei Yang, Xiaolong Qin, William Zhu.
Clustering Analysis of all available research data on the Iowa Gambling Task(list of sources in readme) using R. The Scripts produce the output for the most common archetypes among the dataset of one researcher using PCA.
load and visualize data and clusters with scatter plots; prepare data for cluster analysis; perform centroid clustering with k-means; interpret clustering results and determine the optimal number of clusters for a given dataset.
A clustering exercise of global currencies on three common financial market features using data from 2017 through 2019, as published in Towards Data Science on Medium.com
This is the home of my public efforts in data analytics, research and data science/software development dedicated to various aspects of COVID-19 impact analysis
It is One of the Easiest Problems in Data Science to Detect the MNIST Numbers, Using a Classification Algorithm, Here I have used a csv File which contains the Pixels of the Numbers from 0 to 9 and we have to Classify the Numbers Accordingly. I have Used K-Means Classification Algorithm.
Using single-unit/single-family residential properties sold in the Southern California area, predict Zillow's 2017 log error drivers. Use KMeans clustering techniques to uncover drivers of log error.
This project was done together with my colleague Noam Shmuel during the class of "Unsupervised Learning" @ University of Warsaw taught by professor Jacek Lewkowicz, PhD