All Questions
Tagged with imbalanced-data classification
53 questions
0
votes
0
answers
64
views
Imbalanced Dataset Correlation in Machine Learning
If there is an imbalanced dataset, I cannot figure out the correlation or dependency of the target column on different features. How can I check that?
I am using countplot but with that, I cannot ...
0
votes
0
answers
65
views
Weighted F1-score
I'm training and validating models for a binary classification problem in a dataset that has great class imbalance.
When searching for metrics for evaluating the performance of the models, I found ...
0
votes
0
answers
29
views
Questions of handling imbalance dataset classification
I am trying to predict number of members who will discontinue their membership. The whole dataset is about 12 millions rows of data with about 40 columns. A member status can be “Continue”, “Voluntary ...
0
votes
0
answers
103
views
Generate synthetic data for majority and minority classes
I am working on a classification problem where I try to generate synthetic data for both the Majority and Minority classes,as i want to train my model on synthetic data and test on actual data, i am ...
0
votes
3
answers
587
views
Optimize metrics for Fraud Detection Imbalanced Data
I would need your help to improve my model performance. As mostly happens for fraud detection, I have an imbalanced dataset (0.1/0.9). I would like to optimize the recall for my target 1 and 0, ...
-1
votes
1
answer
182
views
How to choose the best technique for handling imbalanced data for binary classification?
I am working on my thesis on imbalanced dataset for binary classification problem. I need to handle the imbalance on data before make the classification, but I am not sure what technique is better to ...
0
votes
1
answer
7k
views
How to Handle Imbalanced Data in a Classification Problem?
I am working on a binary classification problem using machine learning, where my target classes are imbalanced. I have approximately 80% of data points in Class A and only 20% in Class B.
I have tried ...
1
vote
0
answers
55
views
Sample Size Inconsistency Error with imblearn's classification_report_imbalanced
I'm encountering an error when using classification_report_imbalanced from imblearn.metrics on a classification task. The code runs smoothly until I add the classification_report_imbalanced function, ...
0
votes
1
answer
105
views
Are mlr3 class weights applied to validation score calculations?
I have previously used mlr3 for imbalanced classification problems, and used PipeOpClassWeights to apply class weights to learners during training. This pipe op adds a column of observation weights to ...
1
vote
0
answers
119
views
Using Class_weights for imbalance dataset in Mask RCNN
I have added Class_Weights to be used while training Mask RCNN on custome dataset. It is showing error :
ValueError: Unknown entries in class_weight dictionary: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, ...
0
votes
0
answers
51
views
Error in checkMeasures(measures, learner) : object 'fbeta' not found
I am doing an imbalanced classification task, so I want to use f-beta as performance measure. I used the library(mlr) to set measures=fbeta, which follows:
library(mlr)
#create tasks
## Create ...
0
votes
0
answers
338
views
There were missing values in resampled performance measures
I need to do a classification task on this dataset. As the following code shows, I tried to implement xgboost using caret package. Since my dataset is imbalanced, I prefer to use Fscore as performance ...
1
vote
0
answers
238
views
WeightedRandomSampler with multi-dimensional batch
I'm working on a classification problem (100 classes) and my dataset has a huge class imbalance. To tackle this, I'm considering using torch's WeightedRandomSampler to oversample the minority class. I ...
0
votes
1
answer
406
views
Poorly calibrated probabilities but good classification in confusion matrix
I have an imbalanced data set. My goal is to balance sensitivity and specificity via the confusion matrix. I used glmnet in r with class weights. The model does well at balancing the sensitivity/...
1
vote
1
answer
243
views
Undersampling/Oversampling issues with onehotencoded categorical data
I am trying to fit a classification problem which has a (40000 vs 400) split between 0 and 1 class. I am trying to play around with oversampling and undersampling (not preferred) but keep running into ...