Skip to main content

All Questions

Filter by
Sorted by
Tagged with
0 votes
0 answers
64 views

Imbalanced Dataset Correlation in Machine Learning

If there is an imbalanced dataset, I cannot figure out the correlation or dependency of the target column on different features. How can I check that? I am using countplot but with that, I cannot ...
Maisara Waseem's user avatar
0 votes
0 answers
65 views

Weighted F1-score

I'm training and validating models for a binary classification problem in a dataset that has great class imbalance. When searching for metrics for evaluating the performance of the models, I found ...
JS_ps's user avatar
  • 1
0 votes
0 answers
29 views

Questions of handling imbalance dataset classification

I am trying to predict number of members who will discontinue their membership. The whole dataset is about 12 millions rows of data with about 40 columns. A member status can be “Continue”, “Voluntary ...
Anson's user avatar
  • 1
0 votes
0 answers
103 views

Generate synthetic data for majority and minority classes

I am working on a classification problem where I try to generate synthetic data for both the Majority and Minority classes,as i want to train my model on synthetic data and test on actual data, i am ...
user286076's user avatar
0 votes
3 answers
587 views

Optimize metrics for Fraud Detection Imbalanced Data

I would need your help to improve my model performance. As mostly happens for fraud detection, I have an imbalanced dataset (0.1/0.9). I would like to optimize the recall for my target 1 and 0, ...
Sandro231's user avatar
-1 votes
1 answer
182 views

How to choose the best technique for handling imbalanced data for binary classification?

I am working on my thesis on imbalanced dataset for binary classification problem. I need to handle the imbalance on data before make the classification, but I am not sure what technique is better to ...
Shada Hamed's user avatar
0 votes
1 answer
7k views

How to Handle Imbalanced Data in a Classification Problem?

I am working on a binary classification problem using machine learning, where my target classes are imbalanced. I have approximately 80% of data points in Class A and only 20% in Class B. I have tried ...
Viper's user avatar
  • 11
1 vote
0 answers
55 views

Sample Size Inconsistency Error with imblearn's classification_report_imbalanced

I'm encountering an error when using classification_report_imbalanced from imblearn.metrics on a classification task. The code runs smoothly until I add the classification_report_imbalanced function, ...
yuyudss's user avatar
  • 11
0 votes
1 answer
105 views

Are mlr3 class weights applied to validation score calculations?

I have previously used mlr3 for imbalanced classification problems, and used PipeOpClassWeights to apply class weights to learners during training. This pipe op adds a column of observation weights to ...
AhmetZamanis's user avatar
1 vote
0 answers
119 views

Using Class_weights for imbalance dataset in Mask RCNN

I have added Class_Weights to be used while training Mask RCNN on custome dataset. It is showing error : ValueError: Unknown entries in class_weight dictionary: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, ...
Tima's user avatar
  • 11
0 votes
0 answers
51 views

Error in checkMeasures(measures, learner) : object 'fbeta' not found

I am doing an imbalanced classification task, so I want to use f-beta as performance measure. I used the library(mlr) to set measures=fbeta, which follows: library(mlr) #create tasks ## Create ...
ebrahimi's user avatar
  • 926
0 votes
0 answers
338 views

There were missing values in resampled performance measures

I need to do a classification task on this dataset. As the following code shows, I tried to implement xgboost using caret package. Since my dataset is imbalanced, I prefer to use Fscore as performance ...
ebrahimi's user avatar
  • 926
1 vote
0 answers
238 views

WeightedRandomSampler with multi-dimensional batch

I'm working on a classification problem (100 classes) and my dataset has a huge class imbalance. To tackle this, I'm considering using torch's WeightedRandomSampler to oversample the minority class. I ...
theodre7's user avatar
  • 108
0 votes
1 answer
406 views

Poorly calibrated probabilities but good classification in confusion matrix

I have an imbalanced data set. My goal is to balance sensitivity and specificity via the confusion matrix. I used glmnet in r with class weights. The model does well at balancing the sensitivity/...
mapleleaf's user avatar
  • 764
1 vote
1 answer
243 views

Undersampling/Oversampling issues with onehotencoded categorical data

I am trying to fit a classification problem which has a (40000 vs 400) split between 0 and 1 class. I am trying to play around with oversampling and undersampling (not preferred) but keep running into ...
ricardo's user avatar
  • 186

15 30 50 per page