Skip to main content
Filter by
Sorted by
Tagged with
1 vote
0 answers
24 views

How can I apply SMOTE for multiple binary responses with NAs in R and produce a single balanced dataset? [migrated]

I have a dataset with several predictor variables and multiple binary response columns. Each response is highly imbalanced (many more 0’s than 1’s) and also contains some NAs. I am trying to balance ...
Electrino's user avatar
  • 2,910
0 votes
0 answers
67 views

Loan Default Prediction - Kaggle

I am working on the loan default prediction data set available on Kaggle which has a highly skewed class distribution. The best model I have gotten so far is as follows using ExtraTreesClassifier: ...
RenamedUser7008's user avatar
0 votes
1 answer
120 views

Why is my BERT model producing NaN loss during training for multi-label classification on imbalanced data?

I’m running into a frustrating issue while training a BERT-based multi-label text classification model on an imbalanced dataset. After a few epochs, the training loss suddenly becomes NaN, and I can’t ...
Erhan Arslan's user avatar
0 votes
0 answers
21 views

How to Build a Neural Network for Predicting Loan Status Using Multi-Table Data from the Berka Dataset

I am working on a project using the Berka dataset, and I want to build a neural network to predict the loan status for accounts. The dataset contains multiple tables, and I want to avoid flattening ...
Dmitrii Ponomarev's user avatar
-1 votes
1 answer
40 views

Improving Accuracy [closed]

I am working on testing accuracy and performance using deep learning models on a complex dataset but I have reached a good accuracy but I need to improve it so any suggestions other than what I did(...
Menna's user avatar
  • 7
0 votes
0 answers
87 views

Use StratifiedGroupKFold with GridSearchCV in an XGBoost Model

Dataset: I have a very imbalanced binary dataset of Groups; approximately 362 "yes" Groups and 47000 "no" Groups. Each Group has time series data updated at minute intervals. For ...
Tom's user avatar
  • 33
0 votes
0 answers
60 views

Understanding the `model.fit` function in keras and imbalanced datasets

As an exercise, I'm trying to translate a model written in Keras (https://github.com/CVxTz/ECG_Heartbeat_Classification/blob/master/code/baseline_mitbih.py) into Pytorch code. I realize in Keras much ...
user26579046's user avatar
1 vote
2 answers
402 views

Problem with Keras class weights and KeyError

I anticipate that I have seen the question: Keras class_weight error dictionary keys/values referring to the same problem, but the solution does not seem to help me. With this code, where I just added ...
Pinguiz's user avatar
  • 123
0 votes
0 answers
265 views

How to use skmultilearn.iterative_train_test_split to split to test, val and test?

The goal is to split an imbalanced dataset to train, val and test sets in order to train a multilabel model. Following this simple example from HuggingFace and the documentation it is clear to me how ...
learningtocode's user avatar
0 votes
0 answers
27 views

ROSE package in R not reading variable correctly; does not read updated value contained in variable

I'm hoping to receive some help here as I've struggled for a while now and I cannot figure out the problem. I am using the ROSE package in R, attempting to make use of the function for random over/...
studentinneed's user avatar
0 votes
0 answers
64 views

Imbalanced Dataset Correlation in Machine Learning

If there is an imbalanced dataset, I cannot figure out the correlation or dependency of the target column on different features. How can I check that? I am using countplot but with that, I cannot ...
Maisara Waseem's user avatar
0 votes
0 answers
195 views

Managing problems of class imbalance in machine learning models using spatial data in R

I am trying to simultaneously perform feature selection and hyperparameter tuning on stacked learners (glmnet and rpart). However, I am encountering the following error message with the classif.glmnet ...
Marine's user avatar
  • 89
0 votes
0 answers
37 views

I faced an error when I used PCA with LSTM model

I have a time series dataset with 20 classes, but they are imbalanced; when I tried a method like "RandomOverSampler", I got an error because of the 3D of our data so could you suggest a ...
Zineb Adaika's user avatar
0 votes
0 answers
65 views

Weighted F1-score

I'm training and validating models for a binary classification problem in a dataset that has great class imbalance. When searching for metrics for evaluating the performance of the models, I found ...
JS_ps's user avatar
  • 1
0 votes
0 answers
117 views

Class imbalance calculation for each class in a dataset

I am trying to compute class imbalance in each dataset and my approach was to check average and standard deviation of the counts. The average is the total number of samples in class 1 / total number ...
Aparna Bhat's user avatar

15 30 50 per page
1
2 3 4 5
23