Skip to content
#

feature-engineering

Here are 1,225 public repositories matching this topic...

nni
featuretools
gsheni
gsheni commented Sep 9, 2021
  • With Featuretools 1.0.0 we add a dataframe to an EntitySet with the following:
es = ft.EntitySet('new_es')

es.add_dataframe(dataframe=orders_df,
                 dataframe_name='orders',
                 index='order_id',
                 time_index='order_date')

Improvement

  • However, you could also change the EntitySet setter to add it with this approach:
es = ft.Ent
adchia
adchia commented Nov 4, 2021

When specifying on demand feature views at retrieval time (e.g. get_X_features), the output feature vectors include e.g. request data or dependent feature vectors, even if users did not specify said features.

Expected Behavior

Non-specified dependent feature values are not returned in output

Current Behavior

Non-specified dependent feature values are in output

Steps to reprodu

mljar-supervised
JustinKurland
JustinKurland commented Nov 27, 2021

There are several evaluation metrics that would be particularly beneficial for (binary) imbalanced classification problems and would be greatly appreciated additions. In terms of prioritizing implementation (and likely ease of implementation I will rank-order these):

  1. AUCPR - helpful in the event that class labels are needed and the positive class is of greater importance.
  2. **F2 Scor
feature_engine
solegalli
solegalli commented Dec 5, 2020

At the moment, in the categorical tree encoder and the tree discretiser, we have an argument is_regression that the user needs to fill in in order to detect if the user is aiming to perform classification or regression.

Sklearn has an automated process with the is_classification (see Decision tree source code).

Can we bring this functionality to feature-engine?

I think we can :p

evalml
Hyperactive

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

  • Updated Nov 29, 2020
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."

Learn more