Skip to content
#

feature-engineering

Here are 1,451 public repositories matching this topic...

nni
pkubik
pkubik commented Mar 14, 2022

Describe the issue:
During computing Channel Dependencies reshape_break_channel_dependency does following code to ensure that the number of input channels equals the number of output channels:

in_shape = op_node.auxiliary['in_shape']
out_shape = op_node.auxiliary['out_shape']
in_channel = in_shape[1]
out_channel = out_shape[1]
return in_channel != out_channel

This is correct

bug help wanted good first issue model compression
featuretools
natsoule
natsoule commented May 17, 2022

Is your feature request related to a problem? Please describe.
The current Feast online store for GCP implementation requires Firestore in Datastore mode. Firestore can only be in one mode at a time per GCP account. You cannot use native mode for some applications and Datastore mode for others within the same account. Adding a feature store to an existing GCP account that uses native mode wou

kind/feature good first issue priority/p2 Community Contribution Needed
mljar-supervised
ViacheslavDanilov
ViacheslavDanilov commented May 19, 2022

I trained models on Windows, then I tried to use them on Linux, however, I could not load them due to an incorrect path joining. During model loading, I got learner_path in the following format experiments_dir/model_1/100_LightGBM\\learner_fold_0.lightgbm. The last two slashes were incorrectly concatenated with the rest part of the path. In this regard, I would suggest adding something like `l

bug help wanted good first issue
feature_engine
hangfei
hangfei commented May 23, 2022

This is reported by Manjunath from Slack channel.
For work_dir: "dbfs:/feathr_getting_started"
The user was running jupyter notebook from windows.
without forward slash('dbfs:/feathr_getting_started') the job configs were coming as '--join-config', 'dbfs:/feathr_getting_started\feature_join.conf'

To fix this, the user need to add foward slash: `work

enhancement good first issue
EvenOldridge
EvenOldridge commented Jun 8, 2021

Current version of bucketize uses fixed boundaries. If the user doesn't know these boundaries they need to calculate them using cudf.

We should support splitting continuous variables into buckets based on quantile and uniform splits of the data.

For uniform splits the statistics gathering phase needs to compute the min and max of the column and figure out the boundaries to create N buckets.

enhancement good first issue
evalml
skrawcz
skrawcz commented May 11, 2022

Is your feature request related to a problem? Please describe.
The friction to getting the examples up and running is installing the dependencies. A docker container with them already provided would reduce friction for people to get started with Hamilton.

Describe the solution you'd like

  1. A docker container, that has different python virtual environments, that has the dependencies t
documentation good first issue help wanted

Improve this page

Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."

Learn more