feature-engineering
Here are 1,366 public repositories matching this topic...
-
Updated
Mar 3, 2022 - Python
Once Woodwork implements this issue, we can clean up the Woodwork initialization in add_last_time_indexes
to pass in the previous dataframe's table schema to keep that typing information but also perform inference on the new last time index column.
-
Updated
Mar 18, 2022 - Java
Is your feature request related to a problem? Please describe.
Currently in feature_store.yaml, we can only specify a region for DynamoDB provider. As a result, it requires an actual DynamoDB to be available when we want to do local development/testing or integration testing in a sandbox environment.
Describe the solution you'd like
A way to solve this is to let user pass an endpoint
Problem
Some of our transformers & estimators are not thoroughly tested or not tested at all.
Solution
Use OpTransformerSpec
and OpEstimatorSpec
base test specs to provide tests for all existing transformers & estimators.
When using r2 as eval metric for regression task (with 'Explain' mode) the metric values reported in Leaderboard (at README.md file) are multiplied by -1.
For instance, the metric value for some model shown in the Leaderboard is -0.41, while when clicking the model name leads to the detailed results page - and there the value of r2 is 0.41.
I've noticed that when one of R2 metric values in the L
-
Updated
Feb 10, 2021 - Python
If select
closure in select into
result is empty, we just skip saving, and the job is succeed. But users can't find the output file. It's not intuitive.
We should throw an exception.
-
Updated
Feb 11, 2022 - Python
-
Updated
Feb 14, 2017 - Jupyter Notebook
The transformer should create computations over windows of past values of the features, and populate them at time t, t being the time of the forecast.
It uses pandas rolling, outputs several comptutations, mean, max, std, etc, and pandas shift to move the computations to the right row.
tmp = (data[variables]
.rolling(window='3H').mean() # Average the last 3 hr values.
.
-
Updated
Mar 18, 2022 - Python
-
Updated
Feb 4, 2021 - Jupyter Notebook
-
Updated
Dec 20, 2017 - Python
-
Updated
Dec 15, 2018 - Jupyter Notebook
-
Updated
Mar 6, 2022 - Python
-
Updated
Jan 20, 2021 - Python
-
Updated
May 8, 2019 - Python
-
Updated
Mar 18, 2022 - Jupyter Notebook
-
Updated
Oct 26, 2018
-
Updated
Mar 16, 2022 - Python
In #3324 , we had to mark some tests as expected to fail since XGBoost was throwing a FutureWarning. The warning has been addressed in XGBoost, so we're just waiting for the PR merged to be released. This issue is discussed in the #3275 issue.
evalml/tests/component_tests/test_xgboost_classifier.py
needs to have the @pytest.mark.xfail
removed f
-
Updated
Feb 16, 2022 - Jupyter Notebook
-
Updated
Mar 16, 2022 - Python
-
Updated
Mar 1, 2022 - Jupyter Notebook
-
Updated
Jul 1, 2019 - Python
what
Ray workflows seems like something we could easily add too https://docs.ray.io/en/latest/workflows/concepts.html given that we now have GraphAdapters
Task
- Implement something very similar to the RayGraphAdapter, i.e. RayWorkflowGraphAdapter. The hypothesis is that then we just need to use workflow step function to wrap hamilton functions.
- implement an integration test f
-
Updated
Mar 16, 2022 - Python
Improve this page
Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."
Describe the issue:
During computing Channel Dependencies
reshape_break_channel_dependency
does following code to ensure that the number of input channels equals the number of output channels:This is correct