-
Updated
Sep 30, 2020
scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
Here are 3,708 public repositories matching this topic...
-
Updated
Sep 30, 2020 - Python
-
Updated
Oct 4, 2020 - Jupyter Notebook
-
Updated
Sep 25, 2020 - Jupyter Notebook
-
Updated
Oct 1, 2020 - Python
-
Updated
Oct 3, 2020 - Jupyter Notebook
-
Updated
Oct 1, 2020 - Jupyter Notebook
-
Updated
Jul 31, 2020
-
Updated
Oct 3, 2020 - Python
When merging a dask dataframe, the resulting index is duplicated - seems to be because of the number of partitions. See example below:
import pandas as pd
import dask.dataframe as dd
a = dd.from_pandas(pd.DataFrame({'a': [1,2,3,4]}), npartitions=2)
b = pd.DataFrame({'a': [1,2,3,4], 'b': [2,3,4,5]})
a.merge(b, on='a').compute()
Returns
a | b |
---|
|
-
Updated
Oct 1, 2020 - Python
-
Updated
Oct 1, 2020 - Jupyter Notebook
For example, if there is a relationship transaction.session_id -> sessions.id
and we are calculating a feature transactions: sessions.SUM(transactions.value)
any rows for which there is no corresponding session should be given the default value of 0
instead of NaN
.
Of course this should not normally occur, but when it does it seems more reasonable to use the default_value
.
`DirectF
with the Power Transformer.
-
Updated
Jul 12, 2019 - Jupyter Notebook
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
Sep 30, 2020 - CSS
I see the code
device = ‘cuda’ if torch.cuda.is_available() else ‘cpu’
repeated often in user code. Maybe we should introduce device='auto'
exactly for this case?
-
Updated
Oct 4, 2020 - C++
Interpret
Yes
-
Updated
Nov 12, 2019 - Jupyter Notebook
-
Updated
Oct 3, 2020 - Python
resuming training
How do i resume training for text classification?
-
Updated
Oct 3, 2020 - Python
Describe the solution you'd like
We already have AutoARIMA, but it would be nice to also interface ARIMA, either from statsmodels or pmdarima.
-
Updated
Jul 23, 2020 - Jupyter Notebook
I think it could be useful, when one wants to plot only e.g. class 1, to have an option to produce consistent plots for both plot_cumulative_gain and plot_roc
At the moment, instead, only plot_roc supports such option.
Thanks a lot
Support Series.median()
Created by David Cournapeau
Released January 05, 2010
Latest release 2 months ago
- Repository
- scikit-learn/scikit-learn
- Website
- scikit-learn.org
- Wikipedia
- Wikipedia
Bug Report
These tests were run on s390x. s390x is big-endian architecture.
Failure log for helper_test.py