-
Updated
Sep 30, 2020
scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
Here are 3,745 public repositories matching this topic...
-
Updated
Oct 7, 2020 - Python
-
Updated
Oct 11, 2020 - Jupyter Notebook
-
Updated
Oct 5, 2020 - Jupyter Notebook
-
Updated
Oct 1, 2020 - Python
-
Updated
Oct 12, 2020 - Jupyter Notebook
-
Updated
Oct 1, 2020 - Jupyter Notebook
-
Updated
Jul 31, 2020
-
Updated
Oct 5, 2020 - Python
Like xarray, pandas supports attaching arbitrary metadata to DataFrames and persisting it across operations. https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.attrs.html
Dask could pretty easily implement this as well. We'd have a
_Frame.attrs
property. This would likely returnself._meta.attrs
.- We'd verify that
dd.from_pandas(data)
correctly extractsattrs
from `da
-
Updated
Oct 1, 2020 - Python
-
Updated
Oct 1, 2020 - Jupyter Notebook
For example, if there is a relationship transaction.session_id -> sessions.id
and we are calculating a feature transactions: sessions.SUM(transactions.value)
any rows for which there is no corresponding session should be given the default value of 0
instead of NaN
.
Of course this should not normally occur, but when it does it seems more reasonable to use the default_value
.
`DirectF
with the Power Transformer.
-
Updated
Jul 12, 2019 - Jupyter Notebook
-
Updated
Apr 24, 2020 - Jsonnet
-
Updated
Oct 6, 2020 - CSS
I see the code
device = ‘cuda’ if torch.cuda.is_available() else ‘cpu’
repeated often in user code. Maybe we should introduce device='auto'
exactly for this case?
-
Updated
Oct 14, 2020 - C++
Interpret
Yes
-
Updated
Nov 12, 2019 - Jupyter Notebook
-
Updated
Oct 11, 2020 - Python
resuming training
How do i resume training for text classification?
-
Updated
Oct 13, 2020 - Python
Is your feature request related to a problem? Please describe.
it would be useful for users to be able to download the classification/regression problems directly from timeseriesclassification.com in code. Apparently tslearn does this, and it seems like a good idea
Describe the solution you'd like
I'm not sure where it should be located, perhaps with the load data tools
**Describ
-
Updated
Jul 23, 2020 - Jupyter Notebook
Description
In the future, igel should support multiple dataset format other than csv. Maybe add support for excel, json and sql tables.
It would be great if users have the flexibility of providing their datasets in other formats and not only csv.
I think it could be useful, when one wants to plot only e.g. class 1, to have an option to produce consistent plots for both plot_cumulative_gain and plot_roc
At the moment, instead, only plot_roc supports such option.
Thanks a lot
Created by David Cournapeau
Released January 05, 2010
Latest release 2 months ago
- Repository
- scikit-learn/scikit-learn
- Website
- scikit-learn.org
- Wikipedia
- Wikipedia
Bug Report
These tests were run on s390x. s390x is big-endian architecture.
Failure log for helper_test.py