Skip to content
#

scikit-learn

scikit-learn logo

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

Here are 4,872 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated May 13, 2021
  • Python
chan4cc
chan4cc commented Apr 26, 2021

New Operator

Describe the operator

Why is this operator necessary? What does it accomplish?

This is a frequently used operator in tensorflow/keras

Can this operator be constructed using existing onnx operators?

If so, why not add it as a function?

I don't know.

Is this operator used by any model currently? Which one?

Are you willing to contribute it?

TomAugspurger
TomAugspurger commented Aug 25, 2021

dask.data

In [12]: import dask.dataframe as dd, pandas as pd

In [13]: df = dd.from_pandas(pd.DataFrame({"A": [1, 2]}), npartitions=1)

In [14]: df.head()
/home/taugspurger/miniconda3/envs/stac-table/lib/python3.9/site-packages/dask/dataframe/core.py:6778: UserWarning: Insufficient elements for `head`. 5 elements requested, only 2 elements available. Try passing larger `npartiti
eddiebergman
eddiebergman commented Jul 29, 2021

Building the doc fails for example 40_advanced/example_single_configurations on the current development branch

Logs here

...
generating gallery for examples/40_advanced... [ 50%] example_debug_logging.py

Warning, treated as error:
/home/runner/work/auto-sklearn/auto-sklearn/examples/40_advanced/example_single_configu
sktime
Lovkush-A
Lovkush-A commented Aug 13, 2021

Describe the bug
If you load arrow_head data with default split settings, the resulting dataframe has indices that repeat. this is because there is a concatenation of train and test data

To Reproduce

from sktime.datasets import load_arrow_head
X, y = load_arrow_head(return_X_y=True)
X.index.values

Output:

array([  0,   1,   2,   3,   4,   5,   6,   7,   8,
igel
anjali-rgpt
anjali-rgpt commented Aug 27, 2021
  • igel version: 0.3.1
  • Python version: 3.6.9
  • Operating System: Ubuntu 18.04 LTS running as a Linux Subsystem on WSL2

Description

Adding support for K-Medoids Clustering from the sklearn_extra library.
This clustering method would be useful for median-based distance metrics in clustering, because it reduces the impact of outliers on finding new central points, and calculates dissimil

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  • Updated Aug 25, 2021
  • Python

Created by David Cournapeau

Released January 05, 2010

Latest release 4 months ago

Repository
scikit-learn/scikit-learn
Website
scikit-learn.org
Wikipedia
Wikipedia

Related Topics

python scikit