Skip to content
#

scikit-learn

scikit-learn logo

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

Here are 5,400 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Nov 4, 2021
  • Python
chan4cc
chan4cc commented Apr 26, 2021

New Operator

Describe the operator

Why is this operator necessary? What does it accomplish?

This is a frequently used operator in tensorflow/keras

Can this operator be constructed using existing onnx operators?

If so, why not add it as a function?

I don't know.

Is this operator used by any model currently? Which one?

Are you willing to contribute it?

jrbourbeau
jrbourbeau commented Dec 10, 2021

I noticed our release version anchor links in the changelog don't actually reference a specific released version. If I go to the changelog and click on the 2021.12.0 link, I'm redirected to https://docs.dask.org/en/stable/changelog.html#id1 when, naively, I would have expected this link to look like https://docs.dask.org/en/stable/changelog.html#2021.12.0 (or something similar). As you move down

eddiebergman
eddiebergman commented Dec 20, 2021

The components part of our codebase was written sometime ago, with older sklearn versions and before python typing was production ready.

In general, some of these files need to be cleaned up. Mostly typing of parameters and functions, adding documentation a bout these parameters and finally double checking with scikit learn that there aren't some new or deprecated parameters we still use.

To

featuretools
gsheni
gsheni commented Sep 9, 2021
  • With Featuretools 1.0.0 we add a dataframe to an EntitySet with the following:
es = ft.EntitySet('new_es')

es.add_dataframe(dataframe=orders_df,
                 dataframe_name='orders',
                 index='order_id',
                 time_index='order_date')

Improvement

  • However, you could also change the EntitySet setter to add it with this approach:
es = ft.Ent
sktime
fkiraly
fkiraly commented Dec 31, 2021

Names of private functions in datatypes check modules should be changed to lower_snake_case.

Private means: it's used only internally in datatypes and is not imported anywhere else.
For public functions, we will have to go through deprecation. I don't think there are any, but don't know for sure - may be also worth collecting non-compliant ones here.

TheAutumnOfRice
TheAutumnOfRice commented Sep 28, 2021

The current History class has some limitations: (ver 0.10.0)

  1. Currently the history is saved as JSON, as a result, those recorded values are limited to simple numbers and strings. Other objects can not be saved in history files directly.
  2. Saving as JSON takes lots of time and space because numbers are stored in decimal. It's getting worse when the training epoch is increasing.
  3. In some
willsmithorg
willsmithorg commented Dec 26, 2021

Could FeatureTools be implemented as an automated preprocessor to Autogluon, adding the ability to handle multi-entity problems (i.e. Data split across multiple normalised database tables)? So if you supply Autogluon with a list of Dataframes instead of a single Dataframe it would first invoke FeatureTools:

  • take the multiple Dataframes (entities) and try to auto-infer the relationship betwee

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  • Updated Nov 10, 2021
  • Python

Created by David Cournapeau

Released January 05, 2010

Latest release 8 days ago

Repository
scikit-learn/scikit-learn
Website
scikit-learn.org
Wikipedia
Wikipedia

Related Topics

python scikit