Skip to content
#

scikit-learn

scikit-learn logo

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

Here are 3,239 public repositories matching this topic...

pranavsharma
pranavsharma commented Feb 27, 2020

Several parts of the op sec like the main op description, attributes, input and output descriptions become part of the binary that consumes ONNX e.g. onnxruntime causing an increase in its size due to strings that take no part in the execution of the model or its verification.

Setting __ONNX_NO_DOC_STRINGS doesn't really help here since (1) it's not used in the SetDoc(string) overload (s

eugeneh101
eugeneh101 commented Apr 24, 2020

If you join Dask DataFrame on a categorical column, then the outputted Dask DataFrame column is still category dtype. However, the moment you .compute() the outputted Dask DataFrame, then the column is the wrong dtype, not categorical.

Tested on Dask 2.14.0 and Pandas 1.0.3
This example where the category type looks like a float, so after .compute(), the dtype is float.

import dask.d
rickiepark
rickiepark commented Dec 31, 2018

(p380) In last sentence of 1st paragraph and first sentence of 2nd paragraph, "Pitt" should be "Pitts".

(p384) In a picture, "3nd Layer" should be "3rd Layer".

(p385) In 1st paragraph, "the o superscript" should be "the out superscript".

(p406) In 1st code block, "print('Training accuracy: ..)" should be "print('Test accuracy: ..)"

(p410) In 1st paragraph,

CJStadler
CJStadler commented Jul 23, 2019

For example, if there is a relationship transaction.session_id -> sessions.id and we are calculating a feature transactions: sessions.SUM(transactions.value) any rows for which there is no corresponding session should be given the default value of 0 instead of NaN.

Of course this should not normally occur, but when it does it seems more reasonable to use the default_value.

`DirectF

cfregly
cfregly commented Apr 17, 2019
  File "/root/miniconda3/bin/pipeline", line 11, in <module>
    sys.exit(_main())
  File "/root/miniconda3/lib/python3.7/site-packages/cli_pipeline/cli_pipeline.py", line 5734, in _main
    _fire.Fire()
  File "/root/miniconda3/lib/python3.7/site-packages/fire/core.py", line 127, in Fire
    component_trace = _Fire(component, args, context, name)
  Fil
cli
yellowbrick
aaronmarkham
aaronmarkham commented Dec 6, 2019

I tried building the docs, but was met with a graphviz error. Typically this means I can spend a few hours pecking away at the dependencies until I get stable build... or someone that has it working can export their environment, and publish an environment.yml that we can use with the build instructions.
I was going off of the d2l book since that's a dep here, but their [environment.yml](https://g

hermidalc
hermidalc commented Jan 10, 2020

RFE/RFECV are not only feature selectors (SelectorMixin) but also classifiers/regressors (MetaEstimatorMixin), though ELI5 explain_weights doesn't support them as classifiers/regressors. The final fit of an RFE/RFECV object is a fitted estimator with either rfe.estimator_.coef_ or rfe.estimator_.feature_importances_ and in sklearn you do not usually follow up RFE/RFECV with another classifier

scikit-plot
StrikerRUS
StrikerRUS commented Oct 18, 2019

I'm sorry if I missed this functionality, but CLI version hasn't it for sure (I saw the related code only in generate_code_examples.py). I guess it will be very useful to eliminate copy-paste phase, especially for large models.

Of course, piping is a solution, but not for development in Jupyter Notebook, for example.

Created by David Cournapeau

Released January 05, 2010

Latest release 20 days ago

Repository
scikit-learn/scikit-learn
Website
scikit-learn.org
Wikipedia
Wikipedia

Related Topics

python scikit
You can’t perform that action at this time.