Skip to content
#

R

r logo

R is a free programming language and software environment for statistical computing and graphics. R has a wide variety of statistical linear and non-linear modeling and provides numerous graphical techniques.

Here are 26,310 public repositories matching this topic...

dash
sameermahajan
sameermahajan commented Nov 15, 2021

make_future_dataframe doesn't support regressors currently. So code like:

m = Prophet()
m.add_regressor('var')
m.fit(df)
forecasts = m.predict(m.make_future_dataframe(periods=7))

gives an error like:

ValueError: Regressor 'var' missing from dataframe when attempting to generate forecasts

I know prophet may not know what exact values to put for var in each of the rows a

enhancement good first issue
LightGBM
jameslamb
jameslamb commented Jan 27, 2021

Summary

mypy shows some issues in LightGBM's Python package.

mypy \
    --exclude='python-package/compile/|python-package/build' \
    --ignore-missing-imports \
    python-package/
18 errors in 4 files (click me)
python-package/lightgbm/compat.py:12: error: Name 'Series' already defined (possibly by an import)
python-package
fingoldo
fingoldo commented Mar 24, 2022

Problem:

_catboost.pyx in _catboost._set_features_order_data_pd_data_frame()

_catboost.pyx in _catboost.get_cat_factor_bytes_representation()

CatBoostError: Invalid type for cat_feature[non-default value idx=1,feature_idx=336]=2.0 : cat_features must be integer or string, real number values and NaN values should be converted to string.

Could you also print a feature name, not o

wowchemy-hugo-themes

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • Updated May 14, 2022
  • Jupyter Notebook
metaflow
data-science-at-the-command-line
jeroenjanssens
jeroenjanssens commented Jun 10, 2020

I'm happy to announce that I'll be writing the second edition of Data Science at the Command Line (O'Reilly, 2014). This issue explains why I think a second edition is needed, lists what changes I plan to make, and presents a tentative outline. Finally, I have a few words about the process and giving feedback.

Why a second edition?

While the command line as a technology and as a way of w

wave
ehoppmann
ehoppmann commented Aug 23, 2019

Our xgboost models use the binary:logistic' objective function, however the m2cgen converted version of the models return raw scores instead of the transformed scores.

This is fine as long as the user knows this is happening! I didn't, so it took a while to figure out what was going on. I'm wondering if perhaps a useful warning could be raised for users to alert them of this issue? A warning

bug help wanted good first issue

Created by Ross Ihaka, Robert Gentleman

Released August 1993

Website
www.r-project.org
Wikipedia
Wikipedia

Related Topics

language