Skip to content
#

data-mining

Here are 3,848 public repositories matching this topic...

LightGBM
jameslamb
jameslamb commented Jan 27, 2021

Summary

mypy shows some issues in LightGBM's Python package.

mypy \
    --exclude='python-package/compile/|python-package/build' \
    --ignore-missing-imports \
    python-package/
18 errors in 4 files (click me)
python-package/lightgbm/compat.py:12: error: Name 'Series' already defined (possibly by an import)
python-package
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
fingoldo
fingoldo commented Mar 24, 2022

Problem:

_catboost.pyx in _catboost._set_features_order_data_pd_data_frame()

_catboost.pyx in _catboost.get_cat_factor_bytes_representation()

CatBoostError: Invalid type for cat_feature[non-default value idx=1,feature_idx=336]=2.0 : cat_features must be integer or string, real number values and NaN values should be converted to string.

Could you also print a feature name, not o

sktime
fkiraly
fkiraly commented Apr 10, 2022

The Aggregator incorrectly assumes that all hierarchy levels are named, in contrast to the specification of pd_multiindex_hier which does not. The estimator breaks when receiving input where at least one level is not named.

#2432 proposes a temp workaround, but this should be:

  • cleaned up so the inner logic of Aggregator no longer assumes it
  • tested, so we ensure in all estimators t
bug good first issue
ferret
ngirard
ngirard commented Mar 31, 2021

Unless I missed something, the documentation doesn't explain how to query document metadata (searching "site:montferret.dev metadata" through Google returned nothing, neither did grepping the source code).

As an example, I tried to query the og:url metadata.
I tried variations of //meta[property='og:url']::attr(content), with or without the leading //, and with or without the `attr(conte

help wanted good first issue type/documentation

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域

  • Updated Feb 6, 2020

Improve this page

Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."

Learn more