Skip to content
#

bigdata

Here are 1,219 public repositories matching this topic...

argenisleon
argenisleon commented Feb 22, 2020

Hi,
I am trying to load a CSV with no header using

df = vaex.open('data/star0000-1.csv',sep=",", header=None, error_bad_lines=False)

but I get

could not convert column 0, error: TypeError('getattr(): attribute name must be string'), will try to convert it to string
Giving up column 0, error: TypeError('getattr(): attribute name must be string')
could not convert column 
imback82
imback82 commented Mar 19, 2019

We create multiple jars during our builds to accommodate multiple versions of Apache Spark. In the current approach, the implementation is copied from one version to another and then necessary changes are made.

An ideal approach could create a common directory and extract common classes from duplicate code. Note that even if class/code is exactly the same, you cannot pull out to a common clas

ansorre
ansorre commented Jul 24, 2019

On home page of website: https://nlp.johnsnowlabs.com/ I read "Full Python, Scala, and Java support"

Unfortunately it's 3 days now I'm trying to use Spark NLP in Java without any success.

  • I cannot find Java API (JavaDoc) of the framework.
  • not event a single example in Java is available
  • I do not know Scala, I do not know how to convert things like:
    val testData = spark.createDataFrame(
allwefantasy
allwefantasy commented Aug 29, 2019

目前MLSQL已经提供了一份中文文档,
英文文档也有但很不完整同时不少内容过时了。现在诚邀各位英文较好的朋友
将中文文档翻译成英文。

参与翻译流程

  1. fork 项目 https://github.com/allwefantasy/streamingpro
  2. clone自己刚刚fork的项目
  3. 中文文档在 ·项目根目录/docs/gitbook/zh/
  4. 英文文档在 ·项目根目录/docs/gitbook/en/
  5. 申领中文文档下的文件
  6. 翻译稿件放在英文文档目录中
  7. push自己的成果,然后在web上选择 新提交PR

英文要和中文保持一致。领取任务者在本Issue下罗列领取的中文章节,避免其他人重复。

譬如 我是用户A, 我可以根据目录文件进行

andrewjw1995
andrewjw1995 commented May 16, 2018

The documentation file appears to have been generated with no space between the hashes and the header text. This is causing the headers to not display correctly, and is difficult to read. See below for an example of with and without the space:

##

Mobius API Documentation


###Microsoft.Spark.CSharp.Core.Accumulator</

panther
KVasya
KVasya commented Jul 16, 2019

When using artm.SmoothSparseThetaRegularizer(tau=tau_val) with tau_val<0 we get some \Theta matrix columns filled totally with zeros. From perplexity score, the optimization converges. The quantity of documents with all zeros in their \Theta columns grows as $tau_val->-\infty$.
How it's possible that optimization constraint on theta columns violates?

iamtmrobinson
iamtmrobinson commented May 30, 2018

The installation section currently says to use npm install --save circos with no further instruction.

Doing just this and then using Browserify to require circos results in the following error:
Error: Cannot find module '!!../../node_modules/css-loader/index.js!./tooltip.css' from '/.../node_modules/circos/dist'

Is there an assumption that Webpack is being used by those who try to impo

Improve this page

Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.