Skip to content
#

text-classification

Here are 1,327 public repositories matching this topic...

dipanjan77
dipanjan77 commented Aug 12, 2019

Description

Add a ReadMe file in the GitHub folder.
Explain usage of the Templates

Other Comments

Principles of NLP Documentation
Each landing page at the folder level should have a ReadMe which explains -
○ Summary of what this folder offers.
○ Why and how it benefits users
○ As applicable - Documentation of using it, brief description etc
Scenarios folder:

Shotgun167
Shotgun167 commented May 1, 2019

I have some values in slots that are surrounded by curly braces and are meant to be returned as is. Instead, the trailing brace is being stripped. "${website}" becomes "${website". I have training examples where the whole "${website}" is included. Is there a way to change this behavior?

prabhakar267
prabhakar267 commented Feb 16, 2018

If I have a word, how do i get top k words closest to that given word. As far as i understand, there is a way to get it from cpp code but I can't find anything in the python library.
Something similar to what gensim word2vec implementation has:

model.most_similar(positive=[your_word_vector], topn=1))
simpletransformers
ThilinaRajapakse
ThilinaRajapakse commented May 2, 2020

As Simple Transformers grows, the single page README documentation has gotten quite bloated and difficult to use. Because of this, I've decided that it's time (if not a little late already) to move the documentation to a more user-friendly Github Pages hosted website at the link below.

https://thilinarajapakse.github.io/simpletransformers/

As of now, only the text classification section is

dongcy-AHU
dongcy-AHU commented Sep 22, 2019

我觉得这里的else:row.append(i + vocab_size)应该改为else:row.append(i + train_size+vocab_size),对吗?
这段代码是为doc节点和word节点生成稀疏邻接矩阵的代码,邻接矩阵的大小为train_size + vocab_size + test_size, 当doc文本序号i大于train_size时,剩下的不就是test_size大小的文本与单词建立连接吗?test_size在邻接矩阵之前不是有train_size+vocab_size,所以此时是不是文本从train_size+vocab_size开始一一与词建立连接?

以下为源码:

for i in range(len(shuffle_doc_words_list)):
    doc_words = shuffle_doc_words_

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

  • Updated May 6, 2020
  • Python

基于Pytorch和torchtext的自然语言处理深度学习框架,包含序列标注、文本分类、句子关系、文本生成、结构分析、五大功能模块,已实现了命名实体识别、中文分词、词性标注、语义角色标注、情感分析、关系抽取、语言模型、文本相似度、文本蕴含、依存句法分析、词向量训练、聊天机器人、机器翻译、文本摘要等功能。框架功能丰富,开箱可用,极易上手!基本都是学习他人实现然后自己修改融合到框架中,没有细致调参,且有不少Bug~

  • Updated Jan 10, 2020
  • Python
icecity96
icecity96 commented Mar 6, 2018

OS: MacOS 10.13.3
I installed the MeTA as instructed on setup guide. When I do the unit test

describe [ranker regression]
libc++abi.dylib: terminating with uncaught exception of type meta::corpus::corpus_exception: corpus configuration file (../data//cranfield/line.toml) not present

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

  • Updated Apr 3, 2020
  • Python
EagleYing
EagleYing commented Nov 10, 2018

(tensorflow) F:\Postgraduate\KaggleLearning\multi-class-text-classification-cnn-rnn-master\multi-class-text-classification-cnn-rnn-master>python predict.py ./t
rained_results_1541818386/ ./data2/samples.csv
D:\Anaconda\anaconda\envs\tensorflow\lib\site-packages\gensim\utils.py:1212: UserWarning: detected Windows; aliasing chunkize to chunkize_serial
warnings.warn("detected Windows; aliasing c

Improve this page

Add a description, image, and links to the text-classification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-classification topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.