-
Updated
Jun 15, 2020 - Python
#
document-classification
Here are 93 public repositories matching this topic...
Text Classification Algorithms: A Survey
deep-learning
random-forest
text-classification
recurrent-neural-networks
naive-bayes-classifier
dimensionality-reduction
logistic-regression
document-classification
convolutional-neural-networks
text-processing
decision-trees
boosting-algorithms
support-vector-machines
hierarchical-attention-networks
nlp-machine-learning
conditional-random-fields
k-nearest-neighbours
deep-belief-network
rocchio-algorithm
deep-neural-network
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
nlp
text-classification
question-answering
document-classification
transfer-learning
fasttext
language-model
textcnn
attention-is-all-you-need
self-attention
transformer-encoder
bert-model
pre-training
language-understanding
-
Updated
Jan 1, 2019 - Python
Hierarchical Attention Networks for Document Classification in PyTorch
-
Updated
Mar 4, 2020 - Jupyter Notebook
Open
Add type hints
daemon
commented
Mar 26, 2019
Hierarchical Attention Networks for document classification
python
nlp
deep-neural-networks
deep-learning
text-classification
cnn
python3
pytorch
document-classification
deeplearning
hierarchical-attention-networks
nlp-machine-learning
han
-
Updated
Jun 16, 2020 - Python
HDLTex: Hierarchical Deep Learning for Text Classification
information-retrieval
text-mining
deep-neural-networks
deep-learning
text-classification
tensorflow
gpu
recurrent-neural-networks
dataset
document-classification
convolutional-neural-networks
hierarchical-deep-learning
science-dataset
-
Updated
May 29, 2020 - Python
2
NickYi1990
commented
Apr 1, 2019
Your tutorials are too good to be true, will u finish this text classification tutorial? I really enjoy reading your repo!
ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
java
image-processing
image-classification
image-captioning
document-classification
image-segmentation
ner
annotation-tool
document-annotate
-
Updated
Jun 7, 2020 - JavaScript
A Python package that implements a novel text classifier (SS3) with visualizations tools for Explainable Artificial Intelligence (XAI)

nlp
machine-learning
natural-language-processing
text-mining
data-mining
text-classification
machine-learning-algorithms
artificial-intelligence
document-classification
sentence-classification
interpretability
multilabel-classification
explainable-artificial-intelligence
interpretable-ml
xai
interpretable-machine-learning
document-categorization
early-classification
text-labeling
ss3-classifier
-
Updated
May 27, 2020 - Python
lc1915
commented
Mar 3, 2020
This is definitely a good implementation of the paper. As shown in the paper, it provides the visualization of the importance (attention) of each word in a document. I'm wondering, can we get the attention score for each word from the codes? Thank you so much!
TensorFlow implementation of Hierarchical Attention Networks for Document Classification and some extension
python
font
natural-language-processing
deep-learning
tensorflow
document-classification
ideogram
logogram
-
Updated
Apr 14, 2017 - Python
TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
sentiment-analysis
label
svm
word2vec
pytorch
logistic-regression
document-classification
glove
configurable
bert
sklearn-classify
drnn
textcnn
textrnn
cnn-text-classification
dpcnn
lstm-text-classification
neuralclassifier
-
Updated
Mar 8, 2020 - Python
TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"
sentiment-analysis
text-classification
tensorflow
document-classification
attention-mechanism
hierarchical-attention-networks
-
Updated
May 28, 2019 - Python
nlp
library
framework
deep-learning
sentiment-analysis
text-classification
keras
lstm
attention
document-classification
sentence-classification
nlp-machine-learning
keras-tensorflow
cnn-text-classification
stacked-lstm
-
Updated
Apr 28, 2019 - Python
Helps you organizing your paperwork
-
Updated
Mar 31, 2020 - PHP
Character-level CNN for text classification
nlp
natural-language-processing
deep-neural-networks
deep-learning
text-classification
pytorch
document-classification
nlp-machine-learning
character-level-cnn
-
Updated
Jan 31, 2019 - Python
GroupDocs.Classification-for-.NET samples and showcase (text and documents classification and sentiment analysis)
sentiment-analysis
analysis
examples
sentiment
classification
document
document-classification
iab
showcases
-
Updated
May 19, 2020 - C#
Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks
deep-learning
image-classification
document-classification
transfer-learning
structure-learning
deep-convolutional-neural-networks
document-image-classification
training-strategies
-
Updated
Nov 16, 2019 - Python
The Tensorflow implementation of accepted ACL 2018 paper "A deep relevance model for zero-shot document filtering", Chenliang Li, Wei Zhou, Feng Ji, Yu Duan, Haiqing Chen, http://aclweb.org/anthology/P18-1214
-
Updated
Apr 29, 2019 - Python
3HAN: A Deep Neural Network for Fake News Detection: https://link.springer.com/chapter/10.1007%2F978-3-319-70096-0_59
-
Updated
Jun 21, 2018 - Python
Very deep CNN for text classification
nlp
natural-language-processing
deep-neural-networks
deep-learning
text-classification
pytorch
document-classification
deeplearning
very-deep-cnn
vdcnn
-
Updated
Jan 31, 2019 - Python
Character-level CNN for text classification
nlp
natural-language-processing
deep-neural-networks
deep-learning
text-classification
tensorflow
document-classification
nlp-machine-learning
character-level-cnn
-
Updated
Jan 31, 2019 - Python
Python implementation of bag-of-concepts
machine-learning
text-mining
clustering
word2vec
concept
document-classification
representation-learning
unsupervised-learning
datamining
bag-of-concepts
document-representation
-
Updated
Apr 5, 2019 - Python
Very deep CNN for text classification
nlp
natural-language-processing
deep-neural-networks
deep-learning
text-classification
tensorflow
document-classification
deeplearning
nlp-machine-learning
very-deep-cnn
vdcnn
-
Updated
Jan 31, 2019 - Python
NLP in python Vector Space Modelling and document classification NLP
-
Updated
Mar 19, 2017 - Jupyter Notebook
A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
python
sklearn
word-embeddings
document-classification
flair
active-learning
model-interpretation
eli5
document-classifier
model-interpretability
-
Updated
Sep 9, 2019 - HTML
Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"
-
Updated
Mar 24, 2020 - Python
Simple command-line scripts for document classification
-
Updated
Jun 26, 2019 - Python
Document classification using Latent semantic analysis in python
python
natural-language-processing
deep-learning
tensorflow
keras
document
document-classification
tf-idf
lsa
latent-semantic-analysis
-
Updated
Oct 17, 2017 - Jupyter Notebook
Improve this page
Add a description, image, and links to the document-classification topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the document-classification topic, visit your repo's landing page and select "manage topics."
In the file data_util.py, the code is as follows:
`def batch(inputs):
batch_size = len(inputs)
document_sizes = np.array([len(doc) for doc in inputs], dtype=np.int32) # Different batch will
# have different document_sizes.
document_size = document_sizes.max() # Document with maximum sentence number.
sentenc