natural-language-processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 7,818 public repositories matching this topic...
-
Updated
Jun 2, 2021 - Python
-
Updated
Jun 14, 2021 - Jupyter Notebook
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 14, 2021 - Python
-
Updated
Jun 12, 2017
Change tensor.data
to tensor.detach()
due to
pytorch/pytorch#6990 (comment)
tensor.detach()
is more robust than tensor.data
.
(triggered by SO question: https://stackoverflow.com/questions/67944732/using-my-own-stopword-list-with-gensim-corpora-textcorpus-textcorpus/67951592#67951592)
Gensim has two remove_stopwords()
functions with similar, but slightly-different behavior that risks confusing users.
gensim.parsing.preprocessing.remove_stopwords
takes a space-delimited string, and always consults the current
-
Updated
Jun 16, 2021
-
Updated
May 2, 2021
-
Updated
Jun 18, 2021 - Python
-
Updated
Jun 16, 2021 - Python
-
Updated
Jun 4, 2021
Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict
command opens the file and reads lines for the Predictor
. This fails when it tries to load data from my compressed files.
-
Updated
Jun 17, 2021 - Python
-
Updated
Dec 22, 2020 - Python
-
Updated
Jun 17, 2021 - Python
-
Updated
May 21, 2021
-
Updated
Jun 14, 2021 - Python
-
Updated
May 2, 2021 - Jupyter Notebook
-
Updated
Jun 17, 2021 - Java
-
Updated
May 29, 2021 - Python
-
Updated
Jun 17, 2021 - Python
Hello spoooopyyy hackers
This is a Hacktoberfest only issue!
This is also data-sciency!
The Problem
Our English dictionary contains words that aren't English, and does not contain common English words.
Examples of non-common words in the dictionary:
"hlithskjalf",
"hlorrithi",
"hlqn",
"hm",
"hny",
"ho",
"hoactzin",
"hoactzine
-
Updated
Jun 12, 2021 - Python
-
Updated
Jun 15, 2021 - Python
Created by Alan Turing
- Wikipedia
- Wikipedia
Let's use this Issue to track performance issues and enhancement requests, so it's easier to prioritize the work.
This is for pytorch
transformers
Also I will label it as a
Good Difficult Issue
in case someone is ready for a challenging but rewarding experience of figuring things out. If you do want to take the challenge comment in the corresponding Issue/PR that resonates with you s