natural-language-processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 8,385 public repositories matching this topic...
-
Updated
Sep 11, 2021 - Python
-
Updated
Sep 2, 2021 - Jupyter Notebook
-
Updated
Sep 11, 2021 - Python
-
Updated
Sep 5, 2021 - Python
-
Updated
Sep 11, 2021 - Python
-
Updated
Sep 10, 2021 - Python
-
Updated
Jun 12, 2017
-
Updated
Sep 11, 2021
Change tensor.data
to tensor.detach()
due to
pytorch/pytorch#6990 (comment)
tensor.detach()
is more robust than tensor.data
.
-
Updated
Sep 10, 2021 - Python
In gensim/models/fasttext.py:
model = FastText(
vector_size=m.dim,
vector_size=m.dim,
window=m.ws,
window=m.ws,
epochs=m.epoch,
epochs=m.epoch,
negative=m.neg,
negative=m.neg,
# FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
# or model=3 supervi
-
Updated
Sep 2, 2021
-
Updated
May 2, 2021
-
Updated
Sep 10, 2021 - Python
-
Updated
Sep 11, 2021 - Python
Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict
command opens the file and reads lines for the Predictor
. This fails when it tries to load data from my compressed files.
The below input should remove all user handles which start with "@".
input: @remy:This is waaaaayyyy too much for you!!!!!!@adam
output : [':', 'This', 'is', 'waaayyy', 'too', 'much', 'for', 'you', '!', '!', '!', '@adam']
The TweetTokenizer fail to remove the user handle of Adam.
I would like to open a pull request that solves the following issues:-
- Improve the regular expression
-
Updated
Dec 22, 2020 - Python
-
Updated
Sep 10, 2021
-
Updated
Sep 11, 2021 - Python
-
Updated
Jul 1, 2021 - Python
-
Updated
Jul 25, 2021 - Jupyter Notebook
-
Updated
Sep 10, 2021 - Java
Hello spoooopyyy hackers
This is a Hacktoberfest only issue!
This is also data-sciency!
The Problem
Our English dictionary contains words that aren't English, and does not contain common English words.
Examples of non-common words in the dictionary:
"hlithskjalf",
"hlorrithi",
"hlqn",
"hm",
"hny",
"ho",
"hoactzin",
"hoactzine
-
Updated
Jul 9, 2021 - Python
-
Updated
Sep 11, 2021 - Python
-
Updated
Aug 9, 2021 - Python
-
Updated
Aug 31, 2021 - Python
Created by Alan Turing
- Wikipedia
- Wikipedia
https://github.com/huggingface/transformers/blob/546dc24e0883e5e9f5eb06ec8060e3e6ccc5f6d7/src/transformers/models/gpt2/modeling_gpt2.py#L698
Assertions can't be relied upon for control flow because they can be disabled, as per the following: