-
Updated
Jun 17, 2020 - Python
#
computational-linguistics
Here are 241 public repositories matching this topic...
Curated List: Practical Natural Language Processing done in Ruby
ruby
nlp
list
machine-learning
natural-language-processing
awesome
sentiment-analysis
awesome-list
computational-linguistics
pos-tag
rubynlp
rubyml
-
Updated
Jul 16, 2020 - Ruby
Python Keyphrase Extraction module
python
natural-language-processing
information-retrieval
keyword
computational-linguistics
keyword-extraction
keyphrase-extraction
keyphrase
-
Updated
Sep 24, 2020 - Python
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
python
nlp
machine-learning
natural-language-processing
library
linguistics
computational-linguistics
text-processing
nlp-library
search-algorithms
evaluation-metrics
folia
language-modelling
-
Updated
Mar 13, 2019 - Python
Statistics and accepted paper list of ACL 2020 with arXiv link
-
Updated
Jun 18, 2020 - Jupyter Notebook
-
Updated
Nov 16, 2019
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
nlp
machine-learning
natural-language-processing
ai
parsing
artificial-intelligence
computational-linguistics
nlp-library
-
Updated
Sep 30, 2017 - GAP
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
nlp
twitter
deep-learning
sentiment-analysis
neural-network
word-embeddings
keras
embeddings
lstm
attention
deeplearning
glove
computational-linguistics
semeval
attention-mechanism
keras-models
nlp-machine-learning
twitter-messages
semeval-sentiment
-
Updated
Jun 8, 2018 - Python
Statistical NLG for spoken dialogue systems
python
dialogue
seq2seq
computational-linguistics
natural-language-generation
dialogue-systems
tgen
seq2seq-generation
-
Updated
Jul 2, 2020 - Python
Data and software for building the ACL Anthology.
rails
natural-language-processing
library
solr
acl
computational-linguistics
solr-server
library-management-system
rails-server
acl-anthology
acl-rails
-
Updated
Oct 1, 2020 - Python
Cantonese Linguistics and NLP in Python
-
Updated
Jul 25, 2020 - Python
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
python
nlp
c-plus-plus
library
corpus
linguistics
pattern-recognition
computational-linguistics
text-processing
ngram
ngrams
skipgram
-
Updated
May 6, 2020 - C++
A curated list of NLP resources for Hungarian
nlp
parser
natural-language-processing
information-retrieval
text-mining
awesome
nlu
corpus
information-extraction
dataset
named-entity-recognition
awesome-list
tagger
computational-linguistics
opinion-mining
corpus-linguistics
nlp-resources
hungarian
natural-language-understanding
hungarian-language
-
Updated
Sep 18, 2020
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
javascript
python
nlp
web-application
linguistics
computational-linguistics
folia
annotation-tool
linguistic-annotation-framework
clarin
clariah
-
Updated
Mar 17, 2020 - JavaScript
Abstract Meaning Representation (AMR) tutorial slides
-
Updated
Mar 9, 2016 - TeX
Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ
-
Updated
Jun 18, 2020 - Jupyter Notebook
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
nlp
natural-language-processing
text-mining
computational-linguistics
corpus-linguistics
german-language
-
Updated
Jul 20, 2020
python
docker
linguistics
automatic-speech-recognition
computational-linguistics
kaldi
transcription
-
Updated
Oct 4, 2020 - Python
kylebgorman
commented
Aug 6, 2020
/ɛ, ɔ/ appear in a few-dozen phonemic transcriptions, but they either are allophones of, or incorrect transcriptions, of, /e, o/ in the modern language.
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
nlp
syntax
natural-language-processing
morphology
named-entity-recognition
computational-linguistics
text-processing
dutch
dependency-parser
pos-tagger
folia
lemmatiser
morphological-analyser
-
Updated
Sep 28, 2020 - C++
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
python
nlp
natural-language-processing
text-mining
research
spacy
nltk
computational-linguistics
textblob
textual-analysis
-
Updated
Jun 5, 2020 - Jupyter Notebook
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
-
Updated
Jun 4, 2020 - C++
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
python
nlp
linux
vagrant
natural-language-processing
virtual-machine
docker-image
installer
flat
linux-distribution
frog
computational-linguistics
webservices
folia
software-distribution
clam
-
Updated
Oct 2, 2020 - Shell
Python tutorials as Jupyter Notebooks for NLP, ML, AI
python
natural-language-processing
deep-learning
parsing
neural-network
wordnet
nltk
deeplearning
computational-linguistics
hidden-markov-model
flair
part-of-speech-tagger
natural-language-understanding
framenet
verbnet
spacy-nlp
propbank
-
Updated
Oct 4, 2020 - Jupyter Notebook
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
python
nlp
language
library
xml
corpus
linguistics
file-format
computational-linguistics
folia
linguistic-annotation-framework
-
Updated
Sep 2, 2020 - Python
Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.
nlp
machine-learning
tweets
sentiment-analysis
word2vec
word-embeddings
keras
jupyter-notebook
cnn
embeddings
machinelearning
computational-linguistics
convolutional-neural-network
nlp-machine-learning
word2vec-ru
-
Updated
Dec 12, 2019 - Jupyter Notebook
Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
visualization
nlp
machine-learning
word2vec
word-embeddings
embeddings
machinelearning
computational-linguistics
tsne
nlp-machine-learning
google-news
leo-tolstoy
-
Updated
Nov 15, 2018 - Jupyter Notebook
Yet Another (natural language) Parser
nlp
go
golang
natural-language-processing
disambiguation
computational-linguistics
dependency-parser
nlp-dependency-parsing
nlp-parsing
hebrew
transition-systems
universal-dependencies
morphological-analysis
hebrew-analytical-lexicon
morphological-disambiguator
-
Updated
May 15, 2019 - Go
Improve this page
Add a description, image, and links to the computational-linguistics topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the computational-linguistics topic, visit your repo's landing page and select "manage topics."
Add departments column and values for PIs at Stanford University