#
text-segmentation
Here are 43 public repositories matching this topic...
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
spellcheck
fuzzy-search
fuzzy-matching
edit-distance
levenshtein
levenshtein-distance
spelling
spell-check
chinese-text-segmentation
word-segmentation
approximate-string-matching
spelling-correction
damerau-levenshtein
text-segmentation
chinese-word-segmentation
symspell
-
Updated
Oct 31, 2021 - C#
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
nlp
sentiment-analysis
unsupervised
named-entity-recognition
text-summarization
dependency-parser
keyword-extraction
text-segmentation
text-cleaning
gitee
new-word-discovery
pyhanlp
harvesttext
-
Updated
May 10, 2021 - Python
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
nlp
tokenizer
text-processing
semeval
nlp-library
word-segmentation
spelling-correction
tokenization
text-segmentation
spell-corrector
word-normalization
-
Updated
Feb 8, 2021 - Python
Python port of SymSpell
python
spellcheck
fuzzy-search
fuzzy-matching
edit-distance
levenshtein
levenshtein-distance
spelling
spell-check
chinese-text-segmentation
word-segmentation
approximate-string-matching
spelling-correction
damerau-levenshtein
text-segmentation
chinese-word-segmentation
symspell
-
Updated
Sep 1, 2021 - Python
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
machine-learning
awesome
ocr
computer-vision
deep-learning
text-recognition
text-detection
text-segmentation
end-to-end-ocr
video-ocr
-
Updated
Sep 15, 2019
A sentence segmenter that actually works!
-
Updated
Aug 18, 2020 - Python
Implementation of the paper: Text Segmentation as a Supervised Learning Task
-
Updated
Oct 2, 2019 - Python
(yet another not really) awesome topic/text segmentation list
-
Updated
Nov 26, 2018
Fast Word Segmentation with Triangular Matrix
spellcheck
spell-check
spelling-checker
spell-checker
word-segmentation
spelling-correction
spelling-corrector
spellchecker
text-segmentation
spell-corrector
symspell
-
Updated
Oct 31, 2021 - C#
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
-
Updated
Nov 2, 2020 - Java
Fast SymSpell written in c++ and exposes to python via pybind11
python
spellcheck
fuzzy-search
fuzzy-matching
spelling
spell-check
word-segmentation
spelling-correction
spelling-corrector
text-segmentation
pybind11
compound-words
symspell
-
Updated
Jun 2, 2021 - C++
Printed and handwritten text segmentation using fully convolutional networks and CRF post-processing
machine-learning
segmentation
conditional-random-fields
fully-convolutional-networks
text-segmentation
printed-handwritten-text
-
Updated
Jan 14, 2021 - Python
Uses GloVe embeddings and greedy sequence segmentation to semantically segment a text document into any number of k segments.
-
Updated
Feb 17, 2019 - Jupyter Notebook
Spelling correction and string segmentation written in Go
golang
spellcheck
spelling
spell-check
word-segmentation
spelling-correction
text-segmentation
symspell
string-segmentation
-
Updated
Sep 19, 2021 - Go
Word Segmentation with Dynamic Programming
spellcheck
spell-check
spell-checker
word-segmentation
spelling-correction
spelling-corrector
spellchecker
text-segmentation
spell-corrector
symspell
-
Updated
Oct 31, 2021 - C#
Text segmentation into separate words using a simple unigram model and the Viterbi algorithm
-
Updated
Sep 12, 2021 - C#
This project aimed to perform text segmentation in images using AutoEncoders.
-
Updated
Jul 7, 2020 - Jupyter Notebook
Neural and nonneural text segmentation methods.
-
Updated
Oct 8, 2018 - Jupyter Notebook
Data for the ACL 2020 paper - Improving Segmentation for Technical Support Problems
natural-language-processing
nlp-machine-learning
text-segmentation
nlp-datasets
ibm-research-ai
acl2020
-
Updated
Jun 23, 2020
Image Analysis Toolkit for text document Binarization & Segmentation written in TypeScript.
angular
typescript
angular-material
image-processing
image-analysis
text-segmentation
binarization
web-workers
ostu-threshold
sauvola-threshold
gpp-threshold
arlsa-segmentation
-
Updated
Aug 12, 2021 - TypeScript
"WBSUBNdb_text: Bangla handwritten text document dataset" is a Bangla text dataset containing 1352 offline handwritten text documents contributed by 188 writers. The dataset is composed of both simple and compound characters.
computer-vision
image-processing
text-segmentation
bangla-dataset
bangla-ocr
bangla-handwritten-text-dataset
-
Updated
Oct 13, 2021
Transcript segmentation using the average semantic encodings of cue sentences.
-
Updated
Sep 28, 2021 - Python
Language processing interface: some tools to process different natural languages
natural-language-processing
wordnet
segmentation
preprocessing
tokenization
stemming
text-segmentation
word-tokenizing
-
Updated
Jul 28, 2017 - Java
Text segmentation solution using natural language processing.
nlp
viterbi-algorithm
machine-learning
natural-language-processing
word
dynamic-programming
word-segmentation
nlp-machine-learning
dp
viterbi
machine-intelligence
text-segmentation
unigram
bigram-model
bigram
unigram-model
-
Updated
Jun 30, 2021 - Jupyter Notebook
Perl wrapper for CppJieba (Chinese text segmentation)
-
Updated
Jun 27, 2021 - SWIG
一些容易被分词工具被分错的句子。
natural-language-processing
text-analysis
segmentation
chinese-nlp
chinese-text-segmentation
text-segmentation
chinese-word-segmentation
-
Updated
Apr 11, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the text-segmentation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the text-segmentation topic, visit your repo's landing page and select "manage topics."
ENUMs for the catalyst-based conventions.
As an example, I will use
scope
.Motivation
We have a lot of callbacks that used
scope
and maybe some other parameter that has only in catalyst-based meaning.And every time we need to do something like that: