-
Updated
Apr 9, 2022 - Python
#
information-retrieval
Here are 1,633 public repositories matching this topic...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
python
machine-learning
information-retrieval
data-mining
ocr
deep-learning
image-processing
cnn
pytorch
lstm
optical-character-recognition
crnn
scene-text
scene-text-recognition
easyocr
bug
Issue described a bug
difficulty easy
Easy issue: required small fix
good first issue
Issue for new contributors (not required gensim understanding + very simple)
fasttext
Issues related to the FastText model
ZanSara
commented
Mar 16, 2022
Problem
Currently FARMReader
will ask users to raise max_seq_length
every time some samples are longer than the value set to it. However, this can be confusing if max_seq_length
is already set to the maximum value allowed by the model, because raising it further will cause hard-to-read CUDA errors.
See #2177.
Solution
We should find a way to query the model for the maximum va
type:feature
New feature or request
good first issue
Good for newcomers
topic:models
journey:intermediate
e.g. model training, api, evaluation...
Apache Lucene and Solr open-source search software
-
Updated
Apr 8, 2022
Fetches system/theme information in terminal for Linux desktop screenshots.
-
Updated
Feb 23, 2022 - Shell
Accelerated deep learning R&D
python
infrastructure
machine-learning
natural-language-processing
information-retrieval
research
reinforcement-learning
computer-vision
deep-learning
text-classification
distributed-computing
image-processing
pytorch
image-classification
metric-learning
recommender-system
object-detection
image-segmentation
reproducibility
text-segmentation
-
Updated
Apr 11, 2022 - Python
Learning to Rank in TensorFlow
-
Updated
Feb 21, 2022 - Python
Deep neural network to extract intelligent information from invoice documents.
information-retrieval
deep-neural-networks
deep-learning
invoices
keras
information-extraction
classification
invoice
billing
deeplearning
keras-neural-networks
invoice-pdf
invoice-management
keras-tensorflow
invoice-software
invoice-insight
invoice-parser
-
Updated
Jul 8, 2021 - Python
A collection of research on knowledge graphs
natural-language-processing
information-retrieval
paper
survey
knowledge-graph
question-answering
representation-learning
cross-modal
knowledge-graph-completion
ner
dialogue-systems
reasoning
relation-extraction
commonsense
temporal-knowledge-graph
recommendation-systems
meta-relational-learning
-
Updated
Mar 24, 2022 - JavaScript
Python Keyphrase Extraction module
python
natural-language-processing
information-retrieval
keyword
computational-linguistics
keyword-extraction
keyphrase-extraction
keyphrase
-
Updated
Apr 11, 2022 - Python
Track any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
linux
information-retrieval
ip-location
ip-geolocation
termux
hacking-tool
linux-tools
information-gathering
hacking-tools
termux-tool
termux-hacking
ip-tracer
gnuroot-debian
-
Updated
Feb 12, 2022 - PHP
Apache Lucene open-source search software
-
Updated
Apr 12, 2022 - Java
Resources to learn more about Machine Learning and Artificial Intelligence
machine-learning
natural-language-processing
information-retrieval
reinforcement-learning
deep-learning
artificial-intelligence
knowledge-graph
question-answering
probabilistic-programming
bayesian-inference
recommender-systems
causal-inference
knowledge-representation
reasoning
-
Updated
Feb 8, 2022
telegram group scraper tool. fetch all information about group members
linux
information-retrieval
telegram
python3
promotion
termux
information-gathering
smsbomber
termux-tool
telegram-scraper-bot
telegram-scraper
-
Updated
Jun 20, 2021 - Python
Anserini is a Lucene toolkit for reproducible information retrieval research
-
Updated
Apr 12, 2022 - Java
A curated list of papers dedicated to neural text (semantic) matching.
-
Updated
Dec 6, 2020 - HTML
Information Gathering Instagram.
python
linux
instagram
information-retrieval
scraper
osint
python3
instagram-scraper
termux
information-gathering
termux-tool
-
Updated
Apr 3, 2022 - Python
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
machine-learning
natural-language-processing
information-retrieval
clustering
record-linkage
fuzzy-matching
deduplication
-
Updated
May 5, 2021 - JavaScript
PISA: Performant Indexes and Search for Academia
-
Updated
Apr 3, 2022 - C++
nlp
natural-language-processing
information-retrieval
deep-learning
transformers
pytorch
artificial-intelligence
question-answering
reading-comprehension
bert
-
Updated
Apr 30, 2020 - Python
Hardware-accelerated vector database and search engine. Available as a HTTP service or as an embedded library.
search
search-engine
machine-learning
information-retrieval
nlu
vector-space-model
vector-space
search-algorithms
resin
nlu-engine
-
Updated
Apr 12, 2022 - C#
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
natural-language-processing
information-retrieval
corpus
language-detection
embeddings
named-entity-recognition
normalizer
spell-check
persian-language
stemmer
dependency-parser
persian-nlp
part-of-speech-tagger
morphological-analysis
persian-stemmer
shallow-parser
-
Updated
Apr 2, 2022
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
-
Updated
Jun 5, 2017 - Python
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
nlp
elasticsearch
benchmark
information-retrieval
deep-learning
retrieval
pytorch
dataset
bert
dpr
passage-retrieval
question-generation
sentence-transformers
sbert
zero-shot-retrieval
colbert
retrieval-models
ance
use-qa
-
Updated
Apr 7, 2022 - Python
allRank is a framework for training learning-to-rank neural models based on PyTorch.
python
machine-learning
information-retrieval
deep-learning
pytorch
transformer
ranking
learning-to-rank
ndcg
click-model
-
Updated
Aug 16, 2021 - Python
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
-
Updated
Apr 12, 2022 - Python
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
-
Updated
Dec 24, 2018 - Python
ACL'2021: Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too
-
Updated
Apr 8, 2022 - Python
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
information-retrieval
text-classification
word2vec
text-generation
information-extraction
knowledge-graph
network-embedding
sequence-labeling
dialogue-systems
sentence2vec
machine-reading-comprehension
pretrained-language-model
-
Updated
Jan 11, 2021 - OpenEdge ABL
Improve this page
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."
In gensim/models/fasttext.py: