#
embedding
Here are 275 public repositories matching this topic...
100+ Chinese Word Vectors 上百种预训练中文词向量
-
Updated
Mar 20, 2021 - Python
Open
文档增加tokenizer类别及样例建议
1
akari0216
commented
Sep 2, 2021
欢迎您反馈PaddleNLP使用问题,非常感谢您对PaddleNLP的贡献!
在留下您的问题时,辛苦您同步提供如下信息:
- 版本、环境信息
1)PaddleNLP和PaddlePaddle版本:请提供您的PaddleNLP和PaddlePaddle版本号,例如PaddleNLP 2.0.4,PaddlePaddle2.1.1
2)系统环境:请您描述系统类型,例如Linux/Windows/MacOS/,python版本 - 复现信息:如为报错,请给出复现环境、复现步骤
paddle版本2.0.8 paddlenlp版本2.1.0
建议,能否在paddlenlp文档中,整理列出各个模型的tokenizer是基于什么类别的based,如bert tokenizer是word piece的,xlnet tokenizer是sentence piece的,以及对应的输入输出样例
good first issue
Good for newcomers
Open
关于一些具体建议
3
1
Siamese and triplet networks with online pair/triplet mining in PyTorch
machine-learning
deep-learning
pytorch
embedding
triplet-loss
siamese-network
contrastive-loss
triplet-network
learning-embeddings
-
Updated
Sep 7, 2021 - Python
A curated list of community detection research papers with implementations.
data-science
machine-learning
deep-learning
social-network
clustering
community-detection
network-science
deepwalk
matrix-factorization
networkx
dimensionality-reduction
factorization
network-analysis
unsupervised-learning
igraph
embedding
graph-clustering
node2vec
network-clustering
bigclam
-
Updated
Mar 2, 2022 - Python
Documents, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
tensorflow
svm
word2vec
crf
keras
similarity
classification
attention
gensim
lda
fasttext
ner
embedding
bert
elmo
-
Updated
Jul 5, 2021 - Python
Extensible, parallel implementations of t-SNE
-
Updated
Mar 18, 2022 - Python
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
-
Updated
Apr 10, 2022 - Python
Minimum-distortion embedding with PyTorch
visualization
machine-learning
gpu
cuda
pytorch
dimensionality-reduction
embedding
graph-embedding
feature-vectors
-
Updated
Mar 28, 2022 - Python
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
pytorch
face-recognition
metric-learning
speaker-recognition
embedding
loss-functions
face-verification
sphereface
normface
fashion-mnist
arcface
am-softmax
fmnist-dataset
loss-function
-
Updated
Oct 5, 2020 - Python
2
freakeinstein
commented
Jul 13, 2019
Current docker image size is insane. It is 2.55 GB. Reduce that to below 1GB or less. Apply changes from this reference: https://hackernoon.com/tips-to-reduce-docker-image-sizes-876095da3b34
enhancement
New feature or request
help wanted
Extra attention is needed
good first issue
Good for newcomers
low priority
Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。
text-classification
tensorflow
keras
python3
ner
embedding
bert
sequence-labeling
relation-extraction
-
Updated
Feb 9, 2022 - Python
OpenL3: Open-source deep audio and image embeddings
-
Updated
Apr 13, 2022 - Jupyter Notebook
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
machine-learning
facebook
neural-network
clustering
tensorflow
word2vec
community-detection
deezer
deepwalk
matrix-factorization
gensim
unsupervised-learning
embedding
network-embedding
node2vec
graph-embedding
gemsec
implicit-factorization
semisupervised-learning
m-nmf
-
Updated
Mar 2, 2022 - Python
This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper
clustering
pytorch
robust-optimization
embedding
dcc
rcc
autoencoder-clustering
robust-continuous-clustering
-
Updated
Jul 14, 2021 - Python
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
data-science
machine-learning
deep-learning
clustering
word2vec
sklearn
community-detection
deepwalk
autoencoder
dimensionality-reduction
unsupervised-learning
cikm
embedding
nmf
coordinate-descent
node2vec
node-embedding
gemsec
mnmf
danmf
-
Updated
Mar 2, 2022 - Python
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
nlp
ai
deep-learning
tensorflow
encoder
word2vec
embeddings
transformer
glove
fasttext
albert
embedding
bert
word-embedding
roberta
ulmfit
sentence-encoding
bert-as-service
xlnet
embedding-as-service
-
Updated
Feb 10, 2022 - Python
A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
machine-learning
word2vec
spectral
factorization
kdd
wavelet
unsupervised-learning
embedding
diffusion
laplacian
refex
node2vec
graph-embedding
graphwave
graph-wavelet
struc2vec
rolx
structural-embedding
structural-role
heat-kernel
-
Updated
Mar 2, 2022 - Python
Multi-Hop Logical Reasoning in Knowledge Graphs
-
Updated
Mar 27, 2022 - Python
EMNLP 2018: HyTE: Hyperplane-based Temporally aware Knowledge Graph Embedding
machine-learning
tensorflow
wikidata
knowledge-graph
representation-learning
knowledge-base
embedding
yago
temporal-information
-
Updated
May 6, 2019 - Python
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
neural-network
pytorch
speech-recognition
neural-networks
kaldi
speaker-recognition
speaker-verification
embedding
speaker-diarization
tdnn
acoustic-model
acoustic-models
x-vector
tdnn-f
factorized-tdnn
-
Updated
Jan 6, 2020 - Python
Some Cool NLP and CV Repositories and Solutions (收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等)
-
Updated
Mar 23, 2021
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
machine-learning
deep-learning
neural-network
tensorflow
embeddings
deepwalk
gensim
complex-networks
factorization
unsupervised-learning
embedding
network-embedding
diffusion
node2vec
graph-embedding
node-embedding
struc2vec
diff2vec
implicit-factorization
semisupervised-learning
-
Updated
Mar 2, 2022 - Python
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
deep-learning
word2vec
deepwalk
gensim
network-analysis
embedding
network-embedding
node2vec
graph-embedding
node-embedding
gemsec
implicit-factorization
asonam
tadw
attributed-embedding
aane
asne
musae
walklets
graph-neural-network
-
Updated
Mar 2, 2022 - Python
Python Wrapper for t-SNE Visualization
-
Updated
Jan 12, 2018 - Python
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, and clause indexing
machine-learning
regression
classification
convolution
propositional-logic
bandit-learning
embedding
frequent-pattern-mining
rule-based
interpretable
tsetlin-machine
-
Updated
Jan 4, 2022 - C
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
machine-learning
deep-learning
word2vec
deepwalk
dimensionality-reduction
gensim
edge-prediction
multiscale
graph-mining
embedding
node2vec
word-embedding
graph-embedding
node-classification
graph-neural-networks
node-embedding
walklet
graphlet
dont-walk-skip
graph-convolution
-
Updated
Mar 2, 2022 - Python
Improve this page
Add a description, image, and links to the embedding topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the embedding topic, visit your repo's landing page and select "manage topics."
Is there an existing issue for this?
Is your feature request related to a problem? Please describe.
When I start the docker container
Set automatic backup condition parameters:
1).backup condition by date(year,month,week,day)
2).number of backup files retained
Recovery the data
1).Identification by backup file,rest