Newest 'bert' Questions

2 votes

2 answers

53 views

BERT + CNN Model Underfitting for Binary Text Classification: How to Improve?

I'm working on a binary text classification task using a BERT + CNN model. However, based on the loss and accuracy graphs during training, it seems that the model is underfitting, and I'm not seeing ...

DMabulage

121

asked Jan 6 at 17:36

0 votes

1 answer

49 views

Find the correlation between two lists of texts

Let's say that I have some lists of texts such as : ...

Leon

1

asked Sep 26, 2024 at 9:09

0 votes

2 answers

105 views

Calculate the correlation of two lists of embeddings

I have two lists of sentences ...

Leon

1

asked Sep 25, 2024 at 12:40

0 votes

1 answer

35 views

Handle text column with PyTorch

I'm new in ML so question may be stupid. I have a data set with multiple numeric columns and one text column. Text is just one sentense. So i want to use all data avaible for classification. But i don'...

Kliver Max

101

asked Aug 29, 2024 at 11:22

0 votes

0 answers

16 views

What does token_type_id affect the self attention and other mechanism in BERT?

I know that BERT use NSP (Next Sentence Prediction) task for pre-training. They design two sentence separated by [SEP] token and have token id different on each sentence. My downstream task is build a ...

jupyter

101

asked Aug 23, 2024 at 2:54

1 vote

0 answers

29 views

Sampling multiple masked tokens through Metropolis–Hastings

I'm trying to replicate the finding of the the publication "Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis-Hastings" for obtaining the joint distribution ...

Chris

11

asked Aug 4, 2024 at 16:21

0 votes

0 answers

66 views

Using multiple text inputs for one output with RoBERTa/DistillBERT

In a current project i want to fine tune a RoBERTa/DistillBERT for text classification. The model should take two text input features, limited to the length of around 280 characters, and generate a ...

Mime

101

asked Jul 28, 2024 at 22:32

0 votes

0 answers

33 views

Best model for enforcing corporate naming conventions

I'm working on a project (Python) to enforce the company naming convention of products on product lists provided by clients/suppliers. I'm having a list of company names (Standardised names) and those ...

Secret Ambush

1

asked Jul 10, 2024 at 5:52

0 votes

0 answers

27 views

NLP model for word recovery (analogy to BERT, but letters)

I am working on solving the problem of restoring words in text where some letters are missing. For example (restore words where vowels are removed): Hll wrld -> Hello world n ltrntv ssssmnt sggsts -...

SoH

119

asked Jul 7, 2024 at 11:03

0 votes

0 answers

32 views

How can I make my Hugging Face fine-tuned model's config.json file reference a specific revision/commit from the original pretrained model?

I uploaded this model: https://huggingface.co/pamessina/CXRFE, which is a fine-tuned version of this model: https://huggingface.co/microsoft/BiomedVLP-CXR-BERT-specialized Unfortunately, CXR-BERT-...

Pablo Messina

197

asked Jul 1, 2024 at 15:46

0 votes

0 answers

26 views

Fine-tuning pretrained model on 2 tasks with 2 labeled dataset

I am having difficulty using BERT for a sentiment analysis task that handles both aspect-based sentiment analysis (ABSA) and comment sentiment analysis. I know that using two separate classification ...

ndycuong

1

asked May 27, 2024 at 16:05

3 votes

0 answers

69 views

Weird behaviour when using RobERTA for text classification

I have a dataset with around 70 classes and the dataset is largely balanced ~150 samples per class. I am finetuning RoBERTA-base for 4 epochs with a ...

user1274878

201

asked Apr 24, 2024 at 6:09

2 votes

0 answers

162 views

Use text embeddings to map job descriptions to ESCO occupations

I'm trying to build a model to map job descriptions to ESCO occupations which is a taxonomy for job titles. Every ESCO occupations have a title, a description and some essential skills. Ideally I ...

GanaelD

21

asked Apr 19, 2024 at 15:07

1 vote

1 answer

121 views

Reducing emails token count preprocessing for Large Email Datasets - Feeding LLMs

I have a large email dataset in .txt format and want to feed LLMs (like Gemini and ChatGPT) to provide answers based on email content. The token count for my email data is very high (~1M for 1K emails)...

Rafael Borja

63

asked Apr 11, 2024 at 19:41

1 vote

1 answer

413 views

How can I use contextual embeddings with BERT for sentiment analysis/classification

I have a BERT model which I want to use for sentiment analysis/classification. E.g. I have some tweets that need to get a POSITIVE,NEGATIVE or NEUTRAL label. I can't understand how contextual ...

average_discrete_math_enjoyer

113

asked Mar 1, 2024 at 22:34

Stack Exchange Network

Questions tagged [bert]

BERT + CNN Model Underfitting for Binary Text Classification: How to Improve?

Find the correlation between two lists of texts

Calculate the correlation of two lists of embeddings

Handle text column with PyTorch

What does token_type_id affect the self attention and other mechanism in BERT?

Sampling multiple masked tokens through Metropolis–Hastings

Using multiple text inputs for one output with RoBERTa/DistillBERT

Best model for enforcing corporate naming conventions

NLP model for word recovery (analogy to BERT, but letters)

How can I make my Hugging Face fine-tuned model's config.json file reference a specific revision/commit from the original pretrained model?

Fine-tuning pretrained model on 2 tasks with 2 labeled dataset

Weird behaviour when using RobERTA for text classification

Use text embeddings to map job descriptions to ESCO occupations

Reducing emails token count preprocessing for Large Email Datasets - Feeding LLMs

How can I use contextual embeddings with BERT for sentiment analysis/classification

Hot Network Questions

Questions tagged [bert]

Related Tags