451 questions
0
votes
0
answers
36
views
Truncated transcription in chirp model
I am using google.cloud.speech_v2 client library for python to get the
transcription of a short (< 1 min) audio in spanish. It works fine with model = "long", language code = "es-US&...
1
vote
3
answers
188
views
How do I split a sentence into words and punctuation?
I want to replace any word that follows a certain condition in a text, with another word, which varies, depending on the first word. The possibilities of the other word is stored in an array, and the ...
0
votes
0
answers
21
views
Solr 9 punctuation issue
I am migrating Solr 3 to Solr 9. However, the issues I have now are:
Solr 9 returns no results where punctuation is included within quotes (a phrase search), such as queries: "Electric Vehicles: ...
0
votes
1
answer
383
views
Existing list of all punctuation/whitespace chars in C#
I'm splitting a string on all punctuation and whitespace characters. Rather than build a complicated (?) regex to match what C# considers "punctuation" and "whitespace" characters, ...
0
votes
0
answers
100
views
TypeError: "float" object is not iterable
The first thing is that I import a .csv file and I´m working some tasks without any problem, but now I have this issue that I´m not able to solve.
After defining a function I expected to remove all ...
-2
votes
2
answers
67
views
Individually Add Punctuation (apostrophe) to a Printed Word?
I'm curious about adding punctuation without the need for a script/add-on, specifically for apostrophes since commas and ellipsis seem just fine by default.
So how would you individually add ...
0
votes
2
answers
88
views
Autocapitalize each sentence beginning after a period
I'm trying to make it so that when a user is entering text in an input field, their sentences will autocapitalize. Some people are already bad enough with proper grammar and punctualization, so this ...
0
votes
2
answers
60
views
R {quanteda}: remove accents in a dictionary
I want to remove accents and punctuation from a dictionary. For example, I want to transform "à l'épreuve" into "a l epreuve". The dictionary is this one: https://www.poltext.org/...
1
vote
1
answer
402
views
How to add a line break after every period in a .txt file
I want to add line break after every .?! but not in decimals. However, I'm not very skilled at programming and all that (I only learned really basic python).
So I wanted to know if there's a way to do ...
1
vote
1
answer
224
views
Add a space between dot or comma and a letter but with some exceptions
I need to do some grammar validation, for example add spaces after dots. Problem is that it shouldn't be done everywhere like in e.g. or www.example.co. or some more advanced exceptions like 999.77.SA....
0
votes
2
answers
111
views
Remove punctuations from pandas column but keep original list of lists structure
I know how to do it for a single list in a cell but I need to keep the structure of multiple list of lists as in [["I","need","to","remove","punctuations&...
1
vote
1
answer
485
views
Bert Tokenizer punctuation for named entity recognition task
I'm working on a named entity recognition task, where I need to identify person names, books etc.
I am using Huggingface Transformer package and BERT with PyTorch. Generally it works very good, ...
1
vote
1
answer
89
views
How to remove punctuation within a string?
I am doing text cleaning for my pandas dataframe
This is a string from my description column before punctuation is removed:
['dedicated', 'to', 'support', 'the', 'fast-paced', 'technology',
'...
-1
votes
2
answers
94
views
How to find punctuation marks in str python [duplicate]
its a kattis problem "Canadians, eh?"
if the sentence ends with "eh?" the person is canadian and if not its an imposter
this is my code rn
a = input()
found = a.find('e')
found2= a[...
2
votes
1
answer
354
views
how do i solve AttributeError: 'float' object has no attribute 'encode'
this is the code
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
plt.style.use('ggplot')
import nltk
df = pd.read_csv('/kaggle/input/starbucks-review-...