python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
How to count word similarity between two pandas dataframe
Here's my first dataframe df1
Id Text
1 Asoy Geboy Ngebut
2 Asoy kita Geboy
3 Bersatu kita Teguh
Here's my second dataframe df2
Id Text
1 Bersatu Kita
2 Asoy Geboy Jalanan
Simila...
Nabih Bawazir
Votes: 0
Answers: 1
Efficiently compute cosine similarity
I have a bank of about 100k strings and when I get a new string, I want to match it to the most similar string.
My thoughts were to use tf-idf (makes sense as keywords are quite important), then match...
user112633
Votes: 0
Answers: 1
How calculate clusters coherence/quality?
I did embeddings with fasttext and I have clusters thanks to KMeans.
I would like to calculate similarities inside each cluster to check if the sentences inside are well clustered. I want to keep sent...
codelifevcd
Votes: 0
Answers: 1
Determining cosine similarity for large datasets
I am currently using a dataset of over 2.5 million images, of which I use the image itself as a comparison to eachother, for use in a content-based recommendation engine.
I use the following code to c...
pippinpaddleopsicopolis
Votes: 0
Answers: 2