python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
Computing similarity between duplicated variables using unique identifier
I have a data set that looks like this, where id is supposed to be the unique identifier. There are duplicates, for example, lines 1 and 4, but not lines 1 and 6 or 3 and 6 due to the year difference....
Rivie Cham
Votes: 0
Answers: 0
How can I organize a list keeping the minimum similarity between adjacent elements from a similarity matrix?
I have a list of 24 samples. For each sample I have a vector with the distances to the other samples. This constitutes a 24 by 24 similarity matrix. In this matrix the similarity values range from 0 t...
a_j_linan
Votes: 0
Answers: 1
create a fast custom similarity matrix in python
I am trying to build a similarity matrix while using a custom similarity function.
The problem is that the code runs very slow.
I have a dataframe which looks like this:
col1 col2 col3
'car' ...
Carbo
Votes: 0
Answers: 1
Efficient way for Computing the Similarity of Multiple Documents using Spacy
I have around 10k docs (mostly 1-2 sentences) and want for each of these docs find the ten most simliar docs of a collection of 60k docs. Therefore, I want to use the spacy library. Due to the large a...
LaLeLo
Votes: 0
Answers: 2