Questions tagged [cluster-analysis]
Cluster analysis is the process of grouping "similar" objects into groups known as "clusters", along with the analysis of these results.
cluster-analysis
6,239
questions
0
votes
0
answers
15
views
How to calculate k-means clustering with presence of double-zeroes? [migrated]
I've tried to find species assemblages with a data set that has double-zeroes. When I run the tests, the results are inconclusive and no groups seem to be significant.
I used hellinger's ...
-1
votes
0
answers
11
views
How to Implement NLP for Text Analysis in Evaluating Business Projects?
I need to evaluate business activities (projects) for eligibility based on specific criteria. We gather data through interviews with stakeholders, capturing details like project names, descriptions, ...
0
votes
0
answers
12
views
How to show only one class of the clusters in Google Earth Engine where NDVI gt (0.2)
How to show only one class of the clusters in Google Earth Engine where NDVI gt (0.2).
Suppose we have the following codes for unsupervised classification. My goal is to mask out other classes (...
1
vote
2
answers
34
views
Fast way of detecting outliers in 2D space
I have hundreds of millions of point clouds like the following:
I want to remove outliers 1, 2, 4, 5, 6, 7. The safest bet is to build a minimum spanning tree connecting all the points and remove ...
-2
votes
0
answers
13
views
Is there any updated python library for clustering using Markov Clustering Algorithm (MCL algorithm) that works well? [closed]
I'm trying to cluster an undirected weighted graph dataset using in Python.
I have used several libraries but, as all of them are dated, they dont work. For instance, I have used the library ...
0
votes
0
answers
4
views
Text clustering and classification model
What I want to do - I have a dataset of different phrases, I want to train a model which would set a few clusters depending on the dataset and then cluster the data for that clusters. i have an ...
-1
votes
0
answers
6
views
Does DBCV score for density based clustering algorithms reward more granular clusters?
I am trying to run a hyperparameter search for HDBSCAN based on the DBCV scores. From what I observe, the DBCV score is generally higher for more granular clusters. Is it because DBCV rewards granular ...
-1
votes
0
answers
35
views
Group similar vectors in high-dimensional vector space into "spaces/partitions" with unique IDs being assigned per similar group
Clarifying Example
As a contrived example, let's say I have vectors in some R^3 vector space:
A: [1, 2, 3]
B: [1.02, 2.5, 3]
C: [1512, 123, 51]
I'd like to partition this space into N "slices/...
0
votes
0
answers
33
views
How to "cluster" the predominant HUEs in an image in PHP
I have a catalog with images where there's a text overlay. The overlay has 3 texts, one of them is colored. The current color is equal for all pages. This results in hues of the text mismatching the ...
0
votes
2
answers
39
views
matplotlib detect and isolate in circles different groups of points
I would like to automatically detect and isolate the different groups of points placed on a graph. I drew 3 groups, I would like to detect and isolate each of them in a circle.
This is what I ...
0
votes
0
answers
25
views
Kmedoids clustering assigns nodes to far away clusters?
I am currently trying to set up kmedoids clustering on a project I am working on in matlab. The basic approach works fine, but I have the additional constraint that there are obstacles on my map that ...
-2
votes
0
answers
22
views
Tableau Cluster Analysis
I am new to Tableau world and have a requirement to perform a cluster analysis, I have order data as below along with "Issue_Number" associated to each order, my requirement is to build a ...
0
votes
0
answers
18
views
Adjusted Rand Index and Adjusted Mutual Information Score the same for BIRCH and Agglomerative Clustering?
I have a randomly generated dataset of synthetic clusters, with 25 files (each can contain up to 6 clusters with up to 15 points each) for each dimension from 2 to 100. My issue is that it seems that ...
0
votes
2
answers
61
views
Julia - AssertionError in K-medoids algorithm
I am using K-medoids to reduce my data (potential future scenarios of a stock's price) from 400 scenarios to n = 25.
I computed the distance amtrix based on the Eucledian distance:
C = pairwise(...
-2
votes
0
answers
45
views
Connected components across gaps?
I recently got a dataset of water storage in Europe, which is inherently full of spatial gaps because of the waterbodies:
data
The goal is to study drought propagation, and I want to find dry area &...