Skip to main content

Questions tagged [cluster-analysis]

Cluster analysis is the process of grouping "similar" objects into groups known as "clusters", along with the analysis of these results.

0 votes
0 answers
15 views

How to calculate k-means clustering with presence of double-zeroes? [migrated]

I've tried to find species assemblages with a data set that has double-zeroes. When I run the tests, the results are inconclusive and no groups seem to be significant. I used hellinger's ...
capthom_obvious's user avatar
-1 votes
0 answers
11 views

How to Implement NLP for Text Analysis in Evaluating Business Projects?

I need to evaluate business activities (projects) for eligibility based on specific criteria. We gather data through interviews with stakeholders, capturing details like project names, descriptions, ...
quadratic's user avatar
0 votes
0 answers
12 views

How to show only one class of the clusters in Google Earth Engine where NDVI gt (0.2)

How to show only one class of the clusters in Google Earth Engine where NDVI gt (0.2). Suppose we have the following codes for unsupervised classification. My goal is to mask out other classes (...
djebbouri mohammed's user avatar
1 vote
2 answers
34 views

Fast way of detecting outliers in 2D space

I have hundreds of millions of point clouds like the following: I want to remove outliers 1, 2, 4, 5, 6, 7. The safest bet is to build a minimum spanning tree connecting all the points and remove ...
user2961927's user avatar
  • 1,654
-2 votes
0 answers
13 views

Is there any updated python library for clustering using Markov Clustering Algorithm (MCL algorithm) that works well? [closed]

I'm trying to cluster an undirected weighted graph dataset using in Python. I have used several libraries but, as all of them are dated, they dont work. For instance, I have used the library ...
JALC's user avatar
  • 1
0 votes
0 answers
4 views

Text clustering and classification model

What I want to do - I have a dataset of different phrases, I want to train a model which would set a few clusters depending on the dataset and then cluster the data for that clusters. i have an ...
dea's user avatar
  • 1
-1 votes
0 answers
6 views

Does DBCV score for density based clustering algorithms reward more granular clusters?

I am trying to run a hyperparameter search for HDBSCAN based on the DBCV scores. From what I observe, the DBCV score is generally higher for more granular clusters. Is it because DBCV rewards granular ...
Tanay's user avatar
  • 179
-1 votes
0 answers
35 views

Group similar vectors in high-dimensional vector space into "spaces/partitions" with unique IDs being assigned per similar group

Clarifying Example As a contrived example, let's say I have vectors in some R^3 vector space: A: [1, 2, 3] B: [1.02, 2.5, 3] C: [1512, 123, 51] I'd like to partition this space into N "slices/...
Matthew Trent's user avatar
0 votes
0 answers
33 views

How to "cluster" the predominant HUEs in an image in PHP

I have a catalog with images where there's a text overlay. The overlay has 3 texts, one of them is colored. The current color is equal for all pages. This results in hues of the text mismatching the ...
Xavi Montero's user avatar
  • 10.3k
0 votes
2 answers
39 views

matplotlib detect and isolate in circles different groups of points

I would like to automatically detect and isolate the different groups of points placed on a graph. I drew 3 groups, I would like to detect and isolate each of them in a circle. This is what I ...
Bast38's user avatar
  • 81
0 votes
0 answers
25 views

Kmedoids clustering assigns nodes to far away clusters?

I am currently trying to set up kmedoids clustering on a project I am working on in matlab. The basic approach works fine, but I have the additional constraint that there are obstacles on my map that ...
kiwithebird's user avatar
-2 votes
0 answers
22 views

Tableau Cluster Analysis

I am new to Tableau world and have a requirement to perform a cluster analysis, I have order data as below along with "Issue_Number" associated to each order, my requirement is to build a ...
CMG23's user avatar
  • 1
0 votes
0 answers
18 views

Adjusted Rand Index and Adjusted Mutual Information Score the same for BIRCH and Agglomerative Clustering?

I have a randomly generated dataset of synthetic clusters, with 25 files (each can contain up to 6 clusters with up to 15 points each) for each dimension from 2 to 100. My issue is that it seems that ...
Regina Deri's user avatar
0 votes
2 answers
61 views

Julia - AssertionError in K-medoids algorithm

I am using K-medoids to reduce my data (potential future scenarios of a stock's price) from 400 scenarios to n = 25. I computed the distance amtrix based on the Eucledian distance: C = pairwise(...
Lyft's user avatar
  • 57
-2 votes
0 answers
45 views

Connected components across gaps?

I recently got a dataset of water storage in Europe, which is inherently full of spatial gaps because of the waterbodies: data The goal is to study drought propagation, and I want to find dry area &...
Brian Kao's user avatar

15 30 50 per page
1
2 3 4 5
416