2

I'm working on a project that seeks to identify clusters in urban development based on location (in lat/lon) and a categorical variable (what the particular site is zoned for). Ideally, the analysis would identify clusters of sites that are 1) near each other and 2) zoned the same. Below is a sample of what my data looks like:

 lat       lon      zone
 33.22320 -112.6741 R-43      
 33.45324 -113.0888 R-43      
 33.71800 -112.3885 R-43      
 33.45626 -111.9408 AG        
 33.45746 -111.9313 R-6       
 33.45747 -111.9309 R-6 

I've seen methods that define distance on just lat/lons using great circle distances, but I haven't seen any mixed clustering of the sort I'm trying to implement. I'm fairly new to cluster analysis so any guidance on implementing something like this in R or Python would be greatly appreciated!

Tasos
  • 3,920
  • 4
  • 23
  • 54
user27974
  • 21
  • 1

0 Answers0