It’s useful to take a look at data to quickly eyeball the patterns so that you can see what a good analysis should pick up on. The dataset I’ve been working with is a collection of information about houses sold in the southeast of the UK over the past 6 months or so.
Amongst other things, the dataset contains information about the number of bedrooms, property subtype (terraced, semi-detached, detached, etc) and price. I decided to use R with the Leaflet package to visualize this data by plotting it on a map, with one translucent dot plotted for each house.
Rather than plotting 6 or 7 different subtypes, each subtype is given a “score” between 0 and 1, with more desirable subtypes (such as detached) getting a higher score (and a more orange color) on this map:
Number of bedrooms
displays comparatively less variation, but this is expected.
These are not interactive maps because the interactive versions using the leaflet library and Shiny will cause one’s browser to either crash or respond extremely slowly – there are 44,000 datapoints plotted here. More on this to come.