React for Data Visualization [FULL COURSE] - Part 3

Andrew.Kaluba · 29.Октябрь.2019 21:56:23

s08e39 - 176,113 tech salaries visualized

A big example project - 176,113 tech salaries visualized

We’re going to build this:

An interactive visualization dashboard app with a choropleth map and a histogram comparing tech salaries to median household income in the area. Users can filter by year, job title, or US state to get a better view.

It’s going to be great.

At this point, I assume you’ve used create-react-app to set up your environment. Check the getting started section if you haven’t. I’m also assuming you’ve read the basics chapter. I’m still going to explain what we’re doing, but knowing the basics helps.

I suggest you follow along, keep npm start running, and watch your visualization change in real time as you code. It’s rewarding as hell.

If you get stuck, you can use my react-d3-walkthrough-livecode to jump between steps. The commits correspond to the code at the end of each step. Check out the first commit and run npm install to skip the initial setup.

If you want to see how this project evolved over 22 months, check the original repo. The modern-code branch has the code you’re about to build.

s09e40 - Show a Preloader

Show a Preloader

Preloader screenshot

Our preloader is a screenshot of the final result. Usually you’d have to wait until the end of the project to make that, but I’ll just give you mine. Starting with the preloader makes sense for two reasons:

It’s nicer than looking at a blank screen while data loads
It’s a good sanity check for our environment

We’re using a screenshot of the final result because the full dataset takes a few seconds to load, parse, and render. It looks better if visitors see something informative while they wait.

React Suspense is about to make building preloaders a whole lot better. Adapting to the user’s network speed, built-in preload functionality, stuff like that. More on that in the chapter on React Suspense and Time Slicing.

Make sure you’ve installed all dependencies and that npm start is running.

We’re building our preloader in 4 steps:

Get the image
Make the Preloader component
Update App
Load Bootstrap styles in index.js

s09e41 - Step1: get image

Step 1: Get the image

Download my screenshot from Github and save it in src/assets/preloading.png . It goes in the src/assets/ directory because we’re going to import it in JavaScript (which makes it part of our source code), and I like to put non-JavaScript files in assets . Keeps the project organized.

s09e42 - Step 2: make component

Step 2: Preloader component

Our Preloader is a small component that pretends it’s the App and renders a static title, description, and a screenshot of the end result. It goes in src/components/Preloader.js .

We’ll put all of our components in src/components/ .

We start the component off with some imports, an export, and a functional stateless component that returns an empty div element.

// src/components/Preloader.js

import React from "react";

import PreloaderImg from "../assets/preloading.png";

const Preloader = () => (
    <div className="App container">

    </div>
);

export default Preloader;

We import React (which we need to make JSX syntax work) and the PreloaderImg for our image. We can import images because of the Webpack configuration that comes with create-react-app . The webpack image loader returns a URL that we put in the PreloaderImg constant.

At the bottom, we export default Preloader so that we can use it in App.js as import Preloader . Default exports are great when your file exports a single object. Named exports when you want to export multiple items. You’ll see that play out in the rest of this project.

The Preloader function takes no props (because we don’t need any) and returns an empty div . Let’s fill it in.

// src/components/Preloader.js

const Preloader = () => (
    <div className="App container">
        <h1>The average H1B in tech pays $86,164/year</h1>
        <p className="lead">
            Since 2012 the US tech industry has sponsored 176,075 H1B work
            visas. Most of them paid <b>$60,660 to $111,668</b> per year (1
            standard deviation).{" "}
            <span>
                The best city for an H1B is <b>Kirkland, WA</b> with an average
                individual salary <b>$39,465 above local household median</b>.
                Median household salary is a good proxy for cost of living in an
                area.
            </span>
        </p>
        <img
            src={PreloaderImg}
            style={{ width: "100%" }}
            alt="Loading preview"
        />
        <h2 className="text-center">Loading data ...</h2>
    </div>
);

A little cheating with grabbing copy from the future, but that’s okay. In real life you’d use some temporary text, then fill it in later.

The code itself looks like HTML. We have the usual tags - h1 , p , b , img , and h2 . That’s what I like about JSX: it’s familiar. Even if you don’t know React, you can guess what’s going on here.

But look at the img tag: the src attribute is dynamic, defined by PreloaderImg , and the style attribute takes an object, not a string. That’s because JSX is more than HTML; it’s JavaScript. Think of props as function arguments – any valid JavaScript fits.

That will be a cornerstone of our project.

s09e43 - Step 3: Update App

Step 3: Update App

We use our new Preloader component in App – src/App.js . Let’s remove the create-react-app defaults and import our Preloader component.

// src/App.js

import React from 'react';
// Delete the line(s) between here...
import logo from './logo.svg';
import './App.css';
// ...and here.

// Insert the line(s) between here...
import Preloader from './components/Preloader';
// ...and here.

class App extends React.Component {
    // Delete the line(s) between here...
  render() {
    return (
      <div className="App">
        <div className="App-header">
          <img src={logo} className="App-logo" alt="logo" />
          <h2>Welcome to React</h2>
        </div>
        <p className="App-intro">
          To get started, edit <code>src/App.js</code> and save to reload.
        </p>
      </div>
    );
  }
    // ...and here.
}

export default App;

We removed the logo and style imports, added an import for Preloader , and gutted the App class. It’s great for a default app, but we don’t need that anymore.

Let’s define a default state and a render method that uses our Preloader component when there’s no data.

// src/App.js

class App extends React.Component {
    // Insert the line(s) between here...
    state = {
        techSalaries: []
    }

    render() {
        const { techSalaries } = this.state;

        if (techSalaries.length < 1) {
            return (
                <Preloader />
            );
        }

        return (
            <div className="App">

            </div>
        );
    }
    // ...and here.
}

Nowadays we can define properties directly in the class body without a constructor method. It’s not part of the official JavaScript standard yet, but most React codebases use this pattern.

Properties defined this way are bound to each instance of our components so they have the correct this value. Late you’ll see we can use this shorthand to neatly define event handlers.

We set techSalaries to an empty array by default. In render we use object destructuring to take techSalaries out of component state, this.state , and check whether it’s empty. When techSalaries is empty our component renders the preloader, otherwise an empty div.

If your npm start is running, the preloader should show up on your screen.

Preloader without Bootstrap styles

Hmm… that’s not very pretty. Let’s fix it.

s09e44 - Step 4: Load styles

Step 4: Load Bootstrap styles

We’re going to use Bootstrap styles to avoid reinventing the wheel. We’re ignoring their JavaScript widgets and the amazing integration built by the react-bootstrap team. Just the stylesheets please.

They’ll make our fonts look better, help with layouting, and make buttons look like buttons. We could use styled components, but writing our own styles detracts from this tutorial.

We load stylesheets in src/index.js .

// src/index.js

import React from 'react';
import ReactDOM from 'react-dom';
import App from './App';
// Insert the line(s) between here...
import 'bootstrap/dist/css/bootstrap.css';
// ...and here.

ReactDOM.render(
  <App />,
  document.getElementById('root')
);

Another benefit of Webpack: import -ing stylesheets. These imports turn into <style> tags with CSS in their body at runtime.

This is also a good opportunity to see how index.js works to render our app

loads App and React
loads styles
Uses ReactDOM to render <App /> into the DOM

That’s it.

Your preloader screen should look better now.

Preloader screenshot

If you don’t, try comparing your changes to this diff on Github.

s10e45 - Asynchronously load data

Asynchronously load data

Great! We have a preloader. Time to load some data.

We’ll use D3’s built-in data loading methods and tie their promises into React’s component lifecycle. You could talk to a REST API in the same way. Neither D3 nor React care what the datasource is.

First, you need some data.

Our dataset comes from a few sources. Tech salaries are from h1bdata.info, median household incomes come from the US census data, and I got US geo map info from Mike Bostock’s github repositories. Some elbow grease and python scripts tied them all together.

You can read about the scraping on my blog here, here, and here. But it’s not the subject of this book.

s10e46 - Step 0: Get data

Step 0: Get the data

Download the data files from my walkthrough repository on Github. Put them in your public/data directory.

s10e47 - Step 1: Prep App.js

Step 1: Prep App.js

Let’s set up our App component first. That way you’ll see results as soon data loading starts to work.

Start by importing our data loading method - loadAllData - and both D3 and Lodash. We’ll need them later.

// src/App.js
import React from 'react';
// Insert the line(s) between here...
import * as d3 from 'd3';
import _ from 'lodash';
// ...and here.

import Preloader from './components/Preloader';
// Insert the line(s) between here...
import { loadAllData } from './DataHandling';
// ...and here.

You already know about default imports. Importing with {} is how we import named exports. That lets us get multiple things from the same file. You’ll see the export side in Step 2.

Don’t worry about the missing DataHandling file. It’s coming soon.

// src/App.js
class App extends React.Component {
    state = {
        techSalaries: [],
        // Insert the line(s) between here...
        medianIncomes: [],
        countyNames: [],
    };

    componentDidMount() {
        loadAllData(data => this.setState(data));
    }
    // ...and here.

We initiate data loading inside the componentDidMount lifecycle hook. It fires when React first mounts our component into the DOM.

I like to tie data loading to component mounts because it means you aren’t making requests you’ll never use. In a bigger app, you’d use Redux, MobX, or similar to decouple loading from rendering. Many reasons why.

To load our data, we call the loadAllData function, then use this.setState in the callback. This updates App ’s state and triggers a re-render, which updates our entire visualization via props.

Yes, it would be better to make loadAllData return a promise instead of expecting a callback. That was giving me trouble for some reason and it doesn’t matter right now anyway.

We also add two more entries to our state : countyNames , and medianIncomes . Defining what’s in your component state in advance makes your code easier to read. People, including you, know what to expect.

Let’s change the render method to show a message when our data finishes loading.

// src/App.js
  render() {
     const { techSalaries } = this.state;

     if (techSalaries.length < 1) {
         return (
             <Preloader />
         );
      }

    return (
        // Delete the line(s) between here...
        <div className="App">
        // ...and here.
        // Insert the line(s) between here...
        <div className="App container">
            <h1>Loaded {techSalaries.length} salaries</h1>
        // ...and here.
        </div>
    );
}

We added a container class to the main <div> and an <h1> tag that shows how many datapoints there are. You can use any valid JavaScript in curly braces {} and JSX will evaluate it. By convention we only use that ability to calculate display values.

You should now get an error overlay.

DataHandling.js not found error overlay

These nice error overlays come with create-react-app and make your code easier to debug. No hunting around in the terminal to see compilation errors.

Let’s build that file and fill it with our data loading logic.

s10e48 - Step 2: Prep data parsing functions

Step 2: Prep data parsing functions

We’re putting data loading logic in a separate file from App.js because it’s a bunch of functions that work together and don’t have much to do with the App component itself.

We start with two imports and four data parsing functions:

cleanIncome , which parses each row of household income data
dateParse , which we use for parsing dates
cleanSalary , which parses each row of salary data
cleanUSStateName , which parses US state names

// src/DataHandling.js
import * as d3 from 'd3';
import _ from 'lodash';

const cleanIncome = (d) => ({
    countyName: d['Name'],
    USstate: d['State'],
    medianIncome: Number(d['Median Household Income']),
    lowerBound: Number(d['90% CI Lower Bound']),
    upperBound: Number(d['90% CI Upper Bound'])
});

const dateParse = d3.timeParse("%m/%d/%Y");

const cleanSalary = (d) => {
    if (!d['base salary'] || Number(d['base salary']) > 300000) {
        return null;
    }

    return {employer: d.employer,
            submit_date: dateParse(d['submit date']),
            start_date: dateParse(d['start date']),
            case_status: d['case status'],
            job_title: d['job title'],
            clean_job_title: d['job title'],
            base_salary: Number(d['base salary']),
            city: d['city'],
            USstate: d['state'],
            county: d['county'],
            countyID: d['countyID']
    };
}

const cleanUSStateName = (d) => ({
    code: d.code,
    id: Number(d.id),
    name: d.name
});

const cleanCounty = d => ({
    id: Number(d.id),
    name: name
});

You’ll see those d3 and lodash imports a lot.

Our data parsing functions all follow the same approach: Take a row of data as d , return a dictionary with nicer key names, cast any numbers or dates into appropriate formats. They all start as strings.

Doing all this parsing now, keeps the rest of our codebase clean. Handling data is always messy. You want to contain that mess as much as possible.

s10e49 - Step 3: Load the datasets

Step 3: Load the datasets

Now we can use D3 to load our data with fetch requests.

// src/DataHandling.js
export const loadAllData = (callback = _.noop) => {
    Promise.all([
        d3.json("data/us.json"),
        d3.csv("data/us-county-names-normalized.csv", cleanCounty),
        d3.csv("data/county-median-incomes.csv", cleanIncome),
        d3.csv("data/h1bs-2012-2016-shortened.csv", cleanSalary),
        d3.tsv("data/us-state-names.tsv", cleanUSStateName)
    ]).then(([us, countyNames, medianIncomes, techSalaries, USstateNames]) => {

    });
};

Here you can see another ES6 trick: default argument values. If callback is undefined, we set it to _.noop - a function that does nothing. This lets us later call callback() without worrying whether it’s defined.

In version 5, D3 updated its data loading methods to use promises instead of callbacks. You can load a single file using d3.csv("filename"😉.then(data => .... . The promise resolves with your data, or throws an error.

If you’re on D3v4 and can’t upgrade, you’ll have to import from d3-fetch .

Each d3.csv call makes a fetch request, parses the fetched CSV file into an array of JavaScript dictionaries, and passes each row through the provided cleanup function. We pass all median incomes through cleanIncome , salaries through cleanSalary , etc.

To load multiple files, we use Promise.all with a list of unresolved promises. Once resolved, our .then handler gets a list of results. We use array destructuring to expand that list into our respective datasets before running some more logic to tie them together.

D3 supports formats like json , csv , tsv , text , and xml out of the box. You can make it work with custom data sources through the underlying request API.

PS: we’re using the shortened salary dataset to make page reloads faster while building our project.

s10e50 - Step 4: Tie datasets together

Step 4: Tie the datasets together

If you add a console.log to the .then callback above, you’ll see a bunch of data. Each argument - us , countyNames , medianIncomes , techSalaries , USstateNames - holds a parsed dataset from the corresponding file.

To tie them together and prepare a dictionary for setState back in the App component, we need to add some logic. We’re building a dictionary of county household incomes and removing any empty salaries.

// src/DataHandling.js
]).then(([us, countyNames, medianIncomes, techSalaries, USstateNames]) => {
    let medianIncomesMap = {};

    medianIncomes.filter(d => _.find(countyNames,
                                     {name: d['countyName']}))
                 .forEach((d) => {
                     d['countyID'] = _.find(countyNames,
                                            {name: d['countyName']}).id;
                     medianIncomesMap[d.countyID] = d;
                 });

    techSalaries = techSalaries.filter(d => !_.isNull(d));

    callback({
        usTopoJson: us,
        countyNames: countyNames,
        medianIncomes: medianIncomesMap,
        medianIncomesByCounty: _.groupBy(medianIncomes, 'countyName'),
        medianIncomesByUSState: _.groupBy(medianIncomes, 'USstate'),
        techSalaries: techSalaries,
        USstateNames: USstateNames
    });
});

Building the income map looks weird because of indentation, but it’s not that bad. We filter the medianIncomes array to discard any incomes whose countyName we can’t find. I made sure they were all unique when I built the datasets.

We walk through the filtered array with a forEach , find the correct countyID , and add each entry to medianIncomesMap . When we’re done, we have a large dictionary that maps county ids to their household income data.

Then we filter techSalaries to remove any empty values where the cleanSalaries function returned null . That happens when a salary is either undefined or absurdly high.

When our data is ready, we call our callback with a dictionary of the new datasets. To make future access quicker, we use _.groupBy to build dictionary maps of counties by county name and by US state.

You should now see how many salary entries the shortened dataset contains.

Data loaded screenshot

If that didn’t work, try comparing your changes to this diff on Github.

s11e51 - Render a choropleth map of the US

Render a choropleth map of the US

With our data in hand, it’s time to draw some pictures. A choropleth map will show us the best places to be in tech.

We’re showing the delta between median household salary in a statistical county and the average salary of an individual tech worker on a visa. The darker the blue, the higher the difference.

The more a single salary can out-earn an entire household, the better off you are.

Choropleth map with shortened dataset

There’s a lot of gray on this map because the shortened dataset doesn’t have that many counties. Full dataset is going to look better, I promise.

Turns out immigration visa opportunities for techies aren’t evenly distributed throughout the country. Who knew?

Just like before, we’re going to start with changes in our App component, then build the new bit. This time, a CountyMap component spread over three files:

CountyMap/index.js , to make imports easier
CountyMap/CountyMap.js , for overall map logic
CountyMap/County.js , for individual county polygons

s11e52 - Step 1: Prep App.js

Step 1: Prep App.js

You might guess the pattern already: add an import, add a helper method or two, update render .

// src/App.js
import Preloader from './components/Preloader';
import { loadAllData } from './DataHandling';

// Insert the line(s) between here...
import CountyMap from './components/CountyMap';
// ...and here.

That imports the CountyMap component from components/CountyMap/ . Your browser will show an error overlay about some file or another until we’re done.

In the App class itself, we add a countyValue method. It takes a county entry and a map of tech salaries, and it returns the delta between median household income and a single tech salary.

    // src/App.js
    countyValue(county, techSalariesMap) {
        const medianHousehold = this.state.medianIncomes[county.id],
              salaries = techSalariesMap[county.name];

        if (!medianHousehold || !salaries) {
            return null;
        }

        const median = d3.median(salaries, d => d.base_salary);

        return {
            countyID: county.id,
            value: median - medianHousehold.medianIncome
        };
    }

We use this.state.medianIncomes to get the median household salary and the techSalariesMap input to get salaries for a specific census area. Then we use d3.median to calculate the median value for salaries and return a two-element dictionary with the result.

countyID specifies the county and value is the delta that our choropleth displays.

In the render method, we’ll:

prep a list of county values
remove the “data loaded” indicator
render the map

// src/App.js
render() {
    // Insert the line(s) between here...
    const { countyNames, usTopoJson, techSalaries, } = this.state;
    // ...and here.

    if (techSalaries.length < 1) {
        return (
            <Preloader />
        );
    }

    // Insert the line(s) between here...
    const filteredSalaries = techSalaries,
          filteredSalariesMap = _.groupBy(filteredSalaries, 'countyID'),
          countyValues = countyNames.map(
              county => this.countyValue(county, filteredSalariesMap)
          ).filter(d => !_.isNull(d));

    let zoom = null;
    // ...and here.

    return (
      <div className="App container">
        // Delete the line(s) between here...
        <h1>Loaded {techSalaries.length} salaries</h1>
        // ...and here.
        // Insert the line(s) between here...
        <svg width="1100" height="500">
            <CountyMap usTopoJson={usTopoJson}
                       USstateNames={USstateNames}
                       values={countyValues}
                       x={0}
                       y={0}
                       width={500}
                       height={500}
                       zoom={zoom} />
        </svg>
        // ...and here.
      </div>
    );
}

We call our dataset filteredTechSalaries because we’re going to add filtering in the subchapter about adding user controls. Using the proper name now means less code to change later. The magic of foresight

We use _.groupBy to build a dictionary mapping each countyID to an array of salaries, and we use our countyValue method to build an array of counties for our map.

We set zoom to null for now. We’ll use this later.

In the return statement, we remove our “data loaded” indicator, and add an <svg> element that’s 1100 pixels wide and 500 pixels high. Inside, we place the CountyMap component with a bunch of properties. Some dataset stuff, some sizing and positioning stuff.

s11e53 - Step 2: CountyMap index.js

Step 2: CountyMap/index.js

We make index.js for just reason: to make imports and debugging easier. I learned this lesson the hard way so you don’t have to.

// src/components/CountyMap/index.js

export { default } from './CountyMap';

Re-export the default import from ./CountyMap.js . That’s it.

This allows us to import CountyMap from the directory without knowing about internal file structure. We could put all the code in this index.js file, but that makes debugging harder. You’ll have plenty of index.js files in your project.

Having a lot of code in <directory>/index.js is a common pattern in new projects. But when you’re looking at a stack trace, source files in the browser, or filenames in your editor, you’ll wish every component lived in a file of the same name.

s11e54 - Step 3: CountyMap CountyMap.js

Step 3: CountyMap/CountyMap.js

Here comes the fun part - declaratively drawing a map. You’ll see why I love using React for dataviz.

We’re using the full-feature integration approach and a lot of D3 maps magic. Drawing a map with D3 I’m always surprised how little code it takes.

Start with the imports: React, D3, lodash, topojson, County component.

// src/components/CountyMap/CountyMap.js
import React, { Component } from 'react';
import * as d3 from 'd3';
import * as topojson from 'topojson';
import _ from 'lodash';

import County from './County';

Out of these, we haven’t built County yet, and you haven’t seen topojson before.

TopoJSON is a geographical data format based on JSON. We’re using the topojson library to translate our geographical datasets into GeoJSON, which is another way of defining geo data with JSON.

I don’t know why there are two, but TopoJSON produces smaller files, and GeoJSON can be fed directly into D3’s geo functions. ¯\ (ツ) /¯

Maybe it’s a case of competing standards.

Constructor

We stub out the CountyMap component then fill it in with logic.

// src/components/CountyMap/CountyMap.js
class CountyMap extends Component {
    constructor(props) {
        super(props);

        this.state = {
        }
    }

    static getDerivedStateFromProps(props, state) {

    }

    render() {
        const { usTopoJson } = this.props;

        if (!usTopoJson) {
            return null;
        }else{
            return (

            );
        }
    }
}

export default CountyMap;

We’ll set up default D3 state in constructor and keep it up to date with getDerivedStateFromProps .

We need three D3 objects to build a choropleth map: a geographical projection, a path generator, and a quantize scale for colors.

// src/components/CountyMap/CountyMap.js
class CountyMap extends React.Component {
    constructor(props) {
        super(props);

        const projection = d3.geoAlbersUsa().scale(1280);

        this.state = {
            geoPath: d3.geoPath().projection(projection),
            quantize: d3.scaleQuantize().range(d3.range(9)),
            projection
        };
    }

You might remember geographical projections from high school. They map a sphere to a flat surface. We use geoAlbersUsa because it’s made specifically for maps of the USA.

D3 offers many other projections. You can see them on d3-geo’s Github page.

A geoPath generator takes a projection and returns a function that generates the d attribute of <path> elements. This is the most general way to specify SVG shapes. I won’t go into explaining the d here, but it’s an entire language for describing shapes.

quantize is a D3 scale. We’ve talked about the basics of scales in the D3 Axis example. This one splits a domain into 9 quantiles and assigns them specific values from the range .

Let’s say our domain goes from 0 to 90. Calling the scale with any number between 0 and 9 would return 1. 10 to 19 returns 2 and so on. We’ll use it to pick colors from an array.

getDerivedStateFromProps

Keeping our geo path and quantize scale up to date is simple, but we’ll make it harder by adding a zoom feature. It won’t work until we build the filtering, but hey, we’ll already have it by then!

    // src/components/CountyMap/CountyMap.js
    static getDerivedStateFromProps(props, state) {
        let { projection, quantize, geoPath } = state;

        projection
            .translate([props.width / 2, props.height / 2])
            .scale(props.width * 1.3);

        if (props.zoom && props.usTopoJson) {
            const us = props.usTopoJson,
                USstatePaths = topojson.feature(us, us.objects.states).features,
                id = _.find(props.USstateNames, { code: props.zoom }).id;

            projection.scale(props.width * 4.5);

            const centroid = geoPath.centroid(_.find(USstatePaths, { id: id })),
                translate = projection.translate();

            projection.translate([
                translate[0] - centroid[0] + props.width / 2,
                translate[1] - centroid[1] + props.height / 2
            ]);
        }

        if (props.values) {
            quantize.domain([
                d3.quantile(props.values, 0.15, d => d.value),
                d3.quantile(props.values, 0.85, d => d.value)
            ]);
        }

        return {
            ...state,
            projection,
            quantize
        };
    }

There’s a lot going on here.

We destructure projection , quantize , and geoPath out of component state. These are the D3 object we’re about to update.

First up is the projection. We translate (move) it to the center of our drawing area and set the scale property. You have to play around with this value until you get a nice result because it’s different for every projection.

Then we do some weird stuff if zoom is defined.

We get the list of all US state features in our geo data, find the one we’re zoom -ing on, and use the geoPath.centroid method to calculate its center point. This gives us a new coordinate to translate our projection onto.

The calculation in .translate() helps us align the center point of our zoom US state with the center of the drawing area.

While all of this is going on, we also tweak the .scale property to make the map bigger. This creates a zooming effect.

After all that, we update the quantize scale’s domain with new values. Using d3.quantile lets us offset the scale to produce a more interesting map. Again, I discovered these values through experiment - they cut off the top and bottom of the range because there isn’t much there. This brings higher contrast to the richer middle of the range.

With our D3 objects updated, we return a new derived state. Technically you don’t have to do this but you should. Due to how JavaScript works, you already updated D3 objects in place, but you should pretend this.state is immutable and return a new copy.

Overall that makes your code easier to understand and closer to React principles.

render

After all that hard work, the render method is a breeze. We prep our data then loop through it and render a County element for each entry.

    render() {
        const { geoPath, quantize } = this.state,
            { usTopoJson, values, zoom } = this.props;

        if (!usTopoJson) {
            return null;
        } else {
            const us = usTopoJson,
                USstatesMesh = topojson.mesh(
                    us,
                    us.objects.states,
                    (a, b) => a !== b
                ),
                counties = topojson.feature(us, us.objects.counties).features;

            const countyValueMap = _.fromPairs(
                values.map(d => [d.countyID, d.value])
            );

            return (
                <g>
                    {counties.map(feature => (
                        <County
                            geoPath={geoPath}
                            feature={feature}
                            zoom={zoom}
                            key={feature.id}
                            quantize={quantize}
                            value={countyValueMap[feature.id]}
                        />
                    ))}

                    <path
                        d={geoPath(USstatesMesh)}
                        style={{
                            fill: "none",
                            stroke: "#fff",
                            strokeLinejoin: "round"
                        }}
                    />
                </g>
            );
        }
    }

We use destructuring to save all relevant props and state in constants, then use the TopoJSON library to grab data out of the usTopoJson dataset.

.mesh calculates a mesh for US states – a thin line around the edges. .feature calculates feature for each count – fill in with color.

Mesh and feature aren’t tied to US states or counties by the way. It’s just a matter of what you get back: borders or flat areas. What you need depends on what you’re building.

We use Lodash’s _.fromPairs to build a dictionary that maps county identifiers to their values. Building it beforehand makes our code faster. You can read some details about the speed optimizations here.

As promised, the return statement loops through the list of counties and renders County components. Each gets a bunch of attributes and returns a <path> element that looks like a specific county.

For US state borders, we render a single <path> element and use geoPath to generate the d attribute.

s11e55 - Step 4: CountyMap County.js

Step 4: CountyMap/County.js

The County component is built from two parts: imports and color constants, and a component that returns a <path> . All the hard calculation happens in CountyMap .

// src/components/CountyMap/County.js
import React, { Component } from 'react';
import _ from 'lodash';

const ChoroplethColors = _.reverse([
    'rgb(247,251,255)',
    'rgb(222,235,247)',
    'rgb(198,219,239)',
    'rgb(158,202,225)',
    'rgb(107,174,214)',
    'rgb(66,146,198)',
    'rgb(33,113,181)',
    'rgb(8,81,156)',
    'rgb(8,48,107)'
]);

const BlankColor = 'rgb(240,240,240)'

We import React and Lodash, and define some color constants. I got the ChoroplethColors from some example online, and BlankColor is a pleasant gray.

Now we need the County component.

// src/components/CountyMap/County.js
class County extends Component {
    shouldComponentUpdate(nextProps, nextState) {
        const { zoom, value } = this.props;

        return zoom !== nextProps.zoom
            || value !== nextProps.value;
    }

    render() {
        const { value, geoPath, feature, quantize } = this.props;

        let color = BlankColor;

        if (value) {
            color = ChoroplethColors[quantize(value)];
        }

        return (
            <path d={geoPath(feature)}
                  style={{fill: color}}
                  title={feature.id} />
        );
    }
}

export default County;

The render method uses a quantize scale to pick the right color and returns a <path> element. geoPath generates the d attribute, we set style to fill the color, and we give our path a title .

shouldComponentUpdate is more interesting. It’s a React lifecycle method that lets you specify which prop changes are relevant to component re-rendering.

CountyMap passes complex props - quantize , geoPath , and feature . They’re pass-by-reference instead of pass-by-value. That means React can’t see when their output changes unless you make completely new copies.

This can lead to all 3,220 counties re-rendering every time a user does anything. But they only have to re-render if we change the map zoom or if the county gets a new value.

Using shouldComponentUpdate like this we can go from 3,220 DOM updates to a few hundred. Big speed improvement

s11e56 - A map appears

Your browser should now show a map.

Choropleth map with shortened dataset

Tech work visas just aren’t that evenly distributed. Even with the full dataset most counties are gray.

If that didn’t work, consult this diff on Github.