This article was also featured on Hacker Noon.
The Surprising Solution - Make Less, But Use More
Understanding how the things that we buy and use affect our environment is not a simple exercise. Unless you're foraging for food and making tools by hand, everything you consume has a complex environmental impact that's probably larger than you think.
For example, that little piece of sashimi you're eating in SOHO might have been caught thousands of miles out in the Pacific and brought back to Japan (the boat burning fuel both ways), flown on an airplane (burning fuel), and delivered to the restaurant in a truck (burning fuel). Everyone involved - fisherman, airline crew, truck driver, chef, wait staff - had to get to and from work. And if you're eating an apex predator like a tuna, there's an additional set of downstream ecosystem impacts.
Computer chips are no different. In fact, manufacturing at such a staggering level of microscopic precision comes with a surprisingly large investment of energy and other resources, plus other ecological impacts. And beyond fabrication, chips must be distributed, installed, and ultimately powered inside computers - all activities that impact the environment.
As the juggernauts of the Metaverse, Machine Learning, Artificial Intelligence, and Gaming push forward, GPU acceleration will become even more critical and widespread - with corresponding environmental pressures mounting.
The environmental impacts of modern chip fabrication, or "fab,” can be categorized like so (note: I don't mean to pick on TSMC in particular, but they are the largest chip manufacturer):
This lake near a mine in Romania is… not supposed to be this colorful. (Jaanus Jagomägi, Unsplash)
Most of this impact is attributable to CPUs rather than discrete GPUs. But despite their smaller numbers, GPUs are much larger and heavier, thus carrying a higher per-unit impact - and with 41M discrete GPUs shipped in 2020, the impact is sizeable.
Water, power, fossil fuels, greenhouse gases, toxic chemicals - GPU manufacturing, delivery, and operation come with the whole suite of environmental impacts. And it's ironic, for example, that the massive computing capacity that goes into things like climate simulation is also a substantial contributor to climate change.
Imagine instead that we could do way more with way fewer GPUs. It might sound too good to be true.
GPU utilization today is woefully low, averaging less than 15%, as we've covered previously. Such dire underutilization is, without question, a colossal waste of resources - but it's also needlessly destructive to the environment.
If we can figure out a way to raise utilization to, say, 90%, we would be able to do six times the computing with the GPUs we have, do the same amount of computing with 1/6 the number of GPUs, or some balance on a spectrum in between. The amount of leverage we could create would be incredible.
Archimedes understood doing more with less. (Mechanics Magazine, 1824)
Let's imagine two futures five years from now where the world requires 10x the accelerated computing capacity it has today:
Let's predict a ~20% annual increase in computing capacity per GPU, based on the recent slowing of Moore's Law. Over five years, this gives us 2.5x of our 10x (1.2⁵=2.49). We still need 4x on top of that 2.5x to get our 10x.
So we will need 4x as many GPU cards, with a corresponding 4x in nasty environmental impacts. Let's be generous and say that innovation toward cleaner and more efficient manufacturing, mining, delivery systems, etc., reduce these impacts by 10% (a 1.1x effect).
10x compute increase / 2.5x "Moore's Law" / 1.1x cleantech innovation → 3.6x
In Scenario 1, we're still left with a ~3.6x environmental impact to get our 10x compute capacity.
10x compute increase / 2.5x "Moore's Law" / 1.1x cleantech innovation / 6x utilization / 0.5x operational impact → 1.2x
1.2x! In Scenario 2, incredibly, we can meet this 10x accelerated computing future while barely increasing our environmental impact.
This might sound too good to be true, but we don't need to bend the laws of physics to achieve this stunning 8x result (10x compute with 1.2x environmental impact) - we just need to unleash an additional 75% utilization in our GPUs that is already there.
This is not a trivial undertaking. If it were, someone would have already done it. But the breakthrough innovation has arrived.
By abstracting the PCIe connector between a GPU and its application host with software, the two fundamental changes to GPU usage - both necessary to drive the 6x increase in utilization - are now possible:
We've already outlined how our team has done this, deployed our solution with customers, and envisioned a world where remote GPU is widely adopted.
As we move into an uncertain environmental future, the last thing we need is rampant waste. We hope you'll join us in our mission to bring the world to 10x compute capacity with only a marginal increase in environmental impact - driven by the simple idea of doing more with what we already have.
Steve Golik is co-founder of Juice Labs, a startup with a vision to make computing power flow as easily as electricity.