This is the third in a series of blog posts about how I wrote and optimized VkColors, a small compute program written using Vulkan.
The previous post added a simple GPU implementation. This post will add several optimizations that improve performance massively.