Spotify: An Early Adopter of Containers, Spotify Is Migrating from Homegrown Orchestration to Kubernetes

Challenges Faced:

“Their goal is to empower creators and enable a really immersive listening experience for all of the consumers that they have today — and hopefully the consumers they'd have in the future,

An early adopter of micro services and Docker, Spotify had containerized micro services running across its fleet of VMs with a homegrown container orchestration system called Helios. By late 2017, it became clear that “having a small team working on the features was just not as efficient as adopting something that was supported by a much bigger community”.

Solution they came up with:

Kubernetes was more feature-rich than Helios. Plus, “they wanted to benefit from added velocity and reduced cost, and also align with the rest of the industry on best practices and tools”.

At the same time, the team wanted to contribute its expertise and influence in the flourishing Kubernetes community. The migration, which would happen in parallel with Helios running, could go smoothly because “Kubernetes fit very nicely as a complement and now as a replacement to Helios”.

Impact to the industry:

The biggest service currently running on Kubernetes takes about 10 million requests per second as an aggregate service and benefits greatly from auto scaling. Plus, “Before, teams would have to wait for an hour to create a new service and get an operational host to run it in production, but with Kubernetes, they can do that on the order of seconds and minutes.” In addition, with Kubernetes’s bin-packing and multi-tenancy capabilities, CPU utilization has improved on average two- to threefold.

Matt Brown, Staff Software Engineer of Infrastructure at Spotify, talks about how Kubernetes played a key role in the migration of back-end micro services to keep everything as seamless as possible for the 200+ teams involved.

Success story:

Technologies later added:

Both of those technologies are in early stages of adoption, but already “they have reason to believe that gRPC will have a more drastic impact during early development by helping with a lot of issues like schema management, API design, weird backward compatibility issues, things like that,”. “So they’re leaning heavily on gRPC to help in that space.”

As the team continues to fill out Spotify’s cloud native stack — tracing is up next — it is using the CNCF landscape as a helpful guide. “They look at things that needed to solve, and if there are a bunch of projects, they evaluate them equivalently, but there is definitely value to the project being a CNCF project,”.

Spotify’s experiences so far with Kubernetes bears this out. “The community has been extremely helpful in getting thier community to work through all the technology much faster and much easier,”. “It’s been surprisingly easy to get in touch with anybody they wanted to, to get expertise on any of the things they’re working with and it’s helped them validate all the things thet’re doing.”

This is it…

Thank You !

I am a forward-thinking individual with exceptional skills in problem-solving, adaptive thinking, automation, and development.