The Site Reliability Engineering (SRE) team is part of the Digital Distribution group with the mission to develop processes, tooling, and services to increase reliability through observability, proactive collaboration, and support.
We are responsible for providing preferred systems and solutions with the aim to be the accelerator for product teams, helping developers to build reliable products and services to achieve broadcast-level reliability. We do this through collaboration, our product stack, and best engineering practices.
Using cloud technologies we are part of the team responsible for publishing all of the BBC's online audio and video content which powers iPlayer, BBC Sounds, and News and Sport. We handle live streaming, on-demand content, and podcasts. Our services handle over 13 million plays per day, ingest and publish 1000s of hours of on-demand content, and publish 3-4 TB of new files daily. We strive to provide our global audiences the best possible playback experiences across numerous platforms and devices.
Our Main Responsibilities
Our engineers are advocates for SRE principles and practices and actively influence culture change in the BBC. SRE provides metrics visualisation, release and deployment tooling, automation, secrets storage, SLI/SLO & error budget facilitation and more.
We develop and share the concept of Site Reliability Engineering as a discipline; incorporating aspects of software engineering and applying them to infrastructure and operational problems at scale.
The team helps product teams to become production-ready, treating operational concerns as integral to the development process.
Our tooling monitors systems, generating observable metrics leading to ways to improve internal and audience facing systems.
We consult with product owners and development teams to support their goals.
Technical requirements and specifications are developed to achieve a better development and audience experience.
We seek to understand complex technical problems including contributing in blameless post mortems and swarming.
Our code is developed to be easily maintainable, supportable, and minimises risk.
How Do We Do This?
The Site Reliability Engineering principles are our daily work.
We have a growth mindset, continuously looking to learn new software skills and technologies.
Most of our work is with Linux/Unix operating systems, including containers.
We reduce repetitive work (toil) through the automation of tasks.
We respond to problems and failures with an open and blameless approach and treat these as chances to improve.
The Ideal Candidate
We are looking for a Software Engineer to join our team to help deliver our mission. The ideal candidate would enjoy working as part of a collaborative and supportive team with opportunities to grow and develop and work in a truly exciting space, delivering the best possible experience to our global audiences.
Contract type: Permanent
Location: London, Salford, Glasgow or Cardiff
We’re happy to discuss flexible working. Please indicate your choice under the flexible working question in the application. There is no obligation to raise this at the application stage but if you wish to do so, you are welcome to. Flexible working will be part of the discussion at offer stage.
- Excellent career progression – the BBC offers great opportunities for employees to seek new challenges and work in different areas of the organisation.
- Unrivalled training and development opportunities – our in-house Academy hosts a wide range of internal and external courses and certification.
- Benefits- We offer a competitive salary package, a flexible 35-hour working week for work-life balance and 26 days (1 of which is a corporation day) with the option to buy an extra 5 days, a defined pension scheme and discounted dental, health care, gym and much more.
The situation regarding the coronavirus outbreak is developing quickly and the BBC is keen to continue to ensure the safety and wellbeing of people across the BBC, while continuing to protect our services. To reduce the risk access to BBC buildings is limited to those essential to our broadcast output. From Wednesday 18th March until further notice all assessments and interviews will be conducted remotely. For more information go to https://www.bbc.co.uk/careers/
Mae’r sefyllfa gyda’r coronafeirws yn datblygu’n gyflym, ac mae’r BBC yn awyddus i barhau i sicrhau diogelwch a lles pobl ar draws y BBC, gan barhau i warchod ein gwasanaethau hefyd. I leihau’r risg, dim ond y bobl sy’n hanfodol i’n hallbwn darlledu fydd yn cael mynediad i adeiladau’r BBC. O ddydd Mercher 18fed Mawrth ymlaen, bydd pob asesiad a chyfweliad yn cael ei gynnal o bell, nes rhoddir gwybod yn wahanol. I gael mwy o wybodaeth, ewch i https://www.bbc.co.uk/careers/
About the Company
We don’t focus simply on what we do – we also care how we do it. Our values and the way we behave are important to us. Please make sure you’ve read about our values and behaviours in the document attached below.
Diversity matters at the BBC. We have a working environment where we value and respect every individual's unique contribution, enabling all of our employees to thrive and achieve their full potential.
We want to attract the broadest range of talented people to be part of the BBC – whether that’s to contribute to our programming or our wide range of non-production roles. The more diverse our workforce, the better able we are to respond to and reflect our audiences in all their diversity.
We are committed to equality of opportunity and welcome applications from individuals, regardless of age, gender, ethnicity, disability, sexual orientation, gender identity, socio-economic background, religion and/or belief. We will consider flexible working requests for all roles, unless operational requirements prevent otherwise.
To find out more about Diversity and Inclusion at the BBC, please click here