Principal Data Engineer – Knowledge Graph
As the Principal Data Engineer, you will be part of GSK’s ambitious programme to transform its commercial manufacturing and supply chain organisation helping to increase capacity and speed for transferring new products from our R&D organisation. Data and AI are an essential part in achieving this goal, which ultimately will help launch medicines quicker and have a positive impact on patients.
We are open for this role to be based at GSK HQ, Stevenage, Ware, Irvine, Poznan and Warsaw.
Job Purpose
The primary purpose of this role is to take technical accountability for the CMC Knowledge Graph, driving forward its design and implementation through being hands-on and by providing technical direction and oversight to rest of the development team, while also working closely with Product Management, business representatives and other Tech & Data experts to ensure that it meets the business requirements.
More broadly, the role will additionally:
-
Support the CMC Knowledge System Director, product managers, business leaders and other stakeholders to identify opportunities where Knowledge Graph and other Data & AI capabilities can have a transformative impact on GSK’s CMC and New Product Introduction (NPI) processes
-
Provide technical leadership for other Data & AI Products in the CMC/NPI portfolio
The immediate priority in the role is to drive the technical work needed to productionise an existing proof-of-concept CMC Knowledge Graph and its associated analytics use-cases into a full-fledged, sustainable, supportable Data & AI Product.
In this role you will…
-
Lead the ongoing technical design, development, testing and release of the CMC Knowledge Graph and other Data & AI solutions in CMC/NPI portfolio
-
Provide Data Engineering leadership, technical guidance, and mentorship to development team (both internal staff and contractors), driving performance and continuous improvement, including identifying and realising opportunities to increase velocity and product sustainability
-
Use hands-on technical problem-solving expertise to address technical challenges throughout product life-cycle
-
Support Product Management in creating and evolving a compelling Product vision and roadmap, using expertise and insights on the art-of-the-possible as well as in engaging users and other business stakeholders to gather feedback and identify system enhancements
-
Collaborate and influence Data & AI Architecture team and other stakeholders like Data Science teams to ensure alignment with established architecture standards, contributing to and evolving them to incorporate new technologies and patterns as needed
-
Ensure compliance with relevant policies and procedures (including Gx P validation where required)
-
Provide input to, reviewing and approving key technical documents (e.g. design spec, validation plan)
-
Stay updated with the latest advancements in Knowledge Graph and related technologies, recommending and implementing improvements to enhance system functionality and performance.
-
Lead discovery / proof-of-concept activities to establish early technical feasibility of new Products or Product Features
Why you?
Basic Qualifications/Experience:
We are looking for professionals with these required skills to achieve our goals:
- Proven track record in delivering complex data engineering projects in a cloud environment, preferably Azure.
-
Strong technical expertise in designing, developing, and supporting Knowledge Graphs, including proficiency in working with graph technologies such as:
-
Resource Description Framework (RDF), Web Ontology Language (OWL), SPARQL, and Cypher
-
Experience in leading and managing technical teams
-
Expertise in data modelling/ontologies, data integration, and data transformation techniques including experience with structured query languages and SQL and/or No SQL databases.
-
Strong programming skills and proficiency in utilising code repositories, such as Git, for efficient version control, collaboration, and code management.
-
Experience with Dev Ops principles and CI/CD practices to automate software delivery, streamline development processes, and ensure efficient deployment and monitoring
Preferred Qualifications/Experience:
If you have the following characteristics, it would be a plus:
-
Understanding of pharmaceutical industry data and domain knowledge within CMC.