Software Engineer Data Quality Framework (PySpark / Databricks)

Aanvraagnummer: 43309
Contractvorm: Detachering
Organisatie: APG
Locatie: Heerlen, Netherlands
Uren per week: 40 per week
Tarief: Geen maximum
Duur van de opdracht: Ik mis de invoer. Stuur de duur van de opdracht, dan geef ik alleen het resultaat in het formaat “X maanden” of “X,5 maanden”.
Regio: Limburg
Startdatum: N/A
Optie tot verlenging: Mogelijk, maar nog niet bekend
Sluitingsdatum: 01-01-2900 00:00
LET OP! Reacties dienen minimaal 1 werkdag voor de sluitingsdatum binnen te zijn voor tijdige verwerking.
Dit ga je doen:
Within our Data Capabilities team at APG Asset Management, you will develop and maintain reusable data quality framework components in Databricks. Your focus is on rule execution, result handling, contract-driven validation and the operational setup needed to run the framework reliably across environments.
As a senior developer, you are responsible for, among other things:
- Design, develop and improve the PySpark-based data quality framework and reusable validation components in Databricks;
- Build and maintain rule execution, result handling and contract-driven validation capabilities;
- Own engineering quality through refactoring, testing, CI/CD, packaging and release practices;
- Support Databricks operations including jobs, alerts, triggers, notifications and multi-environment setup;
- Work with stakeholders to refine requirements and contribute to documentation, standards and team coaching.
Our team is a multi-disciplinary team responsible for the end-to-end delivery of solutions. We work in an agile setting and combine engineering, functional and platform expertise to deliver reliable capabilities that support business processes and data products. We aim for flexible, high-performing teams with strong collaboration and T-shaped skills.
Your work location will be Amsterdam or Heerlen.
Dit breng je mee:
As a Senior Data Quality Framework Developer, you combine strong engineering skills with a quality mindset and platform ownership. You know how to build reusable solutions, work across development and operations, and collaborate effectively with both technical and non-technical stakeholders.
Additionally we ask for:
- At least five years of relevant experience in software engineering, data platform engineering or data engineering;
- Strong proficiency in Python and PySpark, with a good understanding of the Spark framework and optimization practices;
- Experience developing and operating Databricks jobs in a production environment;
- Good understanding of software design principles, including object-oriented programming, modular design, maintainable code structures and package management;
- Experience with testing, CI/CD, deployment practices and development on a major cloud provider (Azure preferred) in multi-environment setups;
- Background in Computer Science, Software Engineering, Data Engineering or a related field;
- Strong communication skills in English and the ability to work with a broad stakeholder group.
- You work independently, are proactive, and take ownership by picking up tasks on your own.
Nice to haves:
- Experience with data quality checks, data quality frameworks, rule engines or metadata-driven validation patterns;
- Experience building or maintaining reusable framework components, platform services or shared libraries;
- Experience with Databricks alerts, notifications and triggers;
- Experience with Azure DevOps and Artifactory;
- Familiarity with data contracts, rule versioning or governance-related requirements.
QUALIFICATION/LICENSURE
Preferred years of experience : 5 years
Travel required : 25% to 50%
Shift timings :
