DATA ENGINEERL3
Descrição da oferta de emprego
We operate at global scale and we are expanding to Portugal! If you are passionate and have the desire to make the difference we want to get to know you! Join us to be part of this incredible adventure! Who are we looking for YOU! Description.
Position Summary.
We are looking for a Databricks Specialist Consultant with a deep focus on Unity Catalog and Microsoft Purview responsible for migrating files in Parquet format stored in Azure Storage Account containers to Delta Table format with the implementation of a layered architecture (Bronze Silver Gold) in Databricks.
The consultant will also be responsible for integrating these Delta Tables into Unity Catalog with centralised governance via Microsoft Purview ensuring that the data is accessible and well governed while being used for dynamic reporting in Power BI.
Responsibilities.
Expertise in Unity Catalog.
Migrate Parquet files stored in Azure Storage Account to Delta Tables registered in Unity Catalog in Databricks.
Apply granular access control at table column and schema level using RBAC in Unity Catalog ensuring compliance and security.
Configure and optimise Unity Catalog to provide centralised governance over all data in Databricks ensuring that permissions and data lineage are clearly defined.
Advanced integration with Microsoft Purview.
Integrate Unity Catalog with Microsoft Purview for automatic cataloguing data lineage tracking and auditing.
Ensure that all data changes permissions and metadata are visible and auditable via Purview guaranteeing compliance with regulations such as GDPR and HIPAA.
Implementation of Layered Data Architecture.
Implement Bronze Silver Gold architecture in Databricks with different layers of data for ingestion transformation and final exposure for reporting.
Create Databricks clusters suitable for each layer optimising performance and guaranteeing scalability and security in data processing.
Notebook Conversion and Delta Table Optimisation.
Review and migrate existing notebooks that handle Parquet files to use Delta Tables registered in Unity Catalog.
Implement performance optimisations in Delta Tables using commands such as OPTIMIZE and VACUUM to improve query efficiency and free up space.
Reports and Visualizations with Power BI.
Ensure that data transformed and governed via Delta Tables in Databricks is accessible for realtime reporting in Power BI using Direct Query to ensure data is always uptodate.
Technical skills required.
Expert in Unity Catalog.
Advanced experience in configuring managing and optimizing Unity Catalog in Databricks including access control security policies and governance.
Microsoft Purview.
Proficiency in Microsoft Purview with experience in integrating and maintaining data governance with Unity Catalog ensuring traceability auditing and compliance.
Databricks and Delta Lake.
Solid experience using Databricks for largescale data manipulation especially utilizing Delta Lake and Delta Tables.
Ability to implement and optimize data pipelines in Bronze Silver and Gold tiers in Databricks.
Azure Storage.
Indepth knowledge of Azure Storage Accounts Azure Blob Storage and Azure Data Lake Storage (ADLS) including the manipulation of Parquet files for storage and performance optimization.
Power BI.
Ability to integrate Databricks data into Power BI to create dynamic dashboards and reports ensuring security permissions are respected.
Desirable Skills.
Data Governance and Security.
Deep understanding of data governance compliance policies and security practices especially in the context of sensitive data.
Pipeline Automation.
Experience in automating and orchestrating data pipelines in an Azure environment with a focus on efficiency and resource optimization.
Mandatory Requirements.
Expertise in Unity Catalog in Databricks.
Advanced experience with Microsoft Purview for data governance and auditing.
Proven ability to migrate and optimize data in Delta Tables and register in Unity Catalog.
Indepth knowledge of data manipulation in Azure Storage Accounts and Databricks.
Experience with data integration in Power BI for dynamic reports and visualizations.
Nice to Have.
Microsoft Azure certifications such as Azure Data Engineer or Azure Solutions Architect.
Experience with PySpark for task automation and optimization in Databricks.
Work Location.
Lisbon 60% Remote and 40% Presential Remote Work.
No
Detalhes da oferta
- Grupo Data
- Em todo Portugal
- Indeterminado - Indeterminado
- Indeterminado
- 27/10/2024
- 25/01/2025
Fórum selecção is looking for a senior data engineer (m/f) for a corporate bank main activities: - design high-performance data pipelines (etl) to feed a dwh using state of the art data engineering techniques according to business specifications; - implement flow automation through ci/cd pipelines and......
As a candidate you’ve decided to provide us your personal data... who we are: unikystem is a low-code bpm automation platform boosted by cognitive data capture with 100% accuracy, that turns any unstructured text data into business-critical information... required skills and experience: being a big......
@confidentialnote: mne library of python will be used to explore, visualise and analysehuman neurophysiological data... ● clinic needs to click on “stop data stream” to stop the recording, after thesession is completed... ● once the board is set up, the clinic needs to click on “start data stream” tostart......
Responsabilidades: levar a cabo projetos no âmbito do desenvolvimento da oferta para os clientes, através de soluções de business intelligence / data analytics; propor e implementar melhorias nas plataformas e processos existentes; envolvimento direto em projetos estratégicos que visam desenvolver soluções......
Responsabilidades: levar a cabo projetos no âmbito do desenvolvimento da oferta para os clientes, através de soluções de business intelligence / data analytics; propor e implementar melhorias nas plataformas e processos existentes; envolvimento direto em projetos estratégicos que visam desenvolver soluções......
We also inform you that your data will be kept in the company for a period of 5 years... js (3+ years) experience with web applications (backend), distributed systems and apis preferred additional skills and experience: experienced in devops and promoter of automation mvc, jsf, jsp, j2ee, oo, distributed......
Subsistindo situações de igualdade de valoração, aplica-se a primazia na submissão da candidatura – data e hora – contadas desde a última alteração à candidatura... prazo de validade: 18 (dezoito) meses, a contar da data da homologação da lista de classificação final, ou antes, pela colocação de todos......
Requisitos do trabalho requirements: mandatory proficiency in english and dutch strong client-facing and communication skills customer service orientation available to work in fixed schedules role purpose: provide first level contact and convey resolutions to customer issues properly escalate unresolved......
Requisitos do trabalho mandatory proficiency in english and german strong client-facing and communication skills customer service orientation available to work in fixed schedules role purpose: provide first level contact and convey resolutions to customer issues properly escalate unresolved queries to......
Os e-mails recebidos após essa data não serão tratados... a pessoa deverá ter os seus papéis de identificação regulamentados... por favor, enviar cv, com carta de apresentação, para o seguinte endereço de *****@*****, até ao próximo dia 16 de agosto... é mesmo necessário ter carta de condução para poder......