Lead Cloud Data Engineer:
The role will be responsible for building new data pipelines and optimizing data flows using the Azure cloud stack.
Ideal candidate will be an experience data pipeline builder and data wrangler who enjoys building data products from scratch. The data engineer will need to support Business Analyst's and Data Architect's with discovery and best practices. He/She must be self-directed and comfortable supporting the data needs of multiple teams, systems and products.
Skills Needed:
- Advance Azure knowledge and experience working and migrating data products from on-prem to Azure.
- Experience building and optimizing big data' data pipelines, architectures and data sets using Py-Spark.
- Experience building Real Time data pipelines using Event-hub, storage queues and Azure stream analysis.
- Strong analytic skills related to working with unstructured datasets.
- Build processes supporting data transformation, data structures, metadata, dependency and workload management.
- Cloud Big Data Analytics in Azure Synapse Analytics, Azure Analysis Services
- Data Ingestion and Storage including Azure Data Factory, Azure Databricks, Azure Data Lake, Kafka and Spark Streaming, Azure EventHub/IoT Hub, and Azure Stream Analytics
- Experience with Bigdata Tools: Hadoop Spark, Kafka
- Experience with Stream processing systems: Spark-streaming, Kafka
- Experience with Object Oriented/object function Scripting languages: Python preferred.
- Understanding of Hadoop HDFS-Hive, Power BI and Basic Unix Scripting will be a bonus.