Azure Data Engineer Resume Fort wayne, I...

Azure Data Engineer Resume Fort wayne, I...
Resumes | Register
Candidate Information
Name	Available: Register for Free
Title	Azure Data Engineer
Target Location	US-IN-Fort Wayne
Email	Available with paid plan
Phone	Available with paid plan
20,000+ Fresh Resumes Monthly
View Phone Numbers
Receive Resume E-mail Alerts
Post Jobs Free
Link your Free Jobs Page
... and much more
Register on Jobvertise Free
sr. Azure Data Engineer Fort Wayne, IN
Azure data engineer Dayton, OH
Data Engineer Azure Fort Wayne, IN
Azure Data Factory Fort Wayne, IN
Data Center Palo Alto Dayton, OH
Sr WEB Full Stack Engineer Fort Wayne, IN
Click here or scroll down to respond to this candidate
Candidate's Name
Azure Data EngineerEmail: EMAIL AVAILABLEMobile: PHONE NUMBER AVAILABLESHORT SUMMARY:Experienced Azure Data Engineer with 10+ years in the IT industry, including 6+ years in Data Engineering and 4 years in Data Warehousing. Specialized in Data Integration, Transformation, and Consolidation of diverse datasets. Proficient in Python, SQL, and Scala Programming, with strong expertise in Azure Databricks, Azure Data Factory, Synapse Analytics, and Databricks for batch and real-time data processing. Skilled in working with data streaming platforms like Apache Kafka and Azure Event Hubs. Hands-on experience with Spark notebooks, Azure Data Lake Storage, BLOB Storage, and relational databases. In-depth knowledge of ETL processes, Power BI for data visualization, and migration projects to Azure. Adept at leveraging Azure services and Snowflake to design and implement scalable data solutions, ensuring data accuracy and integrity throughout the pipeline.PROFESSIONAL SUMMARY:Having 10+ years of experience in the IT industry contributing and implementing Data Integration, Data transformation and Data Consolidation procedures, currently working as an Azure Data Engineer.Proficient with Python, SQL, and Scala Programming and hands-on experience with source systems.Strong knowledge and experience in Azure Data Factory, Azure Synapse Analytics, Azure Databricks for batch data processing.Developed a comprehensive strategy for leveraging ADF to streamline data integration processes and enhance reporting capabilities.Strong knowledge and experience in Azure Stream Analytics, Azure Databricks for real-time data processing.Worked with real time data streaming platforms Apache Kafka and Azure Event Hubs.Good experience working with Azure Data Lake Storage Gen2, Azure BLOB Storage, Azure Data Lake Analytics.Involved in the process of creation of Security groups in the Azure Active Directory and leveraged Terraform for Infrastructure as Code (IaC) to automate provisioning, deployment, and management of Azure resources.Worked with Azure key Vault secrets and certificates ensuring the security as per the organizational standards.Worked with Azure Logic apps to automate workflows connecting various services and systems.Strong theoretical background and hands-on experience in Microsoft Azure service like File Storages, Databases, Incremental Loads, Multi dependency trigger file pipeline, Migrate Data from on premise to Cloud, Loading Data from (Snowflake, REST API).Implemented pre-processing and transformations using Azure Databricks, Azure Data Factory (Dataflows).Experience with migration projects to Azure cloud and Azure architecture decision making to implement ETL and data movement solutions using Azure Data Factory (ADF).Developed and optimized large-scale data processing pipelines using PySpark for ETL processes on Azure Databricks and Implemented data transformation and aggregation tasks using PySpark DataFrames and SQL APIs.Developed and maintained data transformation workflows using DBT, enabling efficient and scalable data pipeline management.Developed and optimized Kafka topics, partitions, and replication strategies and utilized Spark applications using PySpark to process terabytes of data.Proficient in Hadoop, Spark, Cloudera, Hive, Sqoop, Flume, and Kafka for customer behavioural data analysis.Comprehensive experience in real-time Big Data technologies, including Hadoop, Spark, HBase, Hive, RDDs, Data Frames and Cassandra migration.Performed Data Quality Checks using Data Flows, Implementing SCD Type 1 and Type 2 using Data Flows, External tables, Spark and Synapse Notebooks, Logging web services which provides fast and efficient processing of Big Data.Implemented integration from various data sources like Oracle cloud, File Storages, RDBMS, Spreadsheets, Data lakes.Good Experience working with Data Warehouses such as Snowflake, Dedicated SQL Pool and Strong knowledge and hands-on experience working with Serverless and Dedicated SQL Pool within the Azure Synapse Analytics.Demonstrated experience in Enterprise Data warehouse design using Star Schema, and Snowflake schema dimensional models.Good experience working with Relational Databases such as MS SQL Server 2016, MySQL, PostgreSQL, Azure SQL Database.Expertise in performing ETL in Power Query Editor, Data Modelling, Reports/Dashboards creation in Power BI.Utilized different visualization tools such as tree maps, area chart, funnel chart and imported custom visuals in Power BI for interactive data analysis.Experienced in working with Databricks and different components in Spark, Git, SQL and HDFS.Experienced in working with version control system Git and web-based GitHub.In depth knowledge of Software Development Life Cycle (SDLC) with thorough understanding of various phases such as requirements analysis/design and development.Hands-on experience using Jira, Azure DevOps Boards following agile methodology (SCRUM).CERTIFICATION:Microsoft Certified: Azure Data Engineer Associate (DP-203)TECHNICAL SKILLS:Cloud ServicesMS Azure (IaaS, PaaS, SaaS), Azure SQL, Azure Data Bricks, Azure Data Factory, Azure Key vault, Azure Logic Apps, Azure Event hubs, Azure Blob Storage, Snowflake.Big Data TechnologiesPySpark, Scala, MapReduce, Hive Apache Spark, Hive, Impala, Kafka, Zookeeper, Oozie, Cloudera, HBASE.Hadoop DistributionCloudera, Horton Works, Apache Hadoop.Programming LanguagesPython, SQL, Scala, PL/SQL.Operating SystemsWindows (XP/7/8/10), UNIX, LINUX, UBUNTU, CENTOS.SDLCAgile, Scrum, Waterfall, Trello.Source Control & Collaboration ToolJira, Confluence, SharePoint, Git, GitHub, Azure DevOps.Build ToolsJenkins, Maven, DBT.DatabasesMS SQL Server 2018/2016/2008, Azure SQL DB, Azure Synapse, MS Access, Oracle 11g/12c, Cosmos DB, PostgreSQL, MongoDB, T-SQL.Other ToolsPower BI, Tableau, Terraform.PROFESSIONAL EXPERIENCE:Client: Penske Logistics, Tampa, FL Sep 2022  Till DateRole: Azure Data Engineer with SnowflakeResponsibilities:Developed and implemented of end-to-end data pipelines using copy activity, lookup activity in Azure Data Factory, by seamlessly extracting, transforming, and loading diverse data sources into Azure Synapse and Snowflake to optimize logistics and transportation operations.Implemented strategic data processing workflows in Azure Databricks, harnessing Spark's capabilities to execute large-scale data transformations, contributing significantly to operational efficiency in managing package tracking, shipment details, and supply chain data.Implemented scalable and optimized Snowflake and Azure Synapse schemas, tables, and views using DBT catering to complex reporting requirements and analytics queries, ultimately improving decision-making processes across the logistics and transportation landscape.Contributed to a 25% improvement in data pipeline stability and integrity by actively collaborating on ETL tasks and implementing robust error-handling mechanisms.Developed and maintained Terraform scripts to automate the creation and management of Azure resources such as virtual networks, storage accounts, and SQL databases.Optimized data pipelines and Spark jobs in Azure Databricks using PySpark and SparkSQL for enhanced performance, including tuning Spark configurations, caching, and leveraging data partitioning techniques to support the efficient processing of large datasets.Implemented data quality checks and validation processes in PySpark to ensure the integrity and accuracy of data and Implemented Scala and Python scripts for data extraction from various sources, including REST APIs and Azure SQL Database.Designed data ingestion pipelines utilizing Azure Event Hubs and Azure Functions, facilitating real-time data streaming into Azure Synapse for timely insights into package tracking and shipment information.Used Azure Data Lake Storage for efficient storage of raw and processed data, applying data partitioning and retention strategies to enhance data management and support the company's daily operations.Leveraged Azure Blob Storage for streamlined storage and retrieval of data files, implementing compression and encryption techniques to optimize costs and enhance data security.Integrated Azure Data Factory with Azure Logic Apps to orchestrate complex data workflows, triggering actions based on specific events and significantly improving customer service through enhanced package tracking and communication.Implemented data replication and synchronization strategies between Azure Synapse, Snowflake, and other data platforms, utilizing Azure Data Factory and Change Data Capture (CDC) techniques to maintain data integrity.Leveraged capabilities of Snow pipe to process semi-structured data formats such as JSON and Parquet, enabling efficient storage and analysis of complex data structures in Snowflake.Implemented notification feature of Snow pipe to trigger downstream data processing tasks in Azure Databricks and Azure Data Factory, enabling timely insights and decision-making in the logistics and transportation domain.Developed and deployed Azure Functions for data preprocessing, enrichment, and validation tasks within data pipelines, contributing to increased operational efficiency in managing diverse datasets.Executed data archiving and retention strategies utilizing Azure Blob Storage and Azure Synapse, optimizing operational efficiency in managing historical data, troubleshooted and resolved issues related to Terraform deployments.Implemented Slowly Changing Dimension (SCD) and Change Data Capture (CDC) techniques to integrate these workflows to manage historical and incremental changes in customer data effectively.Established custom monitoring and alerting solutions using Azure Monitor and Azure Synapse Query Performance Monitoring (QPM), ensuring proactive identification and resolution of performance issues.Integrated Azure Synapse and Snowflake with Power BI and Azure Analysis Services to create interactive dashboards and reports, using DBT for Data transformations empowering business users with self-service analytics enhancing Operational Efficiency, Customer Service and Decision-Making.Utilized JIRA for issue and project workflow management. And employed Git as a version control tool to maintain the code repository.Collaborated with Devops engineers to establish automated CI/CD pipelines aligning with client requirements.Exhibited excellent time management abilities, regularly meeting project deadlines through workflow optimization, Agile methodology application, and active participation in planning and daily stand-ups.Environment: Azure Databricks, Data Factory, Azure Synapse, Azure event Hub, Logic Apps, Azure SQL Database, DBT, Terraform, Azure Data Lake Storage, Blob Storage, Snowflake, Functional App, MS SQL, Oracle, HDFS, MapReduce, YARN, Spark, Hive, SSIS, SQL, Python, Scala, PySpark, shell scripting, GIT, JIRA, Jenkins, Kafka, ADF Pipeline, Power BI.Client: Citigroup Inc, Dallas, TX July 2021 to Aug 2022Role: Azure Data EngineerResponsibilities:Implemented end-to-end data pipelines for a fraud detection system, utilizing Azure Databricks, Azure Data Factory (ADF), and Logic Apps.Utilized ETL processes and DBT to gather information from diverse sources and feed it into a centralized fraud detection platform.Designed and implemented fraud detection workflows using a scalable data processing framework, incorporating optimized data models to support complex analytics queries, and reporting requirements specific to fraud analytics.Leveraged MS SQL, Oracle, and other relevant databases for efficient data storage and retrieval.Implemented a secure and compliant for storing raw and processed data, utilizing Azure Data Lake Storage, HDFS, and incorporating partitioning and retention strategies aligned with fraud detection regulations.Integrated ADF Pipelines for data ingestion, enabling real-time data streaming into the fraud detection system.Implemented functions within lookup activity code implementation for complex business logic and data transformations, including decoding, mapping, filtering, and reformatting data.Integrated the fraud detection system with Azure Logic Apps for orchestration, managing complex data workflows, and triggering actions based on specific fraud-related events.Implemented robust data governance practices and quality checks using MS SQL, Oracle, and other relevant databases to ensure the accuracy and consistency of data within the fraud detection system.Designed and deployed functions using Python, Scala, and PySpark specifically tailored for data preprocessing, enrichment, and validation tasks within the fraud detection pipelines.Implemented robust and scalable HDFS cluster architecture and data lake solutions supporting batch and real-time processing.Optimized data pipelines for improved performance, incorporating tuning techniques specific to Azure Databricks and Spark. Implemented Hive for data warehousing.Developed ETL processes to synchronize changes from operational systems to the analytics store supporting different Slowly Changing Dimensions (SCDs) strategies.Implemented facts and dimensions tables to understand the principles of dimensional modelling, enabled simpler queries and reporting for business users.Proficient in loading data into Snowflake from various sources, including structured and semi-structured data formats, using Snowflake's native loading utilities, such as SnowSQL, Snowpipe, and Snowflake connectors.Experienced in tuning SQL queries and optimizing query execution plans in Snowflake to improve query performance and reduce latency.Designed and implemented data security and compliance measures in Snowflake to ensure data protection and regulatory compliance.Implemented monitoring and alerting solutions using Azure Monitor and JIRA for proactive identification and resolution of performance issues within the fraud detection pipelines and also utilized Jenkins for automated pipeline deployment.Proficient in creating interactive and visually compelling reports and dashboards using Power BI to visualize complex datasets and derive actionable insights.Integrated Terraform with CI/CD pipelines to enable continuous delivery and deployment of infrastructure changes and developed automated testing frameworks for validating infrastructure code before deployment.Implemented data cataloging and lineage solutions using Azure Purview, incorporating GIT for version control, to provide a comprehensive understanding of data assets and their relationships within the fraud detection system.Environment: Azure Databricks, Data Factory, DBT, Azure Logic Apps, Functional App, Terraform, Snowflake, MS SQL, MongoDB, Oracle, HDFS, MapReduce, Spark, Hive, SQL, Python, Scala, Pyspark, GIT, JIRA, Jenkins, Kafka, ADF Pipeline, Power BI, Kubernetes, Azure Purview.Client: CITRIX  Fort Lauderdale, FL Oct 2018 to Dec 2020Role: Data EngineerResponsibilities:Designed and implemented complex data pipelines using Azure Data Factory, ensuring seamless data integration and transformation workflows.Employed Azure Data Factory (ADF), Sqoop, Pig, and Hive to establish an ETL framework for extracting data from diverse sources and ensuring availability for consumption.Integrated Azure SQL Database with ADF for structured data storage and real-time data processing, enabling high-performance analytics.Implemented Azure Data Lake Storage (ADLS) for secure, scalable, and cost-effective storage of large datasets, facilitating advanced analytics.Leveraged Spark SQL and Data Frames for advanced data querying and manipulation, improving data accessibility and analytical capabilities.Engineered a robust Spark Streaming application to handle real-time data analytics, providing timely insights and enhancing business decision-making processes.Processed HDFS data and created external tables in Hive, developing reusable scripts for table ingestion and repair activities across the project.Developed Spark and Scala-based ETL jobs for migrating data from Oracle to new MySQL tables, leveraging the capabilities of Spark for efficient data processing.Utilized Spark (RDDs, Data Frames, Spark SQL) and Spark-Cassandra Connector APIs for tasks such as data migration and generating business reports, demonstrating versatility in Spark usage.Pioneered in data crunching, ingestion, and transformation activities, engineering a Spark Streaming application for real-time sales analytics, showcasing proficiency in real-time data processing.Conducted thorough analysis of source data, managed data type modifications, and utilized various data formats (Excel sheets, flat files, CSV files) to generate ad-hoc reports in Power BI, ensuring data-driven decision-making.Implemented Slowly Changing Dimensions (SCD) strategies, seamlessly handling updates and enhancing data accuracy by 25%, showcasing expertise in data quality management.Implemented automation for deployments using YAML scripts, ensuring streamlined builds and releases, demonstrating proficiency in deployment practices.Collaborated extensively on the creation of combiners, partitioning, and distributed cache to enhance the performance of MapReduce jobs, demonstrating teamwork and optimization skills.Managed source code and enabled version control using Git and GitHub repositories, ensuring code integrity and collaboration among team members.Environment: Azure Data Factory, Azure SQL Databases, Azure Data Lake Storage, Sqoop, Pig, Hive, HDFS, Spark, Scala, MySQL, RDDs, Data Frames, Spark SQL, Spark-Cassandra Connector, Excel sheets, flat files, CSV files, Power BI, Azure Key Vault, Azure Function Apps, Azure Logic Apps, Apache HBase, Zookeeper, Flume, Kafka, Git, and GitHub.Client: Nielsen Corporation, Tampa, FL Aug 2017 to Sep 2018Role: ETL DeveloperResponsibilities:Created and maintained databases for Server Inventory and Performance Inventory.Developed stored procedures and triggers to ensure consistent data entry into the database.Generated Drill-through and Drill-down reports in Power BI, incorporating drop-down menu options, data sorting, and defining subtotals.Utilized Data Warehouse for developing Data Mart, which feeds downstream reports. Developed a User Access Tool enabling users to create ad-hoc reports and run queries for data analysis in the proposed Cube.Created packages for transferring data between ORACLE, MS ACCESS, FLAT FILES, Excel Files, to SQL SERVER 2008R2 using SSIS.Deployed SSIS Packages and established jobs for efficient package execution.Possess expertise in creating ETL packages using SSIS to extract, transform, and load data from heterogeneous databases into the data mart.Experienced in building Cubes and Dimensions with various architectures and data sources for Business Intelligence.Involved in creating SSIS jobs for automating report generation and cube refresh packages.Proficient with SQL Server Reporting Services (SSRS) for authoring, managing, and delivering both paper-based and interactive Web-based reports.Worked in Agile Scrum Methodology, participating in daily stand-up meetings. Possess significant experience working with Visual SourceSafe for Visual Studio 2010 and utilized Trello for project tracking.Environment: SQL Server technologies, Power BI, SSIS, SSRS, Agile Scrum Methodology, Visual SourceSafe, Visual Studio 2010, and Trello for project tracking.Client: Portware, Hyderabad, India June 2012 to July 2016Role: Data Warehouse DeveloperResponsibilities:Experience in designing ETL data flows using SSIS, creating mappings/workflows to extract data from SQL Server, and performing Data Migration and Transformation from Access/Excel Sheets using SQL Server SSIS.Efficient in Dimensional Data Modeling for Data Mart design, identifying Facts and Dimensions, and developing fact tables, dimension tables, using Slowly Changing Dimensions (SCD).Experience in Error and Event Handling: Precedence Constraints, Break Points, Check Points, and Logging.  Experienced in Building Cubes and Dimensions with different Architectures and Data Sources for Business Intelligence and writing MDX Scripting.Thorough knowledge of Features, Structure, Attributes, Hierarchies, Star and Snowflake Schemas of Data Marts.Good working knowledge of Developing SSAS Cubes, Aggregation, KPIs, Measures, Partitioning Cube, Data Mining Models, and Deploying and Processing SSAS objects.Experience in creating Ad hoc reports and reports with complex formulas, as well as querying the database for Business Intelligence.Implemented OLAP (Online Analytical Processing) and OLTP (Online Transactional Processing) to optimize data storage, retrieval, and analysis for diverse business needs.Expertise in developing Parameterized, Chart, Graph, Linked, Dashboard, Scorecards, and Reports on SSAS Cube using Drill-down, Drill-through, and Cascading reports using SSRS.Flexible, enthusiastic, and project-oriented team player with excellent written, verbal communication, and leadership skills to develop creative solutions for challenging client needs.Environment: MS SQL Server 2016, Visual Studio 2017/2019, SSIS, Share point, MS Access, Team Foundation server.Education:Bachelor of Technology from Jawaharlal Nehru Technological University Kakinada in 2012.
Respond to this candidate
Your Message
Please type the code shown in the image: