| 20,000+ Fresh Resumes Monthly | |
|
|
| | Click here or scroll down to respond to this candidateCandidate's Name
PH: PHONE NUMBER AVAILABLE mail: EMAIL AVAILABLE Website: LinkedIn Location: IllinoisPROFESSIONAL SUMMARYData Engineer with 3+ years of experience designing, implementing, and optimizing data solutions across AWS and Azure cloud platforms. Proven expertise in ETL processes, real-time pipelines, and scalable data lakes using Big Data technologies like Kafka, Spark, and Hadoop tools. Skilled in data governance, data quality management, and data visualization using Tableau and Power BI. Exceptional attention to detail, project management, and problem-solving skills, ensuring accurate and efficient data solutions. TECHNICAL SKILLS Programming: Python, SAS, Scala, SQL Big Data: HDFS, MapReduce, Hadoop, Hive, Kafka, Apache Spark, Data Lake Databases: MySQL, MSSQL, NoSQL, Azure SQL DB(T-SQL), Snowflake Cloud: Microsoft Azure and Amazon Web Services (AWS) Tools: Tableau, Power BI, Agile, Jira, Git, Scrum, Docker, Kubernetes, CI/CD, Excel, Terraform, VS code, A/B Testing WORK EXPERIENCEData Engineer, BNY Mellon Remote, USA November 2023 Present Migrated on-premises warehouse to Azure Synapse Analytics, reducing costs by 40% and improving performance Implemented scalable data lake on Azure Data Lake Storage Gen2 for efficient data processing. Created real-time streaming applications using Spark Streaming, Kafka, and Scala reducing latency by 50%. Implemented monitoring systems using Airflow, achieving 99.9% data availability, reducing downtime by 80%. Automated infrastructure management with Terraform, cutting deployment time by 30%. Implemented automated testing for ETL processes using Azure DevOps and SAS, increasing code reliability and reducing bugs in production by 25%. Developed Power BI dashboards for real-time monitoring of key business metrics. Led ML model deployment into production using Docker and Kubernetes for real-time insights. Implemented A/B testing framework with Python, Azure Databricks, and T-SQL Azure SQL DB improving product featuresData Engineer, Tech Mahindra Hyderabad, India March 2020 July 2022 Developed and optimized ETL processes and automated data quality checks using AWS Glue, Python, and Cassandra, boosting system performance and reducing errors in production data by 30%. Built real-time data pipelines with Kinesis and Lambda, improving the data quality Designed and deployed comprehensive data lake architecture on AWS S3, improving accessibility and reducing costs Engineered high-throughput data pipeline using Kafka, Spark Streaming, and HDFS, processing 1 million events/second. Managed Amazon EMR clusters, Hive tables, MapReduce jobs, improving the data processing speed Optimized Snowflake warehouse, reducing query execution time by 45% and cloud costs by 30%. Developed CI/CD pipeline using AWS DevOps tools, increasing deployment efficiency Implemented a Kubernetes cluster for containerized applications, improving scalability by 55%. EDUCATIONMaster of Science in Computer Science Missouri, USA August 2022 May 2024 University of Missouri-Kansas CityBachelor of Technology in Computer Science Hyderabad, India June 2017 May 2021 GITAM Deemed to be UniversityCERTIFICATIONS Microsoft Certified: Azure Developer Associate Cisco certified: Data Science Essentials Coursera certified: Programming for Everybody, Python Data Structures, Machine Learning |