| 20,000+ Fresh Resumes Monthly | |
|
|
| | Click here or scroll down to respond to this candidateCandidate's Name
EMAIL AVAILABLE PHONE NUMBER AVAILABLE Memphis, US LinkedInProfessional summaryExperienced Data Engineer with 3 years of experience, proficient in designing and implementing ETL processes and leveraging big data technologies such as Apache Spark and Hadoop. Experienced in stream processing with Apache Kafka, managing data warehousing solutions, and working with SQL and NoSQL databases such as PostgreSQL, MongoDB, and Cassandra. With extensive experience in deploying and managing solutions on cloud platforms such as AWS and Azure. SkillsLanguages: Python, C, SQL, PL/SQL, R, Shell Scripting Frameworks & Technology: Azure (ADF, Data Warehouse, Databricks, Data Lake, Blob Storage, Data Analytics), AWS (EC2, Lambda, S3, Redshift, SageMaker, IAM, EMR, Kinesis), GCP, Hadoop, Spark, Kafka, Airflow, Tableau, PowerBi, PyCharm, Jupyter, CI/CD, Pandas, NumPy, PySpark, Matplotlib, scikit-learn, TensorFlow, Jenkins, Kubernetes, Docker. Database and Tools: Oracle, PostgreSQL, MySQL, MongoDB, Cassandra, Amazon DynamoDB, HBase, Hive, Snowflake, Informatica, Glue, Databricks, Git, VSCodeSoft Skills: Analytical Thinking, Communication & Collaboration, Problem Solving, Critical Thinking, Decision making Professional ExperienceData Engineer, BNY Mellon 08/2023 Present Remote, USADesigned and implemented robust data integration pipelines, employing ETL processes to seamlessly collect and transform aviation data from diverse sources.Utilized advanced Python (Pandas) and SQL techniques for data cleaning and validation, ensuring data accuracy and reliability within the aviation dataset.Designed and implemented scalable and secure cloud-based storage solutions like Azure Blob Storage and Azure Functions for structured and unstructured aviation data.Utilized Apache Spark, Hadoop, and Python for large-scale data processing, and employed Tableau, Power BI, and Excel to create visually intuitive dashboards for real-time insights.Applied Python-based frameworks like TensorFlow, and sci-kit-learn for machine learning model development while ensuring robust security measures (HashiCorp Vault, Azure Key Vault) to protect sensitive aviation data.Fostered collaboration among aviation stakeholders through user-friendly interfaces, generating automated reports using SQL queries, Python scripts, and Excel for regulatory compliance and trend analysis. Data Engineer, Infosys 10/2019 12/2021 Hyderabad, IndiaGenerated monthly reports encompassing month-end revenue, profit, variance analysis, and project status.Evolved data models optimizing data processing pipelines in the AWS cloud, leading to a 30% increase in productivity.Produced and maintained monitoring dashboards, reports, and trends on AWS, reducing customer pain points by 28%.Established data transfer pipelines between AWS services and on-premises systems, achieving a throughput increase.Developed python Scripts for deploying the Pipeline in AWS Data Pipeline (ADP) for data processing using SQL Activity. Additionally Architected and implemented medium to large-scale BI solutions on AWS using AWS Data Platform services(Amazon S3, AWS Glue, Amazon Redshift, Amazon EMR, Amazon Athena, DynamoDB).Created scripts for loading data to Hive from HDFS and ingested data into the Data Warehouse using various data loading techniques, including batch processing and real-time ingestion.Analyzed existing systems and proposed improvements, integrating modern scheduling tools like Airflow and migrating legacy systems into an Enterprise data lake built on AWS Cloud.Enhanced data warehouses based on the STAR schema, executed data model updates, and conducted Tableau data analytics and reporting.EducationMaster of Science, University of Memphis 01/2022 12/2023 Memphis, TN Computer ScienceBachelor of Technology, Teegala Krishna Reddy Engineering College 08/2016 11/2020 Hyderabad, India Electronics and Communication EngineeringCertificatesAzure Data Engineer Associate Certification (DP 203) Data Science Foundations Certification from IBMHadoop Administration Certification from IBMHadoop Data Access Certification from IBMProjectsRetail Data Analysis and Visualization SystemEmployed Python, Pandas, and SQL for processing and analyzing extensive datasets from various retail sources.Devised and executed ETL workflows utilizing AWS Glue to extract, transform, and load retail transaction data, adhering to regulatory standards.Utilized PowerBi to craft dashboards and reports, providing insights into purchasing behaviors, product performance, and sales outcomes. Conducted analytics to detect trends, patterns, and irregularities within the retail data. |