| 20,000+ Fresh Resumes Monthly | |
|
|
| | Click here or scroll down to respond to this candidateCandidate's Name
Data EngineerEmail: EMAIL AVAILABLE LinkedIn: LINKEDIN LINK AVAILABLE Mobile: PHONE NUMBER AVAILABLE United StatesProfessional SummaryExperienced Data Engineer with over 3 years of specialized expertise in architecting and developing robust data pipelines and scalable solutions. Expertly skilled in leveraging cutting-edge cloud services and a versatile array of programming languages (Python, SQL, Scala) to elevate data processing, storage, and analytics in both healthcare and enterprise environments. Achieved a notable 10% reduction in processing times by implementing optimized ETL processes and real-time data streaming solutions. Adept at harnessing the power of libraries such as NumPy, Pandas, and TensorFlow for advanced data processing and analysis. Proven track record of collaborating seamlessly with cross- functional teams, including data scientists, analysts, and stakeholders, to deliver actionable insights and drive impactful business outcomes.Technical SkillsProgramming Languages: Python, C/C++, SQL, JavaScript, TypeScript, Java Web Technologies: HTML/HTML5, CSS3/CSS (Tailwind CSS, LESS, Bootstrap, Sass), JavaScript, Typescript, ECMAScript6, JQuery, AJAX, JSON, WebServices(REST, SOAP) Frameworks: Django, Flask, Fast API, React JS, Angular JS, Node JS, Express JS, Next JS Databases: MySQL, PostgreSQL, SQL Server, MongoDBMethodologies: SDLC, Agile (SCRUM), WaterfallCloud Technologies: AWS (EC2, S3, Lambda, ECS, ECR, CloudFront, CloudWatch, CloudFormation), Azure CI/CD & DevOps: Git, GitHub, Jenkins, Docker, Kubernetes, Redis, Jest, Tableau, CircleCI Libraries & ML: Pandas, NumPy, Matplotlib, Scikit-learn, SciPy, Plotly, Seaborn, Machine Learning, Deep Learning, Natural Language Processing, Data Analytics, Data Visualization (Tableau, Power BI) Other Tools & Technologies: Apache Spark, Kafka, Hadoop Professional ExperienceData Engineer Molina Healthcare USA May 2023 Present Designed and implemented high-throughput data pipelines using Apache Spark, significantly reducing processing time and enhancing population health management and cost analysis. Built scalable data warehouses on AWS Redshift, leading to a 15% improvement in data accessibility for generating reports and performing risk assessments. Collaborated with healthcare data scientists and analysts to develop data solutions that enhanced member risk profiling, care coordination, and fraud detection accuracy. Automated data quality checks and cleansing routines, achieving a high level of data accuracy to support reliable decision-making. Implemented Amazon Web Services (AWS), employing AWS Glue, EC2, and PySpark for computing, and S3 as a storage mechanism, leading to a 20% improvement in data processing speed. Automated ETL logic development offshore, ensuring accurate understanding and enabling the creation of optimal ETL mappings. Designed and implemented complex and interactive dashboards in Tableau incorporating dynamic visualizations and user-driven parameters to enhance data exploration capabilities. Implemented CI/CD pipelines using Jenkins, developed Ansible templates for automatic code deployment, and configured SonarQube to enhance code quality, resulting in a 30% reduction in deployment time.Data Engineer ISPARROW India Sep 2020 Nov 2022 Spearheaded end-to-end processes for Data Extraction, Integration, Modification, Validation, Analysis, Management, and Reporting, resulting in a streamlined and efficient data workflow. Employed advanced features of Scikit-Learn and TensorFlow to optimize machine learning models, enhancing predictive accuracy and reducing inference time by 20%. Developed Python applications, scripts, and automation tools, enhancing workflow efficiency and productivity by automating repetitive tasks, and reducing manual effort by 25%. Implemented AWS cloud technologies, including EC2 and S3, optimizing infrastructure and reducing operational costs by 15%. Executed SQL queries and optimized database performance, resulting in a 25% reduction in query execution time. Utilized Scala for data manipulation, contributing to improved data processing speed and accuracy. Collaborated with cross-functional teams, ensuring successful implementation of Waterfall methodology in project workflows. Employed Git for version control, enhancing collaboration and code management during software development. Executed data visualization using Tableau, providing actionable insights and improving decision- making processes. Implemented Hadoop and PySpark for efficient big data processing, reducing processing time by 20%. Employed SCRUM methodology, leading daily stand-ups, and sprint planning, resulting in on-time project delivery.Alma MaterMaster of Science in Computer ScienceUniversity of Missouri Kansas City Jan 2023 May 2024 Bachelor of Technology in Computer Science & Engineering University College of Engineering JNTUK, Narasaraopet June 2019 June 2022 Diploma in Computer EngineeringAditya Engineering College July 2016 May 2019 |