| 20,000+ Fresh Resumes Monthly | |
|
|
| | Click here or scroll down to respond to this candidateCandidate's Name
DATA ENGINEERPhone:PHONE NUMBER AVAILABLE EMAIL AVAILABLE TX, USASUMMARYResults-driven Data Engineer with 5+ years of experience in designing, developing, and maintaining large-scale data infrastructure at Credit Suisse and Draxo Infotech. Proficient in a variety of technologies including Python, SQL, Apache Spark, and Hadoop, with extensive experience in cloud platforms such as AWS and Azure. Successfully implemented data pipelines that enhanced data processing speed by 30% and designed scalable database architectures that improved data retrieval efficiency by 25%. Adept at collaborating with cross-functional teams to translate business requirements into technical solutions, ensuring data integrity and optimizing performance. SKILLS Programming Languages: Scala, Python, Java, SQL,PL/SQL, R IDEs: PyCharm, Jupyter Notebook Big Data Ecosystem: Hadoop, MapReduce, Hive, DynamoDB, Big Query, HDFS, Apache Spark, Apache Airflow, Storm Machine Learning: Linear Regression, Logistic Regression, Decision Tree, K-means, Nave Bayes, Random Forest, Reconsolidation models, calculation models Cloud Technologies: AWS (EC2, S3 Bucket, Amazon Redshift, Lambda, IAM, Kinesis, EMR), Kafka, Databricks, Microsoft Azure Packages: NumPy, Pandas, Matplotlib, SciPy, Scikit-learn, Seaborn, TensorFlow, PySpark Reporting Tools: Tableau, Power BI, SSRS Databases: MS SQL Server, PostgreSQL, MongoDB, MySQL, Cassandra Operating Systems: Windows, MacOSEDUCATIONMaster of Science in Data Science May 2023University at Texas at Arlington, TexasBachelors in Biotechnology May 2018Motilal Nehru National Institute of Technology, Allahabad, India EXPERIENCECredit Suisse, TX Data Engineer January 2023 Present Engineered and automated ETL workflows using Apache Spark and Python. Achieved a 30% reduction in data processing time and improved data accuracy. Designed and implemented ETL processes using SQL to extract data from various sources, including transaction databases, online interactions, social media, and customer support systems. Integrated Python with AWS Lambda for serverless data processing, decreasing infrastructure costs by 15%. Led the development of a comprehensive customer analytics platform. Utilized SQL for data collection, integration, transformation, and analysis. Developed SQL queries and scripts for in-depth data analysis. Calculated KPIs and derived actionable insights for stakeholders. Implemented CI/CD for data pipelines using AWS Data Pipeline and Airflow. Reduced deployment time by 20%. Integrated Python with AWS Lambda for serverless data processing. Decreased infrastructure costs by 15%. Leveraged AWS Glue to automate ETL processes for daily sales data feeds. Reduced processing time by approximately 50%. Draxo Infotech, India Data Engineer June 2018 November 2021 Designed and automated data pipelines to parse and store raw data into partitioned Hive tables. Improved data retrieval for reporting and analysis by 15%. Built a scalable data lake using Databricks. Ingested and processed 2 terabytes of financial data, including historical market data, customer transactions, and risk assessments. Collaborated with teams to integrate SQL-based visualization tools like Tableau. Created interactive dashboards displaying customer behaviors, sales trends, and marketing campaign effectiveness. Designed and executed Hive tables to store website log data in a structured format, enabling efficient analysis with Apache Spark and reducing data processing time for marketing campaign performance analysis, leading to 35% faster identification of key insights. Led a cross-functional team to implement automated dashboards and reporting using Power BI, empowering stakeholders with data- driven insights, and a remarkable 25% improvement in workforce performance. CERTIFICATIONS AWS Certified Solutions Architect View Certification. |