| 20,000+ Fresh Resumes Monthly | |
|
|
| | Click here or scroll down to respond to this candidate Candidate's Name
PHONE NUMBER AVAILABLE | EMAIL AVAILABLE | LINKEDIN LINK AVAILABLE | Candidate's Name 7.github.io/portfolio
EXPERIENCE
Clinical Research Data Scientist Jun 2024 Present
University of Pennsylvania Philadelphia, PA
Analyzed MRI data from Multiple Sclerosis patients using machine learning models such as OC-SVM, Isolation Forest,
and a Variational Autoencoder resulting in better fits for complex nonlinear relations.
Data Engineer Jun 2024 Sep 2024
Reveal Global Inc Remote, PA
Built an end to end data orchestration tool by leveraging the Dagster framework, AWS, Docker, and Large Language
models which significantly increases workflow efficiency by automating the Geocoding and unit group classification
processes.
Built a steam-lit app in Python to analyze and visualize U.S Census permit data for identifying misaligned jurisdictions
and delayed reporting.
Data Science Intern Jun 2023 Aug 2023
Constellation Energy Remote, PA
Redesigned and optimized an intern tool and its corresponding database for a customer-centric application serving
200,000 users by utilizing C# (C sharp), .NET, SQL and Agile design practices, resulting in a 44% reduction in database
size.
Identified performance and run-time inefficiencies in API endpoints by developing KQL queries and building an
end-to-end performance monitoring PowerBI dashboard, boosting application performance.
Designed and implemented ETL pipelines in Azure Data Factory to automate data ingestion, transformation, and
loading, resulting in the transfer of 128 entities.
Data Scientist Jan 2023 Aug 2023
Penn State Advanced Vehicles Team State College, PA
Led Radar integration in a level 4 autonomous Chevy Bolt for a project partnered with Intel and General Motors.
Implemented a Python script in Docker to streamline the data collection and calibration processes for LiDAR and Radar
systems, resulting in accurate live outputs.
Enabled seamless integration with YOLO machine learning model to detect pedestrians and objects from up to 175m.
Data Scientist Mar 2022- Apr 2022
Engineering Leadership Society Volunteer State College, PA
Collaborated with KCF Tech to collect survey data from major automotive companies regarding Industry 4.0 Practices.
and developed visualizations on the survey data using Tableau and presented our findings to company executives.
EDUCATION
Pennsylvania State University University Park, PA
Bachelor s in Computational Data Science, Minor: Mathematics, Statistics
PROJECTS
Multivariate Property Sale Price Forecasting | Python, TensorFlow, sklearn, pandas, seaborn
Developed a Machine learning pipeline to identify key features contributing to property tax value and performed time-series
analysis to predict sale price.
Population Density Analysis on High Income Metropolitan Regions in USA | R, ggplot
Utilized data cleaning, data aggregation, and regression techniques to identify statistically significant metrics or income
disparities in major metropolitan regions in California.
SKILLS
Languages: Python, R, Java, SQL, JavaScript, HTML/CSS, Scala, SAS
Frameworks: Dagster,AWS, Azure, Hadoop, Spark, Tableau, PowerBI, Azure DevOps
Developer Tools: Dagster, Local Stack/ AWS,Git, Docker, Linux, Azure, Android Studio, VS Code
Libraries: Numpy, Sklearn, Pandas, Matplotlib, XGBoost, Dplyr, Tidyverse, Ggplot2, Pyplot, Seaborn
Techniques: Large Language Models (LLM), Data Orchestration,Agile, Artificial Intelligence (AI) , ETL, Machine Learning
Pipelining, Statistical Modeling, Machine Learning, Artificial Intelligence, Data Visualizations, Data Analysis, Data Cleaning,
Object-Oriented Programming, CI/CD, DevOps, ANOVA, Clustering, Regression, Time Series, Hypothesis Testing, A/B
Testing, Logistic Regression, CART, Random Forests, SVM, Neural Networks :
|