Quantcast

Data Scientist Resume Washington, VA
Resumes | Register

Candidate Information
Name Available: Register for Free
Title Data Scientist
Target Location US-VA-Washington
Email Available with paid plan
Phone Available with paid plan
20,000+ Fresh Resumes Monthly
    View Phone Numbers
    Receive Resume E-mail Alerts
    Post Jobs Free
    Link your Free Jobs Page
    ... and much more

Register on Jobvertise Free

Search 2 million Resumes
Keywords:
City or Zip:
Related Resumes

Graduate Teaching Assistant Student Data Scientist Fairfax, VA

Data Engineer Scientist Chantilly, VA

Data Management Environmental Monitoring Herndon, VA

Data Science Machine Learning Herndon, VA

Data Analyst Supply Chain Gainesville, VA

Sr. Principal Data Analytics Consultant (Alteryx/Tableau Develop Gainesville, VA

Google Cloud Data Processing Centreville, VA

Click here or scroll down to respond to this candidate
Candidate's Name
AI Scientist/ ML EngineerContact: PHONE NUMBER AVAILABLE Email: EMAIL AVAILABLEProfessional SummaryA highly accomplished Sr. Data Scientist with over 12 years of IT experience and 8 years of expertise in Data Science, AI (Artificial Intelligence), data mining, deep learning, predictive analysis, and machine learning. Proven ability to manage the entire data science project life cycle, extract insights from massive datasets, and develop innovative solutions.Technical Skills:Deployment: CI/CD, workflows, automation, Model Lifetime Management. Model registry.Python Libraries and Tools: Flask, Django, Neo4j, MongoDB, Boto 3, NumPy, Pandas, SciPy, SciKit-learn, Matplotlib, Seaborn, Plotly, TensorFlow, Keras, NLTK, PyTorch, BeautifulSoup4, PySpark, SQLAlchemy, dplyr, ggplot2, reshape2, tidyr, pinecone, tableau.Machine Learning Techniques: Auto Encoders, Generative AI, Nave Bayes Classifiers, Gaussian Mixtures, Imbalanced Learning (SMOTE), K Means Clustering, Unsupervised Machine Learning Algorithms (K Nearest Neighbors), Deep Learning Artificial Neural Networks, Support Vector Machines, Supervised Machine Learning Algorithms (Logistic Regression, Hidden Markov Models, XGBoost), Decision Trees and Random Forests, Linear Regression.Time Series Analysis: ARIMA, Sentiment Analysis, Data Visualization, Hypothesis Testing, Multivariate Analysis, Behavioral Modeling, Statistical Analysis, Pattern Recognition, Predictive Analysis, Linear Regression, Stochastic Optimization, Data Mining, Classification, Forecasting, ANOVA.NLP (Natural Language Processing): Bag of Words, Word2Vec, Processing Document Tokenization, LDA, Token Embedding, Fast Text, TF/IDF, Bert, RoBerta.Programming Languages: Python 2, Python 3, R, SQL, Matlab, C++, java, PhpProfessional ExperienceGen AI Scientist at General MillsSaint-Paul Minnesota January 2023  PresentGeneral Mills has several brands including Pillsbury, Post and several others. As an AI Scientist I created a Knowledge management system that responds to nutrition related questions in a conversational way. This was done using RAG on OpenAI and Pinecone.Some tasks include:Designed and Implemented Gen AI strategy for General Mills Involving Interactive Chat Agents, Retrieval Augmented Generation of Product data and Image Generation.Utilized Universal Sentence Encoder and Bert Uncased 768 embeddings for clustering and encoding.Employed K-Means and DB Scan algorithms to cluster embeddings and identify topics by locating centroids in the clustered text.Utilized OpenAI API and ADA-002 Embeddings to vectorize text into 1536 element tensors.Leveraged Pytest, Unittest, Django Frameworks, and Python Virtual Environments for testing purposes.Employed NLP techniques such as Tokenization, Lemmatization and Removal of Stop Words for corpus management.Created a document processing pipeline and utilized Langchain for chunking and document preparation.Applied Syntactic NLP techniques such as Synonyms, Entities, and Phrase Syntax/Semantics analysis.Conducted Linguistic Paraphrase testing simulations.Developed Test Plan Designs/Test Cases for Phrase-Service Matching.Automated NLP Features API and implemented Customer Query Service Department Classifications and Text Request Multi-Class Service Department Classifications.Utilized NLP Manual (Annotation/Correction Language Synonyms/Entities Relations) for API Integrations.Innovated with Large Language Models (LLMs), specifically GPT-3.5 and LLAMA 2, exploring in-context learning capabilities for topic identification, specifically using the Retrieval-Augmented Generation (RAG) approach.Upload and inserted embedded information into Pinecone Vector DB.Used Postman for API testing, ensuring seamless integration and functionality.Sr. Data Scientist/ ML-Ops Engineer at OptumBaltimore, Maryland Apr 2020  Dec 2022Optum is a Health-Tech company with a network of over 5 millions practitioners, my role as a Senior ML Engineer, I spearheaded a pivotal AI/ML initiative. The purpose of the project was to predict and forecast hospital stay times and discharge rates using time series analysis. I was also responsible for the Model lifetime Management and ML-Ops pipelines of our organization.Some of the tasks included:Implemented Time Series Analysis Modeling utilizing Sarimax and FB Prophet as well as RNN models.Leveraged Relational Database Management Systems (RDBMS) for structured data storage and retrieval.Specifically, utilized TensorFlow for designing intricate deep learning models and employed Python packages (NumPy, Pandas, and Tensorflow) to address computer vision and NLP-based OCR challenges.Optimized and automated the data pipeline with Directed Acyclic Graphs (DAGs) on Apache Airflow for efficient workflow orchestration and scheduling.Managed CI/CD deployment through Jenkins and designed a robust data quality framework using the AWS Data Quality Rules Engine, ensuring accuracy and consistency across diverse data sources.Acted as a mentor to junior data engineers, offering guidance on best practices in data engineering.Senior Data Scientist at SalesforceAtlanta, GA Sep 2019  Mar 2020As a Senior Data Scientist at Salesforce, I lead a cross-functional team comprising data engineers, modelers, and ML-ops experts. Our focus is on deploying forecasting and Natural Language Processing (NLP) models to elevate the Customer Experience Department. I authored and tested Transformer-based and Statistical Models to assess client reactions to upgrades and support sessions. Additionally, I integrated forecasting models to predict peak demand points.Processed and prepared text data through normalization, tokenization, stemming, and lemmatization using NLTK in Python.Customized solutions coded in Python, utilizing TensorFlow, Keras, and NumPy libraries, and tested various embedders, including BERT, Word2Vec, GloVe, and others.Employed statistical classifiers, random forests, and logistic regressions for sentiment analysis, constructing an Artificial Neural Networking solution for natural language processing.Implemented a model using BERT for embedding and classification, fine-tuned for specific data.Developed processes and tools for monitoring and analyzing performance and data accuracy, enhancing data collection procedures for analytics system optimization.Collaborated with IT to continuously improve business performance and processed, cleansed, and verified data integrity from various sources.Advised the leadership team and stakeholders with data-driven solutions, recommending strategies to address business challenges.Prototyped foundational data pipelines and collaborated with the data engineering team to establish canonical sources of truth for Customer Experience metrics.Wrote test classes to ensure code coverage for Apex classes and triggers, developed reports, and utilized them in dashboards.Supported the design of data models, user interfaces, business logic, and security for custom applications.Designed REST-based APIs for effective deployment and integration of NLP models into existing front ends.Data Scientist and Data Analyst at Enphase EnergyPetaluma California Jan 2018  Aug 2019 (Remote)As a Data Scientist and Data Analyst at Enphase, I contributed to a project involving sensitivity analysis on a numerical model simulating a solar generation plant. I designed and implemented a neural network to replicate the physics of the process model, optimizing computational efficiency for the sensitivity analysis. Created complex forecasting models to predict power generation in different locations. Used Time Series Analysis to predict electrical demand.Devised specialized algorithms for the storage and comparison of vectorized features and verifications, demonstrating a tailored approach to data analysis.Implemented Convolutional Neural Networks (CNNs) using PyTorch and Python, showcasing proficiency in cutting-edge deep learning technologies.Conducted meticulous data cleaning on both images and tabular data, ensuring the quality and reliability of the dataset.Designed and implemented statistical evaluation techniques to assess the performance of the model, emphasizing a rigorous validation process.Deployed the developed model using Flask and pickle, showcasing practical implementation skills.Quantified uncertainties associated with the ANN predictions, providing a nuanced perspective on predictive reliability.Data Analyst at Jones Lang LaSalle (JLL)Baltimore, Maryland May 2016-Dec 2017Examined housing and real estate data from 6 different sources.Analysed property location data using KNN and K-Means clustering.Worked in Agile MethodologyPerformed Univariate and Statistical Analysis.Created imputation rules to deal with missing data.Presented findings to stakeholders and internal clients.IT Support at Cygnus Systems Inc.Southfied, Michigan Jun 2012-Apr 2016 (Remote)Provide technical support and troubleshooting for hardware, software, and network issues.Perform regular maintenance on IT systems, including updates and patches.Assist users with IT-related issues, both remotely and on-site.Manage and support network infrastructure, ensuring reliable and secure connectivity.Identify and resolve technical problems in a timely manner.Maintain documentation of IT systems, processes, and support activities.Implement and manage backup and recovery solutions to protect data.Ensure the security of IT systems through monitoring and applying security measures.EducationMaster of Science in Computer Application / Data ScienceBachelor of Science in Computer Science

Respond to this candidate
Your Message
Please type the code shown in the image:

Note: Responding to this resume will create an account on our partner site postjobfree.com
Register for Free on Jobvertise