| 20,000+ Fresh Resumes Monthly | |
|
|
| Related Resumes Machine Learning Data Scientist Boston, MA Data Analyst Machine Learning Milton, MA Machine Learning Data Analysis Boston, MA Data Science Machine Learning Billerica, MA Data Analyst Machine Learning Shrewsbury, MA Data Science and Machine Learning Chestnut Hill, MA Data Engineering, Business Intelligence,Machine Learning, Data A Boston, MA |
| Click here or scroll down to respond to this candidateCandidate's Name
Boston, MA, Street Address PHONE NUMBER AVAILABLE EMAIL AVAILABLE LINKEDIN LINK AVAILABLE github.com/Candidate's Name anshul EducationNortheastern University - Master of Science, Information Systems 3.Street Address Sep 2022 - May 2024 Courses: Database Management and Database Design, Design Data Architecture and Business Intelligence, Application Engineering and Development, Data Science Engineering Methods, Applied Machine Learning with Python in Finance JSS Academy of Technical Education- Bachelor of Technology Aug Street Address - Jun 2018 Courses: Programming in C, Object Oriented Programing, Cyber Security Work ExperienceGraduate Teaching Assistant (Data Science) Northeastern University Jan 2024 - May 2024 Conducted instructional sessions for students to explain Python and delve into the intricacies of supervised machine learning models like Linear Regression, Logistic Regression, Decision Trees, Random Forest etc. Provided personalized support to 50 students in a Data Science course, clearing doubts and answering questions to facilitate their understanding of complex machine learning concepts. Collaborated with faculty to identify individual strengths and weaknesses, tailoring teaching strategies to address diverse learning needs and styles.Senior Systems Engineer Infosys Ltd Sep 2020 - Jun 2022 Designed and implemented robust ETL pipelines using SQL, Python, leveraging AWS S3 and CloudFormation Templates (CFT). Managed large-scale data ingestion, transformation, and storage, resulting in a 27% increase in data processing efficiency. Developed and maintained CI/CD pipelines using Jenkins and Artifactory, automating the deployment of data applications in a DevOps environment. Implemented thorough testing practices to achieve 100% code coverage, ensuring robust and reliable applications. Optimized relational databases (Oracle) for storing customer and marketing data. Focused on efficient schema design, indexing, and query optimization to ensure high-performance data operations. Collaborated with marketing and sales teams to identify data requirements and deliver comprehensive end-to-end data solutions, resulting in a 59% increase in profit. Created intuitive dashboards using Google Sheets and Tableau, providing non-technical stakeholders with accessible and actionable insights to drive business decisions. Systems Engineer Infosys Ltd Sep 2018 - Aug 2020 Automated the extraction, transformation, and loading of BOM data using PLSQL. Ensured data accuracy and consistency, reducing manual intervention by 73% and improving data reliability for manufacturing processes. Designed and implemented RESTful APIs for accessing and manipulating Bill of Material and Raw Material data. Ensured comprehensive API documentation and SOPs, facilitating seamless integration and collaboration with cross-functional teams. Scheduled jobs in Control-M to load data from various databases into a central database. Wrote shell scripts to automate the data loading process at different stages of the database, reducing manual work by 40 hours. Key ProjectsTesla Stock Price Analysis Demonstrated expertise in statistical analysis by summarizing key metrics such as mean, standard deviation, skewness, and kurtosis to understand the distribution and volatility of stock prices. Developed and optimized a Kalman Filter and Smoother algorithm to predict stock prices, achieving significant improvements in prediction accuracy as evidenced by a reduced Root Mean Square Error (RMSE). Created insightful visualizations, including Kernel Density Estimate (KDE) plots, to compare the performance and volatility of stocks from Tesla and its competitors.Risk analytics in banking and financial services Leveraged Python and libraries such as Pandas, NumPy, and Matplotlib to conduct comprehensive financial analytics on banking. Implemented data cleaning techniques to handle missing values, ensuring high-quality datasets for analysis. Developed predictive models using XGBoost and Logistic Regression to assess loan defaulters. Utilized GridSearchCV for hyperparameter tuning to optimize model performance. Employed Seaborn and Matplotlib for creating intuitive plots, facilitating strategic decision-making. New York Motor Vehicle Collision Developed efficient ETL (Extract, Transform, Load) pipelines for over 50 million rows on the Azure platform, utilizing Talend and Alteryx in conjunction with Azure SQL Database. Ensured data precision for reporting by employing profiling, modeling, and transformation techniques. Achieved a significant 39% reduction in data processing time, resulting in the delivery of impactful insights through Power BI and Tableau dashboards. Honors and Certifications Received Infosys Insta award (thinking out of the box) for developing a SharePoint automation for the team Volunteered in Startup Boston Week 2023 #SBW2023, assisting attendees, speakers, and investors with the registration process at the front desk.Technical Skills Programming Languages: SQL, PL/SQL, Python, Java ETL, Data Visualizations & Business Intelligence Tools: Talend, Alteryx, Power BI, Tableau, MS Excel, ER-Studio Data Processing & Analysis: Pandas, NumPy, Scikit-learn Database Management & Cloud Platforms: MySQL, Oracle, SQL Server, PostgreSQL, Control-M, HIVE, AWS S3, Azure Scripting, CI/CD Tools & Version Control: Shell Scripting, Jenkins, Git, GitHub |