Vatsal Shah

Vatsal Shah

Graduate Assistant | Aspiring Data Analyst | Data Scientist

Resume

About Me

Currently pursuing Masters in Information Technology and Analytics at Rutgers University. Eager to explore new data and analyze them, find out what is the story behind such data.

An aspiring Data Analyst with equal interest in the field of Data Science, Software Engineering and Web Development. Hobbies like Reading, Hiking, Outdoor Sports, Coding and many more.

My Projects

Statistical Analysis of Height and Weight

Performed Statistical Analysis data of 10,000 individuals using Microsoft Excel.
Performed Linear Regression analysis to determine the relation between the Height and Weight.
Determined the gender of the individual based on the values of Height and Weight with an accuracy of 92%.

Excel, Regression

Predicting the future prices of houses on Zillow

Performed predictive analysis on the Zillow Rent Index dataset of last 7 years from Kaggle in order to predict the future prices per square feet houses across all the states of United States of America.
Determined the most and the least expensive states of USA by using Exploratory Data Analysis(EDA).
Performed steps such as Data Cleaning, Exploratory Data Analysis(EDA) and Polynomial Regression to analyze the data and predict the price of the future with an accuracy of 90%.

Python, Machine Learning, NumPy, Pandas

Forecasting Morgan Stanley's future stock price

Performed time series analysis on the stock prices of Morgan Stanley for last 5 years (January 2013 - March 2018) using R.
Forecasted the future price for next 6 months based on the historical data by applying supervised learning logic with an accuracy of 80% and using ARIMA model.

R, Machine Learning

Machine Learning Algorithm for Intelligent Email Sorting

Developed a machine learning algorithm using K-Means Clustering to sort emails into respective categories using Google App Script.
Performed sorting to categorize the mails in respective categories by giving labels dynamically

Machine Learning, Sorting, K-Means, Clustering

Work Experience

Graduate Assistant - Rutgers University (February 2018 - May 2018)

Working as a graduate assistant in the CS department for course Massive Data Mining with major focus on Hadoop, K-Means, KNN and other Machine Learning concepts.
Also given the task of correcting and grading papers.

Data Analyst - Capgemini (June 2016 - August 2017)

Analyzed massive datasets on Oracle Cloud and performed ad-hoc analysis and data manipulation to fetch 100% accurate data on an “as needed basis”.
Developed reports using OTBI and BI reporting system through Oracle Cloud and Tableau to extract data for analysis using filter based analysis which allowed the user to have access to all the transactions at one place.
Designed data marts that were used as the source of analytical reporting through Oracle Cloud that improved the end-user response time by allowing user to have access to specific type of data they need to view the most.

SQL Developer - Capgemini (July 2015 - May 2016)

Developed custom BI Publisher Reports using SQL queries and worked with business analyst groups to ascertain their reporting needs helping them to get all the necessary and required information at one place.
Created Data Models and Dashboards using SQL Server Analysis helping the user have easy access to the KPIs. Performed data loads and routine updates on ongoing basis maintaining accuracy of the data of upto 90%.
Developed database application that allowed the user to fetch the correct and accurate data from a computerized database.