I'm a dynamic Research Consultant driven by results, armed with a BS in Mathematics, an MA in Teaching, and an ongoing pursuit of an MS in Analytics at Georgia Tech. My teaching background exemplifies my fervor for mathematics, showcasing my adaptability in any environment. Now, I'm poised to channel these talents into the realm of data analytics, continuously augmenting my technical expertise.
My solid grounding in mathematics is complemented by adept problem-solving and communication abilities, enabling me to distill intricate concepts for stakeholders. Explore this platform for a glimpse into my professional journey and a curated collection of my projects outside of my work responsibilities.
Project description:
This is a capstone analysis project for the Google Data Analytics Course on Coursera. The following document details the analysis of Fitbit data using Excel to make business recommendations for a new and growing fitness technology company called Bellabeat. The Excel sheet and accompanying dashboard are available as well.
Bellabeat is a high-tech manufacturer of health-focused products for women. They are small but have potential to become a major player in the global smart device market. This analysis is designed to discover trends that will unlock growth opportunities for the company. Bellabeat offers wearable smart devices as a watch, water bottle, and bracelet, necklace, or clip that track activity, sleep, stress, and water intake. All devices connect to their Bellabeat app which provides users with the data collected, menstrual cycle, and recommendations based on user data.
This analysis will be done in Excel and focus on how trends and connections in the activity and wellness data can help advertise to consumer needs and possibly retool their app product to better meet consumer demands.
Identify trends in how consumers use non-Bellabeat smart devices and apply insights to Bellabeat's marketing strategy. Stakeholders
The data source used is Fitbit Fitness Tracker data. This dataset is stored in Kaggle and made available through Mobius.
The data is split up into 18 CSV documents. Each document either holds a different type of data collected or it holds data collected using different time frames. For example, there is a Daily Calories, Hourly Calories, and Minute Calories. Files being used are:
Data File Name | Description |
---|---|
dailyActivity_merged | CSV - Daily Activity over 31 days of 33 users. Tracking daily: Steps, Distance, Intensities, Calories |
dailyCalories_merged | CSV - Daily Calories over 31 days of 33 users |
dailyIntensities_merged | CSV - Daily Intensity over 31 days of 33 users. Measured in Minutes and Distance, dividing groups in 4 categories: Sedentary, Lightly Active, Fairly Active, Very Active |
dailySteps_merged | CSV - Daily Steps over 31 days of 33 users |
heartrate_seconds_merged | CSV - Exact day and time heartrate logs for just 7 users |
hourlyCalories_merged | CSV - Hourly Calories burned over 31 days of 33 users |
hourlyIntensities_merged | CSV - Hourly total and average intensity over 31 days of 33 users |
hourlySteps_merged | CSV - Hourly Steps over 31 days of 33 users |
minuteCaloriesNarrow_merged | CSV - Calories burned every minute over 31 days of 33 users (Every minute in single row) |
minuteCaloriesWide_merged | CSV - Calories burned every minute over 31 days of 33 users (Every minute in single column) |
minuteIntensitiesNarrow_merged | CSV - Intensity counted by minute over 31 days of 33 users (Every minute in single row) |
minuteIntensitiesWide_merged | CSV - Intensity counted by minute over 31 days of 33 users (Every minute in single column) |
minuteMETsNarrow_merged | CSV - Ratio of the energy you are using in a physical activity compared to the energy you would use at rest. Counted in minutes |
minuteSleep_merged | CSV - Log Sleep by Minute for 24 users over 31 days. Value column not specified |
minuteStepsNarrow_merged | CSV - Steps tracked every minute over 31 days of 33 users (Every minute in single row) |
minuteStepsWide_merged | CSV - Steps tracked every minute over 31 days of 33 users (Every minute in single row) |
sleepDay_merged | CSV - Daily sleep logs, tracked by: Total count of sleeps a day, Total minutes, Total Time in Bed |
weightLogInfo_merged | CSV - Weight track by day in Kg and Pounds over 30 days. Calculation of BMI.5 users report weight manually 3 users not. In total there are 8 users |
This data is open source as confirmed by the metadata. The work has been designated to the public domain by the owner through waiving all rights to the work under copyright law, including all relating neighboring rights, to the extent allowed by law. The data and work can be copied, modified, distributed, and used for commercial purposed without asking permission.
These datasets were generated by respondents to a survey distributed by Amazon Mechanical Turk between 03/12/2016 - 05/12/2016. Thirty eligible Fitbit users consented to the submission of personal tracker data. Variation in user data represents difference in devices or usage preferences.
The dataset is limited by its small size (30 users) and could be subject to sampling bias for the same reason. There is no certainty that this dataset is representative of the whole population. At the time of this analysis, the data (2016) is not current (2022) and habits or trends could have changed. The data’s collection period is also very short which does not allow for a complete analysis due to differences in behaviors throughout the year.
Dates and timestamps are inconsistent across datasets. All date and timestamp records will be separate date and timestamp into separate columns.
Check data for duplicates and remove any that exist.
Check data for Nulls and resolve any that exist.
DailyActivity table and SleepDay table were joined using Power Query using an Inner Join. This was done to connect sleep data to activity data and find connections between activity levels and amount of sleep.
As an overview analysis of general activity levels, we will look at user daily steps to estimate their lifestyle activity level. This can be classified using information from https://www.10000steps.org.au/articles/healthy-lifestyles/counting-steps/ .
Most people are either Sedentary (25%) or fall into the range of being moderately active (53%). Device reminders to get up and welk, or even schedule walks using the device, would be beneficial to increase basic activity levels.
Could provide data amongst all users in certain age brackets on their percentile score on steps/day. Example: 27 years old and 9,000 steps/day could fall at the 67th percentile. Not bad, but not great and could improve. This would incorporate a level of competitiveness if desired by the user.
Using data of the number of steps taken per hour throughout the day, a pivot table was created to allow aggregation of steps by hour. Below is the chart created.
There is a large dip in steps between 2pm – 4pm. Possible reasons:
Using Power Query, I combined the DailyActivity and SleepDay tables with an Inner Join.
The relationship between Steps and Sleep is shown graphically below. This graph with the included correlation coefficient shows that there is not enough evidence to show a connection between Time Asleep and Daily Steps taken.
There is a connection, albeit moderate, between the amount of sleep users are getting and the amount of the day they are sedentary. Less sleep correlates with more sedentary time.
This unhealthy trend could be detected by the user’s device and account. Reminders could be sent to the user on how to fix these issues and reasons why it is important to move and sleep more.
A similar analysis could be done for users on a rolling average basis. This would show users how trends in their sleep effects trends in their activity which in turn effects their overall health. The app could offer sleep reminders along with resources to either help fall asleep or track types of sleep under premium subscriptions similar to other popular sleep apps.
When looking for a connection between sleep and amount of activity at each level, almost no correlation was found. The exception is Sleep and Sedentary Time. This can be seen in the graph and r-value of -0.6011. While not a very strong correlation, it is enough to suggest a relationship. This finding can be used for app recommendations or reminders when users are either sedentary too often or not sleeping enough.
Users tend to wear their devices around 60% - 70% of the day while around 66%-80% of the time worn is spent being sedentary. Within the dataset, users either wore their device for the majority of days.
Users should try to be active for a larger portion of their day. Studies show the health benefits of staying active.
Light Activity Benefits
Staying Mobile
General Activity and Exercise
Bellabeat’s mission is to support women to live healthier lives and take control of their health data. The data collected and used is small in scale so further collection and analysis would need to be done for more concrete results.
Users would benefit from these analyses being reproduced on their personalized data as the main target is young and adult women. With larger amounts of data and more specific data being collected on the target audience, new and unique insights could be found for a more specific and targeted marketing strategy. Below are the recommendations for the Bellabeat app.
Recommendation | Description |
1. Daily Notifications and reminders on app and wearable devices to move and walk more frequently | Users were classified into 5 categories based on daily steps. Analysis showed 79% of users do not walk enough to get general health benefits from walking. Based on previous data (when user gets most steps in per day) the app would send recommendations and notifications when users are not on track to meet daily step goal. |
2. Sleep monitor with data analysis | Data showed a near strong negative correlation (-0.60107) between time asleep and time spent sedentary. This shows more time asleep means less time sedentary which has health benefits. Users could set sleep schedules and receive reminders or instructions on how to better their sleep (meditation, breathing techniques, device shut off reminders). Some of these sleep features could be wrapped into a premium subscription. |
3. Incentives to wear device and stay active | One third of users only wore their device between one and six days. Without data it would be hard to provide recommendations for the user’s health. In app incentives, points, could be given for every day the device was worn for 75% of the day or more. Points could also be given for achieving step goals or ranking on an overall or age specific leaderboard for step count. These points could all be redeemed for Bellabeat merchandise or products that would help in brand advertising. |
4. Device “wearability” | Users wore the device for an average of 67.5% of the day and only 29% of users wore their device more than 83% of the days recorded. The design of the device should be continuously improved to make sure the battery is long lasting, device can be worn for all occasions, water/damage resistant, and comfort. |