Repository logo
 

Application of data analytics and machine learning on data collected by smartphones to understand human behavioural patterns

dc.contributor.advisorOsgood, Nathaniel D
dc.contributor.committeeMemberMcQuillan, Ian
dc.contributor.committeeMemberNeufeld, Eric
dc.contributor.committeeMemberRamsden, Vivian
dc.creatorThomas Vattukalathil, Teena 1990-
dc.creator.orcid0000-0001-7330-154X
dc.date.accessioned2019-12-16T22:11:47Z
dc.date.available2019-12-16T22:11:47Z
dc.date.created2019-12
dc.date.issued2019-12-16
dc.date.submittedDecember 2019
dc.date.updated2019-12-16T22:11:47Z
dc.description.abstractA growing number of health studies seek to leverage smartphone-based recording to continuously monitor consenting participants’ health behaviours, including those related to mental health, mobility, and activity. So as to better understand health risks and the influence of the environment on human physical and mental health conditions, such studies commonly use smartphones to collect health behaviour relevant metrics such as screen state, app usage, location, activity level, browsing behaviour, etc. They also typically use survey instruments incorporating questionnaires, voice recordings, photos, multi-media content on which the user is asked to provide feedback, etc. When the data volume and variety grow substantially --- such as is common with sensed data --- then challenges associated with data quantity, quality, diversity, trustworthiness, etc. also increase significantly. Because most health scientists are unfamiliar with tools and concepts required for effective analysis of such high-volume and high-velocity data, it is challenging for health scientists alone to perform the computationally intensive analyses needed to secure certain types of insight from the collected data. The primary objective of this thesis is to provide computational mechanisms to support research teams associated with 3 distinct case studies utilizing smartphone-based data, so as to help obtain insights accessible to team health scientists. The data sets for these three studies were collected from participants using a pre-existing smartphone based application named Ethica. Such data was accumulated over a period ranging from 2 weeks to 6 months – with the study period differing across the three studies – through a set of surveys and mobile sensors such as those for the battery, screen state, GPS, etc. This thesis addresses three significant challenges associated with the extraction and processing of smartphone data. The first is the computational burden and intricacies associated with data extraction, preprocessing and analytic steps. The second consists of a need for handling omitted and missing data points with the help of machine learning and statistical methods. The final challenge covered here is to secure valuable findings from these data sets through exploratory analysis following examination of participant adherence patterns and evaluation of the quantity and quality of the data collected. The methods applied in this thesis are useful for other studies using the Ethica platform because of the shared structure of Ethica datasets and the capacity of the code to be reused and readily adapted for other such datasets.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10388/12488
dc.subjectMachine learning
dc.subjectData analysis
dc.subjectbig data
dc.subjectsimulation modelling
dc.subjectcomputational statistics
dc.subjectbehavioural patterns
dc.subjectsmartphone data
dc.subjectdata analytics
dc.subjectstatistical analysis
dc.subjectexploratory analysis
dc.subjectquantitative analysis
dc.subjecthealthdata
dc.subjectScala and Spark
dc.subjecthuman behavior
dc.subjecthealthscientists
dc.subjectcomputerscience research
dc.titleApplication of data analytics and machine learning on data collected by smartphones to understand human behavioural patterns
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Saskatchewan
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (M.Sc.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
THOMASVATTUKALATHIL-THESIS-2019.pdf
Size:
2.6 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.28 KB
Format:
Plain Text
Description: