Repository logo
 

Data Reduction and Deep-Learning Based Recovery for Geospatial Visualization and Satellite Imagery

dc.contributor.advisorMondal, Debajyoti
dc.contributor.committeeMemberStavness, Ian
dc.contributor.committeeMemberJin, Lingling
dc.contributor.committeeMemberLiu, Juxin
dc.creatorTasnim, Jarin
dc.date.accessioned2021-03-16T21:03:47Z
dc.date.available2021-03-16T21:03:47Z
dc.date.created2020-12
dc.date.issued2021-03-16
dc.date.submittedDecember 2020
dc.date.updated2021-03-16T21:03:47Z
dc.description.abstractThe storage, retrieval and distribution of data are some critical aspects of big data management. Data scientists and decision-makers often need to share large datasets and make decisions on archiving or deleting historical data to cope with resource constraints. As a consequence, there is an urgency of reducing the storage and transmission requirement. A potential approach to mitigate such problems is to reduce big datasets into smaller ones, which will not only lower storage requirements but also allow light load transfer over the network. The high dimensional data often exhibit high repetitiveness and paradigm across different dimensions. Carefully prepared data by removing redundancies, along with a machine learning model capable of reconstructing the whole dataset from its reduced version, can improve the storage scalability, data transfer, and speed up the overall data management pipeline. In this thesis, we explore some data reduction strategies for big datasets, while ensuring that the data can be transferred and used ubiquitously by all stakeholders, i.e., the entire dataset can be reconstructed with high quality whenever necessary. One of our data reduction strategies follows a straightforward uniform pattern, which guarantees a minimum of 75% data size reduction. We also propose a novel variance based reduction technique, which focuses on removing only redundant data and offers additional 1% to 2% deletion rate. We have adopted various traditional machine learning and deep learning approaches for high-quality reconstruction. We evaluated our pipelines with big geospatial data and satellite imageries. Among them, our deep learning approaches have performed very well both quantitatively and qualitatively with the capability of reconstructing high quality features. We also show how to leverage temporal data for better reconstruction. For uniform deletion, the reconstruction accuracy observed is as high as 98.75% on an average for spatial meteorological data (e.g., soil moisture and albedo), and 99.09% for satellite imagery. Pushing the deletion rate further by following variance based deletion method, the decrease in accuracy remains within 1% for spatial meteorological data and 7% for satellite imagery.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10388/13285
dc.subjectData Reduction
dc.subjectReconstruction
dc.subjectDeep Learning
dc.subjectSRGAN
dc.subjectImage Inpainting
dc.subjectGeospatial Visualization
dc.titleData Reduction and Deep-Learning Based Recovery for Geospatial Visualization and Satellite Imagery
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Saskatchewan
thesis.degree.levelMasters
thesis.degree.nameMaster of Science (M.Sc.)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TASNIM-THESIS-2020.pdf
Size:
58.44 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.27 KB
Format:
Plain Text
Description: