Efficient Exploratory Data Analysis with Spatio-temporal Aggregation over Polygonal Regions

dc.contributor.advisorRay, Suprio
dc.contributor.authorHiggins, Catherine
dc.date.accessioned2023-09-05T17:47:20Z
dc.date.available2023-09-05T17:47:20Z
dc.date.issued2022-04
dc.description.abstractStatistical analysis is at the heart of data science work-flows. With the rapid rise in spatio-temporal data volume, and popularity of Web and mobile mapping applications, exploratory data analysis with spatio-temporal data is becoming important. Such exploratory analysis often involves the user selecting an arbitrary polygon region to perform a statistical computation on the selected region. Existing approaches for spatio-temporal data aggregation support rectangular query regions only, and not arbitrary polygons. A recently proposed system called GeoBlocks supports polygonal queries, but GeoBlocks was designed for spatial data, not spatio-temporal data. Another aspect of exploratory data analysis is that the users often repeatedly perform similar statistical analyses over the same selected query region. Although the reuse of already computed answers can improve the response time, existing approaches do not support this reuse for statistical analysis. A recently proposed system called Data Canopy supports statistics synthesis by reusing basic aggregates, but Data Canopy does not support spatial or spatio-temporal analysis. To address the mentioned challenges, we introduce ScanCube, an exploratory statistical analysis system over any arbitrary polygonal query region for any time interval. ScanCube also supports statistics synthesis by reusing a small set of basic aggregates that are computed and stored a priori. We introduce two new techniques, ScanX1 and ScanX2, for providing a grid-based polygonal approximation, which offers distance-based bounded error. Experimental evaluation suggests that ScanCube significantly outperforms GeoBlocks and other existing approaches.
dc.description.copyright© Catherine Higgins, 2022
dc.format.extentxiv, 76
dc.format.mediumelectronic
dc.identifier.oclc(OCoLC)1417475628en
dc.identifier.otherThesis 11121en
dc.identifier.urihttps://unbscholar.lib.unb.ca/handle/1882/37334
dc.language.isoen
dc.publisherUniversity of New Brunswick
dc.rightshttp://purl.org/coar/access_right/c_abf2
dc.subject.disciplineComputer Science
dc.subject.lcshStatistics.en
dc.subject.lcshQuantitative research.en
dc.subject.lcshSet theory.en
dc.titleEfficient Exploratory Data Analysis with Spatio-temporal Aggregation over Polygonal Regions
dc.typemaster thesis
oaire.license.conditionother
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of New Brunswick
thesis.degree.levelmasters
thesis.degree.nameM.C.S.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Catherine Higgins - Updated Thesis.pdf
Size:
2.55 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.13 KB
Format:
Item-specific license agreed upon to submission
Description: