Export to
Monday, June 20, 2016 at 10:55am.
pdxrlang meetup: Probabilistic Approaches to Multi-dimensional Fuzzy Joins: A GeoSpatial Example
Access Notes
You'll need to check in with the Mozilla office front desk, on the third floor. The elevators lock at 6pm, but when there's an evening event scheduled, they should stay open until 7pm.
Doors open after 6 pm. DO NOT SHOW UP BEFORE 6 PM. Talks start at 6:30 pm. Repeat: DO NOT SHOW UP BEFORE 6 PM. Doors are open at bottom, take elevator to 3rd floor, door should be open for suite 320
Website
Description
Speaker: De'Mel Mojica
Abstract: This talk will be on a general approach to automatically join large-scale, geospatial data across distinct data sets, using a mix between Levenshtein Distance thresholds and Haversine Distance thresholds. This approach permits joining multiple data sets without the need to provide ad hoc normalization conventions for each data resource. In addition, this approach can be generalized beyond a geospatial field and applied any domain which requires joining across two or more non-identical dimensions.
We'll visit a local watering hole afterwards.