pdxrlang meetup: Probabilistic Approaches to Multi-dimensional Fuzzy Joins: A GeoSpatial Example

1120 NW Couch St., Suite 320
Portland, Oregon 97209, US (map)
Public WiFi

Access Notes

If it's around regular business hours, you have to check in with the front counter on your way up.

Doors open after 6 pm. DO NOT SHOW UP BEFORE 6 PM. Talks start at 6:30 pm. Repeat: DO NOT SHOW UP BEFORE 6 PM. Doors are open at bottom, take elevator to 3rd floor, door should be open for suite 320



Speaker: De'Mel Mojica

Abstract: This talk will be on a general approach to automatically join large-scale, geospatial data across distinct data sets, using a mix between Levenshtein Distance thresholds and Haversine Distance thresholds. This approach permits joining multiple data sets without the need to provide ad hoc normalization conventions for each data resource. In addition, this approach can be generalized beyond a geospatial field and applied any domain which requires joining across two or more non-identical dimensions.

We'll visit a local watering hole afterwards.