Viewing 0 current events matching “data engineering” by Date.

Sort By: Date Event Name, Location , Default
No events were found.

Viewing 4 past events matching “data engineering” by Date.

Sort By: Date Event Name, Location , Default
Thursday
Jun 19, 2014
Soren Macbeth: Data Processing and Machine Learning with Clojure
Puppet

Soren Macbeth, chief data scientist at yieldbot, will discuss machine learning techniques using cascalog, storm, and spark.

The session will begin with one or more lightning talks at 6:45pm followed by Soren's presentation at 7:00pm.

Website
Monday
Mar 2, 2015
PDX Data Engineering: Apache Spark
Walmart Labs

For our first meeting lets get together and talk about using Spark in production. I can give a quick overview if necessary and then we'll open it up for discussions.

We can also talk about plans for the group and what kinds of topics and discussion you'd like to see in the future.

Website
Monday
May 4, 2015
Intro to Cassandra
eBay Community Lounge

For this meetup, we are going to be doing a joint meetup with the DataStax Cassandra Portland Users group.

Archaic database technologies just don't scale under the always on, distributed demands ​of modern IOT, mobile and web applications. We'll start this Intro to Cassandra by discussing how its approach is different and why so many awesome companies have migrated from the cold clutches of the relational world into the warm embrace of peer to peer architecture. After this high-level opening discussion, we'll briefly unpack the following:

  • Cassandra's internal architecture and distribution model
  • Cassandra's Data Model
  • Reads and Writes ​​ You'll go home with enough Cassandra basics to get your feet wet and some resources to get your hands dirty.
Website
Thursday
Mar 1, 2018
Datalog, Data Transformation, and Apache Beam w/ Austin Haas
Puppet

Datalog is a powerful recursive query language that can be implemented in a few hundred lines of Clojure. It's a great introduction to logic programming and it can be used to solve real problems. It is also the basis for Datomic's query language.

Apache Beam is a data processing library that runs on Google Dataflow, a serverless distributed computing service.

In this presentation, I will explain Datalog, how it is used in Datomic, and how my team is using Datalog (in Clojure) to build flexible programs that transform large amounts of interdependent data in minutes with Apache Beam.

Bio: Austin is a senior engineer at Healthsparq, where he uses Clojure to transform healthcare data. He has been programming in Clojure for about 7 years. His interests include Logic Programming, Automated Planning, and Game Development.

Website