Viewing 0 current events matching “cascading” by Date.

Sort By: Date Event Name, Location , Default
No events were found.

Viewing 2 past events matching “cascading” by Date.

Sort By: Date Event Name, Location , Default
Tuesday
Jun 21, 2011
Portland Java User Group: Lean Mobile Data and Open Source: Storage, Messaging and Analysis
Oracle (Downtown Campus)

This month's topic: Lean Mobile Data and Open Source: Storage, Messaging and Analysis

This talk will provide a overview of Urban Airship's core data warehouse architecture - a system designed to handle capture, intake and analysis of data for 100s of millions of mobile devices with near real time precision. The talk will touch on Urban Airship's use of HBase, Hadoop Core, ZooKeeper, Kafka as well as home-grown services. Time permitting, the talk will also cover how Urban Airship takes a lean approach to working with volumes of data including the use of ad-hoc tools such as Pig and Cascading as well as how the company leverages the data architecture for fast customer discovery and innovations.


Speaker: Erik Onnen

Erik Onnen is the Hadoop and Analytics Lead at Urban Airship, the Portland-based leader in mobile application engagement services. He has over 10 years in distributed systems experience including the design and implementation of multiple "big data" systems. Erik joined Urban Airship in October of 2010, prior to that he was a Principal Engineer at Jive Software.


PJUG meetings start with some time to eat and socialize (pizza and beverages are provided), followed by the featured speaker, then Q&A, discussion, sometimes a drawing to give away swag. :)

Though we like knowing how many people to expect, you don't have to RSVP, on Upcoming or otherwise. Go ahead and just show up!

Many people also go for a drink and further discussion following the meeting, at a location determined ad hoc (lately, Trees restaurant in the same building).

http://twitter.com/pjug http://pjug.org/ (join our mailing list, linked from the website!)

Website
Thursday
Jul 25, 2013
Enterprise Data Workflows with Cascading - Hadoop
Widmer Brothers Gasthaus

Cascading is an open source workflow abstraction atop Hadoop and other Big Data frameworks, with a 5+ year history of large-scale Enterprise deployments. For example, half of Twitter's total compute uses this API, along with other large use cases at eBay, Etsy, Airbnb, LinkedIn, Apple, Climate, Nokia, Factual, Telefonica, etc. Cascading leverages some aspects of functional programming so that developers can create large-scale data pipelines which are robust and easier to operationalize. There are popular DSLs in Scala (Scalding) and Clojure (Cascalog), plus Jython, JRuby, etc. Recent support also implements DSLs for ANSI SQL (Lingual) and PMML (Pattern).

This talk will describe the technology and some of the large use cases for Cascading, plus show sample apps in Scalding, Cascalog, and go into examples of ANSI SQL with Lingual and PMML with Pattern. We'll also cover material about Mesos, a cluster scheduler akin to Google's "Borg" which is used at scale by Twitter, Airbnb, Box, etc.

Doors are open at 6:15, Talk starts at 7pm. Light snacks, beverages and brews will be available and will have a social hour following the talk.

Please RSVP via Eventbrite if you plan to attend: http://pdx-hadoop.eventbrite.com/

Talk will be led by Paco Nathan.

Paco Nathan is Chief Scientist at Mesosphere, a committer on Cascading, and an O'Reilly author for "Enterprise Data Workflows with Cascading". He is a recognized expert in Hadoop, R, cloud computing, distributed systems, machine learning, and predictive analytics, with 25+ years experience in the tech industry ranging from Bell Labs to early-stage start-ups. For the past 10+ years, Paco has led innovative Data teams building large-scale apps.

http://liber118.com/pxn/ @pacoid

Organized/Sponsored By: Aaron Betik, NIKE, Inc Global Technology Director, Consumer and Digital Analytics & BI

Website