Export or edit this event...

Export to

Google Calendar
iCalendar file
hCalendar markup
<div class="vevent h-event"> <h1 class="summary p-name">Hadoop and Data Science Meetup</h1> <div class='date'><time class="dtstart dt-start" title="2012-07-25T19:00:00" datetime="2012-07-25T19:00:00">Wednesday, July 25, 2012 from 7</time>–<time class="dtend dt-end" title="2012-07-25T20:30:00" datetime="2012-07-25T20:30:00">8:30pm</time></div> <div class="location vcard p-location h-card"> <a class="url" href='http://calagator.org/venues/202392011'><span class='fn org p-name'>Cloudability</span></a> <div class="adr p-adr h-adr"> <div class="street-address p-street-address">334 NW 11th Avenue</div> <span class="locality p-locality">Portland</span> , <span class="region p-region">OR</span> <span class="postal-code p-postal-code">97209</span> <div class="country-name p-country-name">US</div> (<a href='https://maps.google.com/maps?q=334%20NW%2011th%20Avenue,%20Portland%20OR%2097209%20US'>map</a>) </div> </div> <div class="description p-description"> <p>This month's meet up will start with discussion of news items related to data science and big data led by William Taylor.</p> <p>Presentation this month by Temese Szalai.</p> <p>Title: Asking Questions About Big Data: A Basic How-To For Framing Problems When Working With (Unstructured Text) Data At Scale</p> <p>Summary: Data is only as valuable as the questions we ask about it. The questions to ask need to be those that yield valuable insights, quantifiable results and whose answers lead to actionable information, i.e., help make a decision or meet the requirements of the people and systems consuming the analysis or output. Identifying good questions to ask and how to proceed with very large data sets is at the very heart of being a data scientist.</p> <p>When working with large data sets, especially ones that are unstructured or semi-structured text data, asking questions and getting started is not always easy. In fact, it's sometimes the hardest part. Drawing on her experiences working with text data at scale, Temese will talk about strategies and methodologies for approaching this kind of data when doing initial discovery and analysis. She'll also cover some basic tools and techniques that are available and basic best practices. Although unstructured text data is a focus, the talk should be general enough to apply to analyzing other kinds of data as well.</p> <p>Speaker Bio: Temese Szalai has worked as an industrial computational linguist/taxonomist for 13 years. Presently, she is the founder of Madarka, which leverages semantic analysis of large unstructured corpora for psychographic consumer segmentation.</p> </div> </div>

You can edit, clone, or delete this event.

This item was added directly to Calagator
Friday, July 20, 2012 at 10:01am.

Hadoop and Data Science Meetup

Wednesday, July 25, 2012 from 7–8:30pm

Cloudability

334 NW 11th Avenue

Portland, OR 97209, US (map)

Access Notes

Front door on 11th; venues may have access through large garage door on Flanders

Description

This month's meet up will start with discussion of news items related to data science and big data led by William Taylor.

Presentation this month by Temese Szalai.

Title: Asking Questions About Big Data: A Basic How-To For Framing Problems When Working With (Unstructured Text) Data At Scale

Summary: Data is only as valuable as the questions we ask about it. The questions to ask need to be those that yield valuable insights, quantifiable results and whose answers lead to actionable information, i.e., help make a decision or meet the requirements of the people and systems consuming the analysis or output. Identifying good questions to ask and how to proceed with very large data sets is at the very heart of being a data scientist.

When working with large data sets, especially ones that are unstructured or semi-structured text data, asking questions and getting started is not always easy. In fact, it's sometimes the hardest part. Drawing on her experiences working with text data at scale, Temese will talk about strategies and methodologies for approaching this kind of data when doing initial discovery and analysis. She'll also cover some basic tools and techniques that are available and basic best practices. Although unstructured text data is a focus, the talk should be general enough to apply to analyzing other kinds of data as well.

Speaker Bio: Temese Szalai has worked as an industrial computational linguist/taxonomist for 13 years. Presently, she is the founder of Madarka, which leverages semantic analysis of large unstructured corpora for psychographic consumer segmentation.

Export to

Hadoop and Data Science Meetup

Access Notes

Description

Share

Tags