Export or edit this event...

The Journey from ER Modeler to Data Scientist + The Basics of Text Analytics

200 SW Market Street
200 SW Market Street
Portland, Oregon 97201, US (map)

Building Conference Room (aka Propeller Room). At the top of the escalator, through the revolving doors, and immediately to the left.



We will be talking about what data science is and isn’t, with particular focus on the skills that a data scientist is expected to bring to bear on a problem. We will discuss the educational and skill expectations of a data scientist and how those relate to the skills we’ve developed as ER Modelers.

Going by job descriptions, data science can easily read like “jack of all trades, and master of them all too”. How do we break that down into something more reasonably achievable?

  • What is data science and what is a data scientist? What do they do?
  • Do ER Modelers make good Data Scientists?
  • Are the skill sets complementary?

And then we’ll look at some of the basics of text analytics, or extracting information from unstructured text. This won’t be nearly enough detail to make anybody a practitioner, but we will see several of the basic techniques at work and develop a functional mental model of some of the techniques. By the time we’re done, you’ll have been exposed to concepts like:

  • Term-Document Matrix (TDM)
  • Tokenizing
  • Stopword removal (Stopping)
  • Stemming
  • And as a bonus – we’ll see some Python code for extracting noun phrases

Bring your questions around these topics and let’s get into them.

Our Speaker

Asoka Diggs is a Data Scientist with Intel Corp. in Hillsboro, OR. He has about 15 years of experience in a variety of data management disciplines, including database administration, ER modeling, ETL development, and data architecture. He has recently transitioned to analysis of data by completing the MS Predictive Analytics degree from Northwestern University. It turns out that there IS something even more fun than ER modeling, and he now spends most of his work time practicing his analytic modeling skills, and teaching others how to participate and contribute to predictive analytic projects.


8:30 - 9:00 am - Sign In
9:00 - 10:15 am - Presentation
10:15 - 10:30 am - Break, Chapter Announcements
10:30 - 11:30 am - Presentation continued


Free for Members! See our corporate members at damapdx.org
$15 for Non-Members and $5 for Students with valid student ID
Covers speaker costs and refreshments

Upcoming DAMA PDX Metro Events

  • September: DAMA Day 2016
  • October: "Influencing Business Partners" with May Loomis
  • November: "The T-Shaped Data Professional – Achieving Data Management Goals by Other Means" with Alec Sharp (Clariteq Systems Consulting)
  • January 2017: "Advanced Data Visualization" with Amit P. Manghani

About DAMA Portland

The Portland Metro Chapter of the Data Administration Management Association has been serving the Portland data community since 1984. We are a not-for-profit, vendor independent, professional association dedicated to advancing the concepts and practices of enterprise information and data resource management.

The DAMA Portland Chapter is dedicated to delivering thought provoking data-centric presentations that will make you more successful in your job. Our primary purpose is to promote the understanding, development and practice of managing data, information and knowledge resources as key enterprise assets.