Presented by:

7cdf81df8a6d228cc5b2077245cbcc66

Helge Reikeras

Offerzen

In this talk I'll discuss two frequently held misconceptions about data science (a) that data science requires data to be duplicated in a dedicated big data system like Hadoop, and (b) that you need a PhD degree in Computer Science or Mathematics to be a data scientist.

To this end I'll talk about how data science can be performed using existing database infrastructure, through using Postgres and Apache MADlib, without the additional complexity and overhead of maintaining separate big data infrastructure, and how developers can get started on their own using data science in their work.

Date:
2018 October 9 13:00
Duration:
40 min
Room:
Baobab
Conference:
South Africa 2018
Language:
English
Track:
Development
Difficulty:
Medium