Silicon Valley Program

2018-10-15

08:00

Registration

presented by PostgresConf Organizers

Registration

08:00 - 09:00 Ballroom

08:30

Monday Morning Break

presented by PostgresConf Organizers

Monday morning break

08:30 - 08:50 Ballroom

09:00

Massively Parallel PostgreSQL For Analytics
Greenplum Database, Evolving Advanced Analytics on Postgres

presented by Tushar Pednekar

Greenplum brings massively parallel processing to the world of postgresql to enable analysis of Petabyte scale data using a standard PostgreSQL interface. This talk will explain how Greenplum enables massively parallel proecssing and how the future roadmap and integration with Kubernees will enable Greenplum to scale to new heights.

09:00 - 09:20 Ballroom Data

09:20

Update on Microsoft’s learning and progress on supporting PostgreSQL as a fully managed service on Azure

presented by Sunil Kamath

In this talk, Microsoft will provide an update on its own learnings supporting Postgres as a first class database on the Azure platform, the customer learnings from supporting Postgres over the last 1+ year, and discuss how these learnings is shaping the direction for our investments and innovations going forward.

09:20 - 09:40 Ballroom

09:40

What's new in Postgres 11

presented by Tom Kincaid

This is a keynote presentation as part of 2ndQuadrant's sponsorship. This presentation will cover the new and exciting features in PG11. Specific topics will be stored procedures, new Partitioning enhancements and JIT compilation. There will also be a quick look at some of the smaller features.

09:40 - 10:00 Ballroom Data

10:10

Helping to Solve World Hunger through AgensGraph

presented by Joshua Bae

There is a fundamental problem that AgensGraph DMBS solves for the World Food Program (WFP); Moreover, the solution is not just limited to the WFP, this solution can be universally applied. What we are hitting upon is the core problem of how we store data about humans when there is so much heterogeneity with how the data is collected while also preserving the relationships between variables. Th...

10:10 - 11:00 Ballroom Data

Best Practices for Running Postgres in Kubernetes
Operating a HA, performant Postgres Database on VMs or baremetal

presented by Jeff Silberman

Containers and Kubernetes are two big trends in DevOps. However, it is not always obvious to DevOps teams how to containerize their databases, and how to operate using container platforms like Kubernetes. In this talk, we will discuss the benefits of containerizing databases and present best practices for running PostgreSQL in containers using Kubernetes.

We will cover how to use Kubernetes...

10:10 - 11:00 Winchester 1 Ops

Data Integrity at Scale - lessons learned building Google Cloud SQL

presented by Alexis Guajardo and Brett Hesterberg

Keeping data safe is the top responsibility of anyone running a database. Learn how the Google Cloud SQL team protects against data loss. Cloud SQL is Google’s fully-managed database service that makes it easy to set up and maintain PostgreSQL databases in the cloud. In this session, we’ll dive into Cloud SQL’s storage architecture to learn how we check data down to the disk level. We will also...

10:10 - 11:00 Winchester 2 Data

I Didn't Know Postgres Could Do That!

presented by Robert Treat

PostgreSQL is often thought of as "The Database for DBAs", but what more and more developers are finding out is that what makes Postgres "The Worlds Most Advanced Open Source Database" is its wide array of features, many of which are really geared towards users who want to build applications on top of Postgres.

In this talk, we'll look at some of the features in Postgres you may not have see...

10:10 - 11:00 Market Dev

11:10

Improve Database Warehouse Performance 100x with a Database Proxy
Increase Amazon Redshift and Pivotal Greenplum Concurrency

presented by Lee Roland

Data Analytics/Warehouse solutions are often challenged with slow application response times caused by 1) High latency and 2) Frequent queries to the backend.

Heimdall Data is a Database Proxy that leverages Postgres for query optimization to improve the performance of Data Warehouse solutions (e.g. Amazon Redshift, Pivotal Greenplum). Techniques include batch processing and materialized vi...

11:10 - 12:00 Winchester 1 Ops

Postgres as a foundation for an operations data-science platform

presented by Maryam Sanglaji and Mark Chamness

Nutanix, an “Enterprise Cloud” software company collects an enormous amount of operational data in the customer’s cloud clusters, from storage traffic to compute performance to network patterns.

In addition to analyzing customer derived data, the data-science team also uses Postgres to report financials and internal operations data to every layer of management, from the C-suite to the produ...

11:10 - 11:30 Winchester 2 Data

How to run Stateful application on EC2 Spot Instances

presented by Amiram Shachar

The concept of data integrity and consistency is crucial when managing workloads. This aspect may be trivial when running with On-Demand instances, but it’s not so trivial while working with EC2 Spot Instances, which are conceptually ephemeral and can be revoked at any given moment. At Spotinst, we gave it some deep thought, on how you can leverage Spot, but still handle data concerns easily an...

11:10 - 11:30 Market Ops

11:40

Turning Data Teams Into Super Heroes

presented by Harry Glaser

The most innovative companies today are actually data companies masquerading as something else. The rise of the "data team" has made it possible to investigate questions that couldn't have even been asked just a few years ago. In this talk, Periscope Data CEO Harry Glaser will share his observations from working with more than 1,000 teams of data leaders, and explain why data leaders need a sea...

11:40 - 12:00 Winchester 2 Data

Condenser - A Simple PostgreSQL Subsetting Tool

presented by Adam Kamor and Andrew Colombi

Subsetting a database means to take a subset of its rows, across multiple tables, while still maintaining all of the constraints of the original database. Subsetting a database can be desirable for many reasons. One common use-case is to scale down a production database to a more reasonable size so that it can be used in staging, test, and development environments. This can be done to save cost...

11:40 - 12:00 Market Dev

12:00

Monday Lunch

presented by PostgresConf Organizers

Monday Lunch

12:00 - 13:00 Ballroom

13:00

Using an in-memory storage platform to bridge big-data systems

presented by Bin Fan

Today, the emerging cloud storages like Amazon S3, Azure’s Blob storage along with on-premises solutions like Swift, Minio or Ceph provide many enterprises with scalable, low-cost, fault-tolerant storage capacity. Founded at UC Berkeley’s AMPLab, Alluxio (formerly Tachyon) acts as a single source virtual data layer and enables applications to transparently access data using a consistent interf...

13:00 - 13:50 Winchester 2 Data

Understanding High availability options for PostgreSQL
ScaleGrid Postgres

presented by dharshan rangegowda and Madan Kumar K

Configuring High availability is critical for any production PostGres deployment. High availabilty goes beyond the setup of streaming replication. You need to understand the implications of process failures, node failures & network partitions on your cluster. In this talk we will present in detail three popular options to setup high availability using PostGres replication.

PostGres Automa...

13:00 - 13:50 Market Ops

Using TimescaleDB for time-series storage and analytics in Postgres
An introduction to TimescaleDB, a Postgres extension, and use cases

presented by Diana Hsieh, Matvey Arye, and Andrew Staller

An Introductory Training on TimescaleDB

TimescaleDB is an open-source time-series database, implemented as a Postgres extension, that improves insert rates by 20x over vanilla Postgres and offers much faster queries, while natively supporting full SQL (including JOINs). TimescaleDB achieves this by storing data on an individual server in a manner more common to...

13:00 - 16:00 Winchester 1 Dev

14:00

Dealing with Gigantic Tables
Coping strategies when you have 1+ TB of stuff in one place

presented by Linas Valiukas

One of the databases that I'm working on belongs to an academic project, and academia is notorious for their dislike of deleting data - in their eyes, every single byte has "future research potential" and so nothing is to be purged at any cost. Thus, research datasets have a tendency to grow to colossal sizes, and normal database management practices no longer apply - one has to put their own D...

14:00 - 14:20 Winchester 2 Data

14:30

Data pipelines with PostgreSQL and Kafka

presented by Oskari Saarenmaa

Apache Kafka is a high-performance open-source stream processing platform for collecting and processing large numbers of messages in real-time. It's used in an increasingly large number of data pipelines to handle events such as website click streams, transactions and other telemetry in real-time and at scale. Kafka's core benefit is clean decoupling of event producing and consuming logic which...

14:30 - 15:20 Winchester 2 Data

Architecting distributed data for GDPR
The What and the How of Making PostgreSQL GDPR Compliant

presented by Karthik Ranganathan

The General Data Protection Regulation, often referred to as GDPR, came into effect on 25 May 2018 across the European Union. This regulation has implications on many global businesses, given the fines imposed if the organization is be found to be non-compliant. Making sure that the app architecture continues to ensure regulatory compliance is an on-going challenge for ...

14:30 - 15:20 Market Data

15:40

Operating Machine Learning & Massively Parallel Postgres on Kubernetes
Agile Analytics. Everywhere.

presented by Derek Comingore

Pivotal’s Massively Parallel Postgres for Kubernetes introduces the world to container-native high-performance machine learning and analytics. The offering entails an end-to-end open-source software infrastructure stack including Bosh, Cloud Foundry, Ubuntu, Kubernetes, and the open-source Greenplum analytical database. Due to the inherent advantages that come with immutable infrastructure, use...

15:40 - 16:30 Winchester 2 Data

Building a DBaaS platform for Postgres

presented by Bala Kuchibhotla

You have a 20TB Postgres production database.. requests have come in for new compliance and business-continuity requirements. The new mobile app development team has requested access to a staging copy of the current data. Backup windows are getting longer and longer.

You start doing the math in your head.. 1, 2, 3, 4.. no 6 copies.. that will require an extra 120TB of space... how can you re...

15:40 - 16:30 Market Ops

16:40

Taking care of the elephant
The whys and hows of Elephant Shed

presented by Michael Meskes

The Elephant Shed is an Open Source PostgreSQL Appliance that bundles and integrates proven components, required for easy management of a PostgreSQL server. Proven tools for all relevant aspects are already preinstalled and preconfigured. The majority of these tools can be controlled via a comfortable web interface. Even experienced PostgreSQL administrators will hardly find an area not covered...

16:40 - 17:30 Winchester 2 Ops

Data Recovery in PostgreSQL on a damaged filesystem
A Case Study

presented by Christopher Travers

After a power outage took all our db servers offline suddenly, two of them failed to come back. We decided to do data recovery on these systems in order to reduce the amount of data loss. This talk will focus on both the technical and non-technical aspects of data recovery used, and what anyone should be aware of when trying to recover data from PostgreSQL.

We will discuss the causes of th...

16:40 - 17:30 Market Ops

How To Index Your Database
Indexing is simple, approachable, and useful

presented by Baron Schwartz

Do you know what database indexes are and how they work? Do they seem hard to understand? They don't have to be. The basic principles you need to know are simple and easy to remember. And developers need to know the basics of indexing: your DBAs can't save you, because discovering you need an index after you deploy a feature is often too little, too late. This talk will give you the fundamental...

16:40 - 17:30 Winchester 1 Dev

2018-10-16

09:00

How YugaByte DB implements distributed PostgreSQL
A hands-on introduction to YugaByte DB

presented by Karthik Ranganathan

Building applications on PostgreSQL that require automatic data sharding and replication, fault tolerance, distributed transactions and geographic data distribution has been hard. In this 3 hour workshop, we will look at how to do this using a real-world example running on top of YugaByte DB, a distributed database that is fully wire-compatible with PostgreSQL and NoSQL APIs (Apache Cassandra a...

09:00 - 12:00 Winchester 1 Data

Tips and Tricks with Amazon RDS for PostgreSQL

presented by Jignesh Shah

Managed database services are gaining in popularity. In this session we look at how best to configure Amazon RDS for PostgreSQL and also look at common user operations of using RDS for PostgreSQL. We will also look beyond common user operations and into some specific optimizations related to upgrade, logical replication, performance, and reducing downtime.

09:00 - 09:50 Winchester 2 Ops

Performant time-series data management and analytics with Postgres

presented by Matvey Arye

Time-series databases are one of the fasting growing segments of the database market, spreading across industries and use cases. Common requirements including ingesting high volumes of structured data; answering complex, performant queries for both recent and historical time int...

09:00 - 09:50 Market Data

10:00

HOT - UNDERSTANDING THIS IMPORTANT UPDATE OPTIMIZATION

presented by Grant McAlister

In this session we will dive deep into HOT (Heap Only Tuple) update optimization. Utilizing this optimization can result in improved writes rates, less index bloat and reduced vacuum effort but to enable PostgreSQL to use this optimization may require changing your application design and database settings. We will examine how the number of indexes, frequency of updates, fillfactor and vacuum se...

10:00 - 10:50 Winchester 2 Ops

Designing a data-access control architecture for GDPR
How to for Greenplum

presented by Pratik Verma

Financial, Insurance, Industrial, Healthcare and many other F500 orgs that use data lake containing Greenplum with Hadoop. With BlueTalon, one such company was able to implement a Data Democratization policy framework that allows end users to access the subset of data based on the business they belong to or their finance functional area through templated dynamic attribute based database, table,...

10:00 - 10:50 Market Ops

10:50

Tuesday Morning Break

presented by PostgresConf Organizers

Tuesday morning break

10:50 - 11:10 Ballroom

11:10

Top 10 Mistakes When Migrating From Oracle to PostgreSQL

presented by Jim Mlodgenski

As more and more people are moving to PostgreSQL from Oracle, a pattern of mistakes is emerging. They can be caused by the tools being used or just not understanding how PostgreSQL is different than Oracle. In this talk we will discuss the top mistakes people generally make when moving to PostgreSQL from Oracle and what the correct course of action.

11:10 - 12:00 Winchester 2 Dev

Viewing data at the intersection between roles

presented by Lloyd Albin

We ran across a use case where we needed to restrict people's access to the data by requiring them to belong to 2 or more groups. Most people would create a third group and make the two groups, members of this new group. The extra group was not wanted for various reasons but PostgreSQL by default does not support tying the permissions together for more than one role/group.

In our use case, w...

11:10 - 12:00 Market Dev

12:00

Tuesday Lunch

presented by PostgresConf Organizers

Tuesday Lunch

12:00 - 13:00 Ballroom

13:00

End-to-End Encryption in Modern Database Architectures

presented by David Szabo

In the evolution of databases becoming platforms, they naturally expand to clouds and client devices, increasing the scope of a database from a data store & query engine to elastic scaling and solving the challenge of mobile networking and cross-device data sync on client platforms. This new stretched architecture enables a new and improved way to protect data using End-to-End Encryption, enabl...

13:00 - 13:50 Winchester 2 Dev

Using PostgreSQL in Gobalto
We are using Amazon RDS for PostgreSQL.

presented by savita pandey

Gobalto speeds up clinical trials through site identification, feasibility, selection, activation and into conduct — getting needed therapies to patients sooner, like our namesake, Balto › goBalto is the industry leader in cloud-based study startup software for the global life sciences industry. Committed to accelerating clinical trials through innovation, product excellence, and customer succ...

13:00 - 13:20 Winchester 1 Dev

Building A Serverless Database Using Postgres

presented by Nicolae Rusan

Serverless infrastuctures open up the opportunity to reimagine how databases and code interact. In this talk we demo a new type of database we've built that is focused on serverless use-cases. The design goals being ease of use and enhanced powers at the database level. We outline our thoughts on the future of the data layer for applications, and how new design paradigms such as GraphQL, AI, an...

13:00 - 13:50 Market Dev

13:30

Hardware acceleration of Postgres using FPGAs
postgres acceleration using FPGAs on Amazon F1

presented by HK Verma

In this talk, we present a data analytics acceleration stack, using which users can easily execute their existing Postgres SQL queries on an accelerated FPGA platform. The integrated Xilinx library offloads scan and aggregate instructions from Postgres query plan to FPGA. Instruction code for the massively parallel SQL processing unit is generated on-the-fly for user query. Postgres storage pag...

13:30 - 13:50 Winchester 1 Dev

14:00

Managing PostgreSQL-as-a-Service at Large Scale in SAP Multi-Cloud platform
-

presented by Dinesh Kumar, Vinayak Jadhav, Abhijit Gharami, SUBHRO BHATTACHARYYA, and Shashank Jain

SAP Cloud Platform (SCP) is an open platform-as-a-service (PaaS) product that provides core services, for building and extending cloud applications on multiple cloud IAASs. SCP supports AWS, OpenStack, Azure and GCP.

One of the core services provided by SCP is PostgreSQL as a Service (PostgreSQL-as-a-Service). Each PostgreSQL-as-a-Service instance consists of 5 VMs - Postgres-Master, Pos...

14:00 - 14:50 Market Dev

The Art of Database Experiments

presented by Nikolay Samokhvalov

Future database administration will be highly automated. Until then, we still live in a world where extensive manual interactions are required from a skilled DBA. This will change soon as more "autonomous databases" reach maturity and enter the production environment.

Postgres-specific monitoring tools and systems continue to improve, detecting and analyzing performance issues and bottleneck...

14:00 - 14:50 Winchester 1 Ops

Modernizing the PostgreSQL developer experience with GraphQL

presented by Tanmai Gopal

GraphQL makes it easy for frontend developers to query their backend APIs. At Hasura, we've built a GraphQL engine that works on top of a Postgres database and gives automatically developers a GraphQL backend that they can use to perform "CRUD" and "watch" operations on. The GraphQL engine also adds an access control layer for application users, inspired by postgres's RLS.

In this talk, af...

14:00 - 14:50 Winchester 2 Dev

14:50

Conference Closing

presented by PostgresConf Organizers

Conference Closing

14:50 - 15:10 Ballroom