Silicon Valley Program
presented by PostgresConf Organizers
Registration
08:00 - 09:00 Ballroompresented by PostgresConf Organizers
Monday morning break
08:30 - 08:50 BallroomGreenplum Database, Evolving Advanced Analytics on Postgres
presented by Tushar Pednekar
Greenplum brings massively parallel processing to the world of postgresql to enable analysis of Petabyte scale data using a standard PostgreSQL interface. This talk will explain how Greenplum enables massively parallel proecssing and how the future roadmap and integration with Kubernees will enable Greenplum to scale to new heights.
09:00 - 09:20 Ballroom Datapresented by Sunil Kamath
In this talk, Microsoft will provide an update on its own learnings supporting Postgres as a first class database on the Azure platform, the customer learnings from supporting Postgres over the last 1+ year, and discuss how these learnings is shaping the direction for our investments and innovations going forward.
09:20 - 09:40 Ballroompresented by Tom Kincaid
This is a keynote presentation as part of 2ndQuadrant's sponsorship. This presentation will cover the new and exciting features in PG11. Specific topics will be stored procedures, new Partitioning enhancements and JIT compilation. There will also be a quick look at some of the smaller features.
09:40 - 10:00 Ballroom DataOperating a HA, performant Postgres Database on VMs or baremetal
presented by Jeff Silberman
Containers and Kubernetes are two big trends in DevOps. However, it is not always obvious to DevOps teams how to containerize their databases, and how to operate using container platforms like Kubernetes. In this talk, we will discuss the benefits of containerizing databases and present best practices for running PostgreSQL in containers using Kubernetes.
We will cover how to use Kubernetes...
more 10:10 - 11:00 Winchester 1 Opspresented by Joshua Bae
There is a fundamental problem that AgensGraph DMBS solves for the World Food Program (WFP); Moreover, the solution is not just limited to the WFP, this solution can be universally applied. What we are hitting upon is the core problem of how we store data about humans when there is so much heterogeneity with how the data is collected while also preserving the relationships between variables. Th...
more 10:10 - 11:00 Ballroom Datapresented by Robert Treat
PostgreSQL is often thought of as "The Database for DBAs", but what more and more developers are finding out is that what makes Postgres "The Worlds Most Advanced Open Source Database" is its wide array of features, many of which are really geared towards users who want to build applications on top of Postgres.
In this talk, we'll look at some of the features in Postgres you may not have see...
more 10:10 - 11:00 Market Devpresented by Alexis Guajardo and Brett Hesterberg
Keeping data safe is the top responsibility of anyone running a database. Learn how the Google Cloud SQL team protects against data loss. Cloud SQL is Google’s fully-managed database service that makes it easy to set up and maintain PostgreSQL databases in the cloud. In this session, we’ll dive into Cloud SQL’s storage architecture to learn how we check data down to the disk level. We will also...
more 10:10 - 11:00 Winchester 2 Datapresented by Amiram Shachar
The concept of data integrity and consistency is crucial when managing workloads. This aspect may be trivial when running with On-Demand instances, but it’s not so trivial while working with EC2 Spot Instances, which are conceptually ephemeral and can be revoked at any given moment. At Spotinst, we gave it some deep thought, on how you can leverage Spot, but still handle data concerns easily an...
more 11:10 - 11:30 Market Opspresented by Maryam Sanglaji and Mark Chamness
Nutanix, an “Enterprise Cloud” software company collects an enormous amount of operational data in the customer’s cloud clusters, from storage traffic to compute performance to network patterns.
In addition to analyzing customer derived data, the data-science team also uses Postgres to report financials and internal operations data to every layer of management, from the C-suite to the produ...
more 11:10 - 11:30 Winchester 2 DataIncrease Amazon Redshift and Pivotal Greenplum Concurrency
presented by Lee Roland
Data Analytics/Warehouse solutions are often challenged with slow application response times caused by 1) High latency and 2) Frequent queries to the backend.
Heimdall Data is a Database Proxy that leverages Postgres for query optimization to improve the performance of Data Warehouse solutions (e.g. Amazon Redshift, Pivotal Greenplum). Techniques include batch processing and materialized vi...
more 11:10 - 12:00 Winchester 1 Opspresented by Adam Kamor and Andrew Colombi
Subsetting a database means to take a subset of its rows, across multiple tables, while still maintaining all of the constraints of the original database. Subsetting a database can be desirable for many reasons. One common use-case is to scale down a production database to a more reasonable size so that it can be used in staging, test, and development environments. This can be done to save cost...
more 11:40 - 12:00 Market Devpresented by Harry Glaser
The most innovative companies today are actually data companies masquerading as something else. The rise of the "data team" has made it possible to investigate questions that couldn't have even been asked just a few years ago. In this talk, Periscope Data CEO Harry Glaser will share his observations from working with more than 1,000 teams of data leaders, and explain why data leaders need a sea...
more 11:40 - 12:00 Winchester 2 Datapresented by PostgresConf Organizers
Monday Lunch
12:00 - 13:00 BallroomScaleGrid Postgres
presented by dharshan rangegowda and Madan Kumar K
Configuring High availability is critical for any production PostGres deployment. High availabilty goes beyond the setup of streaming replication. You need to understand the implications of process failures, node failures & network partitions on your cluster. In this talk we will present in detail three popular options to setup high availability using PostGres replication.
- PostGres Automa...
presented by Bin Fan
Today, the emerging cloud storages like Amazon S3, Azure’s Blob storage along with on-premises solutions like Swift, Minio or Ceph provide many enterprises with scalable, low-cost, fault-tolerant storage capacity. Founded at UC Berkeley’s AMPLab, Alluxio (formerly Tachyon) acts as a single source virtual data layer and enables applications to transparently access data using a consistent interf...
more 13:00 - 13:50 Winchester 2 DataAn introduction to TimescaleDB, a Postgres extension, and use cases
presented by Diana Hsieh, Matvey Arye, and Andrew Staller
An Introductory Training on TimescaleDB
TimescaleDB is an open-source time-series database, implemented as a Postgres extension, that improves insert rates by 20x over vanilla Postgres and offers much faster queries, while natively supporting full SQL (including JOINs). TimescaleDB achieves this by storing data on an individual server in a manner more common to...
more 13:00 - 16:00 Winchester 1 DevCoping strategies when you have 1+ TB of stuff in one place
presented by Linas Valiukas
One of the databases that I'm working on belongs to an academic project, and academia is notorious for their dislike of deleting data - in their eyes, every single byte has "future research potential" and so nothing is to be purged at any cost. Thus, research datasets have a tendency to grow to colossal sizes, and normal database management practices no longer apply - one has to put their own D...
more 14:00 - 14:20 Winchester 2 DataThe What and the How of Making PostgreSQL GDPR Compliant
presented by Karthik Ranganathan
The General Data Protection Regulation, often referred to as GDPR, came into effect on 25 May 2018 across the European Union. This regulation has implications on many global businesses, given the fines imposed if the organization is be found to be non-compliant. Making sure that the app architecture continues to ensure regulatory compliance is an on-going challenge for ...
more 14:30 - 15:20 Market Datapresented by Oskari Saarenmaa
Apache Kafka is a high-performance open-source stream processing platform for collecting and processing large numbers of messages in real-time. It's used in an increasingly large number of data pipelines to handle events such as website click streams, transactions and other telemetry in real-time and at scale. Kafka's core benefit is clean decoupling of event producing and consuming logic which...
more 14:30 - 15:20 Winchester 2 DataAgile Analytics. Everywhere.
presented by Derek Comingore
Pivotal’s Massively Parallel Postgres for Kubernetes introduces the world to container-native high-performance machine learning and analytics. The offering entails an end-to-end open-source software infrastructure stack including Bosh, Cloud Foundry, Ubuntu, Kubernetes, and the open-source Greenplum analytical database. Due to the inherent advantages that come with immutable infrastructure, use...
more 15:40 - 16:30 Winchester 2 Datapresented by Bala Kuchibhotla
You have a 20TB Postgres production database.. requests have come in for new compliance and business-continuity requirements. The new mobile app development team has requested access to a staging copy of the current data. Backup windows are getting longer and longer.
You start doing the math in your head.. 1, 2, 3, 4.. no 6 copies.. that will require an extra 120TB of space... how can you re...
more 15:40 - 16:30 Market OpsIndexing is simple, approachable, and useful
presented by Baron Schwartz
Do you know what database indexes are and how they work? Do they seem hard to understand? They don't have to be. The basic principles you need to know are simple and easy to remember. And developers need to know the basics of indexing: your DBAs can't save you, because discovering you need an index after you deploy a feature is often too little, too late. This talk will give you the fundamental...
more 16:40 - 17:30 Winchester 1 DevA Case Study
presented by Christopher Travers
After a power outage took all our db servers offline suddenly, two of them failed to come back. We decided to do data recovery on these systems in order to reduce the amount of data loss. This talk will focus on both the technical and non-technical aspects of data recovery used, and what anyone should be aware of when trying to recover data from PostgreSQL.
We will discuss the causes of th...
more 16:40 - 17:30 Market OpsThe whys and hows of Elephant Shed
presented by Michael Meskes
The Elephant Shed is an Open Source PostgreSQL Appliance that bundles and integrates proven components, required for easy management of a PostgreSQL server. Proven tools for all relevant aspects are already preinstalled and preconfigured. The majority of these tools can be controlled via a comfortable web interface. Even experienced PostgreSQL administrators will hardly find an area not covered...
more 16:40 - 17:30 Winchester 2 Opspresented by Matvey Arye
Time-series databases are one of the fasting growing segments of the database market, spreading across industries and use cases. Common requirements including ingesting high volumes of structured data; answering complex, performant queries for both recent and historical time int...
more 09:00 - 09:50 Market Datapresented by Jignesh Shah
Managed database services are gaining in popularity. In this session we look at how best to configure Amazon RDS for PostgreSQL and also look at common user operations of using RDS for PostgreSQL. We will also look beyond common user operations and into some specific optimizations related to upgrade, logical replication, performance, and reducing downtime.
09:00 - 09:50 Winchester 2 OpsA hands-on introduction to YugaByte DB
presented by Karthik Ranganathan
Building applications on PostgreSQL that require automatic data sharding and replication, fault tolerance, distributed transactions and geographic data distribution has been hard. In this 3 hour workshop, we will look at how to do this using a real-world example running on top of YugaByte DB, a distributed database that is fully wire-compatible with PostgreSQL and NoSQL APIs (Apache Cassandra a...
more 09:00 - 12:00 Winchester 1 Datapresented by Grant McAlister
In this session we will dive deep into HOT (Heap Only Tuple) update optimization. Utilizing this optimization can result in improved writes rates, less index bloat and reduced vacuum effort but to enable PostgreSQL to use this optimization may require changing your application design and database settings. We will examine how the number of indexes, frequency of updates, fillfactor and vacuum se...
more 10:00 - 10:50 Winchester 2 OpsHow to for Greenplum
presented by Pratik Verma
Financial, Insurance, Industrial, Healthcare and many other F500 orgs that use data lake containing Greenplum with Hadoop. With BlueTalon, one such company was able to implement a Data Democratization policy framework that allows end users to access the subset of data based on the business they belong to or their finance functional area through templated dynamic attribute based database, table,...
more 10:00 - 10:50 Market Opspresented by PostgresConf Organizers
Tuesday morning break
10:50 - 11:10 Ballroompresented by Lloyd Albin
We ran across a use case where we needed to restrict people's access to the data by requiring them to belong to 2 or more groups. Most people would create a third group and make the two groups, members of this new group. The extra group was not wanted for various reasons but PostgreSQL by default does not support tying the permissions together for more than one role/group.
In our use case, w...
more 11:10 - 12:00 Market Devpresented by Jim Mlodgenski
As more and more people are moving to PostgreSQL from Oracle, a pattern of mistakes is emerging. They can be caused by the tools being used or just not understanding how PostgreSQL is different than Oracle. In this talk we will discuss the top mistakes people generally make when moving to PostgreSQL from Oracle and what the correct course of action.
11:10 - 12:00 Winchester 2 Devpresented by PostgresConf Organizers
Tuesday Lunch
12:00 - 13:00 BallroomWe are using Amazon RDS for PostgreSQL.
presented by savita pandey
Gobalto speeds up clinical trials through site identification, feasibility, selection, activation and into conduct — getting needed therapies to patients sooner, like our namesake, Balto › goBalto is the industry leader in cloud-based study startup software for the global life sciences industry. Committed to accelerating clinical trials through innovation, product excellence, and customer succ...
more 13:00 - 13:20 Winchester 1 Devpresented by Nicolae Rusan
Serverless infrastuctures open up the opportunity to reimagine how databases and code interact. In this talk we demo a new type of database we've built that is focused on serverless use-cases. The design goals being ease of use and enhanced powers at the database level. We outline our thoughts on the future of the data layer for applications, and how new design paradigms such as GraphQL, AI, an...
more 13:00 - 13:50 Market Devpresented by David Szabo
In the evolution of databases becoming platforms, they naturally expand to clouds and client devices, increasing the scope of a database from a data store & query engine to elastic scaling and solving the challenge of mobile networking and cross-device data sync on client platforms. This new stretched architecture enables a new and improved way to protect data using End-to-End Encryption, enabl...
more 13:00 - 13:50 Winchester 2 Devpostgres acceleration using FPGAs on Amazon F1
presented by HK Verma
In this talk, we present a data analytics acceleration stack, using which users can easily execute their existing Postgres SQL queries on an accelerated FPGA platform. The integrated Xilinx library offloads scan and aggregate instructions from Postgres query plan to FPGA. Instruction code for the massively parallel SQL processing unit is generated on-the-fly for user query. Postgres storage pag...
more 13:30 - 13:50 Winchester 1 Devpresented by Nikolay Samokhvalov
Future database administration will be highly automated. Until then, we still live in a world where extensive manual interactions are required from a skilled DBA. This will change soon as more "autonomous databases" reach maturity and enter the production environment.
Postgres-specific monitoring tools and systems continue to improve, detecting and analyzing performance issues and bottleneck...
more 14:00 - 14:50 Winchester 1 Ops-
presented by Dinesh Kumar, Vinayak Jadhav, Abhijit Gharami, SUBHRO BHATTACHARYYA, and Shashank Jain
SAP Cloud Platform (SCP) is an open platform-as-a-service (PaaS) product that provides core services, for building and extending cloud applications on multiple cloud IAASs. SCP supports AWS, OpenStack, Azure and GCP.
One of the core services provided by SCP is PostgreSQL as a Service (PostgreSQL-as-a-Service). Each PostgreSQL-as-a-Service instance consists of 5 VMs - Postgres-Master, Pos...
more
14:00
-
14:50
Market
Dev
presented by Tanmai Gopal
GraphQL makes it easy for frontend developers to query their backend APIs. At Hasura, we've built a GraphQL engine that works on top of a Postgres database and gives automatically developers a GraphQL backend that they can use to perform "CRUD" and "watch" operations on. The GraphQL engine also adds an access control layer for application users, inspired by postgres's RLS.
In this talk, af...
more 14:00 - 14:50 Winchester 2 Devpresented by PostgresConf Organizers
Conference Closing
14:50 - 15:10 Ballroom