Federated Queries Across Both Different Storage Mediums and Different Data Engines
Greenplum Database excels when performing analytical queries against the data it is managing using its internal storage, however, not all of the interesting data is always found within. It is essential for Greenplum Database to be able to query data residing in external systems, especially “big data” from Hadoop ecosystems, such as files in HDFS or data in Hive or HBase tables.
Postgres Extensions Framework (PXF) has been adopted from the Apache HAWQ (incubating) project to run alongside Greenplum Database and provide the ability to read and write data from many different external systems.
The session will cover the history, architecture and basic use cases of PXF with Greenplum Database. We will also briefly explain more complex topics such as column projection, predicate pushdown, and user impersonation. Additionally, We will share the plans for further development of advanced PXF features.
- 50 min
- PostgresConf US 2018
- Greenplum Summit