Skip to content

Introducing Open Source WarehousePG


WarehousePG is an open source, massively parallel processing (MPP) database server for petabayte scale analytic data warehouse and business intelligence workloads.

WarehousePG is based on PostgreSQL open-source technology. It is essentially many PostgreSQL instances running in unison as one cohesive database management system (DBMS). Being PostgreSQL based, users may intereact with WarehousePG as they would a standard PostgreSQL instance, using familar third party tools and SQL support.

MPP refers to clusters with two or more segment hosts working in parallel to deliver a query's result set. User data is spread across the data segments using a distriubtion key. In an MPP shared nothing environment, every segment hosts contains multiple segment instances (PostgreSQL processes) operating independently of other segments. Each segment hosts contains its own CPU, memory and storage. User data is mirrored across the WarehousePG cluster delivering high availablity and fault tolerance.

WarehousePG Architecture

High-Level WarehousePG Architecture

Documentation

Release Notes

Install Guide

Admin Guide

Utility Guide

Analytics Guide

Reference Guide

Backup & Restore Guide