Introducing Crunchy Data Warehouse: A next-generation Postgres-native data warehouse. Crunchy Data Warehouse Learn more

Posts about Analytics

Analytics
8 min read
Marco SlotNov 20, 2024
Crunchy Data Warehouse: Postgres with Iceberg for High Performance Analytics
Marco SlotNov 20, 2024
PostgreSQL is the bedrock on which many of today’s organizations are built. The versatility, reliability, performance, and extensibility of PostgreSQL make it the perfect tool for a large variety of operational workloads. The one area in which PostgreSQL has historically been lacking is analytics, which involves queries that summarize, filter, or transform large amounts of data. Modern analytical databases are designed to query data in data lakes in formats like Parquet using a fast vectorized...
Read More
Analytics
5 min read
Elizabeth ChristensenNov 18, 2024
Easy Totals and Subtotals in Postgres with Rollup and Cube
Elizabeth ChristensenNov 18, 2024
Postgres is being used more and more for analytical workloads. There’s a few hidden gems I recently ran across that are really handy for doing SQL for data analysis, and . Rollup and cube don’t get a lot of attention, but follow along with me in this post to see how they can save you a few steps and enhance your date binning and summary reporting. We also have a web based tutorial that covers Postgres Functions for Rolling Up Data by Date if you want to try it yourself with a sample dat...
Read More
Analytics
8 min read
Christopher WinslettNov 8, 2024
8 Steps in Writing Analytical SQL Queries
Christopher WinslettNov 8, 2024
It is never immediately obvious how to go from a simple SQL query to a complex one -- especially if it involves intricate calculations. One of the “dangers” of SQL is that you can create an executable query but return the wrong data. For example, it is easy to inflate the value of a calculated field by joining to multiple rows. Use Crunchy Playground to follow allow with this blog post using a Postgres terminal: Postgres Playground w/ Sample Data Let’s take a look at a sample query. This appears...
Read More
Analytics
8 min read
Christopher WinslettOct 29, 2024
4 Ways to Create Date Bins in Postgres: interval, date_trunc, extract, and to_char
Christopher WinslettOct 29, 2024
You followed all the best practices, your sales dates are stored in perfect timestamp format …. but now you need to get reports by day, week, quarters, and months. You need to bin, bucket, and roll up sales data in easy to view reports. Do you need a BI tool? Not yet actually. Your Postgres database has hundreds of functions that let you query data analytics by date. By using some good old fashioned SQL - you have powerful analysis and business intelligence with date details on any data set. In...
Read More
Analytics
4 min read
Craig KerstiensOct 17, 2024
pg_parquet: An Extension to Connect Postgres and Parquet
Craig KerstiensOct 17, 2024
Today, we’re excited to release pg_parquet - an open source Postgres extension for working with Parquet files. The extension reads and writes parquet files to local disk or to S3 natively from Postgres. With pg_parquet you're able to: • Export tables or queries from Postgres to Parquet files • Ingest data from Parquet files to Postgres • Inspect the schema and metadata of existing Parquet files Export tables or queries from Postgres to Parquet files Ingest data from Parquet files to Postgres I...
Read More
Spatial Analytics
14 min read
Paul RamseySep 25, 2024
Vehicle Routing with PostGIS and Overture Data
Paul RamseySep 25, 2024
The Overture Maps collection of data is enormous, encompassing over 300 million transportation segments , 2.3 billion building footprints , 53 million points of interest , and a rich collection of cartographic features as well. It is a consistent global data set, but it is intimidatingly large -- what can a person do with such a thing? Building cartographic products is the obvious thing, but what about the less obvious. With an analytical engine like PostgreSQL and Crunchy Bridge for Analy...
Read More
Analytics Tutorials
8 min read
Elizabeth ChristensenSep 16, 2024
Window Functions for Data Analysis with Postgres
Elizabeth ChristensenSep 16, 2024
SQL makes sense when it's working on a single row, or even when it's aggregating across multiple rows. But what happens when you want to compare between rows of something you've already calculated? Or make groups of data and query those? Enter window functions. Window functions tend to confuse people - but they’re a pretty awesome tool in SQL for data analytics. The best part is that you don’t need charts, fancy BI tools or AI to get some actionable and useful data for your stakeholders. Window...
Read More
Spatial Analytics
8 min read
Marco SlotSep 9, 2024
PostGIS meets DuckDB: Crunchy Bridge for Analytics goes Spatial
Marco SlotSep 9, 2024
Crunchy Data is excited to announce the next major feature release for Crunchy Bridge for Analytics : Geospatial Analytics . We have developed a variety of features to connect Postgres and PostGIS to S3 and public web servers to make spatial data access easier than ever. This release includes: • Creating an analytics table directly from a geospatial data set by providing only the URL, for ad-hoc queries and data transformations. • Creating a regular PostGIS table directly from a URL. • Automat...
Read More
Analytics
4 min read
Marco SlotSep 4, 2024
Postgres Materialized Views from Parquet in S3 with Zero ETL
Marco SlotSep 4, 2024
Data pipelines for IoT applications often involve multiple different systems. First, raw data is gathered in object storage, then several transformations happen in analytics systems, and finally results are written into transactional databases to be accessed by low latency dashboards. While a lot of interesting engineering goes into these systems, things are much simpler if you can do everything in Postgres. Crunchy Bridge for Analytics is a managed PostgreSQL offering that integrates DuckDB...
Read More
Analytics
12 min read
Marco SlotAug 16, 2024
Postgres Powered by DuckDB: The Modern Data Stack in a Box
Marco SlotAug 16, 2024
Postgres for analytics has always been a huge question mark. By using PostgreSQL's extension APIs, integrating DuckDB as a query engine for state-of-the-art analytics performance without forking either project could Postgres be the analytics database too? Bringing an analytical query engine into a transactional database system raises many interesting possibilities and questions. In this blog post I want to reflect on what makes these workloads and system architectures so different and what br...
Read More

1 2 3