Introducing Crunchy Data Warehouse: A next-generation Postgres-native data warehouse. Crunchy Data Warehouse Learn more

  • 3 min read

    Enhanced Postgres Release Notes

    Greg Sabino Mullane

    There is something new you may not have seen in the release notes for Postgres 17. No, not a new feature - I mean inside the actual release notes themselves! The Postgres project uses the git program to track commits to the project, and now each item in the release notes has a link to the actual commit (or multiple commits) that enabled it.

    Read More
  • 14 min read

    Vehicle Routing with PostGIS and Overture Data

    Paul Ramsey

    The Overture Maps collection of data is enormous, encompassing over 300 million transportation segments, 2.3 billion building footprints, 53 million points of interest, and a rich collection of cartographic features as well. It is a consistent global data set, but it is intimidatingly large -- what can a person do with such a thing?

    Building cartographic products is the obvious thing, but what about the less obvious. With an analytical engine like PostgreSQL and Crunchy Bridge for Analytics, what is possible? Well turns out, a lot of things.

    Read More
  • 8 min read

    Real World Performance Gains With Postgres 17 B-tree Bulk Scans

    Brandur Leach

    With RC1 freshly cut, the release of Postgres 17 is right on the horizon, giving us a host of features, improvements, and optimizations to look forward to.

    As a backend developer, one in particular pops off the page, distinguishing itself amongst the dozens of new release items:

    Allow btree indexes to more efficiently find a set of values, such as those supplied by IN clauses using constants (Peter Geoghegan, Matthias van de Meent)

    The B-tree is Postgres' overwhelmingly most common and best optimized index, used for lookups on a table's primary key or secondary indexes, and undoubtedly powering all kinds of applications all over the world, many of which we interact with on a daily basis.

    Read More
  • Window Functions for Data Analysis with Postgres

    Elizabeth Christensen

    SQL makes sense when it's working on a single row, or even when it's aggregating across multiple rows. But what happens when you want to compare between rows of something you've already calculated? Or make groups of data and query those? Enter window functions.

    Window functions tend to confuse people - but they’re a pretty awesome tool in SQL for data analytics. The best part is that you don’t need charts, fancy BI tools or AI to get some actionable and useful data for your stakeholders. Window functions let you quickly:

    • Calculate running totals
    • Provide summary statistics for groups/partitions of data
    • Create rankings
    • Perform lag/lead analysis, ie comparing two separate sets of data with each other
    • Compute moving/rolling averages
    Read More
  • PostGIS meets DuckDB: Crunchy Bridge for Analytics goes Spatial

    Marco Slot

    Crunchy Data is excited to announce the next major feature release for Crunchy Bridge for Analytics: Geospatial Analytics.

    We have developed a variety of features to connect Postgres and PostGIS to S3 and public web servers to make spatial data access easier than ever.

    This release includes:

    • Creating an analytics table directly from a geospatial data set by providing only the URL, for ad-hoc queries and data transformations.
    • Creating a regular PostGIS table directly from a URL.
    • Automatic mapping of geospatial columns into PostGIS geometry type.
    • Support for GeoParquet, GeoJSON, Shapefile (zip), Geopackage, WKT in CSV, and more.
    • Delegate PostGIS functions and operators to DuckDB spatial for fast queries on GeoParquet.

    Together, these make Crunchy Bridge for Analytics an easy-to-use and powerful platform for working with geospatial data.

    Read More
  • 6 min read

    PostgreSQL Snapshots and Backups with pgBackRest in Kubernetes

    Brian Pace

    Backups are dead. Now that I have your attention, let me clarify. Traditional backups have earned a solid reputation for their reliability over time. However, they are dead in the sense that a backup is essentially useless until it's restored—essentially "resurrected." In this post, we'll explore best practices for managing PostgreSQL snapshots and backups using pgBackRest. We will then provide some guidance of how you apply these techniques in Kubernetes using the Postgres Operator (PGO) from Crunchy Data. Whether you're overseeing a production environment, handling replicas, or refreshing lower environments, understanding how to effectively manage snapshots is key.

    Read More
  • 4 min read

    Postgres Materialized Views from Parquet in S3 with Zero ETL

    Marco Slot

    Data pipelines for IoT applications often involve multiple different systems. First, raw data is gathered in object storage, then several transformations happen in analytics systems, and finally results are written into transactional databases to be accessed by low latency dashboards. While a lot of interesting engineering goes into these systems, things are much simpler if you can do everything in Postgres.

    Crunchy Bridge for Analytics

    Read More
  • Understanding the Postgres Hackers Mailing List Language

    Greg Sabino Mullane

    The Postgres hackers mailing list (pgsql-hackers@postgresql.org) is an invaluable resource for anyone wanting to contribute to the PostgreSQL code. The Postgres project does not use PRs (pull requests) or GitHub issues. So if you want to contribute an idea, or help with code reviews, the hackers mailing list is the canonical way to do so. More information on contributing is on the Postgres wiki at: https://wiki.postgresql.org/wiki/So,_you_want_to_be_a_developer

    Read More
  • Announcing an Open Source Monitoring Extension for Postgres with pgMonitor

    Keith Fiske

    Crunchy Data is pleased to announce a new open source pgMonitor Extension. Crunchy Data has worked on a pgMonitor tool for several years as part of our Kubernetes

    Read More
  • 12 min read

    Postgres Powered by DuckDB: The Modern Data Stack in a Box

    Marco Slot

    Postgres for analytics has always been a huge question mark. By using PostgreSQL's extension APIs, integrating DuckDB as a query engine for state-of-the-art analytics performance without forking either project could Postgres be the analytics database too?

    Bringing an analytical query engine into a transactional database system raises many interesting possibilities and questions. In this blog post I want to reflect on what makes these workloads and system architectures so different and what bringing them together means.

    OLAP & OLTP: Never the twain shall meet

    Read More