In some cases, especially for fewer columns with many duplicated values, one row group may have tons of records, thus causes extremely bad performance on downstream Spark queries. I propose to make ...
The One Billion Row Challenge (1BRC) is a fun exploration of how far modern Java can be pushed for aggregating one billion rows from a text file. Later the community created a dedicated @1brc ...