About 50 results
Open links in new tab
  1. What are the pros and cons of the Apache Parquet format compared …

    Apr 24, 2016 · 30,36,2 Parquet files are most commonly compressed with the Snappy compression algorithm. Snappy compressed files are splittable and quick to inflate. Big data systems want to …

  2. What file extension is the correct way to name parquet files?

    Jun 3, 2025 · <file-name>.parquet : 1) This is the standard and most widely accepted naming convention. 2) The compression codec is stored in the Parquet file metadata, not in the filename. 3) …

  3. Is it possible to read parquet files in chunks? - Stack Overflow

    Nov 29, 2019 · The Parquet format stores the data in chunks, but there isn't a documented way to read in it chunks like read_csv. Is there a way to read parquet files in chunks?

  4. Extension of Apache parquet files, is it '.pqt' or '.parquet'?

    Oct 19, 2021 · I wonder if there is a consensus regarding the extension of parquet files. I have seen a shorter .pqt extension, which has typical 3-letters (like in csv, tsv, txt, etc) and then there is a rather …

  5. Inspect Parquet from command line - Stack Overflow

    How do I inspect the content of a Parquet file from the command line? The only option I see now is $ hadoop fs -get my-path local-file $ parquet-tools head local-file | less I would like to avoid

  6. How to read a Parquet file into Pandas DataFrame?

    How to read a modestly sized Parquet data-set into an in-memory Pandas DataFrame without setting up a cluster computing infrastructure such as Hadoop or Spark? This is only a moderate amount of data …

  7. How to view Apache Parquet file in Windows? [closed]

    Jun 19, 2018 · 99 What is Apache Parquet? Apache Parquet is a binary file format that stores data in a columnar fashion. Data inside a Parquet file is similar to an RDBMS style table where you have …

  8. Spark parquet partitioning : Large number of files

    Jun 28, 2017 · I am trying to leverage spark partitioning. I was trying to do something like data.write.partitionBy ("key").parquet ("/location") The issue here each partition creates huge number …

  9. Power Query Editor - Import .Parquet File - Stack Overflow

    Sep 18, 2023 · How can I convert an imported .parquet file correctly, using Excel's Power Query Editor? Are there drivers available for this in Excel?

  10. Read multiple parquet files in a folder and write to single csv file ...

    Aug 5, 2018 · I need to read these parquet files starting from file1 in order and write it to a singe csv file. After writing contents of file1, file2 contents should be appended to same csv without header.