CSV to Parquet converter

Trusted by over 20,000 every month

Convert CSV to Parquet online

With our online CSV to Parquet converter you can convert your files without downloading any software or writing code. Unlike other services, you can make graphs from your converted data or perform analysis. Just click the navigation on the left hand side.

Convert CSV to Parquet online

Works with large CSV files that have millions of rows

View your converted Parquet data before downloading it

CSV

CSV (Comma Separated Values) files are the most common format for storing tabular data. Values in a row are separated by commas and rows are separated by newlines.

CSV files often start with a header row that has column names, but this is not required.

Each row in a CSV file must have the same number of values as the header row.

CSV files do no enforce types or a schema. This means that each column can have multiple types, which can make analysis difficult and compression inefficient.

Parquet files can be easier to analyze and compress better than CSV files.

Parquet

Apache Parquet (.parquet) is a format that was designed for storing tabular data on disk. It was designed based on the format used in Google's Dremel paper (Dremel later became Big Query).

Parquet files store data in a binary format, which means that they can be efficiently read by computers but are difficult for people to read.

Parquet files have a schema, so means that every value in a column must have the same type. The schema makes Parquet files easier to analyse than CSV files and also helps them to have better compression so they are smaller on disk.

How to convert CSV to Parquet

  1. Upload your CSV file
  2. Your CSV file will be converted to Parquet
  3. Download your Parquet file
  4. Click the view button to view your file

How to convert CSV to Parquet in Python

We can convert CSV to Parquet in Python using Pandas or DuckDB

How to Convert CSV to Parquet using Pandas

First, we need to install pandas

pip install pandas

Then we can load the CSV file into a dataframe

df = pd.read_csv('path/to/file.csv')

Finally, we can export the dataframe to the Parquet format

df.to_parquet('path/to/file.parquet', index=False)

How to Convert CSV to Parquet using DuckDB

First, we need to install duckdb for Python

 pip install duckdb

The following DuckDB query will read a CSV file and output a Parquet file

duckdb.sql("""COPY (select * from 'path/to/file.csv') TO 'path/to/file.parquet' (FORMAT 'parquet')""")

CSV viewer

View and explore CSV files.

CSV graph maker

Make graphs from CSV files

Merge CSV

Combine CSV files.

Filter CSV

Filter CSV files online

Sort CSV

Sort CSV files online

CSV bar graph maker

Make bar graphs from CSV files

CSV scatter plot maker

Make scatter plots from CSV files

MT cars

Motor Trends Car Road Tests dataset.

filename

mtcars.parquet

rows

32

Flights 1m

1 Million flights including arrival and departure delays.

filename

flights-1m.parquet

rows

1000000

Iris

Iris plant species data set.

filename

iris.parquet

rows

50

House price

Housing price dataset.

filename

house-price.parquet

rows

545

Weather

Weather dataset with temperature, rainfall, sunshine and wind measurements.

filename

weather.parquet

rows

366