Read .csv-file to dataframe
Read .csv-file in an S3 bucket into a dataframe
Pandas dataframe
The following code reads a .csv-file in an S3 bucket directly into a pandas
dataframe
import pandas as pd
import s3fs
key = 'key_name' # name of key
bucket = 'bucket_name' # name of bucket
# read .csv-file in S3-bucket into a pandas dataframe
df = pd.read_csv('s3://{}/{}'.format(bucket, key))
Dask dataframe
The following code reads a .csv-file in an S3 bucket directly into a dask
dataframe
import dask.dataframe as dd
key = 'key_name' # name of key
bucket = 'bucket_name' # name of bucket
# read .csv-file in S3-bucket into a dask dataframe
ddf = dd.read_csv('s3://{}/{}'.format(bucket, key))
Last updated
Was this helpful?