This simple template shows the content of a Prepfile.toml that lets you read the a dataset called data.csv. In this Prepfile I decide to rename column1 to rename_column1.
I also choose to fill the missing values of the column age with the mean of the column.

Then I export the result into a new dataset called data_cleaned.csv

  [data]
filename = 'data.csv'
csv_separator = ','
decimal_separator = '.'
encoding = 'utf-8'
missing_identifier = ''

[preprocess]
[[preprocess.columns]]
name = 'column1'
type = 'string'
new_name = 'rename_column1'

[[preprocess.columns]]
name = 'age'
type = 'int'
operations = [{op = "fillna", method = "mean"}]



[postprocess]
format = 'csv'
filename = 'data_cleaned.csv'
  

Last updated 01 Jun 2025, 20:24 +0200 . history