The Goal of this project is to try different language and library to preprocess a dataset in a pipeline.
- duckdb_sql.sql : preprocessing only in SQL
https://duckdb.org/docs/stable/sql/expressions/star
https://duckdb.org/docs/stable/sql/query_syntax/from
https://duckdb.org/2025/08/15/ml-data-preprocessing
https://www.markhneedham.com/blog/
https://www.datawithbaraa.com/wiki/sql