github.com/treeverse/lakefs@v1.24.1-0.20240520134607-95648127bfb0/pkg/samplerepo/assets/sample/data/lakes.source.md (about)

     1  Data source: 
     2  
     3  > Khazaei, Bahram; Read, Laura K; Casali, Matthew; Sampson, Kevin M; Yates, David N (2022): GLOBathy, the Global Lakes Bathymetry Dataset. figshare. Collection. https://doi.org/10.6084/m9.figshare.c.5243309.v1
     4  
     5  To re-create the `lakes.parquet` file use DuckDB: 
     6  
     7  ```sql
     8  COPY (SELECT Hylak_id, Lake_name, Country, Dmax_use_m AS Depth_m 
     9  	    FROM read_csv('~/Downloads/GLOBathy_basic_parameters/GLOBathy_basic_parameters(1-100K LAKES).csv',AUTO_DETECT=TRUE))
    10    TO 'lakes.parquet' (FORMAT 'PARQUET', CODEC 'ZSTD');
    11  ```