Skip to content

Tags: aphp/spark-etl

Tags

v2.5.0

Toggle v2.5.0's commit message
Improve pgpass handling

add scd by hash

v2.4.0

Toggle v2.4.0's commit message
Add sqlExec and inputBulkCsv methods

They allow low level sql and loading any csv into postgres

v2.3.0

Toggle v2.3.0's commit message
Hangle arrays

v2.2.0

Toggle v2.2.0's commit message
Change constructor

improve temporary tables

v2.1.0

Toggle v2.1.0's commit message
Add more parallelism for multiline csv

Add a splitFactor argument for inputBulk

v2.0.1

Toggle v2.0.1's commit message
Upgrade documentation and project version

v2.0.0

Toggle v2.0.0's commit message
Change the API to something like builder

- Optimize the parallelism
- make use of csv interface for both input/output bulk

v1.2.0

Toggle v1.2.0's commit message
Implement parallel "copy from".

This produces as many csv as the number of partition needed

v1.1.0

Toggle v1.1.0's commit message
Add hdfs handling

v1.0.3

Toggle v1.0.3's commit message
First release