I have a Django application where users can upload big dataset in PostgreSQL db. I would like to use an ETL tool like Pentaho Kettle to do the upload job. Later, I would like to use Kettle for data mining and transformations.
However, I have no idea how I to connect my Django app to Kettle. I am not sure what version of Kettle is needed, do I need spoon, or carte?
How can I trigger the job from django python and how to pass the request.FILES of my app to Kettle?
Can you explain the general steps or point to documentation or examples?