etl - Downloading files with pentaho kettle -


i'm trying create job can download several files via http. list of these files in mysql table. create main job these steps in line: start, set variables, filelist (a transformation created), download (a job i've created) , success.

the transformation filelist contains following steps: table input , copy rows result (this transformation communicates database , gives list of urls main task). task download contains following steps: start, http, success (this task should download files computer).

all doesn't work, why? know better way same thing?

i expect have basic knowledge of kettle. so, getting list of db not issue. guess stuck @ having kettle download , save of files - running loop.

the step downloading file "http" , available in jobs. trick have job (containing http step download) executed every file - or use kettle-lingo "executed every row". url passed down download-job parameter set field.

if didn't you, check out following link go more detail how accomplish feat (it kind of feat - shouldn't 1 though):

http://www.joyofdata.de/blog/batch-downloading-files-with-pentaho-kettle/


Comments

Popular posts from this blog

PySide and Qt Properties: Connecting signals from Python to QML -

c# - DevExpress.Wpf.Grid.InfiniteGridSizeException was unhandled -

scala - 'wrong top statement declaration' when using slick in IntelliJ -