• by RubenSandwich on 10/2/2017, 7:08:10 PM

    ETL stands for Extract, Transform and Load. So this is a data pipeline framework. Nextdoor please put that in your blog post as I do not believe it is a very common acronym.

  • by john_teller02 on 10/2/2017, 8:51:53 PM

    Its nice but AWS released GLUE for similar use case a couple of weeks back

  • by slagfart on 10/2/2017, 11:13:43 PM

    Gobblin (https://gobblin.apache.org/), which looks like it does something similar, can be packaged up as a single .jar, and run on AWS Lambda.

    Anyone had working experience with either?

  • by throwaway2016a on 10/3/2017, 1:28:18 AM

    Great work. I'm in the process of writing an ETL and while I don't think this will suite our needs (so unfortunately I need to keep writing) this article provides a lot of great detail that helped me see the process more clearly.

  • by throway_foo_bar on 10/2/2017, 10:08:08 PM

    Link to the source in the document doesn't work.