by RubenSandwich on 10/2/2017, 7:08:10 PM
by john_teller02 on 10/2/2017, 8:51:53 PM
Its nice but AWS released GLUE for similar use case a couple of weeks back
by slagfart on 10/2/2017, 11:13:43 PM
Gobblin (https://gobblin.apache.org/), which looks like it does something similar, can be packaged up as a single .jar, and run on AWS Lambda.
Anyone had working experience with either?
by throwaway2016a on 10/3/2017, 1:28:18 AM
Great work. I'm in the process of writing an ETL and while I don't think this will suite our needs (so unfortunately I need to keep writing) this article provides a lot of great detail that helped me see the process more clearly.
by throway_foo_bar on 10/2/2017, 10:08:08 PM
Link to the source in the document doesn't work.
ETL stands for Extract, Transform and Load. So this is a data pipeline framework. Nextdoor please put that in your blog post as I do not believe it is a very common acronym.