Skip to content
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
docker+airflow
  • Loading branch information
sejalv committed Jan 25, 2022
commit 969e6b45ccead701b4072c38aac6b9e7e094072e
8 changes: 5 additions & 3 deletions week_2_data_ingestion/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,14 +4,14 @@
* What is a Data Lake
* ELT vs. ETL
* Alternatives to components (S3/HDFS, Redshift, Snowflake etc.)
* [Video]()
* [Video](https://www.youtube.com/watch?v=W3Zm6rjOq70&list=PL3MmuxUbc_hJed7dXYoJw8DoCuVHhGEQb&index=14)
* [Slides](https://docs.google.com/presentation/d/1RkH-YhBz2apIjYZAxUz2Uks4Pt51-fVWVN9CcH9ckyY/edit?usp=sharing)


### Orchestration (Airflow)
* What is an Orchestration Pipeline?
* What is a DAG?
* [Video]()
* [Video](https://www.youtube.com/watch?v=0yK7LXwYeD0&list=PL3MmuxUbc_hJed7dXYoJw8DoCuVHhGEQb&index=15)

### Workshop:
* Setting up Docker with Airflow: -- 15 mins
Expand All @@ -26,4 +26,6 @@


### Further Enhancements
* Transfer Service (AWS -> GCP)
* Transfer Service (AWS -> GCP)
* [Video 1](https://www.youtube.com/watch?v=rFOFTfD1uGk&list=PL3MmuxUbc_hJed7dXYoJw8DoCuVHhGEQb&index=16)
* [Video 2](https://www.youtube.com/watch?v=VhmmbqpIzeI&list=PL3MmuxUbc_hJed7dXYoJw8DoCuVHhGEQb&index=17)