Preparing to scrape and populate db to not visit a page more than one. Restructured folders and removed some extraneous pages no longer needed.
|
---|
|
.gitignore |
---|
docker/docker-compose.yaml 0 → 100644 |
---|
docker/scripts/schema.sql 0 → 100644 |
---|
gcp_pages/html/compute_docs_html.txt 0 → 100644 |
---|
gcp_pages/html/compute_docs_images_create-custom_html.txt 0 → 100644 |
---|
gcp_pages/html/compute_docs_instances_html.txt 0 → 100644 |
---|
gcp_pages/html/compute_docs_overview_html.txt 0 → 100644 |
---|
gcp_pages/links/compute_docs_images_create-custom_links.txt 0 → 100644 |
---|
gcp_pages/links/compute_docs_instances_links.txt 0 → 100644 |
---|
gcp_pages/links/compute_docs_links.txt 0 → 100644 |
---|
gcp_pages/links/compute_docs_overview_links.txt 0 → 100644 |
---|
gcp_pages/links/gcp_docs_page_links.txt 0 → 100644 |
---|
gcp_products.ipynb |
---|
page_content/compute_engine_overview.txt 100644 → 0 |
---|
page_content/deploy_a_function.txt 100644 → 0 |
---|
page_content/deploy_to_compute_engine.txt 100644 → 0 |
---|
Too large (Show diff)
|
page_content/troubleshooting_using_the_serial_console.txt 100644 → 0 |
---|
Too large (Show diff)
|
raw_soup/deploy_a_function_RAW.txt 100644 → 0 |
---|
sitemap_data/raw_data/sitemap_1_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_2_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_3_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_4_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_5_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_6_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/raw_data/sitemap_7_of_390.txt 0 → 100644 |
---|
Too large (Show diff)
|
sitemap_data/sitemap_scrape.ipynb 0 → 100644 |
---|