diff --git a/solutions/system_design/web_crawler/README.md b/solutions/system_design/web_crawler/README.md index d95dc1071c67eedd8290ab07e23762ad79422bd7..08daa606bb12cc7a7307c95493786f3603445ff2 100644 --- a/solutions/system_design/web_crawler/README.md +++ b/solutions/system_design/web_crawler/README.md @@ -77,7 +77,7 @@ Handy conversion guide: ### Use case: Service crawls a list of urls -We'll assume we have an initial list of `links_to_crawl` ranked initially based on overall site popularity. If this is not a reasonable assumption, we can seed the crawler with popular sites that link to outside content such as [Yahoo](https://www.yahoo.com/), [DMOZ](http://www.dmoz.org/), etc +We'll assume we have an initial list of `links_to_crawl` ranked initially based on overall site popularity. If this is not a reasonable assumption, we can seed the crawler with popular sites that link to outside content such as [Yahoo](https://www.yahoo.com/), [DMOZ](https://dmoz-odp.org/), etc We'll use a table `crawled_links` to store processed links and their page signatures.