Task 4 - Beepboop - Robots.txt

Task 4 - Beepboop - Robots.txt

Where would "robots.txt" be located on the domain "ablog.com"

HINT: full path!

Reveal Flag 🚩

🚩ablog.com/robots.txt

If a website was to have a sitemap, where would that be located?

Reveal Flag 🚩

🚩/sitemap.xml

How would we only allow "Bingbot" to index the website?

Reveal Flag 🚩

🚩User-agent: Bingbot

How would we prevent a "Crawler" from indexing the directory "/dont-index-me/"?

Reveal Flag 🚩

🚩Disallow: /dont-index-me/

What is the extension of a Unix/Linux system configuration file that we might want to hide from "Crawlers"?

HINT: system files are usually 3/4 characters!

Reveal Flag 🚩

🚩.conf

PreviousTask 2 - Let's Learn About Crawlers NextTask 5 - Sitemaps

Last updated 3 years ago