Task 4 - Beepboop - Robots.txt

Task 4 - Beepboop - Robots.txt

Where would "robots.txt" be located on the domain "ablog.com"

HINT: full path!

Reveal Flag ๐Ÿšฉ

๐Ÿšฉablog.com/robots.txt

If a website was to have a sitemap, where would that be located?

Reveal Flag ๐Ÿšฉ

๐Ÿšฉ/sitemap.xml

How would we only allow "Bingbot" to index the website?

Reveal Flag ๐Ÿšฉ

๐ŸšฉUser-agent: Bingbot

How would we prevent a "Crawler" from indexing the directory "/dont-index-me/"?

Reveal Flag ๐Ÿšฉ

๐ŸšฉDisallow: /dont-index-me/

What is the extension of a Unix/Linux system configuration file that we might want to hide from "Crawlers"?

HINT: system files are usually 3/4 characters!

Reveal Flag ๐Ÿšฉ

๐Ÿšฉ.conf

Last updated