An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: robots-txt

php-middleware/block-robots

Middleware to avoid search engine indexing with PSR-7 using robots.txt and X-Robots-Tag

Language: PHP - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 0

useflyyer/robots Fork of ArcaneDigital/parse-robots

Super lightweight plain TypeScript parser for robots.txt with 0 dependencies.

Language: TypeScript - Size: 320 KB - Last synced at: 29 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

slemarchand/no-robots

🚫🤖 Override /robots.txt to disallow all web crawlers, regardless settings stored in the database. Compatible with Liferay 7.0, 7.1, 7.2, 7.3 and 7.4.

Language: Java - Size: 55.7 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

tekord/robots-txt-provider

Provides various framework-agnostic ways to generate the contents of the robots.txt file

Language: PHP - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

josecarneiro/mr-roboto

🤖 Handle and parse a site's robots.txt file and extract actionable information

Language: JavaScript - Size: 36.1 KB - Last synced at: 16 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

benjaminestes/robots

Package robots implements robots.txt file parsing and matching based on Google's specification.

Language: Go - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

VorticonCmdr/robotstxt

Chrome extension which blocks urls based on robots.txt (compatible to Chrome 41)

Language: JavaScript - Size: 200 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

hgruniaux/robotstxt Fork of google/robotstxt

The repository contains Google-based robots.txt parser and matcher as a C++ library (compliant to C++17).

Language: C++ - Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

rimiti/robotstxt

Robots.txt parser and generator - Work in progress

Language: Go - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 0

infinityloop-dev/robots

:wrench: Robots.txt generator component for Nette framework.

Language: PHP - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

amandeepmittal/robotize

Generates a robots.txt

Language: JavaScript - Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

honzahommer/request-robots

An express.js middleware for handling noisy robots.txt

Language: JavaScript - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

pierre-pvln/robots_creator

Scripts to create a robots.txt file from building blocks

Language: Batchfile - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

thomasleveil/pyrobots

python binding for Google robots.txt parser C++ library

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

eghuro/crawlcheck

Extensible web crawler

Language: Python - Size: 3.01 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

hrbrmstr/robotify

🤖 Browser extension to check for and preview a site's robots.txt in a new tab (if it exists)

Language: JavaScript - Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

GeorgeA93/crawley

nodejs web crawler

Language: JavaScript - Size: 70.3 KB - Last synced at: 2 months ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

mounicmadiraju/robot.txt-changes

💧 Test your robots.txt with this testing tool. Check if a URL is blocked, which statement is blocking it and for which user agent. You can also check if the resources for the page (CSS and JavaScript) are disallowed!. Robots.txt files help you guide how search engines crawl your site, and can be an integral part of your SEO strategy.

Language: Python - Size: 10.7 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

barryceelen/wp-robositemap

Manage robots.txt and sitemap.xml via the WordPress admin

Language: PHP - Size: 5.86 KB - Last synced at: 6 days ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

aankur/robot-webpack-plugin

A Webpack 3 plugin for generating robots.txt file

Language: JavaScript - Size: 23.4 KB - Last synced at: 25 days ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

josecarneiro/robotic

A robots.txt generating Express Middleware

Language: JavaScript - Size: 29.3 KB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

james-see/random-robots-txt

Generates a random robots.txt deny list to throw script kiddies off the scent.

Language: Python - Size: 9.77 KB - Last synced at: 8 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

FlorianWendelborn/robogen

🤖 Robots.txt generator done right.

Language: JavaScript - Size: 3.91 KB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 0

mets634/robots-scanner

A program to scan website fro hidden files using the robots.txt file.

Language: Python - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0