Generate env specific robots.txt by sarangj · Pull Request #507 · NYPL/digital-collections

sarangj · 2026-01-14T19:47:55Z

Ticket:

JIRA ticket DR-3986

This PR does the following:

Use the nextJS robots.ts setup to create separate robots.txt files per environment. For non-prod environments, set a disallow on everything to prevent google from indexing qa.

Open questions

How has this been tested? How should a reviewer test this?

See the generated robots file on vercel: https://digital-collections-git-robots-ts-nypl.vercel.app/robots.txt

Accessibility concerns or updates

Checklist:

I have added relevant accessibility documentation for this pull request.
All new and existing tests passed.
I have updated the CHANGELOG.md.

Use the nextJS `robots.ts` setup to create separate `robots.txt` files per environment. For non-prod environments, set a disallow on everything to prevent google from indexing qa.

vercel · 2026-01-14T19:48:00Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
digital-collections	Ready	Preview, Comment	Jan 14, 2026 7:50pm

Copilot

Pull request overview

This PR implements environment-specific robots.txt generation using Next.js's robots.ts functionality to prevent search engine indexing of non-production environments while maintaining selective bot blocking in production.

Changes:

Removed static robots.txt file from public directory
Added dynamic robots.ts handler that generates environment-specific rules
Configured complete disallow for non-production environments to prevent indexing

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
public/robots.txt	Removed static robots.txt file to enable dynamic generation
app/robots.ts	Added dynamic robots handler with environment-specific rules

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-15T15:29:51Z

+    return {
+      rules: [
+        {
+          userAgent: "Bytedance",
+          disallow: "/",
+        },
+        {
+          userAgent: "Bytespider",
+          disallow: "/",
+        },
+        {
+          userAgent: "GPTBot",
+          disallow: "/",
+        },
+      ],
+    };


The production rules block specific bots but don't specify behavior for other crawlers. This means bots like Googlebot will have unrestricted access by default. Consider adding an explicit rule for '*' user agent with 'allow: /' to make the intent clear, or add 'allow' rules for the blocked bots if they should only be blocked from specific paths.

gonna ignore this to maintain the exact existing robots.txt

Copilot · 2026-01-15T15:29:51Z

+    rules: {
+      userAgent: "*",
+      disallow: "/",
+    },


The rules property should be an array to match the MetadataRoute.Robots type definition and maintain consistency with the production configuration. Change 'rules: {' to 'rules: [{' and close with '}]'.

Suggested change

rules: {

userAgent: "*",

disallow: "/",

},

rules: [

{

userAgent: "*",

disallow: "/",

},

],

The Robots type can be an array or single object, gonna ignore this

Generate env specific robots.txt

61145bf

Use the nextJS `robots.ts` setup to create separate `robots.txt` files per environment. For non-prod environments, set a disallow on everything to prevent google from indexing qa.

vercel Bot deployed to Preview January 14, 2026 19:50 View deployment

sarangj marked this pull request as ready for review January 14, 2026 19:52

sarangj requested review from avertrees, chrisb87, emu47 and keithbauer as code owners January 14, 2026 19:52

emu47 approved these changes Jan 15, 2026

View reviewed changes

emu47 requested a review from Copilot January 15, 2026 15:29

Copilot AI reviewed Jan 15, 2026

View reviewed changes

sarangj merged commit f178f08 into qa Jan 15, 2026
5 checks passed

sarangj deleted the robots-ts branch January 15, 2026 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generate env specific robots.txt#507

Generate env specific robots.txt#507
sarangj merged 1 commit into
qafrom
robots-ts

sarangj commented Jan 14, 2026 •

edited by atlassian Bot

Loading

Uh oh!

vercel Bot commented Jan 14, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 15, 2026

Uh oh!

sarangj Jan 15, 2026

Uh oh!

Copilot AI Jan 15, 2026

Uh oh!

sarangj Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

sarangj commented Jan 14, 2026 • edited by atlassian Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ticket:

This PR does the following:

Open questions

How has this been tested? How should a reviewer test this?

Accessibility concerns or updates

Checklist:

Uh oh!

vercel Bot commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sarangj Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sarangj Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sarangj commented Jan 14, 2026 •

edited by atlassian Bot

Loading

vercel Bot commented Jan 14, 2026 •

edited

Loading