Check-in [3bbd18a284]
Not logged in

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

Overview
Comment:Improve the recommented value for robot-exception.
Timelines: family | ancestors | descendants | both | trunk
Files: files | file ages | folders
SHA3-256: 3bbd18a284c8bd6a8c1d46a6faa7ee33142e5fec53bb64792aab53d3c84e57fd
User & Date: drh 2025-10-03 17:27:09.423
Context
2025-10-03
17:31
Fix spelling errors found by the Debian QA tools. check-in: 7db623e5eb user: danield tags: trunk
17:27
Improve the recommented value for robot-exception. check-in: 3bbd18a284 user: drh tags: trunk
16:05
Allow URLs of the form /tarball/TAG/VERSION/NAME.tar.gz for the tarball page where TAG must be a non-branch tag associated with VERSION. This is so that the robot-exception pattern can match things like /tarball/release while will allowing access to historical releases via the VERSION section of the URL. check-in: 57c8bec816 user: drh tags: trunk
Changes
Unified Diff Ignore Whitespace Patch
Changes to src/setup.c.
499
500
501
502
503
504
505
506

507
508
509
510
511
512
513
514
515
516
517
518
519
  @ (Property: robot-restrict)
  @ <br>
  textarea_attribute("", 2, 80,
      "robot-restrict", "rbrestrict", robot_restrict_default(), 0);

  @ <hr>
  @ <p><b>Exceptions to anti-robot restrictions</b><br>
  @ The entry below is a list of regular expressions, one per line.

  @ If any of these regular expressions match the input URL, then the
  @ request is exempt from anti-robot defenses.  Use this, for example,
  @ to allow scripts to download release tarballs using a pattern
  @ like:</p>
  @ <p>
  @ &emsp;&emsp;<tt>^/tarball\b.*\b(version-|release)\b</tt>
  @ <p>The pattern should match against the REQUEST_URI with the
  @ SCRIPT_NAME prefix removed, and with QUERY_STRING appended following
  @ a "?" if QUERY_STRING exists.  (Property: robot-exception)<br>
  textarea_attribute("", 3, 80,
      "robot-exception", "rbexcept", "", 0);

  @ <hr>







|
>





|







499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
  @ (Property: robot-restrict)
  @ <br>
  textarea_attribute("", 2, 80,
      "robot-restrict", "rbrestrict", robot_restrict_default(), 0);

  @ <hr>
  @ <p><b>Exceptions to anti-robot restrictions</b><br>
  @ The entry below is a list of 
  @ <a href="%R/re_rules">regular expressions</a>, one per line.
  @ If any of these regular expressions match the input URL, then the
  @ request is exempt from anti-robot defenses.  Use this, for example,
  @ to allow scripts to download release tarballs using a pattern
  @ like:</p>
  @ <p>
  @ &emsp;&emsp;<tt>^/tarball/(version-[0-9.]+|release)/</tt>
  @ <p>The pattern should match against the REQUEST_URI with the
  @ SCRIPT_NAME prefix removed, and with QUERY_STRING appended following
  @ a "?" if QUERY_STRING exists.  (Property: robot-exception)<br>
  textarea_attribute("", 3, 80,
      "robot-exception", "rbexcept", "", 0);

  @ <hr>