Add Per-Host Rate Limiting and Caching

Currently, lychee faces challenges with rate limiting and cache effectiveness when checking links, particularly when dealing with multiple requests to the same hosts. This leads to several issues that need to be addressed:

## Current Problems
- Multiple concurrent requests to the same host trigger rate limits (429 errors) (See #989)
- Cache is ineffective with high concurrency due to race conditions (See https://github.com/lycheeverse/lychee/issues/1593#issuecomment-2552233507)
- Global concurrency settings are too coarse-grained
- Different hosts have different rate limit requirements
- Headers are applied to all hosts, causing potential security issues (#1298 and https://github.com/lycheeverse/lychee/issues/1441#issuecomment-2170426847)

## Proposed Solution

We should implement a smart per-host rate limiting and caching system that would:

1. Track rate limits per host using a concurrent HashMap:
```rust
use std::collections::HashMap;
use time::OffsetDateTime;

struct HostConfig {
    rate_limit_reset: Option<OffsetDateTime>,
    request_delay: Option<Duration>,
    max_concurrent_requests: Option<u32>,
}
```

2. Implement smarter caching:

- Maintain separate cache states per host

3. Stretch goal: Add configuration options per host:

```bash
lychee --max-concurrency-per-host github.com=10 --delay-per-host github.com=100ms
```

4. Stretch goal II: Add support for per-host headers

The idea would be to maintain a [`HeaderMap`](https://docs.rs/http/latest/http/header/index.html#headermap).
See https://github.com/lycheeverse/lychee/issues/1297 for details.

## Implementation Notes
- Use the existing [rate-limits](https://github.com/mre/rate-limits) crate, which is mostly useful for APIs
- Handle 429 responses with proper backoff using response headers when available

## Benefits
- Prevents IP bans from aggressive checking
- More efficient resource usage
- Better compliance with API rate limits
- Improved cache effectiveness. Since the cache is per host, there would be no synchronization issues
- Faster overall execution by avoiding unnecessary retries

## Examples

```toml
[hosts."github.com"]
max_concurrent_requests = 10
request_delay = "100ms"
headers = { Authorization = "token ghp_xxxx", "User-Agent" = "my-bot" }

[hosts."api.example.com"] 
max_concurrent_requests = 1
request_delay = "1s"
headers = { "X-API-Key" = "secret", Accept = "application/json" }
```

CLI usage example:
```sh
lychee --max-concurrency-per-host github.com=10 --delay-per-host github.com=100ms
```

And when adding headers:

```sh
lychee \
  --max-concurrency-per-host github.com=10 \
  --delay-per-host github.com=100ms \
  --headers-per-host 'github.com=Authorization:token ghp_xxxx,User-Agent:my-bot'
  ```

This is just a proposal. I'm not 100% certain about the naming yet.

Related issues: #989, #1593

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add Per-Host Rate Limiting and Caching #1605

Current Problems

Proposed Solution

Implementation Notes

Benefits

Examples

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Add Per-Host Rate Limiting and Caching #1605

Description

Current Problems

Proposed Solution

Implementation Notes

Benefits

Examples

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions