Skip to content

Dealing with good crawlers #88

@jl-678

Description

@jl-678

Hi,

I am running the latest version and see the dashboard overflying with traffic from "good crawlers." I see 50,000 hits from Microsoft and AWS crawlers This skews the dashboards. What is the best way to manage this? I know that there are many new parameters in v1, but I cannot figure out how to use them most effectively to address something like this.

Ideally, I would want to configure Krawl to limit traffic served to "good crawlers" and maximize the wastage of bad ones. Is there a way to do that with the new parameters?

As an aside, the table on the repo, lists all the parameters, but I think that we could user further clarification about what exactly they mean and why someone would change them. Maybe even some examples. I would be willing to help with something like that, but I am not sure where to start in understanding them.

TIA!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions