Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
I think we should explore both. Rust backed under a feature flag, and python as optional dependency. I can imagine that it increases binary size quite a bit.
We can reduce friction by figuring out how to load data most efficiently to polars memory.
The text was updated successfully, but these errors were encountered: