We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
利用 Kubernetes,应该可以很轻易地启动一个分布式爬虫任务。而如果将其实现成 crd 的方式,会变得非常 Kubernetes native,有更好的 scalability。可以基于 https://github.com/rmax/scrapy-redis 去尝试
skillset: 熟悉 Kubernetes,熟悉 scrapy,了解 redis
The text was updated successfully, but these errors were encountered:
Crawler as a service 🤔
Sorry, something went wrong.
这样的 craweler service 允许用户不写代码仅仅写yaml就能开启分布式爬虫任务了?比如在yaml中可以指定transformation规则?
🤔
yaml 里制定规则我觉得不太可行,毕竟不是图灵完备
No branches or pull requests
利用 Kubernetes,应该可以很轻易地启动一个分布式爬虫任务。而如果将其实现成 crd 的方式,会变得非常 Kubernetes native,有更好的 scalability。可以基于 https://github.com/rmax/scrapy-redis 去尝试
skillset: 熟悉 Kubernetes,熟悉 scrapy,了解 redis
The text was updated successfully, but these errors were encountered: