-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Git Sync improvement #1190
Comments
I think this idea is quite making sense and doable. What we can implement is something like "virtual spider", a spider that links to some subdirectories of a git repo. Welcome to hear your feedback and ideas if you have any suggestion. |
Hello. It is also doable. No idea how complex is following this or that way to make it work, actually. The way I suggested was the way "I did" with bash scripting. Thanks, |
Perhaps I didn't convey my idea clearly. You can follow this issue and a proposed model/process on file management in Crawlab will be implemented later this or the month after. |
There is also a common use case in Scrapy where multiple spiders are hosted in a single folder. This allows spiders to reuse common pipelines and middlewares in a shared code base:
If we have a single codebase with 100 spiders in them, at the moment we need to clone the same repository 100 times, and if there is an update to the repository, we need to pull changes 100x. It would be really nice if we could point spiders to the same code base, but this would mean to decouple the concept of files / git repos from spiders. |
请描述该需求尝试解决的问题
Hello,
I'd like to suggest to improve git sync functionality in order to make it possible for scenarios where there are dozens (or even hundreds) of spiders. Currently the functionality requires that each spiders has its own repository. In such scenarios mentioned before, I would have too much repositories to make it doable.
Currently I'm using as workaround bash script + crontab job to pull data from one single repository on github, where each spider has its own folder, same structure that is found on
/app/spiders
path.Best,
Eliton
The text was updated successfully, but these errors were encountered: