-
-
Notifications
You must be signed in to change notification settings - Fork 115
Add download archiving system with LocalFilesystem provider #1815
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
…iders Signed-off-by: Varsha U N <[email protected]>
@VarshaUN thanks for the PR, you need to address a few overall issues before we can start reviewing the code in more details, see comments below for these:
Couple of issues with general direction of the PR as discussed in #1685 (comment):
Presently input archives are downloaded with scancode.io/scanpipe/pipelines/__init__.py Line 130 in 9d41ad3
/input/ directory for each project as specified in scancode.io/scanpipe/models.py Line 573 in 9d41ad3
|
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
This reverts commit 87c81bd.
This reverts commit cd04f3f1062f3ac8c78af3a7b0ed042633f5b375.
This reverts commit b6d2342873168e53865e8f39185a9602de191b7f.
This reverts commit ca2f49f505bd5c951b5f270d4b218a69848a6de9.
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
Signed-off-by: Varsha U N <[email protected]>
As per discussed with mentors , I have added the following ,
archiving.py
Added DownloadStore abstract base class and implementations (LocalFilesystemProvider, S3LikeProvider, SftpProvider) for storing downloads with SHA256-based deduplication and metadata.
settings.py
Initialized download_store based on DOWNLOAD_ARCHIVING_PROVIDER (localstorage, s3, sftp) with configuration validation and error logging.
input.py
Added add_input_from_url and add_input_from_upload to archive URL downloads and uploaded files using download_store, with fallback to project input directory when archiving is disabled. Integrate with InputSource model for metadata storage.
Enhances input handling for pipelines, supporting deduplicated storage and retrieval of inputs across local, S3, and SFTP backends.
Still in progress ,
Signed-off-by : Varsha U N [email protected]