You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Problem
The current implementation in aws_services.py creates temporary audio file
in S3 but only deletes them after successful transcription. If transcription
fails or the process is interrupted, these files remain in S3 indefinitely,
which can:
Lead to unnecessary storage costs
Create potential security risks with stored audio files
Violate data retention policies
Current Behavior
Audio files are uploaded to audiotranscribetemp bucket
Files are only deleted after successful transcription
Failed transcriptions leave orphaned files
The text was updated successfully, but these errors were encountered:
Implement automatic cleanup of temporary audio files in S3:
- Add 24-hour lifecycle policy for automatic file deletion
- Configure cleanup of incomplete multipart uploads
- Improve error handling and logging around file management
- Create new S3Service class for centralized S3 operations
- Add fallback to lifecycle policy when manual deletion fails
FixesGroupLang#166
Problem
The current implementation in aws_services.py creates temporary audio file
in S3 but only deletes them after successful transcription. If transcription
fails or the process is interrupted, these files remain in S3 indefinitely,
which can:
Lead to unnecessary storage costs
Create potential security risks with stored audio files
Violate data retention policies
Current Behavior
Audio files are uploaded to audiotranscribetemp bucket
Files are only deleted after successful transcription
Failed transcriptions leave orphaned files
The text was updated successfully, but these errors were encountered: