Problem Description
The nexus deployment is consistently failing with Out of Memory (OOM) errors during the container startup process. The failures occur when downloading and installing Python dependencies.
Error Details
- Error Type: Out of Memory (OOM)
- Frequency: Multiple consecutive failures (20+ attempts in ~5 minutes)
- Stage: During dependency installation in container startup
- Total Dependencies: 237 packages being resolved
Large Dependencies Identified
From the build logs, several large packages are being downloaded:
- speechrecognition (31.3MB)
- pymupdf (22.9MB)
- pythonmonkey-fork (20.6MB)
- numpy (17.1MB)
- onnxruntime (15.6MB)
- magika (14.4MB)
- pandas (12.1MB)
- botocore (13.0MB)
- ruff (11.0MB)
Proposed Solution
Phase 1: Verify CI Feasibility
Phase 2: Implement Containerized Approach
Expected Benefits
- Predictable Resource Usage: Containerized builds provide better memory management
- Faster Deployments: Pre-built containers reduce startup time
- Better Debugging: Container logs provide clearer error tracking
- Scalability: Easier to adjust resource limits as needed
Current Impact
- Deployment success rate: 0% (all recent attempts failing)
- Service availability: Severely impacted
- Development workflow: Blocked
Environment
- Platform: Current hosting environment
- Python Version: 3.12.7
- Deployment Region: asia-southeast1-eqsg3a
- Container Status: Crashed due to OOM
Problem Description
The nexus deployment is consistently failing with Out of Memory (OOM) errors during the container startup process. The failures occur when downloading and installing Python dependencies.
Error Details
Large Dependencies Identified
From the build logs, several large packages are being downloaded:
Proposed Solution
Phase 1: Verify CI Feasibility
Phase 2: Implement Containerized Approach
Expected Benefits
Current Impact
Environment