Troubleshooting

This page covers common problems and their solutions.

Installation Issues

orchcli init fails with “Docker not found”

Cause: Docker is not installed or not in your PATH.

Solution:

# Verify Docker is installed
docker --version

# If not installed, orchcli can auto-install it
# Or install manually: https://docs.docker.com/get-docker/

# Ensure Docker daemon is running
docker info

orchcli init hangs during repository cloning

Cause: Network issues or GitHub authentication problems.

Solution:

# Test GitHub connectivity
git ls-remote https://github.com/KubeOrch/core.git

# If using SSH, verify your key
ssh -T git@github.com

# If behind a proxy, configure git
git config --global http.proxy http://proxy:port

npm install fails for @kubeorch/cli

Cause: Node.js version too old or npm registry issues.

Solution:

# Requires Node.js 18+
node --version

# Clear npm cache and retry
npm cache clean --force
npm install -g @kubeorch/cli

Startup Issues

Services fail to start with “port already in use”

Cause: Another process is using port 8080 (core) or 3000 (ui).

Solution:

# Find what's using the port
# Linux/macOS
lsof -i :8080
lsof -i :3000

# Windows
netstat -ano | findstr :8080

# Kill the process or change the port in .env
PORT=8081 orchcli start

MongoDB connection refused

Cause: MongoDB container hasn’t finished starting or crashed.

Solution:

# Check MongoDB container status
docker compose ps mongo

# View MongoDB logs
docker compose logs mongo

# If the container keeps restarting, check disk space
df -h

# Remove corrupted data and restart (WARNING: data loss)
# docker compose down -v && docker compose up -d

Core backend crashes on startup with “JWT_SECRET required”

Cause: Missing required environment variables.

Solution:

# Generate and set required secrets
export JWT_SECRET=$(openssl rand -hex 32)
export ENCRYPTION_KEY=$(openssl rand -hex 16)

# Or add to your .env file
echo "JWT_SECRET=$(openssl rand -hex 32)" >> .env
echo "ENCRYPTION_KEY=$(openssl rand -hex 16)" >> .env

Cluster Connection Issues

”Unable to connect to cluster” error

Cause: Invalid credentials, network issues, or expired tokens.

Solutions:

Verify the cluster is reachable:

kubectl cluster-info

Check token validity:

# If using service account token
kubectl get secret <sa-secret> -o jsonpath='{.data.token}' | base64 -d

Test from the KubeOrch container:

docker compose exec core wget -qO- --no-check-certificate \
  https://<cluster-api-server>:6443/healthz

For minikube users: Ensure the API server is accessible from Docker:

# Get minikube IP
minikube ip

# Use the IP instead of localhost when adding the cluster in KubeOrch

Cluster shows “Unhealthy” status

Cause: Health check failing (runs every 60 seconds).

Solution:

# Check cluster health manually
kubectl get nodes
kubectl get componentstatuses

# Check KubeOrch logs for health check errors
docker compose logs core | grep -i "health"

Workflow Issues

Workflow deployment fails with “namespace not found”

Cause: The target namespace doesn’t exist on the cluster.

Solution:

kubectl create namespace <namespace-name>

Or use the default namespace when deploying workflows.

Nodes show “Error” state after deployment

Cause: Kubernetes resource creation failed.

Solution:

Click the node to view the error details in the diagnostics panel
Use the auto-fix suggestions if available
Check the Kubernetes events:

kubectl get events -n <namespace> --sort-by=.metadata.creationTimestamp

Real-time status not updating

Cause: SSE connection dropped or resource watcher not running.

Solution:

Refresh the browser page to re-establish SSE connection
Check core logs for watcher errors:

docker compose logs core | grep -i "watcher\|sse"

Build Issues

Container build fails with “Nixpacks error”

Cause: Nixpacks cannot detect the project type or build configuration.

Solution:

Ensure the Git repository has a recognized project structure
Check that the repository URL is accessible from the KubeOrch container
View build logs for specific errors in the Build section

Build logs not streaming

Cause: WebSocket connection issue.

Solution:

Check browser console for WebSocket errors
Ensure no proxy is blocking WebSocket connections
If using nginx, add WebSocket support:

location /ws {
    proxy_pass http://core:8080;
    proxy_http_version 1.1;
    proxy_set_header Upgrade $http_upgrade;
    proxy_set_header Connection "upgrade";
}

Authentication Issues

Cause: Wrong email/password or account doesn’t exist.

Solution:

Verify the email is correct
If first time, register a new account at /register
The first registered user automatically gets admin privileges

Cause: OAuth provider misconfiguration.

Solution:

Verify the OAuth callback URL matches your deployment URL
Check the provider configuration in environment variables:

# Example for GitHub OAuth
OAUTH_GITHUB_CLIENT_ID=your-client-id
OAUTH_GITHUB_CLIENT_SECRET=your-client-secret
OAUTH_GITHUB_CALLBACK_URL=https://your-domain.com/v1/api/auth/oauth/github/callback

Check core logs for OAuth errors:

docker compose logs core | grep -i "oauth"

JWT token expired errors

Cause: Token TTL exceeded (default: 24 hours).

Solution: The UI should automatically refresh tokens. If issues persist:

Clear browser local storage
Log out and log back in
Check that the TOKEN_TTL environment variable is set correctly

Performance Issues

UI is slow or unresponsive

Possible causes and solutions:

Too many nodes on canvas: Simplify workflows or split into smaller ones
Browser memory: Close other tabs, check browser task manager
API latency: Check core response times in browser network tab

API responses are slow

Possible causes and solutions:

MongoDB queries: Check MongoDB logs for slow queries

docker compose logs mongo | grep -i "slow"

Resource watcher overload: Too many clusters being monitored
Container resources: Increase Docker memory/CPU limits

Getting Help

If your issue isn’t listed here:

Search existing issues on GitHub
Open a new issue with:
- KubeOrch version
- Steps to reproduce
- Error logs (docker compose logs)
- Environment details (OS, Docker version, Kubernetes version)
Join the community on Slack: #kubeorch

Troubleshooting

Installation Issues

orchcli init fails with “Docker not found”

orchcli init hangs during repository cloning

npm install fails for @kubeorch/cli

Startup Issues

Services fail to start with “port already in use”

MongoDB connection refused

Core backend crashes on startup with “JWT_SECRET required”

Cluster Connection Issues

”Unable to connect to cluster” error

Cluster shows “Unhealthy” status

Workflow Issues

Workflow deployment fails with “namespace not found”

Nodes show “Error” state after deployment

Real-time status not updating

Build Issues

Container build fails with “Nixpacks error”

Build logs not streaming

Authentication Issues

”Invalid credentials” on login

OAuth login redirects to error page

JWT token expired errors

Performance Issues

UI is slow or unresponsive

API responses are slow

Getting Help