AI Chat
Master conversational server management with Hive's AI Chat interface.
Starting a Conversation
Example Conversations
Investigating High CPU
PID USER CPU% COMMAND
12847 www 89% php-fpm: pool www
The process is making excessive database calls. The /api/search endpoint is running an unoptimized query scanning 2 million rows.
Recommendations:
- Add database index on products.name column
- Implement pagination on search results
- Consider caching frequent searches
Disk Space Emergency
/var/log → 45 GB (67% of used space)
/var/lib → 12 GB
/home → 8 GB
Found: /var/log/nginx/access.log → 38 GB (single file!)
This nginx access log hasn't been rotated and grew to 38GB.
I can help you:
- Safely truncate this log file
- Set up log rotation to prevent this
Service Won't Start
mysql.service - MySQL Server
Status: failed (Result: exit-code)
[ERROR] InnoDB: Unable to lock ./ibdata1 error: 11
Another MySQL process is holding the lock file.
Found: Zombie MySQL process (PID 8834) still holding the lock
To fix this, I need to:
- Kill the zombie process
- Remove stale socket file
- Start MySQL service
Kubernetes Pod Troubleshooting
NAME READY STATUS RESTARTS
payment-service-7d4b8c9f6-x2k4m 0/1 CrashLoopBackOff 5
FATAL: Unable to connect to database at db.internal:5432
Connection refused
Found the issue: The service selector doesn't match the pod labels.
Service has selector "app: database" but pod has label "app: postgres"
Fix: Update the service selector to match the pod label.
Docker Container Investigation
CONTAINER ID IMAGE STATUS NAMES
a3b4c5d6e7f8 redis:7 Exited (137) 2 minutes ago redis-cache
Out of memory trying to allocate 16384 bytes
Root cause: Redis has 128MB memory limit but dataset exceeded this.
Recommendations:
- Increase container memory limit to 512MB
- Configure Redis maxmemory policy
- Restart the container
Nginx 502 Bad Gateway
[error] connect() failed (111: Connection refused) while connecting to upstream, upstream: "http://127.0.0.1:3000"
Error: Cannot find module '/var/www/app/server.js'
Root cause: The Node.js app file /var/www/app/server.js is missing.
Last modified: 2 hours ago (possible bad deployment?)
Recommendations:
- Check your deployment - server.js may not have been deployed
- Roll back to previous version if available
Full-Stack Investigation
Found: "PaymentService: Connection timeout to payment-api:8080"
Error: Redis connection failed - max retries exceeded
Redis hit memory limit → Payment service couldn't write →
Payment service crashed → Checkout API returns 500
Fix sequence:
- Increase Redis memory limit to 4GB
- Restart Redis container
- Restart payment-api container
Security Investigation
192.168.1.105 → 312 attempts (internal IP - concerning!)
45.33.32.156 → 245 attempts (external)
167.99.123.45 → 180 attempts (external)
- admin from 10.0.0.5 (3 times) ← Your office IP
- deploy from 10.0.0.10 (12 times) ← CI/CD server
✓ No unauthorized access detected
⚠️ Brute force attempts ongoing (847 in 24h)
⚠️ Internal IP 192.168.1.105 needs investigation
Recommendations:
- Install fail2ban to block brute force attempts
- Investigate the internal IP
- Consider disabling password auth (use keys only)
Capabilities
Hive helps with a wide range of DevOps, SRE, and system administration tasks. Use it to investigate issues, inspect systems, and assist with guided operational work.
Server Health & Monitoring
| Task | What Hive Does |
|---|---|
| Health check | Checks CPU, memory, disk, services, network |
| Performance analysis | Identifies bottlenecks and resource hogs |
| Service status | Checks if critical services are running |
| Uptime report | Shows system uptime and recent reboots |
Troubleshooting
| Task | What Hive Does |
|---|---|
| Error investigation | Finds errors in logs, traces root causes |
| Service failures | Diagnoses why services won't start |
| Performance issues | Finds slow queries, memory leaks, CPU hogs |
| Connectivity problems | Tests network, DNS, ports, firewalls |
Log Analysis
| Task | What Hive Does |
|---|---|
| Find errors | Searches logs for errors and exceptions |
| Track patterns | Identifies recurring issues |
| Filter by time | Shows logs from specific time periods |
| Correlate events | Connects related events across log files |
System Administration
| Task | What Hive Does |
|---|---|
| Service management | Start, stop, restart services (with approval) |
| Process control | Find and manage running processes |
| File operations | Find, view, and manage files |
| Configuration | Check and update config files |
Kubernetes & Containers
| Task | What Hive Does |
|---|---|
| Pod management | Check pod status, logs, describe pods, restart pods |
| Deployments | View deployments, rollout status, scale replicas |
| Services & Ingress | Check service endpoints, ingress configurations |
| Docker containers | List containers, view logs, inspect, exec into containers |
| Container debugging | Investigate why containers crash, check exit codes |
Web Servers & Proxies
| Task | What Hive Does |
|---|---|
| Nginx/Apache | Check config syntax, view access/error logs, reload |
| SSL/TLS | Check certificate expiry, debug SSL issues |
| Load balancers | Verify upstream health, check configurations |
| Reverse proxy | Debug routing issues, check headers |
Databases
| Task | What Hive Does |
|---|---|
| MySQL/PostgreSQL | Check status, show slow queries, connection issues |
| Redis/MongoDB | Check memory usage, connection count, replication |
| Query analysis | Find expensive queries, check locks, deadlocks |
| Backup verification | Check backup status, verify dump files |
Networking & Cloud
| Task | What Hive Does |
|---|---|
| DNS debugging | Check resolution, dig, nslookup |
| Connectivity | Test ports, trace routes, check firewalls |
| Cloud CLIs | Run aws, gcloud, az commands for cloud resources |
| VPN/Tunnels | Check tunnel status, debug connectivity |
Security
| Task | What Hive Does |
|---|---|
| Login audit | Shows who logged in and when |
| Failed attempts | Identifies brute force attacks |
| Open ports | Lists network services and ports |
| File permissions | Checks for insecure permissions |
| Vulnerability scanning | Check for outdated packages, exposed services |
Multi-Step Investigation
Hive automatically continues investigating until it finds the root cause:
Context Awareness
Hive remembers your conversation and understands context:
✓ Nginx is running, listening on ports 80 and 443
Hive understands you mean "nginx config"
Server blocks: 3, SSL: Enabled, Gzip: Enabled, Config test: ✓ OK
Hive understands you want to compare nginx between servers
db-01 doesn't have nginx installed. Did you mean MySQL?
Approval Workflow
When Hive needs to make changes, you're always in control:
Learn more about Safety & Security.