Academic & Research
This guide is for academic institutions and research teams deploying rbee for on-premises AI infrastructure with data residency guarantees and compliance-focused auditing.
⚠️ Note on Premium Features: This guide describes using Premium Queen and GDPR Auditing Module for multi-user deployments. These modules are planned for M2 launch (target Q2 2026). The current M0 release supports basic single-user and multi-machine deployments without quotas, auditing, or advanced user management. Commands shown are planned CLI syntax and subject to change.
Who this is for
- University research labs - Multi-user AI infrastructure for research projects
- Medical research institutions - GDPR-compliant AI for sensitive data
- Government agencies - On-premises deployment with audit trails
- Corporate R&D teams - Internal AI infrastructure with compliance requirements
Why rbee for academic & research
Data sovereignty
All data stays on your infrastructure. No external API calls, no cloud dependencies, no data leaving your network.
GDPR compliance
Optional GDPR Auditing module provides:
- Complete audit trails of all inference requests
- Data lineage tracking
- Automated compliance reports
- Right-to-erasure support
Multi-user support
- Quota management per research group or project
- Usage tracking and cost allocation
- Fair-share scheduling across competing workloads
Heterogeneous hardware
Use what you have:
- Mix NVIDIA and Apple Silicon GPUs (AMD/ROCm planned for future release)
- Combine workstation GPUs with HPC clusters
- Utilize idle compute during off-hours
Prerequisites
- On-premises servers or HPC cluster access
- Network isolation (air-gapped or VPN)
- rbee installed on all nodes (see Installation)
- (Recommended) GDPR Auditing module for compliance features
Architecture for academic deployment
┌─────────────────────┐
│ Research Network │
│ (Isolated/VPN) │
└──────────┬──────────┘
│
┌──────────▼──────────┐
│ Queen + GDPR │
│ Auditing Module │
└──────────┬──────────┘
│
┌──────────────────────┼──────────────────────┐
│ │ │
┌────▼─────┐ ┌────▼─────┐ ┌────▼─────┐
│ Lab A │ │ Lab B │ │ HPC Node │
│ (Hive) │ │ (Hive) │ │ (Hive) │
│ │ │ │ │ │
│ ┌──────┐ │ │ ┌──────┐ │ │ ┌──────┐ │
│ │Worker│ │ │ │Worker│ │ │ │Worker│ │
│ └──────┘ │ │ └──────┘ │ │ └──────┘ │
└──────────┘ └──────────┘ └──────────┘Step 1: Network setup
Ensure all nodes are on an isolated network:
# Verify network isolation
ping 10.0.0.1 # Should work (internal)
ping 8.8.8.8 # Should fail (external, if air-gapped)
# Set up VPN if needed (example with WireGuard)
sudo apt install wireguard
# Configure WireGuard for secure inter-lab communicationStep 2: Install rbee on all nodes
For air-gapped environments, use manual installation:
# On a machine with internet access, download binaries
wget https://github.com/veighnsche/llama-orch/releases/download/v0.1.0/rbee-linux-x64.tar.gz
# Transfer to air-gapped network (USB, secure file transfer)
# Then on each node:
tar -xzf rbee-linux-x64.tar.gz
sudo mv rbee-* /usr/local/bin/Step 3: Configure the queen with GDPR Auditing
Start the queen with auditing enabled:
# Install GDPR Auditing module (requires license)
sudo dpkg -i rbee-gdpr-auditing_0.1.0_amd64.deb
# Start queen with auditing
rbee queen start \\
--host 0.0.0.0 \\
--port 7833 \\
--enable-gdpr-auditing \\
--audit-log-path /var/log/rbee/audit \\
--audit-retention-days 2555 # 7 years for GDPRNote: GDPR Auditing is a paid module. See GDPR compliance for details.
Step 4: Configure research groups
Set up quotas per research group:
# Create research group quotas
queen-rbee quota create \\
--group neuroscience-lab \\
--max-gpu-hours-per-month 1000 \\
--max-concurrent-workers 5
queen-rbee quota create \\
--group genomics-lab \\
--max-gpu-hours-per-month 500 \\
--max-concurrent-workers 3
queen-rbee quota create \\
--group physics-lab \\
--max-gpu-hours-per-month 2000 \\
--max-concurrent-workers 10Step 5: Set up hives across labs
Configure hives for each lab or HPC node:
# ~/.rbee/hives.conf
[[hive]]
alias = "neuroscience-gpu-01"
host = "10.0.1.10"
ssh_user = "rbee"
research_group = "neuroscience-lab"
[[hive]]
alias = "genomics-hpc-01"
host = "10.0.2.10"
ssh_user = "rbee"
research_group = "genomics-lab"
[[hive]]
alias = "physics-cluster-01"
host = "10.0.3.10"
ssh_user = "rbee"
research_group = "physics-lab"Install and start all hives:
# Install on all nodes
for hive in neuroscience-gpu-01 genomics-hpc-01 physics-cluster-01; do
queen-rbee hive install $hive
queen-rbee hive start $hive
doneStep 6: Deploy models for research
Download models appropriate for research use cases:
# Medical imaging models
queen-rbee model download meditron-70b --hive neuroscience-gpu-01
# Genomics models
queen-rbee model download biogpt-large --hive genomics-hpc-01
# General-purpose LLMs for literature review
queen-rbee model download llama-3.1-70b --hive physics-cluster-01Step 7: User authentication and access control
Set up user authentication:
# Create user accounts per researcher
queen-rbee user create \\
--username alice.smith \\
--group neuroscience-lab \\
--role researcher
queen-rbee user create \\
--username bob.jones \\
--group genomics-lab \\
--role researcher
# Create admin accounts for lab managers
queen-rbee user create \\
--username lab.manager \\
--group neuroscience-lab \\
--role adminStep 8: Enable audit logging
Configure comprehensive audit logs:
# Enable detailed logging
queen-rbee audit configure \\
--log-level detailed \\
--log-requests true \\
--log-responses true \\
--log-user-actions true \\
--anonymize-pii false # Keep full logs for internal use
# Set up automated compliance reports
queen-rbee audit schedule-report \\
--frequency monthly \\
--format pdf \\
--email compliance@university.eduStep 9: Fair-share scheduling
Configure fair-share scheduling to prevent one group from monopolizing resources:
# Enable fair-share scheduling
queen-rbee scheduling set-policy fair-share \\
--weight-by-quota true \\
--preemption-enabled true \\
--preemption-grace-period 300 # 5 minutesStep 10: Researcher access
Researchers can now use the API with their credentials:
# Researcher authenticates and sends request
curl -X POST https://rbee.university.edu/v1/chat/completions \\
-H "Authorization: Bearer alice.smith:api-key-here" \\
-H "Content-Type: application/json" \\
-d '{
"model": "meditron-70b",
"messages": [
{"role": "user", "content": "Analyze this medical image..."}
]
}'All requests are logged with:
- User identity
- Timestamp
- Model used
- Input/output data (if configured)
- GPU hours consumed
GDPR compliance features
Audit trail
Every inference request is logged:
# View audit logs
queen-rbee audit logs --user alice.smith --date 2024-01-15
# Export audit logs for compliance review
queen-rbee audit export \\
--start-date 2024-01-01 \\
--end-date 2024-12-31 \\
--format csv \\
--output audit-2024.csvRight to erasure
Support GDPR right-to-erasure requests:
# Delete all data for a specific user
queen-rbee audit erase-user-data \\
--user alice.smith \\
--confirm
# Delete specific requests by ID
queen-rbee audit erase-request \\
--request-id req-abc-123 \\
--confirmData lineage
Track where data has been processed:
# View data lineage for a request
queen-rbee audit lineage --request-id req-abc-123Output shows:
- Which hive processed the request
- Which worker ran the inference
- Which GPU was used
- Timestamps and duration
Cost allocation
Track GPU usage per research group:
# View usage report
queen-rbee billing report \\
--group neuroscience-lab \\
--period last-month
# Export for internal billing
queen-rbee billing export \\
--period 2024-Q1 \\
--format csv \\
--output billing-Q1-2024.csvMulti-tenant isolation
Ensure research groups can’t see each other’s data:
# Enable strict isolation
queen-rbee security set-isolation strict \\
--enforce-group-boundaries true \\
--prevent-cross-group-access trueNext steps
- GDPR compliance - Deep dive into auditing features
- User management - Advanced user and group configuration
- Monitoring - Set up monitoring for research infrastructure
- Premium modules - Overview of Premium Queen and GDPR Auditing
Troubleshooting
Quota exceeded errors
Check current usage:
queen-rbee quota status --group neuroscience-labAdjust quotas if needed:
queen-rbee quota update \\
--group neuroscience-lab \\
--max-gpu-hours-per-month 1500Audit logs not appearing
Verify auditing is enabled:
queen-rbee audit statusCheck log file permissions:
ls -la /var/log/rbee/audit/Fair-share scheduling not working
Check scheduling policy:
queen-rbee scheduling get-policyVerify quotas are set for all groups:
queen-rbee quota listNetwork isolation issues
Test connectivity between nodes:
# From queen, test hive connectivity
ssh rbee@10.0.1.10 "echo 'Connected'"Verify firewall rules allow internal traffic but block external.