HTTP Input Benchmark Comparison

Benchmark comparison of HTTP log ingestion performance across Edge Delta, Cribl, the OpenTelemetry Collector, and Fluentd. Each platform is tested under identical conditions (pass-through, filter, mask, and lookup pipeline types) using synthetic nginx-style logs. The OpenTelemetry Collector runs pass-through, filter, and mask only — its contrib distribution ships no CSV lookup processor, so lookup is reported as N/A; Fluentd runs all four.

Latest Benchmark Results

📊 View Latest Benchmark Report

📈 Interactive Benchmark Charts — throughput, efficiency, and historical trend across vendors.

Purpose

This repository helps developers evaluate and compare HTTP input throughput for four observability pipeline platforms. Benchmarks run on a single EC2 instance with consistent load profiles (80, 100, and 120 workers) and a 1-minute test duration per run.

Prerequisites

Terraform >= 1.0 (for AWS infrastructure)
AWS credentials configured (e.g., AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, or aws configure)
jq and curl (for API scripts)
SSH access from your machine (Terraform restricts EC2 SSH to your public IP)

Required Environment Variables

Set these before running ./run.sh:

Edge Delta

Variable	Description
`ED_ORG_ID`	Edge Delta organization ID
`ED_API_TOKEN`	Edge Delta API token

Cribl

Variable	Description
`CRIBL_WORKSPACE`	Cribl Cloud workspace name
`CRIBL_ORG`	Cribl Cloud organization ID
`CRIBL_WORKER_GROUP`	Cribl worker group (e.g., `default`)
`CRIBL_CLIENT_ID`	Cribl API client ID
`CRIBL_CLIENT_SECRET`	Cribl API client secret
`CRIBL_LEADER_TOKEN`	Cribl leader token (for agent install)

Directory Structure

.
├── aws_resources/          # Terraform: EC2, S3, IAM
├── benchmark_scripts/      # Load generation and trigger scripts (run on EC2)
├── pipelines/              # Pipeline configs per platform
│   ├── cribl/              # Cribl JSON configs and API helper
│   ├── edgedelta/          # Edge Delta YAML configs and API helper
│   ├── otelcol/            # OpenTelemetry Collector YAML configs
│   └── fluentd/            # Fluentd .conf configs
├── scripts/                # Agent install scripts (generated/dynamic)
├── benchmark_results/      # Downloaded results (gitignored)
├── run.sh                  # Main entry point
└── functions.sh            # Shared utilities

Setup

Clone the repository:
```
git clone <repo-url>
cd benchmark
```
Set all required environment variables (see above).
Ensure Terraform and AWS credentials are ready. The script will create:
- EC2 instance (c8i.2xlarge, Ubuntu 24.04, 50 GB gp3)
- S3 bucket for log output
- IAM resources for S3 access
- SSH key pair (stored under aws_resources/)

How to Run

From the repository root:

./run.sh

What run.sh does:

Checks prerequisites – Validates env vars for the selected vendors
Creates AWS resources – Runs terraform apply in aws_resources/
Prepares EC2 – Uploads benchmark scripts and lookup CSV
Runs benchmarks – For each selected platform (Edge Delta → Cribl → OpenTelemetry Collector → Fluentd):
- Installs or configures the agent
- For each pipeline type (pass-through, filter, mask, lookup):
  - Applies the pipeline config
  - Runs loadgen with 80, 100, and 120 workers (1 min each)
  - Captures logs
Downloads results – Saves to benchmark_results/<YYYYMMDD_HHMMSS>/
Cleans up – Deletes pipelines and runs terraform destroy

Running a subset

By default ./run.sh runs every case for every vendor. For faster local iteration you can restrict the run with two optional flags:

# Only the pass-through and mask cases (all vendors):
./run.sh --cases pass-through,mask

# Only Edge Delta and Cribl (all cases):
./run.sh --vendors edgedelta,cribl

# Edge Delta, filter case only:
./run.sh --vendors edgedelta --cases filter

--cases accepts pass-through, filter, mask, lookup (passthrough is accepted as an alias for pass-through). Values may be comma- or space-separated.
--vendors accepts edgedelta, cribl, otelcol, fluentd.
Prerequisite checks (env vars) only run for the selected vendors, so you don't need Cribl credentials to run an Edge Delta–only pass.
otelcol has no lookup case; it is skipped automatically if lookup is the only selected case.
Run ./run.sh --help for the full usage.

Note: Run from the repository root. The script sources functions.sh and expects to execute from that directory.

Benchmark Types

Type	Description
pass-through	Minimal processing; baseline throughput
filter	Exclude events where `attributes["color"] == "Green"`
mask	PII masking (IP, email, credit card, etc.)
lookup	CSV lookup to enrich events (e.g., ip → region)

Results

Results are written to benchmark_results/<timestamp>/ with one log file per platform and pipeline type. File prefixes map to products: edgedelta = Edge Delta, cribl = Cribl, otelcol = OpenTelemetry Collector, fluentd = Fluentd. The OpenTelemetry Collector has no lookup file (lookup is N/A).

benchmark_results/
└── 20260226_135850/
    ├── edgedelta_pass-through.log
    ├── edgedelta_filter.log
    ├── edgedelta_mask.log
    ├── edgedelta_lookup.log
    ├── cribl_pass-through.log
    ├── cribl_filter.log
    ├── cribl_mask.log
    ├── cribl_lookup.log
    ├── otelcol_pass-through.log
    ├── otelcol_filter.log
    ├── otelcol_mask.log
    ├── fluentd_pass-through.log
    ├── fluentd_filter.log
    ├── fluentd_mask.log
    └── fluentd_lookup.log

Each log contains loadgen output with throughput (logs/sec), CPU/memory usage, and error counts.

Cost Considerations

Running ./run.sh creates billable AWS resources (EC2 c8i.2xlarge, S3, etc.). The script tears everything down at the end. Expect roughly 30–60 minutes of runtime; any interruption may leave resources running until you manually run terraform destroy in aws_resources/.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.claude		.claude
.github		.github
aws_resources		aws_resources
benchmark_scripts		benchmark_scripts
pipelines		pipelines
scripts		scripts
site		site
tools		tools
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
functions.sh		functions.sh
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HTTP Input Benchmark Comparison

Latest Benchmark Results

Purpose

Prerequisites

Required Environment Variables

Edge Delta

Cribl

Directory Structure

Setup

How to Run

Running a subset

Benchmark Types

Results

Cost Considerations

About

Uh oh!

Releases 18

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

HTTP Input Benchmark Comparison

Latest Benchmark Results

Purpose

Prerequisites

Required Environment Variables

Edge Delta

Cribl

Directory Structure

Setup

How to Run

Running a subset

Benchmark Types

Results

Cost Considerations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages