Skip to content

nebula crawler chart#121

Merged
coryschwartz merged 7 commits into
masterfrom
feat/nebula
May 25, 2022
Merged

nebula crawler chart#121
coryschwartz merged 7 commits into
masterfrom
feat/nebula

Conversation

@coryschwartz
Copy link
Copy Markdown

helm chart to run the nebula crawler

secretKeyRef:
name: {{ .Values.postgres.user }}.{{.Values.postgres.teamid }}-{{ .Release.Name }}-crawldb.credentials.postgresql.acid.zalan.do
key: password
- name: NEBULA_DATABASE_SSL
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you changed the env name on the other repo this var needed to be NEBULA_DATABASE_SSL

- containerPort: 6666
name: metrics
command: [ "nebula" ]
args: [ "crawl" ]
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you also want to collect neighbor information you may consider to add the --neighbors flag. This will, however, roughly double the data it collects on each crawl.

@dennis-tra
Copy link
Copy Markdown

dennis-tra commented Jan 6, 2022

I may suggest to incorporate the network name that nebula is crawling/monitoring into the naming schemes. AFAICT This Helm chart only considers IPFS. If you also want to crawl/monitor Filecoin in the future you may include the network name in the Kubernetes resource names as well.

Like

apiVersion: batch/v1beta1
kind: CronJob
metadata:
  # name: {{ .Release.Name }}-crawl
  name: {{ .Release.Name }}-ipfs-crawl

@dennis-tra
Copy link
Copy Markdown

dennis-tra commented Jan 6, 2022

Just leaving a few numbers here:

IPFS (from 26th of November 2021 until today):

  • db size: 36 GB
  • crawls: 1962
  • -> ~18 MB/crawl
  • just the neighbors table: 18 GB
  • If you exclude the neighbors collection -> ~9MB/crawl.
    The IPFS network doubled in size during December so the MB/crawl may actually be higher starting from now on as the above numbers are averaged over the period of December.

Filecoin (from 26th of November 2021 until today):

  • db size: 6586 MB
  • crawls: 1963
  • -> 3.4 MB/crawl
  • just the neighbors table: 3761 MB
  • If you exclude the neighbors collection -> 1.4 MB/crawl

@coryschwartz coryschwartz marked this pull request as ready for review May 25, 2022 22:06
@coryschwartz coryschwartz merged commit 907f2fd into master May 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants