Code Monkey home page Code Monkey logo

brianseeders / kibana Goto Github PK

View Code? Open in Web Editor NEW

This project forked from elastic/kibana

1.0 1.0 0.0 4.53 GB

Your window into the Elastic Stack

Home Page: https://www.elastic.co/products/kibana

License: Other

Shell 0.20% JavaScript 4.81% Batchfile 0.01% TypeScript 94.51% CSS 0.15% HTML 0.01% Dockerfile 0.01% Python 0.01% Perl 0.01% Groovy 0.05% SCSS 0.21% Handlebars 0.01% ANTLR 0.01% Starlark 0.01% EJS 0.01% Makefile 0.01% Mustache 0.01% Standard ML 0.01%

kibana's People

Contributors

bargs avatar bleskes avatar cjcenizal avatar epixa avatar flash1293 avatar frankhassanabad avatar jbudz avatar jgowdyelastic avatar jloleysens avatar kibanamachine avatar lcawl avatar lukasolson avatar mistic avatar mshustov avatar nchaulet avatar nreese avatar pgayvallet avatar ppisljar avatar renovate[bot] avatar shahzad31 avatar simianhacker avatar sorenlouv avatar spalger avatar stacey-gammon avatar stormpython avatar stratoula avatar thomasneirynck avatar tsullivan avatar w33ble avatar walterra avatar

Stargazers

 avatar

Watchers

 avatar  avatar

kibana's Issues

[Buildkite] Misc

Misc

  • Create Buildkite service account for API tokens
    • Replace token for Agent Manager
    • Replace token for PR bot

[Buildkite] Monitoring and Alerting

Monitoring and Alerting

Implement monitoring and alerting for Buildkite and Buildkite-related GCP resources.

  • GCP
    • Instances
    • Quotas
  • Buildkite
    • Agent Queues (jobs waiting for agents, current agents per queue, etc)
    • Jobs (status, duration)

[Buildkite] Agent Manager

Agent Manager

  • Change secs configs to mins
  • Documentation
  • Test coverage
  • Balance agents cross numerous GCP zones
  • Hard and soft time limits for GCP instances
  • APM instrumentation
  • Delete orphaned GCP instances (isntance online, not connected to an agent)
  • Scaling idle timeouts
    • e.g. the first agent for a configuration might have an idle timeout of 1 hour, but the 200th might be 5 minutes
  • Automatically replace agents if disk usage is too high?

[Buildkite] Intrastructure

Intrastructure

  • New GCP project for Kibana CI
  • IAM + Roles
  • Migrate current Buildkite infra to new GCP project

Notes:

  • Separate GCP project for CI
  • Things that will live here:
    • GCS Buckets
    • Artifact Registry
    • Cloud Build
    • Cloud Run
      • PR bot, Slack bot?
    • GKE Auto-Pilot
      • Agent Manager
    • Agent Instances / Images
    • Cloud NAT + Cloud Router (for agents)
  • Cloud NAT
    • To scale beyond about 1,800 agents, we will likely need to implement Cloud NAT and remove external IP addresses from our agent instances
    • Needs Cloud Router set up as well
    • Automatic IP allocation works like this:
      • Each VM is reserved a specified number of ports per IP address in Cloud NAT
      • When the ports are maxed, a new IP address is added to the pool, which counts against static IP quota (confirm static IP quota?)
      • Each IP can have about 64K ports
      • Each VM can reserve up to 1024 ports
      • So, with auto scaling of IPs, at least 64 agents will share an IP
  • Allow SSH via IAP
    • IAP allows authorized users (auth via GCP/Google IAM) to SSH into agent instances that don't have public IPs
    • This is already done, except for assigning user/group permissions

[Buildkite] Agent Images

Agent Images

Work related to what's contained in the agent images, rather than the management of the images.

  • Beats

[Buildkite] Agent Image Management

Agent Image Management

  • Periodically create any necessary images
  • Test images after creation
  • Promote images after test
  • Rollback image after promotion (e.g. promote old image or pin an image)
  • Delete old images

What's the best way to "promote" and "pin" images? Do we need separate families that we copy images between?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.