Code Monkey home page Code Monkey logo

datalake-query-db-consumer's Introduction

Datalake Query DB Consumer

datalake-query-pg-consumer is a python microservice that consumes datalake query events from a Kafka topic and stores them inside a relational database.

Menu

Rationale

The purpose of this service is to provide a way of consuming Kafka messages produced by datalake-query-ingester and storing them in a relational database for long-term storage and analysis.

This is part of a datalake query metadata ingestion and analysis pipeline.

Quick Start

To run the service locally, along with supporting services for testing, just run docker-compose up datalakequerydbconsumer. Similarly, for tests run docker-compose run tests.

Building

⚠️ WARNING: This follows sqlalchemy's approach of not packaging DbAPIs, instead letting the user install the appropiate ones for their use case. This is specified in Dockerfile by the SQLALCHEMY_DEPENDENCIES argument.

To build, run docker build --build-arg SQLALCHEMY_DEPENDENCIES=psycopg2-binary -f Dockerfile -t bloomberg/datalakequerydbconsumer:latest-postgresql .

OR

Run docker-compose build datalakequerydbconsumer

Installation

This is meant to be used with Trino and models data based on Trino's query metrics. This has been tested with Trino 363, backwards or forwards compatibility is not guaranteed.

The service is ment to be deployed with k8s. Configuration is passed with environment variables:

  • KAFKA_BROKERS
  • DATALAKEQUERYDBCONSUMER_KAFKA_TOPIC
  • DATALAKEQUERYDBCONSUMER_KAFKA_GROUP_ID
  • DATALAKEQUERYDBCONSUMER_DB_URL

An example config can be found in docker-compose.yaml > datalakequerydbconsumer.

Contributions

We ❤️ contributions.

Have you had a good experience with this project? Why not share some love and contribute code, or just let us know about any issues you had with it?

We welcome issue reports here; be sure to choose the proper issue template for your issue, so that we can be sure you're providing the necessary information.

Before sending a Pull Request, please make sure you read our Contribution Guidelines.

License

Please read the LICENSE file.

Code of Conduct

This project has adopted a Code of Conduct. If you have any concerns about the Code, or behavior which you have experienced in the project, please contact us at [email protected].

Security Vulnerability Reporting

If you believe you have identified a security vulnerability in this project, please send email to the project team at [email protected], detailing the suspected issue and any methods you've found to reproduce it.

Please do NOT open an issue in the GitHub repository, as we'd prefer to keep vulnerability reports private until we've had an opportunity to review and address them.

datalake-query-db-consumer's People

Contributors

mosiac1 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.