Topic: lakehouse Goto Github
Some thing interesting about lakehouse
Some thing interesting about lakehouse
lakehouse,The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Organization: adidas
Home Page: https://adidas.github.io/lakehouse-engine-docs/
lakehouse,The Goal of this project is to provide documentation for the Lakehouse Engine framework.
Organization: adidas
Home Page: https://adidas.github.io/lakehouse-engine-docs/
lakehouse,A comprehensive educational resource hub dedicated to mastering Microsoft Fabric, offering in-depth tutorials, real-world use cases, and hands-on guides for seamless end-to-end analytics
User: anthonybyansi
Home Page: https://jolly-coast-0e280d310.5.azurestaticapps.net
lakehouse,Stream Loader for Apache Doris
Organization: apache
Home Page: https://doris.apache.org
lakehouse,ByConity is an open source cloud data warehouse
Organization: byconity
Home Page: https://byconity.github.io/
lakehouse,Supercharge Your Compute for Analytics & AI
Organization: computeai
Home Page: https://compute.ai/
lakehouse,Use SQL to build ELT pipelines on a data lakehouse.
Organization: cuebook
Home Page: https://cuelake.cuebook.ai
lakehouse,A 1 hour workshop running through the data lakehouse and deep dive into delta lake
Organization: cuusoo-labs
lakehouse,A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Organization: data-dot-all
Home Page: https://data-dot-all.github.io/dataall/
lakehouse,Leverage the Databricks Solution Accelerator for DNS analytics to accelerate time to detection and response across petabytes of data. Tap into DNS traffic logs, enrich streaming threat intelligence, and apply advanced analytics to detect DNS abnormalities and prevent malicious attacks.
Organization: databricks-industry-solutions
Home Page: https://www.databricks.com/solutions/accelerators/threat-detection
lakehouse,Overall Equipment Effectiveness: Performant and Scalable End-to-End Equipment Monitoring
Organization: databricks-industry-solutions
Home Page: https://www.databricks.com/solutions/accelerators/overall-equipment-effectiveness
lakehouse,From FHIR ingestion to patient outcomes analysis
Organization: databricks-industry-solutions
lakehouse,Unlocking the Power of Health Data With a Modern Data Lakehouse
Organization: databricks-industry-solutions
lakehouse,From display to video, the value of an impression can only be realized if an ad is viewed by a user. Therefore, when using programmatic advertising to buy inventory, it’s important to take viewability into account. In this Solution Accelerator, learn how to predict ad viewability to optimize your real-time bidding strategy.
Organization: databricks-industry-solutions
Home Page: https://www.databricks.com/solutions/accelerators/real-time-bidding-optimization
lakehouse,Burning Through Electronic Health Records in Real Time With Smolder
Organization: databricks-industry-solutions
lakehouse,Examples of using Terraform to deploy Databricks resources
Organization: databricks
Home Page: https://registry.terraform.io/modules/databricks/examples/databricks/latest
lakehouse,DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/
Organization: databrickslabs
Home Page: https://databrickslabs.github.io/delta-oms/
lakehouse,Automated provisioning of an industry Lakehouse with enterprise data model
Organization: databrickslabs
lakehouse,World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.
Organization: datastrato
Home Page: https://datastrato.ai/docs/
lakehouse,Automated setup of Apache Iceberg on Amazon S3 using Terraform and AWS Glue Data Catalog. Explore the power of a Lakehouse architecture for data management and analysis, featuring schema discovery, metadata management, and efficient querying with Amazon Athena.
User: davidvanegas2
lakehouse,Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
User: dominikhei
lakehouse, Build Your First End-to-End Lakehouse Solution (aka.ms/fabconlake)
User: ekote
Home Page: https://azuredataconf.com/#!/workshop/Build%20Your%20First%20End-to-End%20Lakehouse%20Solution/6194
lakehouse,Run an open-source data LakeHouse locally using Docker Compose
User: fraibacas
lakehouse,Unified storage framework for the entire machine learning lifecycle
Organization: google
lakehouse,An open-source storage framework that enables building a Lakehouse architecture
Organization: guinsoolab
Home Page: https://guinsoolab.github.io/glab/#/app/mortalmesh
lakehouse,Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.
User: harrydevforlife
lakehouse,Pure Rust Iceberg Implementation
Organization: icelake-io
lakehouse,LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Organization: lakesoul-io
Home Page: https://lakesoul-io.github.io/
lakehouse,Open source stack lakehouse
User: leehuwuj
lakehouse,Lakehouse storage system benchmark
Organization: lhbench
Home Page: https://lhbench.cs.berkeley.edu
lakehouse,a curated list of awesome lakehouse frameworks, applications, etc
User: manuzhang
lakehouse,Tool for deploying a lakehouse in a kubernetes cluster.
User: maximechartier
lakehouse,Microsoft Fabric Real-time Analytics flight streaming
Organization: microsoft
Home Page: https://aka.ms/fabric-trial
lakehouse,A curated list of open source tools used in analytical stacks and data engineering ecosystem
Organization: pracdata
lakehouse,Tutorials and examples of how to deploy Presto and connect it to different data sources
Organization: prestodb
Home Page: https://prestodb.io/
lakehouse,A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.
User: samber
lakehouse,Connect FastAPI to a Databricks Lakehouse
User: sjrusso8
lakehouse,A Data engineering based Proof of Concept demonstrating cutting-edge logistics solutions for a US-based Grocery Delivery Platform
User: soorajpazeekal
Home Page: https://linkedin.com/in/soorajpazeekal
lakehouse,StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
Organization: starrocks
Home Page: https://starrocks.io
lakehouse,FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
User: tspannhw
Home Page: https://datainmotion.dev/
lakehouse,Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
User: victorskl
lakehouse,Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
User: vvalcristina
lakehouse,Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
User: ysfesr
lakehouse,YTsaurus is a scalable and fault-tolerant open-source big data platform.
Organization: ytsaurus
Home Page: https://ytsaurus.tech
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.