Topic: hdfs Goto Github
Some thing interesting about hdfs
Some thing interesting about hdfs
hdfs,Hadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Organization: autovia
Home Page: https://autovia.de
hdfs,HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Organization: avast
hdfs,A distributed storage benchmark for file systems, object stores & block devices with support for GPUs
User: breuner
hdfs,Ceph is a distributed object, block, and file storage platform
Organization: ceph
Home Page: https://ceph.io
hdfs,Mirror of Linkedin's Camus
Organization: confluentinc
hdfs,Kafka Connect HDFS connector
Organization: confluentinc
hdfs,🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
Organization: datawhalechina
Home Page: https://datawhalechina.github.io/juicy-bigdata/
hdfs,Divolte Collector
Organization: divolte
Home Page: https://divolte.io/
hdfs,CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on underlying resource management and maintenance.
Organization: dromara
Home Page: https://www.cloudeon.top/
hdfs,A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
User: eugene-mark
hdfs,Big Data Ecosystem Docker
User: fabiogjardim
hdfs,25+ DevOps CLI Tools - Anonymizer, SQL ReCaser (MySQL, PostgreSQL, AWS Redshift, Snowflake, Apache Drill, Hive, Impala, Cassandra CQL, Microsoft SQL Server, Oracle, Couchbase N1QL, Dockerfiles), Hadoop HDFS & Hive tools, Solr/SolrCloud CLI, Nginx stats & HTTP(S) URL watchers for load-balanced web farms, Linux tools etc.
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hdfs,80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
User: harisekhon
Home Page: https://www.linkedin.com/in/HariSekhon
hdfs,大数据入门指南 :star:
User: heibaiying
hdfs,seaweedfs implemented in pure Rust
Organization: helyim
hdfs,A tool and library for easily deploying applications on Apache YARN
User: jcrist
Home Page: https://jcristharif.com/skein/
hdfs,Python HDFS client
User: jingw
Home Page: https://pyhdfs.readthedocs.io/en/latest/
hdfs,JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Organization: juicedata
Home Page: https://juicefs.com
hdfs,Web tool for Kafka Connect |
Organization: lensesio
Home Page: http://lenses.io/product/features
hdfs,A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Organization: linkedin
hdfs,基于分布式的云笔记(参考某道云笔记),数据存储在redis与hbase中
User: luckyzxl2016
hdfs,Exports Hadoop HDFS content statistics to Prometheus
User: marcelmay
hdfs,Megvii FILE Library - Working with Files in Python same as the standard library
Organization: megvii-research
Home Page: http://megvii-research.github.io/megfile
hdfs,DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Organization: mesosphere
Home Page: https://mesosphere.github.io/dcos-commons/
hdfs,Kafka Connect FileSystem Connector
User: mmolimar
hdfs,ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部署,具备高性能、工业级部署、端到端体验的特点,并且作为开源套件,满足二次深度开发的需求。
Organization: paddlepaddle
hdfs,Utils for streaming large files (S3, HDFS, gzip, bz2...)
User: piskvorky
hdfs,⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Organization: rumbledb
Home Page: http://rumbledb.org/
hdfs,SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Organization: seaweedfs
hdfs,Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
Organization: seznam
hdfs,The Universal Storage Engine
Organization: tiledb-inc
Home Page: https://tiledb.com
hdfs,Python interface to the TileDB storage engine
Organization: tiledb-inc
hdfs,R interface to TileDB: The Modern Database
Organization: tiledb-inc
Home Page: https://tiledb-inc.github.io/TileDB-R
hdfs,Fundamentals of Spark with Python (using PySpark), code examples
User: tirthajyoti
hdfs,StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Organization: uber
hdfs,专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
User: wangzhiwubigdata
hdfs,Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
User: wgzhao
Home Page: https://wgzhao.github.io/Addax/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.