seshendranath Goto Github PK
Type: User
Type: User
Mirror of Apache Hadoop common
A bunch of Hadoop related demos
Lily HBase Indexer - indexing HBase, one row at a time
Spark RDD to read and write from HBase
Code and setup information for Introduction to Real time Spark session at http://www.meetup.com/Bangalore-Apache-Spark-Meetup/events/221645049/
Code and setup information for Introduction to Machine Learning with Spark
Java Projects
Examples for using the scala-kafka-client
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.2.0 and Kafka 0.11.0. Offset management in Zookeeper. Reliable No-Dataloss gurantee. No dependency on HDFS or Checkpointing and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apache Spark Streaming, Apache Cassandra, Apache Kafka and Akka for fast, streaming computations on time series data in asynchronous event-driven environments.
Code base for the Learning PySpark book (in preparation)
Supplementary materials for the "Learning Scala" book from O'Reilly Media
Scala examples for learning to use Spark
real time log event processing using spark, kafka & cassandra
Contains implementations of various machine learning projects/algorithms using Python, NumPy, SciPy, Matplotlib, Scikit-Learn, Scala, Spark, Breeze
Collection of custom I/O Format, File Format, log processing, ipLookup, secondary sort, custom patitioner.
Set of MapReduce application's used for teaching purposes
Repository for MapReduce Design Patterns (O'Reilly 2012) example source code
Mastering Apache Spark 2x by Packt
A general purpose metrics monitor implemented with Apache Spark. Kafka source, Elastic sink, aggregate metrics, different analysis, notifications, configuration updated on-live, missing metrics, ...
the python code of the book:Machine Learning for Spark
Because its never late to start taking notes and 'public' it...
All my projects in one place
Oozie - workflow engine for Hadoop
Columnar file format for hadoop
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.