Code Monkey home page Code Monkey logo

chabane / bigdata-playground Goto Github PK

View Code? Open in Web Editor NEW
204.0 19.0 72.0 3.15 MB

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

License: Apache License 2.0

TypeScript 33.03% JavaScript 3.54% HTML 4.69% CSS 0.47% Scala 29.54% Python 5.21% Java 21.17% Dockerfile 2.34%
docker spark-sql scala kafka hbase parquet avro nodejs angular graphql

bigdata-playground's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bigdata-playground's Issues

admin.tableExists(tableName)

在 HBaseOutputFormat 中, admin.tableExists(tableName) 判断应该放在 open方法中,而不应该放在 configure 中

How to import data into the the app?

We have deployed the application and its running at port 3000. Can you please help us in importing data for the application? How other projects are integrated with the web app?

multi-stage build / simplify setup

Hi,

Hope you are all well !

I was looking for a playground for big data and found your great repository.

Is it possible to create multi-stage docker file in order to simplify the setup process ?

Thanks in advance for your insights and inputs on that.

Cheers,
X.

error installing on macos 10.14.x

In order of execution, the instalation fails on latest macos 10.14.x with the following issues

Step 8/11 : ADD target/scala-2.11/search-flight-spark-streaming-assembly-0.1.0.jar $SPARK_HOME/libs/search-flight-streaming.jar
ERROR: Service 'streaming-spark-dev' failed to build: ADD failed: stat /var/lib/docker/tmp/docker-builder288103139/target/scala-2.11/search-flight-spark-streaming-assembly-0.1.0.jar: no such file or directory

Step 3/6 : ADD target/search-flight-hadoop-batch-0.1.0-jar-with-dependencies.jar $HADOOP_HOME/libs/search-flight-batch.jar
ERROR: Service 'batch-hadoop-dev' failed to build: ADD failed: stat /var/lib/docker/tmp/docker-builder770657281/target/search-flight-hadoop-batch-0.1.0-jar-with-dependencies.jar: no such file or directory

Step 4/6 : ADD target/search-flight-storm-streaming-0.1.0-jar-with-dependencies.jar $STORM_HOME/libs/search-flight-streaming.jar
ERROR: Service 'streaming-storm-dev' failed to build: ADD failed: stat /var/lib/docker/tmp/docker-builder383930762/target/search-flight-storm-streaming-0.1.0-jar-with-dependencies.jar: no such file or directory

error TS2307: Cannot find module 'rxjs/rx'.

Hi,
Tried some fix without success. Can you help?

[root@swarm1 docker]# docker-compose up -d
WARNING: The Docker Engine you're using is running in swarm mode.
......

[email protected] build:prod /usr/src/app/client
ng build --prod

Date: 2019-01-30T22:35:04.797Z
Hash: 1fda7bef43e9c167b28b
Time: 11596ms
chunk {0} styles.01b4fd32fd4135b8fcb3.bundle.css (styles) 44.9 kB [initial] [rendered]
chunk {1} polyfills.3bc34265385d52184eab.bundle.js (polyfills) 86 bytes [initial] [rendered]
chunk {2} main.e402deade8b026b7d50e.bundle.js (main) 84 bytes [initial] [rendered]
chunk {3} inline.22b7623ed7c5ac6f9a35.bundle.js (inline) 1.45 kB [entry] [rendered]

ERROR in src/app/dashboard/search-flight/search-flight.component.ts(3,28): error TS2307: Cannot find module 'rxjs/rx'.

Deployment Challenge

Hi,

We are trying to install the project on a local ubuntu system. We followed the steps as per the document and the web app is running perfectly on port 3000.

We are new to big data technologies and We are facing challenges on understanding the detail of the project. can u help us on following items:

  1. How to run other projects, data is not available. How to import the data.
  2. How Kafka and spark are being used?
  3. Which IDE can be used to run this application? (if any).

Please share any more documents if you have any. It will be very helpful.
Kindly let me know if we can have a call regarding the same.

Thanks
Rohit

com.fasterxml.jackson.databind.exc.InvalidFormatException: Can not construct instance of long from String value '2018-06-13': not a valid Long value

From following the steps in README.MD instructions I was able to build the network and start the containers successfully but when saving a flight info I got the exception above (showing up in the logs of
streaming_flink_dev)

When sending a flight Info to HBase trhough Flink streaming (streaming_flink_dev container) it fails throwing this Exception. When changing the mappings to String I was able to see the record being saved in HBase. Provided a fix as a PR

Question of framework selection

Hello
I'm learner in data engineering field, and glad to meet your great project.
This is just a question...it looks like flink and spark are both being used in this project. Can I select to use one of them, or do I need to use both to run project? Because as I know, function of both project are quite similar.

Thanks.

Dev mode

[error] Not a valid key: assembly

WebApp seach flights issue

Hi, I have deployed your application but the search is not working. The application continuously throws an error "Firefox can’t establish a connection to the server at ws://webapp.vnet:5000/gql-ws.". can you please help me in resolving this or let me know if I'm missing anything, as i am new to docker environment. I have checked in the port no. :5000 there is nothing deployed in it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.