Code Monkey home page Code Monkey logo

spark-pit's Introduction

Hello there, I'm Axel ๐Ÿ‘‹

Who am I

I am a Swedish software developer consultant that likes to work on hobby projects in my free time. Also a Nintendo nerd ๐ŸŽฎ.

What I do

I mostly work on open source web development with JavaScript, but also other types of development!

My primary hobby/side project is Partiguiden, which is a web platform used for getting up-to-date with the politics in Sweden, new features are added constantly, check it out!

Top Languages

spark-pit's People

Contributors

ackuq avatar dependabot[bot] avatar github-actions[bot] avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

spark-pit's Issues

Questions about maintaining this repo

First, thank you for building this and open sourcing it. My current estimate is that the early stop sort merge provides a 25X speed up on one of our workloads compared to the union plus window function approach we were doing before. This makes make extremely happy!

This is kind of a strange github issue because I just want to ask some questions. I'm interested in potentially contributing to this and I don't want to just start bombarding you with stuff.

  1. If I make PRs would they get reviewed and merged? Likely PRs:
    1. Upgrade to Spark 3.5.0
    2. Switch to using SparkSession instead of SQLContext which is deprecated.
    3. Some little edgecase bug fixes.
  2. I'm confused why there is both doExecute and doProduce, since both seem to contain very similar logic. It seems like doExecute only gets used when joining more than 2 datframes or if the input dataframes have >= 98 columns (this part confuses me even more). Probably I'm just missing some context but I think its quite likely that all the little bugs I found would not have happened if there was a single implementation. I would be very interested to know why its like this.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.