owencwl Goto Github PK
Name: owen
Type: User
Bio: Master of Computer Science. cloud | big data | java | python | AI
Location: Changsha, China
Name: owen
Type: User
Bio: Master of Computer Science. cloud | big data | java | python | AI
Location: Changsha, China
A port of Snappy, LZO, LZ4, and Zstandard to Java
A collection of algorithms and data structures
本项目集成了全网优秀的攻防武器工具项目,包含自动化利用,子域名、目录扫描、端口扫描等信息收集工具,各大中间件、cms漏洞利用工具,爆破工具、内网横向及免杀、社工钓鱼以及应急响应等资料。
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
A hadoop archiving tool that can reduce file counts and compress inputs using XZ compression
ARES Stduio
Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
Some notes on things I find interesting and important.
A high performance caching library for Java
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
CloudMoe Windows 10/11 Activation Toolkit get digital license, the best open source Win 10/11 activator in GitHub. GitHub 上最棒的开源 Win10/Win11 数字权利(数字许可证)激活工具!
车辆分析之区域碰撞
简单易用的随机数据生成器。生成各种比较真实的假数据。一般用于开发和测试阶段的数据填充模拟。支持各类**特色本地化的数据格式。An easy-to use random data generator. Generally used for data filling, simulation, demonstration and other scenarios in the development and test phase.
CrateDB is a distributed SQL database that makes it simple to store and analyze massive amounts of machine data in real-time.
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
数据仓库和用户画像
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
machine learning and deep learning
《设计模式就该这样学,基于经典框架源码和真实业务场景》随书代码示例工程
协同开发规范
Distributed File System server implemented using RAFT Algorithm.
DorisDB SQL解析器Java实现;Clickhouse SQL解析器Java实现
Official electron build of diagrams.net
🦉Data Version Control | Git for Data & Models | ML Experiments Management
open source data lake build on top of apache iceberg
elasticsearch-custom-query-demo
在es中,使用RoaringBitmap精确去重,能够在秒级返回
elasticsearch max speed aggragator plugin by timestamp(long) and geo_point
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.