Here you can find all my discoveries on Github, projects I starred and liked or you can visit my personal Github profile.
fast-data-dev
Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors
Check on Githubxgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Check on Githubtokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Check on Githubminio
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Check on Githubprophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Check on Github