fast-data-dev
Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors
Check on Githubxgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Check on Githubtokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Check on Githubminio
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Check on Githubmlflow
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.
Check on Githubprophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Check on Github