SQL stream processing, analytics, and management. We decouple storage and compute to offer efficient joins, instant failover, dynamic scaling, speedy bootstrapping, and concurrent query serving.
-
Updated
Jul 26, 2024 - Rust
SQL stream processing, analytics, and management. We decouple storage and compute to offer efficient joins, instant failover, dynamic scaling, speedy bootstrapping, and concurrent query serving.
Spark log analyser, merging Apache with Application logs to analyse users' request to Apache and response from App
An open source framework for building data analytic applications.
This repo contains implementations of PySpark for real-world use cases for batch data processing, streaming data processing sourced from Kafka, sockets, etc., spark optimizations, business specific bigdata processing scenario solutions, and machine learning use cases.
Ophelian On Mars! More than a simple framework.
🏆 Spark4You Design patterns
This project demonstrates data cleaning, processing with Apache Spark and Apache Flink, both locally and on AWS EMR.
Big Data Applications from different fields
The U.S. Department of Transportation's (DOT) Bureau of Transportation Statistics tracks the on-time performance of domestic flights operated by large air carriers. Summary information on the number of on-time, delayed, canceled, and diverted flights is published in DOT's monthly Air Travel Consumer Report and in this dataset of 2015 flight dela…
Spark and all things Streaming
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Big Data projects for beginners
ETL With Apache Spark Deployed on K8s
This repository is dedicated to my participation in Datatalks Mlzoomcamp
Explore real-time temperature data analysis using Apache Spark Streaming. This repository provides a sample solution for processing streaming data, performing analytics, and visualizing insights from temperature sensor data.
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Real-Time Monitor Panel for Systems Infected by a Keylogger.
Add a description, image, and links to the spark-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-streaming topic, visit your repo's landing page and select "manage topics."