Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
-
Updated
Dec 18, 2023 - Java
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Visual, interactive queries against big databases
a suite of benchmark applications for distributed data stream processing systems
Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.
Workflow management system for the automated and distributed analysis of large-scale experimental data.
Real-time visual analytics for soccer matches, leveraging Apache Flink, Apache Kafka and the Elastic stack. Solution to DEBS 2013 Grand Challenge. Coursework in Systems and Architectures for Big Data 2016/2017.
Real-time visual analytics application for link prediction and link detection in criminal networks. Research work accepted to the 5th IEEE International Conference on Future of Internet of Things and Cloud (FiCloud 2017).
A general-purpose data analysis engine radically changing the way batch and stream data is processed
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
BigFuzz: Efficient Fuzz Testing for Data Analytics using Framework Abstraction (ASE 2020)
CS6240 - Large Scale Parallel Processing Course at Northeastern University
Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.
Big data analytics using Hadoop on GDELT global news dataset.
Adaptive Decision Forest(ADF) is an incremental machine learning framework called to produce a decision forest to classify new records. ADF is capable to classify new records even if they are associated with previously unseen classes. ADF also is capable of identifying and handling concept drift; it, however, does not forget previously gained kn…
Real-time social media analytics application that monitors posts and users popularity, leveraging Apache Flink. Research work accepted to the 10th ACM International Conference on Distributed and Event-Based Systems (DEBS 2015).
White-Box Testing of Big Data Analytics with Complex User-Defined Function (FSE 2019)
Map/Reduce application that analyzes movie ratings collected by Movielens, leveraging Hadoop MapReduce, Hadoop Distributed File System and Apache Flume. Coursework in Structures and Architectures for Big Data 2016/2017.
Big Data Hadoop framework project for analysis of superstore sales data to find insights.
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."