Big Data ramp-up
-
Updated
Nov 22, 2016 - Java
Big Data ramp-up
Computed truck mileage, driver risk factor using Hive and Pig to understand the risk the company is under from fatigue of drivers and over-used trucks and visualized the sensor data using Tableau to observe the impact of the factors on driver’s performance
Dojo for Scala and Hadoop
bash interface to HDFS through webhdfs api
Advanced Topics Databases, NTUA 2019-2020
Kookmin Univ. Computer Science, Practices in Big Data, Homework
HQuery Codebase. HQuery provides an easy and effective interface through which business users can interact with Hadoop, can submit jobs, check the status, and eventually exports the result in the format they prefer.
File merge action to merge files in HDFS or local filesystem
A TF-IDF (Term Frequency & Inverse Document Frequency) based search algorithm for searching a small subset of Wikipedia Data using Apache Spark Cluster of 3 Nodes on top of HDFS, hosted on AWS, having web UI with Django.
Twitter Feed Analysis over twitter data to find most influential people, time zones when majority of users are available and the most common hashtags used on Twitter using Hive
Google page rank algorithm implementation using Hadoop Map reduce on Simple English Wikipedia corpus
A scalable video sharing platform backed by HDFS.
Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."