Skip to content

sidd1990/Big-Data-Exercises

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Big-Data-Exercises

The following are assignments of Big Data Course that I took at UTA.

Project 1

The purpose of this project is to develop a simple Map-Reduce program on Hadoop to multiply two sparse matrices.

Project 2

The goal is to finding connected users in social media datasets(Hadoop Java).

Develop a graph analysis program using Map-Reduce. An undirected graph is represented in the input text file using one line per graph vertex. For example, the line 1,2,3,4,5,6,7 represents the vertex with ID 1, which is connected to the vertices with IDs 2, 3, 4, 5, 6, and 7. My task was to write a Map-Reduce program that finds the connected components of any undirected graph and prints the size of these connected components. A connected component of a graph is a subgraph of the graph in which there is a path from any two vertices in the subgraph. For the above graph, there are two connected components: one 0,8,9 and another 1,2,3,4,5,6,7. The goal was to print the sizes of these connected components: 3 and 7.

Project 3

This project is about multiplying two matrices using Spark and Scala as development tools.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors