Skip to content

Big Data project using Apache Beam with Java SDK to develop Google Page Rank System.

Notifications You must be signed in to change notification settings

pramod096/beam-java-s02g02

Repository files navigation

Google PageRank

         PageRank is an algorithm used by Google Search to display websites in their search engine results based on the rank of each website. PageRank is a way of measuring the importance of website pages by counting the number and quality of links to a page to determine a rough estimate of how important the website is. The underlying assumption is that more important websites are likely to receive more links from other websites.

Algorithm

         Consider a page A has pages T1…Tn which point to it. The parameter d is a damping factor which can be set between 0 and 1. We usually set d to 0.85. Also C(A) is defined as the number of links going out of page A. The PageRank of a page A can be determined as:

PR(A) = (1-d) + d (PR(T1)/C(T1) + … + PR(Tn)/C(Tn))

Team Members

Pramod Gonegari
Pramod Gonegari

SaiKiran Gangidi
SaiKiran Gangidi

Ramu Vallapurapu
Ramu Vallapurapu

Narendra Gunturu
Narendra Gunturu

Vivek Drakshapally
Vivek Drakshapally

Venkatesh Vemula
Venkatesh Vemula

Implementaion

         Each Team Member will implement Google PageRank using Apache Beam with their preferred language SDK, in their unique folders and will communicate with all Team Members about the individual approach and suggestions. The development is planned using our Project Board, Issues, Milestones and our weekly progress is communicated through Wiki.

Member Comments

Pramod Gonegari

         The mini internet I have implemented to build Google PageRank, contains web pages of different books and contain links to other related books.

          I have implemented Google PageRank using Apache Beam with Java SDK, in my own WorkSpace and my weekly progress is communicated through my wiki page.

Books Graph

Execution Command
mvn compile exec:java -D exec.mainClass=edu.nwmissouri.s2g2.pramod.PageRankJobOnePramod

Saikiran Reddy Gangidi

The Cricket Web I implemented Cricket web For which contains interconnection between different webpages like cricket,worldcup and etc Where I Build the Google PageRank for calculating the PageRank of different webpages in the web And I implemeting the Google PageRank Using Apache Beam with Java SDK In my own Workspace

The Weekly Work will be Updated or communicated by using the Wiki

Here is the Interconnection Diagram

CricketWeb

Here is the Execution Command

mvn compile exec:java -D exec.mainClass=edu.nwmissouri.s2g2.gangidi.JobOnePageRankGangidi

Ramu Vallapurapu

Link to Repository
link to Wiki
link to Individual Code Repo

Here is the Interconnection Diagram for social media pages.

socialmediaweb

  • In this project I am implementing Google PageRank on the social media pages.
  • It will show the most ranked social media page based on the usage on the internet
  • Google PageRank using Apache Beam with Java SDK
  • We can run the application using
mvn compile exec:java -D exec.mainClass=nwmissouri.s2g2.vallapurapu.PageRankJobOneVallapurapu 

Here is the Result after successful execution

CricketWeb

Venkatesh Vemula

Repository Link Wiki link Individual Code Folder Link

  • The Films Web is the project i have implemented using the different film industry data which contains data about the multiple insurtries.
  • Here I'm using Apache Beam with Java SDK

Here is the Execution Command

mvn compile exec:java -D exec.mainClass=edu.nwmissouri.s2g2.vemula.JobOneVemula 

Vivek Drakshapally

Repository Link Wiki link Individual Code Folder Link

  • In this project i started implementing Google PageRank on the Sports Web Pages.
  • Google PageRank using Apache Beam with Java SDK
  • We can run the application using mvn compile exec:java -D exec.mainClass=edu.nwmissouri.s2g2.drakshapally.JobOnePageRankDrakshapally

Narendra Gunturu

Repository Link Wiki link Individual Code Link

  • In This project I worked on Google page rank using Apache Beam with Different types of Vehicle's data.
  • Here I'm using Apache Beam with Java SDK
  • For Executing this Application mvn compile exec:java -D exec.mainClass=edu.nwmissouri.s2g2.gunturu.PageRankJobOneGunturu

About

Big Data project using Apache Beam with Java SDK to develop Google Page Rank System.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages