Exploring PageRank Algorithms: Power Iteration & Monte Carlo Methods

Brian Vargas

Masters Thesis

Exploring PageRank Algorithms: Power Iteration & Monte Carlo Methods

A search engine is intended to crawl the world wide web and retrieve a list of sites that match the user's search terms. Listing the search results in a proper order is crucial but it is far from being a trivial task. Prior to the rise of Google, search engines were notorious for presenting irrelevant information, and in turn, were not very optimal tools. It wasn't until around 1998 that Google co-founders Lawrence Page and Sergey Brin invented the PageRank algorithm for estimating the importance of a webpage which ultimately revolutionized internet searches. The PageRank algorithm is used to estimate the importance of a webpage based on the interconnection of the web [28]. Google's superiority came in the form of arranging the more important webpages to appear early in the search results. This way, it significantly reduced the amount of time a user had to sift through search results to find the sought-after information. Although PageRank is not the only algorithm for organizing search results today, it was the first algorithm used by Google. Today there are multiple algorithms working together to prioritize search results. The concept behind PageRank is similar to that of a democracy: each webpage can vote for the importance of other webpages by providing a link to that webpage. However, not every vote is worth the same! Because a webpage with more incoming links is more important than a webpage with less incoming links, the pages with a link from a page of high importance also holds importance. Since PageRank is a variation of an eigenvector problem, we begin with a review of key concepts from linear algebra in Chapter 1. Using this linear algebra theory, we show that the appropriate model will yield the existence and uniqueness of the ranking of webpages. In Chapter 2 we describe the data structures necessary for implementing the algorithm on a computer. In Chapter 3 we describe a model that can be used to synthesize graphs which closely resemble the properties of the world wide web. In Chapter 4, we explore the power iteration, a deterministic numerical algorithm to solve such an eigenvector problem. In Chapter 5, we explore and compare Monte Carlo methods used to solve the eigenvector problem from a probabilistic approach.

Date

2020-04-28

Resource Type

Masters Thesis

Creator

Brian Vargas

Advisor

Hansen, Olaf

Committee Member

Campus

San Marcos

College

Science, Technology, Engineering & Math

Department

Mathematics

Publisher

California State University, San Marcos

Degree Level

Masters

Degree Name

M.S.

Degree Program

Mathematics

Subjects

Date Submitted

2020-04-28

Date Accessioned

2020-04-28T19:30:23Z

Handle

http://hdl.handle.net/10211.3/215708

["Made available in DSpace on 2020-04-28T19:30:23Z (GMT). No. of bitstreams: 1 VargasBrian_Spring2020.pdf: 1036093 bytes, checksum: acb616aecaad9848626ee101ff6df708 (MD5)", "Submitted by Carmen Mitchell (cmitchell@csusm.edu) on 2020-04-28T18:31:41Z No. of bitstreams: 1 VargasBrian_Spring2020.pdf: 1036093 bytes, checksum: acb616aecaad9848626ee101ff6df708 (MD5)", "Approved for entry into archive by Carmen Mitchell (cmitchell@csusm.edu) on 2020-04-28T19:30:23Z (GMT) No. of bitstreams: 1 VargasBrian_Spring2020.pdf: 1036093 bytes, checksum: acb616aecaad9848626ee101ff6df708 (MD5)"]

Language

English

Thumbnail	Title	Date Uploaded	Visibility	Actions
	VargasBrian_Spring2020.pdf	2021-02-23	Public	Download

Downloadable Content

Exploring PageRank Algorithms: Power Iteration & Monte Carlo Methods