Tags pagerank-ermitteln-gratis-dokumen

Pig Latin: A Not-So-Foreign Language for Data Processing

Pig Latin: A Not-So-Foreign Language for Data Processing

spam_urls = FILTER urls BY isSpam(url); culprit_urls = FILTER spam_urls BY pagerank > 0.8; where culprit_urls contains the nal set of urls that the user is interested in. The above Pig Latin fragment suggests that we rst nd the spam urls through the function isSpam, and then lter them by page

Design Patterns for Scientific Applications in DryadLINQ

Design Patterns for Scientific Applications in DryadLINQ

Performance, Design, Languages. Keywords Dryad, DryadLINQ, MapReduce, Design Pattern, SW-G, Matrix Multiply, PageRank 1. INTRODUCTION We are in a Big Data era. The rapid growth of information in science requires the processing of large amounts of scientific data.

Unifying Data-Parallel and Graph-Parallel Analytics

Unifying Data-Parallel and Graph-Parallel Analytics

19" 75" 16" 20" We express the Pregel and GraphLab ! abstractions using the GraphX operators! in less than 50 lines of code" ! 21" By composing these operators we can! construct entire graph-analytics pipelines" . ... ETL" Slice" Compute" Repeat" Initial " Subgraph" PageRank" Graph" Analyze"