selected_top_l2


"/home/yossef/notes/Su/selected_top/selected_top_l2.md"

path: Su/selected_top/selected_top_l2.md

- **fileName**: selected_top_l2
- **Created on**: 2025-04-12 14:59:08

Big Data Challenges: The V’s:

  1. Volume : Amount of data generated
  2. Variety : all kinds of data are generated (text,
    image, voice, time series, etc.)
  3. Velocity : Rate at which data are produced and
    should be processed
  4. Veracity : Noise/anomalies in data, truthfulness
  5. Value : How do we extract/learn valuable
    knowledge from the data.
  6. various : all of sources of data

what is Horizontal Scaling (Scaling Down)?

Increase the processing powers by adding more resources for existing node

what is Horizontal Scaling( pros and cons)?

what is Vertical Scaling (Scaling out)?

Increasing process powers by adding more nodes to the system

what is Horizontal Scaling( pros and cons)?

what is scaling up , scaling out?

Scaling up: a single powerful computer is added with more cpu cores,
memory and hard disk.
scaling out: is divided task between large number of less powerful machine
with slow cpu, less memory, and less hard disk space

A hierarchical infrastructure

what is 3 major challenges posed by cluster architecture

what is Cloud computing ?

Is a model for enabling convenient, on-demand network access to a shared pool of
configurable computing resources (e.g., networks, servers, storage,
applications, and services) that can

what is Cloud Computing  Essential characteristics:

what is cloud computing  Service Models:

What is cloud computing  Deployment Models:

what is MapReduce?

A programming model for processing big data sets with parallel.

Consists of three main components:

what is MapReduce using for or designed for?

what is 3 steps of MapReduce?

  1. Map:
    • Mapper applies the Map function to a single element
  2. Group by key: Sort and shuffle
    • System sorts all the key value pairs by key, and outputs
      key-(list of values) pairs
  3. Reduce:
    • User written Reduce function is applied to each
      key-(list of values)

what is MapReduce environment takes care of: