The Hadoop Distributed File System : Architecture and Design 요약

본 내용은 해당 링크내의 원본을 읽고, 키워드 등의 관점에서 축약하였음


원본 링크 :
http://hadoop.apache.org/core/docs/current/hdfs_design.html


원본 링크내 첨부파일 :
본 내용은 해당 링크내의 원본을 읽고, 키워드 등의 관점에서 축약하였음

■ Introduction
    - HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware.
    - HDFS provided high throughtput access to application data and is suitable for applications that have large data sets.

■ Assumptions and Goals
   - Hadrware Failure : detection of faults and quick, automatic recovery
   - Streaming Data Access : It is designed more for batch processing rather than interactive use by users
   - Large Data Sets : It provides high aggregate data bandwidth and scale to hundreds of nodes in a single cluster.
   - Simple Coherency Model : write-once-read-many access model for files
   - "Moving Computation is Cheaper than Moving Data" : It is often better to migrate the computation closer to where the data is located rather than moving the data to where the application is running. HDFS provides interfaces for applications to move themselves closer to where the data is located.
   - Portability Across Heterogeneous Hardware and Software Platforms : HDFS is easily portable from one platform to another.



'Development > Hadoop, NoSQL, BigData' 카테고리의 다른 글

Redis 설치 방법 세 가지  (0) 2021.04.21
Docker기반 Spark Cluster 설치하기  (6) 2020.12.15
Local AirFlow 설치하기  (0) 2020.12.09
brew로 local zeppelin 설치하기  (0) 2020.11.30
HBase에서 HQL 사용하기  (0) 2008.07.25
HBase 설치/설정하기  (0) 2008.07.25
HDFS 설정하기  (0) 2008.07.25
Hadoop Installation on Ubuntu Linux 7.10  (0) 2008.07.17

댓글

Designed by JB FACTORY