Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
distributed_computing:data_processing:hadoop:hdfs:small_files [2019/10/25 19:05] – [Solutions] phreazer | distributed_computing:data_processing:hadoop:hdfs:small_files [2019/10/25 19:55] (current) – [Solutions] phreazer | ||
---|---|---|---|
Line 2: | Line 2: | ||
Memory overhead | Memory overhead | ||
- | * File/ | + | * File/ |
* Namenode is limited by main memory | * Namenode is limited by main memory | ||
Line 11: | Line 11: | ||
* Consolidator | * Consolidator | ||
* HBase: Stores data in indexed SequenceFiles (HBase) | * HBase: Stores data in indexed SequenceFiles (HBase) | ||
+ | * Spark compaction: https:// | ||
+ | * Filecrush: https:// |