Gluster’s got it wrong
“GlusterFS replication can happen on just 2 nodes as a minimum, as opposed to 3 with HDFS.” So this little tidbit was tucked into the Gluster marketing material for 3.3 Note that we use Gluster internally and it’s been a pretty solid system. That said, they need to do a little more research before they […]
Greenplum vs Hadoop Disk Space
I’ve been spending a whole lot of time calculating Greenplum vs Hadoop disk usage. So here the general equation (MaxAllocFactor * DiskSize * ( #Disk – RaidDisks ) ) / ReplicationFactor MaxAllocFactor = Max recommended allocation. 70% for Greenplum and 75% for Hadoop DiskSize = Size of your drive #Disk = Number of drives RaidDisks […]